Genome-wide meta-analysis identifies 11 new loci for anthropometric traits and provides insights into genetic architecture.
Journal: 2013/June - Nature Genetics
ISSN: 1546-1718
Abstract:
Approaches exploiting trait distribution extremes may be used to identify loci associated with common traits, but it is unknown whether these loci are generalizable to the broader population. In a genome-wide search for loci associated with the upper versus the lower 5th percentiles of body mass index, height and waist-to-hip ratio, as well as clinical classes of obesity, including up to 263,407 individuals of European ancestry, we identified 4 new loci (IGFBP4, H6PD, RSRC1 and PPP2R2A) influencing height detected in the distribution tails and 7 new loci (HNF4G, RPTOR, GNAT2, MRPS33P4, ADCY9, HS6ST3 and ZZZ3) for clinical classes of obesity. Further, we find a large overlap in genetic structure and the distribution of variants between traits based on extremes and the general population and little etiological heterogeneity between obesity subgroups.
Relations:
Content
Citations
(200)
References
(54)
Conditions
(2)
Genes
(129)
Organisms
(1)
Processes
(4)
Affiliates
(1)
Similar articles
Articles by the same authors
Discussion board
Nature genetics. Apr/30/2013; 45(5): 501-512
Published online Apr/6/2013

Genome-wide meta-analysis identifies 11 new loci for anthropometric traits and provides insights into genetic architecture

+313 authors

Abstract

Approaches exploiting extremes of the trait distribution may reveal novel loci for common traits, but it is unknown whether such loci are generalizable to the general population. In a genome-wide search for loci associated with upper vs. lower 5th percentiles of body mass index, height and waist-hip ratio, as well as clinical classes of obesity including up to 263,407 European individuals, we identified four new loci (IGFBP4, H6PD, RSRC1, PPP2R2A) influencing height detected in the tails and seven new loci (HNF4G, RPTOR, GNAT2, MRPS33P4, ADCY9, HS6ST3, ZZZ3) for clinical classes of obesity. Further, we show that there is large overlap in terms of genetic structure and distribution of variants between traits based on extremes and the general population and little etiologic heterogeneity between obesity subgroups.

Twin studies have established a strong heritable component to body mass index (BMI; h2~40-70%),1,2 and height (h2~70-90%).3 Previous meta-analyses of genome-wide association studies (GWAS) have identified 36 genetic loci associated with BMI,4-6 14 loci with waist-hip ratio adjusted for BMI (WHR) reflecting fat distribution,7,8 and 180 loci with height9, and contributed to our understanding of the genetic architecture of complex traits. However, established loci for complex traits only account for a small proportion of trait heritability, as discussed recently.10,11 Some postulated explanations for this include undiscovered low frequency variants with larger effects, imperfect tagging of causal variants, epistasis, gene-environment interaction, and phenotype heterogeneity. This has led to increasing interest in approaches exploiting extremes of the trait distribution, where there may be less locus heterogeneity, greater genetic contribution, and enrichment for highly penetrant variants. Utilization of extremes has also been proposed to improve cost-efficiency, since effect sizes may be larger, fewer subjects may be needed for genotyping, and a smaller proportion of the variance may be attributable to environmental factors. Indeed, several prior studies have used extreme designs to discovery novel loci for various complex traits, such as obesity and lipid fractions using microarray genotyping12-16 or sequencing methods.17-20 However, the few previous studies that have systematically addressed differences between genetic architecture of the overall distribution with extremes for complex traits have been small,21-23 and hence, it remains largely unknown whether genetic loci affecting the extremes are generalizable to the general population.

Studies of extremely obese individuals have reported thirteen loci at or near genome-wide significance (P<5×10-7),14-16,22-26 but not all have shown evidence of association with BMI in the general population.4,27 For example, variants in PCSK1 (rs6232) and PTER have been convincingly associated with severe obesity,14,25 but have at best shown nominal evidence of association with BMI in large-scale meta-analyses.4,28 Although it is possible that other genetic or environmental factors modify the manifestation of these variants producing an extreme phenotype only in selected individuals, it is also conceivable that the extremes are, at least in part, etiologically distinct. Within the extremes of the distribution, there may be etiologically discrete subgroups or enrichment for less common causal variants.19 Although analyzing the full distribution is generally more powerful, in cases where there is heterogeneity, analyzing extremes by case-control design may offer superior power.29

The extremes for anthropometric traits, particularly BMI, have been defined in numerous ways, including using tails of the full population distribution (e.g. >95th or >97th percentile) and absolute cutpoints (e.g. ≥40 kg/m2) based on clinical or standard references, and some studies have used a combination of definitions for their discovery and replication. The common denominator for studies addressing ’extremes’ (herein used as a more generic term) is that they have dichotomized the trait distribution and analyzed data using a case-control design. Studies suggest that the percentile cutpoint choice and ascertainment strategy utilized may impact the observed risk and subsequent power;30,31 however, the consequences of these extreme definitions on discovery and characterization of loci for complex traits have not been systematically evaluated. In the present study, we have used the term ‘tails’ to describe analyses comparing the upper and lower 5th percentiles of the trait distributions; ‘clinical classes of obesity’ to describe analyses where controls were subjects with BMI <25 kg/m2 and cases were defined as BMI ≥25 kg/m2 for overweight, BMI ≥30 kg/m2 for obesity class I, BMI ≥35 kg/m2 for obesity class II, and BMI ≥40kg/m2 for obesity class III32; and ‘extremely obese’ to describe studies using different sampling designs for selecting their extremely obese cases and controls.

The overall aim of the present study was to use and compare different distribution cutoffs for identification of genetic loci of anthropometric traits. The two specific aims were: 1) to systematically compare findings using these cutoffs with those from the full population distribution, as well as with studies utilizing a different ascertainment strategy; and 2) to draw inferences about the value of these different approaches for sampling within a population-based study. Our focus was primarily on BMI, which is a major risk factor for multiple chronic diseases and of important public health significance,33 but we also examined height and waist-hip ratio adjusted for BMI (WHR; as a measure of body fat distribution) to verify if our findings could be generalized to other traits. To address these aims, we performed a genome-wide search for genetic determinants of the tails (defined as the upper vs. lower 5th percentile of the trait distribution) of BMI, height and WHR and for comparison, clinical classes of obesity drawn from populations within the GIANT (Genetic Investigation of ANthropometric Traits) consortium. Association analyses were conducted in a study base (or sampling frame) of up to 168,267 individuals with follow-up of the 273 most significantly associated loci in a study base of up to 109,703 additional individuals. Further, systematic comparisons were conducted to assess differences in genetic inheritance and distribution of risk variants between the extremes and general population for these anthropometric traits.

Results

To first evaluate the contribution of common SNPs to the tails and clinical classes of obesity and discover new loci, we conducted meta-analyses of GWAS of six obesity-related traits (tails of BMI and WHR, overweight, obesity class I, II and III), as well as tails of height, utilizing results for ~2.8 million genotyped or imputed SNPs. Stage 1 analyses included 51 studies with study bases of 158,864 (BMI), 168,267 (height) and 100,605 (WHR) individuals of European ancestry (see Supplementary Table 1 for number of cases and controls per phenotype; Supplementary 2-5 for study characteristics). We observed an enrichment of SNPs with small P-values compared to the null distribution for all seven traits (Q-Q plots, Supplementary Fig. 1-2). The excess was diminished after exclusion of loci previously established for the overall distributions or extremes of these traits, but some enrichment remained, especially for tails of height and to a lesser extent for overweight, obesity class I and II. In total, 69 loci (defined as separated by at least 1 Mb) were associated at P<5×10-8 with at least one trait (Supplementary Fig. 3-4).

To identify and validate loci for these traits, SNPs for which associations reached P<5×10-6 in the stage 1 analyses were taken forward for follow-up (stage 2) in 12 studies with in silico GWAS data and 24 studies with Metabochip data with study bases of 109,703 (BMI), 107,740 (height) and 75,220 (WHR) (Supplementary Tables 1-5).

BMI-Related Traits

Seventeen SNPs were taken forward to stage 2 in up to 4,900 and 4,891 individuals from the upper and lower tails of BMI, respectively. Ten SNPs reached genome-wide significance (P<5×10-8) in the joint meta-analysis of stage 1 and stage 2, but all had been previously identified as loci associated with BMI in the general population.4 A total of 118 SNPs were included in stage 2 for clinical classes of obesity, which included up to 1,162 cases and 22,307 controls for obesity class III, and 65,332 cases and 39,294 controls for overweight. Of the 62 SNPs that showed P<5×10-8 in the joint meta-analyses for at least one obesity class (Supplementary Table 6), seven were novel, explaining an additional 0.09% of the variability in BMI (Supplementary Table 7). These included one locus for overweight (RPTOR), three loci for obesity class I (GNAT2, MRPS33P4, ADCY9), two loci for obesity class II (HS6ST3, ZZZ3), and one locus associated with both overweight and obesity class I (HNF4G) (Table 1, Supplementary Fig. 5-7). Although these loci were identified for specific clinical classes of obesity, all novel loci showed consistent effect direction across the tails of BMI and the other class of obesity, and most P-values were significant (P<0.007, Bonferroni-corrected for 7 SNPs), except for obesity class III and the tails of BMI (presumably due to lower statistical power for these traits; Table 2).

Among the novel obesity loci, at least four are located near genes of high biological relevance. In particular, rs7503807 for overweight, is located within the regulatory associated protein of the MTOR, complex 1 gene (RPTOR), which regulates cell growth in response to nutrient and insulin levels,34 and within 500 kb of the BAI1-associated protein 2 (BAIAP2), which encodes a brain-specific angiogenesis inhibitor (BAI1)-binding protein that regulates insulin uptake in the central nervous system. The overweight and obesity class I SNP rs4735692 is located downstream of the hepatocyte nuclear factor 4-gamma gene (HNF4G). Mutations in HNF4A, a closely related gene that forms a heterodimer with HNF4G to activate gene transcription,35 cause maturity onset diabetes of the young type 1,36 and a common variant near HNF4A was found to be associated with type 2 diabetes (T2D) in east Asians.37 The obesity class I SNP rs2531995 is located within adenylate cyclase 9 (ADCY9), which catalyzes the formation of cyclic AMP from ATP. This SNP was found to be associated with ADCY9 expression in several tissue types (Supplementary Table 8). Loci near other adenylate cyclase genes have been associated with several T2D-related traits, such as glucose homeostasis and susceptibility to T2D (ADCY5).38,39 The obesity class II SNP rs17024258 is located 207kb from the lipid-related gene sortilin (SORT1), which is expressed in multiple cell types and has been reported to be involved in insulin responsiveness in adipose cells.40 Decreased levels of sortilin have been observed in adipose tissues of morbidly obese humans and mice, and in skeletal muscle of obese mice.41 A more comprehensive summary of the biological relevance of the genes nearest to all novel loci is given in the Supplementary Note.

Tails of Height

A total of 134 SNPs from stage 1 were taken forward to stage 2 in up to 4,872 and 4,831 individuals from the upper and lower tails of height, respectively. Of the 95 SNPs that reached P<5×10-8 in the joint meta-analysis of stage 1 and stage 2 (Supplementary Table 6), four novel loci (IGFBP4, H6PD, RSRC1, PPP2R2A) were identified for tails of height (Table 1, Supplementary Fig. 8). The contribution of the four loci to the overall height variability was ≤0.02% (Supplementary Table 7).

Two of the novel loci are located near genes that seem particularly relevant to height. rs584438 is located approximately 500 bp upstream of IGFBP4, which codes for insulin-like growth factor binding protein 4, and is in linkage disequilibrium (r2=0.87) with another SNP (rs598892) that results in a synonymous amino acid change in IGFBP4. IGFBP4 binds to IGF1 and IGF2,42 which have an important role in childhood growth. In blood, this same SNP showed a significant association with the expression of TNS4 (Supplementary Table 8), which interacts with beta-catenin,43 a critical component of the canonical Wnt pathway related to bone formation.44 The height SNP rs2362965 lies 285 kb from SHOX2, a homolog to the X-linked, pseudoautosomal SHOX (short stature homeobox) gene family, which plays a major role in skeletal limb development.

Tails of Waist-Hip Ratio

Ten SNPs were taken forward to stage 2 in 3,351 and 3,352 individuals from the upper and lower tails of WHR, respectively. The four SNPs that reached genome-wide significance (P<5×10-8; Supplementary Table 6) have been previously identified as WHR loci in the general population.7

Comparisons of novel and known loci on the tails, obesity classes, and full distribution

We assessed the impact of our novel loci on the full distribution of these anthropometric traits using data from studies included in stage 1 and stage 2. In the full distribution, evidence of association (P<0.005, Bonferroni-corrected for 11 SNPs) with consistent effect direction was observed with BMI for all novel obesity-related trait loci and with height for all novel loci identified for tails of height (Table 2). None of the loci were associated with WHR, suggesting that these obesity loci are primarily associated with overall adiposity, rather than with fat distribution.

Within GIANT, we previously identified 32 loci associated with BMI.4 There is considerable overlap of samples with the current study, so it is not unexpected that we observed that the effects of all established BMI loci were directionally consistent between the prior study of overall BMI and the obesity-related traits in the present study (Supplementary Table 9). Twenty-seven out of 32 SNPs were significantly associated with the tails of BMI (P<0.0016, Bonferroni-corrected). Although only half of the SNPs were significantly associated with obesity class 3, presumably due to the smaller sample size and reduced power, the majority of SNPs were significantly associated with obesity class 2 and all with obesity class 1 and overweight.

Impact of ascertainment strategy on discovered and known loci

Effect of our novel loci in other studies of extremely obese

Both empirical16 and theoretical work29 has shown that genetic architecture may differ, the more extreme the selection, suggesting that the ascertainment strategy may impact observed results.31 To evaluate impact of ascertainment strategy, we also performed in silico look-ups of all SNPs we found to be associated with BMI-related traits in five studies that applied other ascertainment strategies for defining extremely obese (Supplementary Tables 2-5, bottom panel; total ncases=6,848; ncontrols=7,023). Four studies recruited participants from specialized clinics or hospitals based on absolute or percentile-derived cutoffs, and one study utilized liability-based (women) and standard-based (men) percentile cut-points. We performed a meta-analysis of these five studies and observed directionally consistent associations for all BMI-associated SNPs (Supplementary Table 10). The effect sizes in these extreme obesity studies were similar to those observed for tails of BMI in our analysis (Pheterogeneity>0.007 for all SNPs, Bonferroni-corrected). Four out of seven novel obesity-related loci displayed significance at P<0.007 (Bonferroni-corrected) in these extremely obese studies.

Effect of loci previously identified in extremely obese samples in our study

Previous studies of extreme childhood and/or adult obesity using different ascertainment strategies have reported genome-wide significant or near genome-wide significant associations (P<5×10-7) with FTO, MC4R, TMEM18, FAIM2, TNKS, HOXB5, OLFM4, NPC1, MAF, PTER, SDCCAG8, PCSK1 (rs6235 and rs6232) and KCNMA1.14-16,22-26 With the exception of PCSK1 (rs6232) for tails of BMI and MAF for tails of BMI and obesity class II, all associations showed consistent directions of effect across the BMI-related outcomes (Supplementary Table 11). Of the 13 loci, replication at a significance level of P<0.004 (Bonferroni-corrected) was observed for four SNPs (FTO, MC4R, TMEM18, FAIM2) for the tails of BMI and all clinical classes of obesity Two loci, MAF and KCNMA1, which have thus far only been reported for extreme obesity, were not significantly associated with any of our traits at either a Bonferroni-corrected or nominal significance threshold (P<0.05).

Empirical power comparison of the extremes and full distribution

If the extremes have different genetic inheritance or are etiologically more homogenous than the full distribution, analyzing extremes or tails of the distribution by case-control design may offer superior power. To test this empirically, we conducted meta-analyses of the full distributions of BMI and height with all studies included in stage 1 and stage 2. Only two (IGFBP4 and H6PD) out of four novel loci for tails of height reached genome-wide significance (P<5×10-8) using the full height distribution (Table 2). Four (GNAT2, ZZZ3, HNF4G, and RPTOR) out of seven novel loci identified for clinical classes of obesity achieved genome-wide significance for the full BMI distribution. The remaining loci had P-values <5×10-5 in the full distribution and thus, would likely have been detected with a larger sample size.

Systematic comparisons of the genetic inheritance and distribution of SNPs between the tails and full distribution

To investigate differences in genetic architecture between the tails and full distributions, we estimated whether the observed genetic effects in tails of BMI, height and WHR were different from what would be expected based on the full distributions of corresponding traits. To do this, we first estimated the expected effect for each SNP in the tails based on the full distribution in each study and then meta-analyzed the expected associations across studies. The Q-Q plots of P-values testing differences between the observed and expected (Fig. 1 and Supplementary Fig. 9) did not show any enrichment, indicating that effect sizes observed in tails and those expected based on the overall distribution were similar. Further, comparable results were observed for the 32 SNPs previously associated with BMI in Speliotes et al4, as well as for previously published and novel extreme obesity loci (Supplementary Table 12).

To further compare genetic inheritance of the tails with the full distribution, we used a ‘polygene approach.’45 The meta-analysis results of tails and full distribution were used to create two polygenetic scores (by summing the number of risk alleles at each SNP) in six studies (Supplementary Table 13). We found that the polygene score based on the full BMI distribution consistently explained more of the variance than the score based on the tails (e.g. 15.3% vs. 6.4% at P<0.05) (Fig. 2, Supplementary Table 14). Similar results were observed for height and WHR (Supplementary Fig. 10). On liability scale, the variance explained by the two polygene scores was similar for different BMI-related outcomes (Supplementary Fig. 11) and different percentile cutpoints used to define the tails (data not shown), suggesting that the fraction of the overall variance explained by SNPs is not influenced by the outcome categorization, but by the ability to accurately rank and estimate the beta coefficients of the association, which is better achieved by using the entire study population instead of the tails. Our results also indicate that genetic determinants for the tails are similar to those for the full distribution and that common variant loci contribute to extreme phenotypes. However, it should be noted that our analyses of the upper and lower 5 percentiles of the distribution (tails) does not necessarily extend to more extreme cut-offs, such as the top and bottom 1st percentiles.

Allelic heterogeneity at known and discovered loci

To explore enrichment for allelic heterogeneity in the tails and clinical classes of obesity, we performed conditional analyses using a method recently described by Yang et al.46 In these analyses, we found secondary signals that reached genome-wide significance (P<5×10-8) at 17 loci, including one locus for tails of BMI (FTO), 13 loci for tails of height (PTCH1 [two signals], GHSR, EDEM2, C6orf106, CRADD, EFEMP1, HHIP, FBXW11, NPR3, C2orf52, BCKDHB, EFR3B), one for tails of WHR (RSPO3), two for overweight (MC4R, FANCL), and one for obesity class I (FANCL; Supplementary Table 15). Whereas the secondary signals for tails of BMI (FTO) and WHR (RSPO3), and overweight and obesity class I (FANCL) have not been established previously, all 13 height loci identified here, as well as the MC4R locus have previously been shown to have allelic heterogeneity in the general population,7,9 suggesting that there is no enrichment in the tails for secondary signals (Supplementary Fig. 12-14).

We also looked for evidence of enrichment of unobserved low-frequency variants by conducting haplotype analyses within known and novel loci, since haplotypes constructed from common SNPs may tag low-frequency variants that are enriched in the tails of the trait distributions, but are rarer in the general population. Using genotype data from the largest studies, three signals of association were observed for tails of height that exceeded conservative prior odds of association of one in 30,000: ID4 (Bayes factor: 118,839), LIN28B (Bayes factor: 105,478) and DLEU7 (Bayes factor: 66,599) (Supplementary Table 16). However, for all three loci, association signals were characterized by two clusters of haplotypes (both common and rare) and were not consistent with an enrichment of unobserved low-frequency causal variants in the distribution tails.

Discussion

In our meta-analysis of genome-wide association studies of up to 263,407 individuals of European ancestry, we identified 165 loci associated with tails (upper vs. lower 5th percentile) of BMI, height, and WHR and/or clinical classes of obesity. Eleven of these loci have not previously been associated with anthropometric traits. Several of the novel loci were located near strong biological candidate genes, such as IGFBP4 and SHOX2 for tails of height, and HNF4G and ADCY9 for overweight/obesity class I, suggesting future areas of research. Although by using different distribution cutoffs we discovered additional loci that would not have been identified as genome-wide significant using the full distribution of the same study samples, there is no evidence to suggest that the clinical classes of obesity are etiologically distinct, and the majority of evidence indicates that the extremes share many of the same loci with the general population.

To assess the impact of different distribution cutpoints on genetic variants associated with the extremes, we chose to evaluate the 5% tails of the distribution and clinical classes of obesity, specifically obesity classes II and III. Although others have ascertained extremes differently, all variants associated with obesity-related traits in our meta-analysis were found to have directionally consistent results in five independent studies of extremely obese samples. Of the 13 loci previously identified as associated with extreme obesity,14-16,22-26 nearly all (except PCSK1 rs6232 and MAF) showed a consistent direction of effect for the tails of BMI. Only two loci (MAF and KCNMA1), originally identified for early-onset and morbid adult obesity,14,26 failed to replicate for any of our BMI-related outcomes. While it is possible that we had insufficient power if there was a substantial winner’s curse present in the initial publications, it is also conceivable that these susceptibility loci are population-specific, only contribute to risk at younger ages,47 represent false positive findings, or tag rare causal variants that are difficult to detect in population-based samples.

Since our study was based on GWAS data, we were not well suited to address the role of rare variants in extreme traits. Although the haplotype-based analyses revealed strong associations of haplotypes in three genes with tails of height, which could suggest that they are tagged by rare variants, such putative variants could not be established using our approach. The suggestion that rare variants could be more important in extremes of complex traits needs to be addressed using other designs, such as resequencing projects or using the new Exome Chip microarrays that are currently being analyzed in many large study samples.

Our systematic comparisons between extremes and full distribution yielded several important insights that also may be informative for other complex traits. When comparing observed genetic effects in tails with expected effects extrapolated from overall distributions of corresponding traits, we did not observe any systematic differences. Further, we showed that the polygene score based on the full distribution explained a larger proportion of variance than the score based on the tails. Taken together with the finding that half of our novel loci were associated at genome-wide significant level in the overall distribution, this implies that there is limited etiologic heterogeneity in these anthropometric traits. Our analysis shows that while some common variants can have larger effects in the extremes, these effects as a whole are not larger than expected based on the effects in the overall distribution. Further, while rare variants specific to the extremes may still exist, the extremes share most of the common loci with the overall distribution.

Conclusions that can be drawn from these observations are that when having access to data for the full distribution, case-control analyses using extremes can be useful to find additional loci. Although the analyzing the full distribution is generally more powerful, small amounts of heterogeneity in the distribution may allow for the identification of additional loci by analyzing the data using different cut-points, such as the tails. Further, as in most cases when resources are limited, our results indicate that a strategy with selection of individuals from the extremes for genetic analyses could be a cost-effective approach and will likely yield loci that are relevant and largely generalizable to the full population. Compatible with those of recent, smaller studies,21-23 our results show convincingly that this theoretically appealing approach also holds empirically.

In conclusion, in our large GWAS meta-analysis including up to 263,407 individuals, we identified four new loci influencing height detected at the tails, as well as seven new loci for clinical classes of obesity. Consistent with theoretical predictions and previous smaller studies, our results show that there is a large overlap in terms of genetic structure and distribution of variants between traits based on different distribution cutoffs with those from population-level studies, but additional insight may still be gained from evaluating the extremes. Our results are informative for designing future genetic studies of obesity as well as other complex traits.

Online Methods

Detailed methods descriptions are available in the Supplementary Note.

Discovery and joint meta-analyses

Study design

We conducted a two-stage study for the tails of three anthropometric traits (BMI, WHR, and height) and four clinical classes of obesity (overweight and obesity classes I, II, and III), followed by a combined analysis of the two stages. Stage 1 consisted of a meta-analysis of GWAS utilizing data from a study base (or sampling frame) of up to 168,267 adult individuals of European ancestry from 51 studies participating in the Genetic Investigation of ANthropometric Traits (GIANT) consortium (Supplementary Tables 1-5). In stage 2, 273 SNPs with P-values < 5×10-6 were followed up in up to 109,703 additional individuals of European descent, which included 67,243 individuals from 24 studies with data from the Metabochip (a custom-designed array of ~200,000 SNPs with prior evidence of suggestive association with metabolic traits), and 42,460 individuals from 12 studies with in silico replication GWAS data (Supplemental Tables 1-5). This gave us a study base of up to 276,007 individuals of European descent for the joint meta-analysis of stage 1 and stage 2. For full details about the discovery and replication stages, analysis of data, and meta-analyses, see Supplementary Note.

Phenotype definitions

The tails of the three anthropometric traits (BMI, height, and WHR) were defined as the upper 5th percentile (cases) and lower 5th percentile (controls) of the distribution stratified by sex and disease status after controlling for the following covariates: age, age2 and principal components for BMI; age and principal components for height; and age, age2, BMI and principal components for WHR. For the clinical obesity classes, cases were defined as BMI ≥25 kg/m2 for overweight, BMI ≥30 kg/m2 for obesity class I, BMI ≥35 kg/m2 for obesity class II, and BMI ≥40 kg/m2 for obesity class 3. Controls were subjects with BMI <25 kg/m2. A minimum of 30 cases and 30 controls for each study-specific stratum was required.

Association analyses and meta-analyses

Each study conducted single marker association analyses assuming an additive genetic model taking genotype imputation uncertainty into account. Analyses were stratified by sex (except for studies with related individuals where analyses accounted for family structure) and disease status for studies that ascertained participants based on a relevant disease (e.g., diabetes). Before meta-analyzing data, results from each study were extensively reviewed using standardized quality control procedures to identify potential problems, such as strand issues, discrepancies between the reported standard errors and p-values, and allele frequency differences. SNPs with poor imputation quality scores or estimated minor allele count ≤ 20 (i.e. 2 × N × minor allele frequency) in each stratum (men/women or pooled for family-based studies) of each study were removed from analysis. For the discovery stage 1, each stratum- and study-specific GWAS was corrected for genomic control. Meta-analyses were performed for each phenotype in METAL48 using the fixed effects inverse variance method based on the β estimates and standard errors from each study. The results of the discovery meta-analysis were followed by an additional genomic control correction. Similar methods were employed for the replication and joint discovery and replication analysis.

Association testing in extremely obese studies

We tested the association of all SNPs reaching P<5×10-8 in the joint analysis of stage 1 and stage 2 results for the BMI-related traits, in five studies of extremely obese individuals (Supplementary Tables 2-5). For the four case-control studies (French Extreme Obesity Study, Essen Case-Control GWAS, GEO and GOYA), a fixed effects inverse variance method was used to meta-analyze the results. A fifth study (Essen Obesity Trio GWAS) that has a nuclear family structure was meta-analyzed with the four case-control studies using a weighted z-score method that takes into account the direction but not the magnitude of the association.

Systematic comparison of the genetic structure between tails and overall distribution

For these analyses, we included all GWAS studies that provided genome-wide results for both the full distribution and tails of BMI, height and WHR. First, we used the results for the full distribution to calculate, for each genotype, the expected number of individuals in the upper and lower 5% tails. We used these values to perform a logistic regression, comparing the upper and lower tails, and obtained the ‘expected beta’ and ‘expected standard error’. Second, we tested the differences between the ‘expected betas’ and the ‘observed betas’ obtained from the meta-analyses of the tails of the distributions. The standard error of the differences was estimated as: sqrt[expected standard error ˆ2 + observed standard error ˆ2 - 2*0.65*(expected standard error* observed standard error)], where 0.65 is the correlation between ‘expected betas’ and ‘observed betas’ obtained from TWINGENE by bootstrapping. Finally, differences between ‘expected betas’ and ‘observed betas’ were meta-analyzed using the inverse variance method in METAL.

Polygene comparison of genetic determinants of the BMI tails and overall distribution

Within each trait (BMI, height and WHR), we aimed to compare variance explained in tails of the trait by two genetic scores (polygene scores) obtained from (1) the meta-analyses of the tails of the trait and (2) the meta-analyses of the full distribution. To make the scores comparable, we limited the polygene score construction to the studies that provided genome-wide meta-analysis results for both tails and overall distribution. After LD filtering (using r2≥0.05 and 1 Mb distance) and excluding SNPs present in <50% of samples, we created polygene scores, as weighted sum of risk alleles, using the method proposed by the International Schizophrenia Consortium.45 For the BMI analysis, the association between the polygene scores and tails of BMI was investigated in four samples of extremely obese and two independent cohort studies using the same definition of tails (Supplementary Table 13). Only the two independent cohorts were used for the height and WHR analysis. To estimate the phenotypic variance explained, we fit logistic or linear regression models including age, sex, study-specific covariates and the polygene score as predictors, and tails of the trait or overall trait as outcomes, in separate models. The phenotypic variance explained by the polygene scores was defined as the difference in R2 (linear regression) or Nagelkerke R2 (logistic regression) between these models and a basic model including only age, sex and study-specific covariates as predictors.

Secondary signals analysis

To identify potential secondary signals, we utilized the approximate conditional and joint analysis proposed by Yang et al,46 which uses summary-level statistics and the LD structure from a reference sample to approximate conditional p-values. The meta-analysis results for each trait were analyzed separately with LD correction between SNPs estimated from 6,654 unrelated individuals from the ARIC cohort.

Haplotype-based analyses

Using data from ten of the largest studies, we tested the association between the tails of height, BMI and WHR and haplotypes across each established and novel locus separately for males and females within each study, using GENEBPM.49 The haplotypes were estimated from GWAS SNP data by means of an expectation-maximization algorithm and then clustered according to their allelic similarity. Within a logistic regression-modelling framework, haplotypes within the same cluster were assigned the same allelic effect, reducing the required number of parameters. Markov-chain Monte Carlo techniques were employed to sample over the space of haplotype clusters and regression model parameters. Evidence in favour of a haplotype association with the trait was assessed by summing log10 Bayes factors across studies.

eQTL analyses

We examined the cis associations between SNPs that reached genome-wide significance (P < 5 × 10-8) and expression of nearby genes in multiple tissues from 5 studies described previously: 1) subcutaneous adipose tissue (n=603) and whole blood (n=747) from deCode50; 2) lymphoblastoid cell lines (n=830) from a childhood asthma study51; 3) liver (n=707), subcutaneous fat (n=870) and omental fat (n=916) tissue from a bariatric surgery study52; 4) subcutaneous abdominal (N=52) and gluteal (N=62) adipose tissue and whole blood (n=65) from MolOBB53; and 5) cortical brain tissue (n=193) survey study.54 SNPs were tested for cis associations with transcripts within 500 kb or 1 Mb, assuming an additive effect of the BMI allele or using an ANOVA test with study-specific p-value thresholds used to account for multiple testing. Conditional analyses were performed for all expression data, except for cortical tissue, by conditioning the trait-associated SNP on the most significant cis-associated SNP for that particular gene transcript and vice versa.

Supplementary Material

Figure 1

Q-Q plot of the −log10 p-values for the difference between the observed association for the tails of BMI and expected association based on the overall BMI distribution.

Figure 2

Variance in extreme obesity explained by common genetic variants

The phenotypic variance explained is higher when SNPs with lower degrees of significance are included in the polygenetic prediction model. The y-axis represents the proportion of variance explained (Nagelkerke R2) of extreme obesity in six studies not included in the discovery meta-analysis. In panel A, the prediction model was based on the results from the stage I meta-analysis of tails of BMI. The thicker lines represent the weighted average; 95% confidence intervals are reported as double-headed arrows. In panel B, the prediction model was based on BMI from the full distribution (modified version of the previous GIANT meta-analysis by Speliotes et al4). * Essen Obesity Study was not adjusted by age.

Table 1

Novel loci reaching genome-wide significance (P < 5 × 10-8) for the tails of anthropometric traits and clinical classes of obesity

Stage 1Stage 2*Stage 1 + Stage 2

SNPChrPositionNearby geneEffect
allele
Other
allele
Effect
allele freq
No.
cases
No.
controls
ORPNo.
cases
No.
controls
ORPNo.
cases
No.
controls
ORP
Height tails
rs5844381735852698IGFBP4ca0.617783078501.181.11E-09181418141.190.001964496641.185.22E-12
rs666250919240191H6PDtc0.1463546254611.232.21E-06361535661.233.37E-05907790271.233.19E-10
rs23629653159592073RSRC1/SHOX2ta0.50798979931.141.45E-07481947751.100.00212808127681.122.14E-09
rs1594829826261994PPP2R2Act0.7688669366971.185.51E-07416641151.110.0110859108121.153.88E-08

Obesity Class 2
rs79893361395815549HS6ST3ag0.47049825621141.125.88E-091664171131.040.2511489792261.101.06E-08
rs17381664177820919ZZZ3ct0.39239833621141.117.61E-085351338411.050.0415184959551.092.85E-08

Obesity Class 1
rs170242581109948844GNAT2tc0.036418662384271.231.41E-068956154711.281.12E-0627618538981.258.66E-12
rs4735692876778218HNF4Gag0.583432675656971.075.03E-0822086383521.040.005547611040491.062.48E-09
rs130411262050526403MRPS33P4tc0.717932020640151.073.05E-0722088375951.040.007541081016101.062.16E-08
rs2531995163953468ADCY9tc0.614632433655421.063.17E-066680166021.070.00439113821441.074.04E-08

Overweight
rs4735692876778218HNF4Gag0.583992703656981.056.13E-0965323392901.030.0031580261049881.043.51E-10
rs75038071776205706RPTORac0.565492855657231.044.20E-0664535388131.030.00091573901045361.041.98E-08

*Stage 2 consists of studies with either GWAS and Metabochip data. Not all SNPs were present on Metabochip.

Table 2

Association results for novel genome-wide significant (P < 5 × 10-8) SNPs with height and obesity-related traits

SNPGeneEffect
allele
Other
allele
BMI tailsObesity class 3Obesity class 2Obesity class 1OverweightBMI
(continuous)a
Height tailsHeight
(continuous)a

ORPORPORPORPORPEffectP-valueORPEffectP-value
Height tails
rs584438IGFBP4ca0.980.521.020.641.010.471.000.751.000.590.0050.221.185.22E-120.0259.43E-11
rs6662509H6PDtc1.000.951.110.071.010.830.990.340.990.35-0.0060.271.233.19E-100.0317.76E-12
rs2362965RSRC1/SHOX2ta0.950.020.970.250.980.200.990.370.990.21-0.0070.051.122.14E-090.0177.07E-08
rs1594829PPP2R2Act1.030.331.060.111.010.581.000.821.010.320.0040.331.153.88E-080.0164.29E-05

Obesity Class 2
rs7989336HS6ST3ag1.090.00011.110.00061.101.06E-081.049.38E-051.042.33E-060.0168.80E-061.000.89-0.0010.71
rs17381664ZZZ3ct1.080.00011.125.41E-051.092.85E-081.056.80E-081.042.23E-070.0222.50E-111.060.0050.0100.004

Obesity Class 1
rs17024258GNAT2tc1.270.021.450.0021.267.73E-051.258.66E-121.131.41E-080.0674.34E-141.210.080.0100.36
rs4735692HNF4Gag1.091.97E-051.080.0061.050.00051.062.48E-091.043.51E-100.0199.94E-101.020.500.0060.10
rs13041126MRPS33P4tc1.080.0011.050.161.060.00021.062.16E-081.041.43E-060.0178.52E-071.020.560.0020.65
rs2531995ADCY9tc1.060.011.090.0061.060.0011.074.04E-081.035.57E-050.0216.58E-081.070.0050.0181.87E-06

Overweight
rs4735692HNF4Gag1.091.97E-051.080.0061.050.00051.062.48E-091.043.51E-100.0199.94E-101.020.500.0060.10
rs7503807RPTORac1.087.07E-051.139.44E-061.072.46E-061.051.12E-071.041.98E-080.0203.00E-100.990.55-0.0010.85

aThe beta represents the difference in standardized effects.

Appendix

Author contributions

Steering committee (oversaw the consortium)

Goncalo R Abecasis, Themistocles Assimes, Ines Barroso, Sonja Berndt, Michael Boehnke, Ingrid Borecki, Panos Deloukas, Caroline Fox, Tim Frayling, Leif Groop, Talin Haritunian, Iris Heid, David Hunter, Erik Ingelsson, Robert C Kaplan, Ruth JF Loos, Mark McCarthy, Karen Mohlke, Kari E North, Jeffrey R O’Connell, David Schlessinger, David Strachan, Unnur Thorsteinsdottir, Cornelia van Duijn

Writing group (drafted and edited manuscript)

Sonja I Berndt, Mary F Feitosa, Andrea Ganna, Stefan Gustafsson, Erik Ingelsson, Anne E Justice, Cecilia M Lindgren, Ruth JF Loos, Reedik Mägi, Mark McCarthy, David Meyre, Keri L Monda, Andrew P Morris, Kari E North, André Scherag, Elizabeth K Speliotes, Eleanor Wheeler, Cristen J Willer

Data cleaning and preparation

Sonja I Berndt, Damien C Croteau-Chonka, Felix R Day, Tõnu Esko, Tove Fall, Teresa Ferreira, Stefan Gustafsson, Iris Heid, Erik Ingelsson, Anne U Jackson, Hana Lango Allen, Cecilia M Lindgren, Jian’an Luan, Reedik Mägi, Joshua C Randall, André Scherag, Elizabeth K Speliotes, Gudmar Thorleifsson, Sailaja Vedantam, Thomas W Winkler, Andrew R Wood

Statistical Advisors

Sang Hong Lee, Benjamin M Neale, Yudi Pawitan, Peter M Visscher, Jian Yang, Dan-Yu Lin, Yi-Juan Hu

Gene-expression (eQTL) analyses

Liming Liang, William O Cookson, Miriam F Moffatt, Goncalo R Abecasis, Valgerdur Steinthorsdottir, Gudmar Thorleifsson, Josine L Min, George Nicholson, Fredrik Karpe, Mark I McCarthy, Eric E Schadt

Project design, management and coordination of contributing studies

Stage 1 – Genome-wide association studies

(ADVANCE) Themistocles L Assimes, Carlos Iribarren; (AGES) Vilmundur Gudnason, Tamara B Harris, Lenore J Launer; (ARIC) Eric Boerwinkle, Kari E North; (B58C) David P Strachan; (BRIGHT study) Mark J Caulfield, Patricia B Munroe; (CAPS) Erik Ingelsson; (CHS) Barbara McKnight, Bruce M Psaty; (CoLaus) Vincent Mooser, Peter Vollenweider, Gérard Waeber; (COROGENE) Markku S Nieminen, Juha Sinisalo; (deCODE) Kari Stefansson, Unnur Thorsteinsdottir; (DGI) Leif C Groop, Joel N Hirschhorn; (EGCUT) Andres Metspalu; (EPIC) Kay-Tee Khaw, Ruth JF Loos, Nicholas J Wareham; (ERF) Ben A Oostra, Cornelia M van Duijn; (FamHS) Ingrid B Borecki, Michael A Province; (Fenland) Ruth JF Loos, Nicholas J Wareham; (FRAM) L A Cupples, Caroline S Fox; (FUSION GWAS) Michael Boehnke, Karen L Mohlke; (Genmets) Antti Jula, Samuli Ripatti, Veikko Salomaa; (GerMIFS1) Jeanette Erdmann, Heribert Schunkert; (GerMIFS2) Christian Hengstenberg, Klaus Stark; (GOOD) Claes Ohlsson; (HBCS) Johan G Eriksson; (KORA S3) H.- E Wichmann; (KORA S4) Christian Gieger, Thomas Illig, Wolfgang Koenig, Annette Peters; (MGS) Pablo V Gejman, Douglas F Levinson; (MICROS (SOUTH TYROL)) Peter P Pramstaller; (MIGEN) Joel N Hirschhorn, Sekar Kathiresan; (NESDA) Brenda Penninx; (NFBC 1966) Marjo-Riitta Jarvelin; (NHS) Lu Qi; (Nijmegen Biomedical Study) Lambertus A Kiemeney; (NSPHS) Ulf Gyllensten; (NTR) Dorret I Boomsma; (ORCADES) James F Wilson, Alan F Wright; (PLCO) Stephen J Chanock, Sonja I Berndt; (PROCARDIS) Martin Farrall, Hugh Watkins; (RS-I) Fernando Rivadeneira, André G Uitterlinden; (RUNMC) Lambertus A. Kiemeney; (SardiNIA) David Schlessinger; (SASBAC) Erik Ingelsson; (SHIP) Henri Wallaschofski; (Sorbs) Michael Stumvoll, Anke Tönjes; (TwinsUK) Tim D Spector; (VIS) Igor Rudan; (WGHS) Paul M Ridker; (WTCC-T2D) Mark I McCarthy; (WTCCC-CAD) Anthony J Balmforth, Alistair S Hall, Nilesh J Samani; (YFS) Mika Kähönen, Terho Lehtimäki, Olli Raitakari, Jorma Viikari

Stage 2 – Metabochip/in silico replication

(AMC-PAS) Kees G Hovingh; (B58C) Chris Power; (BHS) Lyle J Palmer; (DILGOM) Kari Kuulasmaa, Veikko Salomaa; (DPS) Matti Uusitupa; (DR’s EXTRA) Timo A Lakka, Rainer Rauramaa; (EPIC, Fenland and Ely) Claudia Langenberg, Ruth JF Loos, Nicholas J Wareham; (FIN-D2D 2007) Sirkka M Keinanen-Kiukaanniemi, Timo E Saaristo; (GLACIER) Paul W Franks; (Go-DARTS (Dundee)) Andrew D Morris, Colin NA Palmer; (HNR) Karl-Heinz Jöckel; (HUNT 2) Kristian Hveem; (Hypergenes) Daniele Cusi; (IMPROVE) Ulf de Faire, Anders Hamsten, Elena Tremoli; (KORA S3) Iris Heid; (LifeLines Cohort Study) Harold Snieder, Melanie M Van der Klauw, Bruce HR Wolffenbuttel; (LURIC) Bernhard O Boehm, Winfried März, Bernhard R Winkelmann; (METSIM) Johanna Kuusisto, Markku Laakso; (MORGAM) Philippe Amouyel, Paolo Brambilla, Marco M Ferrario, Jean Ferrières, Frank Kee, David-Alexandre Tregouet, Jarmo Virtamo; (NSHD) Diana Kuh; (PIVUS) Erik Ingelsson; (PLCO2) Sonja I Berndt, Stephen J Chanock; (PREVEND) Pim van der Harst; (QIMR) Nicholas G Martin, Grant W Montgomery, Andrew Heath, Pamela Madden; (RS-II) Albert Hofman, Joyce BJ van Meurs; (RS-III) Cornelia M Van Duijn, Jacqueline CM Witteman; (Swedish Twin Reg.) Erik Ingelsson; (THISEAS / AMCPAS / CARDIOGENICS) Panos Deloukas; (THISEAS) George V Dedoussis; (TRAILS) Albertine J Oldehinkel; (Tromsø 4) Inger Njølstad; (TWINGENE) Erik Ingelsson; (ULSAM) Erik Ingelsson; (Whitehall II) Aroon Hingorani, Mika Kivimäki; (WTCC-T2D) Mark I McCarthy, Cecilia M Lindgren

Other contributing studies: clinically extremes

(French Extreme Obesity Study) David Meyre, Philippe Froguel; (GEO-IT) Anna Maria Di Blasio; (Essen Obesity Study, Essen Case-Control & Essen Obesity Trio GWAS) Johannes Hebebrand, Anke Hinney; (GOYA) Thorkild IA Sørensen, Ellen A Nohr

Genotyping of contributing studies

Stage 1 – Genome-wide association studies

(ADVANCE) Devin Absher; (ARIC) Eric Boerwinkle; (B58C) Wendy L McArdle; (CAPS) Henrik Grönberg; (CHS) Talin Haritunians; (CoLaus) Vincent Mooser; (COROGENE) Markus Perola; (EGCUT) Tõnu Esko, Lili Milani; (EPIC) Inês Barroso; (ERF) Ben A Oostra, Cornelia M van Duijn; (FamHS) Ingrid B Borecki, Michael A Province, Aldi T Kraja; (Fenland) Jian’an Luan; (Genmets) Samuli Ripatti; (GOOD) Claes Ohlsson, John-Olov Jansson, Mattias Lorentzon; (HBCS) Aarno Palotie, Elisabeh Widén; (MGS) Pablo V Gejman, Alan R Sanders; (MICROS (SOUTH TYROL)) Andrew A Hicks; (NHS) Frank B Hu, David Hunter; (NSPHS) Åsa Johansson; (NTR and NESDA) Jouke- Jan Hottenga; (ORCADES) James F Wilson; (PLCO) Stephen J Chanock, Kevin B Jacobs; (RS-I) Fernando Rivadeneira, André G Uitterlinden, Karol Estrada, Carolina Medina-Gomez; (SardiNIA) Mariano Dei; (SASBAC) Per Hall, Jianjun Liu; (SHIP) Georg Homuth; (TwinsUK) Massimo Mangino, So-Youn Shin, Nicole Soranzo; (VIS) Caroline Hayward, Veronique Vitart; (WGHS) Daniel I Chasman; (WTCC-T2D) Mark I McCarthy; (WTCCC-CAD) Anthony J Balmforth, Alistair S Hall, Nilesh J Samani; (YFS) Terho Lehtimäki

Stage 2 – Metabochip/in silico replication

(BHS) Lyle J Palmer, John Beilby; (CARDIOGENICS) Sarah Edkins, Sarah E Hunt; (DPS) Amy J Swift; (DR’s EXTRA) Mustafa Atalay; (EPIC, Fenland and Ely) Jian’an Luan, Ken K Ong; (FIN-D2D 2007) Peter S Chines; (FUSION) Francis S Collins, Jouko Saramies, Jaakko Tuomilehto; (GLACIER) Inês Barroso, Sarah Edkins; (Go-DARTS (Dundee)) Colin NA Palmer; (HNR) Thomas W Mühleisen; (HUNT 2) Narisu Narisu; (Hypergenes) Francesca Frau; (KORA S3) Harald Grallert; (KORA S4) Thomas Illig; (LifeLines Cohort Study) Harold Snieder, Bruce HR Wolffenbuttel, Marcel Bruinenberg. Lude Franke; (NSHD) Diana Kuh, Ken K Ong, Andrew Wong; (PIVUS) Erik Ingelsson, Lars Lind; (PLCO2) Stephen J Chanock, Kevin B Jacobs, Zhaoming Wang; (PREVEND) Pim van der Harst, Folkert W Asselbergs; (QIMR) Nicholas G Martin, Grant W Montgomery, Andrew C Heath, Pamela A Madden; (RS-II) Marjolein Peters, Mariano Dei; (Swedish Twin Reg.) Erik Ingelsson, Patrik K Magnusson, Nancy Pedersen; (THISEAS / AMCPAS / CARDIOGENICS) Kathleen Stirrups; (TRAILS) Albertine J Oldehinkel, Ilja M Nolte, Jana V Van Vliet-Ostaptchouk; (Tromsø 4) Lori L Bonnycastle; (TWINGENE) Erik Ingelsson, Anders Hamsten, Nancy Pedersen; (ULSAM) Erik Ingelsson; (Whitehall II) Claudia Langenberg; (WTCC-T2D) Mark I McCarthy

Other contributing studies: clinically extremes

(French Extreme Obesity Study) David Meyre, Philippe Froguel; (GEO-IT) Davide Gentilini; (GOYA) Lavinia Paternoster, David M Evans

Phenotyping of contributing studies

Stage 1 – Genome-wide association studies

(ARIC) Eric Boerwinkle; (B58C) David P Strachan; (BRIGHT study) John M Connell; (CAPS) Henrik Grönberg; (CHS) Bruce M Psaty; (CoLaus) Peter Vollenweider, Gérard Waeber; (COROGENE) Juha Sinisalo, Marja-Liisa Lokki; (EGCUT) Andres Metspalu, Krista Fischer; (EPIC) Ruth JF Loos; (ERF) Ben A Oostra, Cornelia M van Duijn; (FamHS) Ingrid B Borecki, Michael A Province, Mary F Feitosa; (Fenland) Ruth JF Loos; (FRAM) Caroline S Fox; (Genmets) Antti Jula, Veikko Salomaa; (GerMIFS1) Stefan Schreiber; (GerMIFS2) Annette Peters; (GOOD) Claes Ohlsson, John-Olov Jansson, Mattias Lorentzon, Liesbeth Vandenput; (MGS) Pablo V Gejman, Alan R Sanders, Douglas F Levinson; (NFBC 1966) Anna-Liisa Hartikainen, Jaana H Laitinen, Anneli Pouta; (NHS) Lu Qi; (Nijmegen Biomedical Study) Femmie de Vegt, Martin den Heijer, Sita H Vermeulen; (NSPHS) Åsa Johansson, Ulf Gyllensten; (NTR and NESDA) Gonneke Willemsen; (ORCADES) Harry Campbell, Sarah H Wild; (PLCO) Sonja I Berndt; (RS-I) Fernando Rivadeneira, André G Uitterlinden; (SASBAC) Per Hall; (SHIP) Sabine Schipf; (Sorbs) Anke Tönjes; (TwinsUK) Massimo Mangino, Tim D Spector; (VIS) Ozren Polasek; (WTCC-T2D) Mark I McCarthy; (WTCCC-CAD) Anthony J Balmforth, Alistair S Hall, Nilesh J Samani; (YFS) Mika Kähönen, Olli Raitakari, Jorma Viikari

Stage 2 – Metabochip/in silico replication

(AMC-PAS) Hanneke Basart, Mieke D Trip; (B58C) Chris Power, Elina Hyppönen; (BHS) Lyle J Palmer, John Beilby, Arthur W Musk; (DPS) Jaana Lindström; (EPIC, Fenland and Ely) Ruth JF Loos; (GLACIER) Paul W Franks, Dmitry Shungin; (Go-DARTS (Dundee)) Colin NA Palmer, Andrew D Morris; (Hypergenes) Daniele Cusi, Paolo Manunta; (KORA S3) Barbara Thorand; (KORA S4) Annette Peters; (LifeLines Cohort Study) Bruce HR Wolffenbuttel, Melanie M Van der Klauw; (METSIM) Alena Stančáková, Pablo V Gejman; (NSHD) Diana Kuh; (PIVUS) Erik Ingelsson, Lars Lind; (PLCO2) Sonja I Berndt; (PREVEND) Gerjan Navis; (QIMR) Nicholas G Martin, Andrew Heath, Pamela Madden; (RS-II) M Carola Zillikens; (RS-III) Jacqueline CM Witteman; (Swedish Twin Reg.) Erik Ingelsson, Patrik K Magnusson, Nancy Pedersen; (THISEAS) Maria Dimitriou, Eirini V Theodoraki; (TRAILS) Ronald P Stolk; (TWINGENE) Erik Ingelsson, Nancy Pedersen; (ULSAM) Erik Ingelsson; (Whitehall II) Meena Kumari; (WTCC-T2D) Mark I McCarthy

Other contributing studies: clinically extremes

(Essen Obesity Study, Essen Case-Control GWAS & Essen Obesity Trio GWAS) Johannes Hebebrand, Anke Hinney; (GEO-IT) Antonio Liuzzi, Stefano Signorini; (GOYA) Thorkild IA Sørensen, Ellen A Nohr

Analyses of contributing studies

Stage 1 – Genome-wide association studies

(ADVANCE) Lindsay L Waite; (AGES) Albert V Smith; (ARIC) Kari E North, Anne E Justice, Keri L Monda; (B58C) David P Strachan; (BRIGHT study) Toby Johnson; (CAPS) Erik Ingelsson, Reedik Mägi; (CHS) Barbara McKnight, Guo Li; (CoLaus) Diana Marek; (COROGENE) Markus Perola; (deCODE) Valgerdur Steinthorsdottir, Gudmar Thorleifsson; (DGI) Elizabeth K Speliotes, Sailaja Vedantam; (EGCUT) Krista Fischer, Tõnu Esko, Evelin Mihailov; (EPIC) Jing Hua Zhao; (ERF) Najaf Amin; (FamHS) Mary F Feitosa; (Fenland) Jian’an Luan; (FRAM) L A Cupples, Nancy L Heard-Costa, Julius S Ngwa; (Genmets) Ida Surakka; (GerMIFS1) Michael Preuss; (GerMIFS2) Inke R König; (GOOD) Claes Ohlsson, John-Olov Jansson, Mattias Lorentzon, Liesbeth Vandenput; (HBCS) Niina Eklund; (KORA S3) Claudia Lamina; (KORA S4) Eva Albrecht; (MGS) Douglas F Levinson, Jianxin Shi; (MICROS (SOUTH TYROL)) Jennifer E Huffman, Åsa Johansson; (MIGEN) Elizabeth K Speliotes, Sailaja Vedantam; (NFBC 1966) Anneli Pouta, Reedik Magi, Joshua C Randall; (NHS) Lu Qi, Tsegaselassie Workalemahu; (NSPHS) Åsa Johansson; (NTR and NESDA) Jouke- Jan Hottenga, Marleen H. de Moor; (ORCADES) Åsa Johansson; (PLCO) Kevin B Jacobs, Sonja I Berndt; (PROCARDIS) Martin Farrall, Anuj Goel, John F Peden; (RS-I) Fernando Rivadeneira, Karol Estrada, Carolina Medina-Gomez; (SardiNIA) Jennifer L Bragg-Gresham, Serena Sanna; (SASBAC) Erik Ingelsson, Reedik Mägi; (SEARCH) Jonathan Tyrer; (SHIP) Alexander Teumer; (Sorbs) Reedik Mägi, Inga Prokopenko; (TwinsUK) Massimo Mangino, Nicole Soranzo; (VIS) Åsa Johansson; (WGHS) Daniel I Chasman; (WTCC-T2D) Cecilia M Lindgren, Reedik Magi, Joshua C Randall; (WTCCC-CAD) Reedik Magi, Joshua C Randall; (WTCCC-NBS (UKBS-CC)) Antony P Attwood, Reedik Magi, Joshua C Randall, Jennifer G Sambrook, Jonathan C Stephens; (YFS) Olli Raitakari, Terho Lehtimäki

Stage 2 – Metabochip/in silico replication

(B58C) Elina Hyppönen, Teresa Ferreira; (BHS) Gemma Cadby; (DILGOM) Kati Kristiansson; (DPS) Anne U Jackson; (DR’s EXTRA) Anne U Jackson; (EPIC, Fenland and Ely) Jian’an Luan, Ken K Ong; (FIN-D2D 2007) Anne U Jackson; (FUSION) Anne U Jackson; (GLACIER) Paul W Franks, Dmitry Shungin; (HNR) Sonali Pechlivanis, Carolin Pütter; (HUNT 2) Anne U Jackson; (Hypergenes) Francesca Frau, Zoltán Kutalik; (IMPROVE) Rona J Strawbridge; (KORA S3) Iris Heid, Thomas W Winkler; (KORA S4) Martina Müller-Nurasyid; (LifeLines Cohort Study) Marcel Bruinenberg, Lude Franke; (LURIC) Marcus E Kleber; (METSIM) Anne U Jackson; (NSHD) Andrew Wong, Jian’an Luan; (PIVUS) Erik Ingelsson, Stefan Gustafsson; (PLCO2) Sonja I Berndt, Zhaoming Wang; (PREVEND) Pim van der Harst, Irene Mateo Leach; (QIMR) Sarah E Medland, Jian Yang; (RS-II) Marjolein Peters; (Swedish Twin Reg.) Erik Ingelsson, Stefan Gustafsson; (THISEAS / AMCPAS / CARDIOGENICS) Stavroula Kanoni; (TRAILS) Ilja M Nolte, Jana V Van Vliet-Ostaptchouk; (Tromsø 4) Anne U Jackson; (TWINGENE) Erik Ingelsson, Stefan Gustafsson; (ULSAM) Erik Ingelsson, Andrea Ganna, Stefan Gustafsson; (WTCC-T2D) Reedik Magi, Teresa Ferreira

Other contributing studies: clinically extremes

(French Extreme Obesity Study) David Meyre, Cecile Lecoeur, Boris Skrobek; (GEO-IT) Anna Maria Di Blasio, Davide Gentilini; (Essen Obesity Study, Essen Case-Control GWAS & Essen Obesity Trio GWAS) André Scherag, Ivonne Jarick; (GOYA) Lavinia Paternoster, David M Evans

Footnotes

Competing interest statements

Kari Stefansson, Valgerdur Steinthorsdottir, Gudmar Thorleifsson and Unnur Thorsteinsdottir are employed by deCODE Genetics with stock/stock options in the company. Inês Barroso and spouse own stock in Incyte Ltd and GlaxoSmithKline. Eric E. Schadt is Chief Scientific Officier, Pacific Biosciences. Thorkild IA Sørensen has collaborated with Nestlé Research Centre and DSM, The Netherlands, on metabolomics and genomics of obesity and related traits, respectively.

Acknowledgments

A full list of acknowledgments appears in the Supplementary Note.

Aarno Koskelo Foundation; Academy of Finland; Agency for Science, Technology and Research of Singapore; Australian National Health and Medical Research Council; Australian Research Council; BDA Research; BioSHaRE Consortium; British Heart Foundation; Cedars-Sinai Board of Governors’ Chair in Medical Genetics; Centre for Clinical Research at the University of Leipzig; Centre of Excellence in Genomics and University of Tartu; Chief Scientist Office of the Scottish Government; City of Kuopio and Social Insurance Institution of Finland; Department of Educational Assistance, University and Research of the Autonomous Province of Bolzano; Donald W. Reynolds Foundation; Dutch Ministry for Health, Welfare and Sports; Dutch Ministry of Education, Culture and Science; Dutch BBRMI-NL; Dutch Brain Foundation; Dutch Centre for Medical Systems Biology; Dutch Diabetes Research Foundation; Dutch Government Economic Structure Enhancing Fund; Dutch Inter University Cardiology Institute; Dutch Kidney Foundation; Dutch Ministry of Economic Affairs; Dutch Ministry of Justice; Dutch Research Institute for Diseases in the Elderly; Eleanor Nichols endowments; Emil Aaltonen Foundation; Erasmus Medical Center and Erasmus University; Estonian Government; European Commission; European Regional Development Fund; European Research Council; European Science Foundation; Faculty of Biology and Medicine of Lausanne; Finland’s Slot Machine Association; Finnish Cultural Foundation; Finnish Diabetes Research Foundation; Finnish Foundation for Cardiovascular Research; Finnish Funding Agency for Technology and Innovation; Finnish Heart Association; Finnish Medical Society; Finnish Ministry of Education and Culture; Finnish Ministry of Health and Social Affairs; Finnish National Institute for Health and Welfare; Finnish Social Insurance Institution; Finska Läkaresällskapet; Folkhälsan Research Foundation; Foundation for Life and Health in Finland; French Ministry of Research; French National Research Agency; Genetic Association Information Network; German Diabetes Association; German Federal Ministry of Education and Research; German Ministry of Cultural Affairs; German National Genome Research Network; German Research Foundation; GlaxoSmithKline; Göteborg Medical Society; Greek General Secretary of Research and Technology; Gyllenberg Foundation; Health Care Centers in Vasa, Närpes and Korsholm; Heinz Nixdorf Foundation; Helmholtz Zentrum München, German Research Center for Environmental Health; Icelandic Heart Association; Icelandic Parliament; Intramural Research Program of the Division of Cancer Epidemiology and Genetics, National Cancer Institute, NIH; Italian Ministry of Education, Universities and Research; Italian Ministry of Health; Juho Vainio Foundation; Juvenile Diabetes Research Foundation International; Knut and Alice Wallenberg Foundation; Kuopio, Tampere and Turku University Hospital Medical Funds; Leducq Foundation; Lundberg Foundation; March of Dimes; Munich Center of Health Sciences as part of LMUinnovativ; Municipal Health Care Center and Hospital in Jakobstad; Municipality of Rotterdam; Närpes Health Care Foundation; National Alliance for Research on Schizophrenia and Depression Young Investigator Awards; Netherlands Genomics Initiative; Netherlands organization for health research and development; NHSBT; National Institute of Health; Nordic Center of Cardiovascular Research; Nordic Center of Excellence in Disease Genetics; Nordic Centre of Excellence on Systems biology in controlled dietary interventions and cohort studies; Northern Netherlands Collaboration of Provinces; Novo Nordisk Foundation; Ollqvist Foundation; Orion-Farmos Research Foundation; Paavo Nurmi Foundation; Päivikki and Sakari Sohlberg Foundation; Perklén Foundation; Petrus and Augusta Hedlunds Foundation; Province of Groningen; Republic of Croatia Ministry of Science, Education and Sport; Reynold’s Foundation; Royal Society; Samfundet Folkhälsan; Signe and Ane Gyllenberg foundation; Sigrid Juselius Foundation; Social Ministry of the Federal State of Mecklenburg-West Pomerania; Sophia Foundation for Medical Research; South Tyrolean Sparkasse Foundation; Southern California Diabetes Endocrinology Research Center; Stockholm County Council; Strategic Cardiovascular Program of Karolinska Institutet; Strategic support for epidemiological research at Karolinska Institutet; Susan G. Komen Breast Cancer Foundation; Swedish Ministry for Higher Education; Swedish Cancer Society; Swedish Cultural Foundation in Finland; Swedish Diabetes Association; Swedish Foundation for Strategic Research; Swedish Heart-Lung Foundation; Swedish Medical Research Council; Swedish Ministry of Education; Swedish Research Council; Swedish Royal Academy of Science; Swedish Society for Medical Research; Swedish Society of Medicine; Swiss National Science Foundation; Tampere Tuberculosis Foundation; The Great Wine Estates of the Margaret River region of Western Australia; The Paul Michael Donovan Charitable Foundation; Torsten and Ragnar Söderberg Foundation; UK Cancer Research; UK Diabetes Association; UK Heart Foundation; UK Medical Research Council; UK National Institute for Health Research, Biomedical Research Centre; UK West Anglia Primary and Community Care; University Medical Center Groningen and University of Groningen; Västra Götaland Foundation; VU University: Institute for Health and Care Research and Neuroscience Campus Amsterdam; Wellcome Trust; Yrjö Jahnsson Foundation.

References

  • 1. MaesHHNealeMCEavesLJGenetic and environmental factors in relative body weight and human adiposityBehav Genet27325511997[PubMed][Google Scholar]
  • 2. StunkardAJFochTTHrubecZA twin study of human obesityJama2565141986[PubMed][Google Scholar]
  • 3. SilventoinenKHeritability of adult body height: a comparative study of twin cohorts in eight countriesTwin research63994082003[PubMed][Google Scholar]
  • 4. SpeliotesEKAssociation analyses of 249,796 individuals reveal 18 new loci associated with body mass indexNat Genet42937482010[PubMed][Google Scholar]
  • 5. OkadaYCommon variants at CDKAL1 and KLF9 are associated with body mass index in east Asian populationsNat Genet4430262012[PubMed][Google Scholar]
  • 6. WenWMeta-analysis identifies common variants associated with body mass index in east AsiansNat Genet44307112012[PubMed][Google Scholar]
  • 7. HeidIMMeta-analysis identifies 13 new loci associated with waist-hip ratio and reveals sexual dimorphism in the genetic basis of fat distributionNat Genet42949602010[PubMed][Google Scholar]
  • 8. LindgrenCMGenome-wide association scan meta-analysis identifies three Loci influencing adiposity and fat distributionPLoS Genet5e10005082009[PubMed][Google Scholar]
  • 9. Lango AllenHHundreds of variants clustered in genomic loci and biological pathways affect human heightNature46783282010[PubMed][Google Scholar]
  • 10. LeeSHWrayNRGoddardMEVisscherPMEstimating missing heritability for disease from genome-wide association studiesAm J Hum Genet882943052011[PubMed][Google Scholar]
  • 11. ZukOHechterESunyaevSRLanderESThe mystery of missing heritability: Genetic interactions create phantom heritabilityProc Natl Acad Sci U S A2012[Google Scholar]
  • 12. DuncanELGenome-wide association study using extreme truncate selection identifies novel genes affecting bone mineral density and fracture riskPLoS Genet7e10013722011[PubMed][Google Scholar]
  • 13. EdmondsonACDense genotyping of candidate gene loci identifies variants associated with high-density lipoprotein cholesterolCirc Cardiovasc Genet4145552011[PubMed][Google Scholar]
  • 14. MeyreDGenome-wide association study for early-onset and morbid adult obesity identifies three new risk loci in European populationsNat Genet4115792009[PubMed][Google Scholar]
  • 15. ScheragATwo new Loci for body-weight regulation identified in a joint analysis of genome-wide association studies for early-onset extreme obesity in French and german study groupsPLoS Genet6e10009162010[PubMed][Google Scholar]
  • 16. BradfieldJPA genome-wide association meta-analysis identifies new childhood obesity lociNat Genet44526312012[PubMed][Google Scholar]
  • 17. CohenJCMultiple rare alleles contribute to low plasma levels of HDL cholesterolScience305869722004[PubMed][Google Scholar]
  • 18. EmondMJExome sequencing of extreme phenotypes identifies DCTN4 as a modifier of chronic Pseudomonas aeruginosa infection in cystic fibrosisNat Genet4488692012[PubMed][Google Scholar]
  • 19. HarismendyOPopulation sequencing of two endocannabinoid metabolic genes identifies rare and common regulatory variants associated with extreme obesity and metabolite levelGenome Biol11R1182010[PubMed][Google Scholar]
  • 20. RomeoSPopulation-based resequencing of ANGPTL4 uncovers variations that reduce triglycerides and increase HDLNat Genet3951362007[PubMed][Google Scholar]
  • 21. ChanYCommon variants show predicted polygenic effects on height in the tails of the distribution, except in extremely short individualsPLoS Genet7e10024392011[PubMed][Google Scholar]
  • 22. CotsapasCCommon body mass index-associated variants confer risk of extreme obesityHum Mol Genet18350272009[PubMed][Google Scholar]
  • 23. PaternosterLGenome-wide population-based association study of extremely overweight young adults--the GOYA studyPLoS One6e243032011[PubMed][Google Scholar]
  • 24. HinneyAGenome wide association (GWA) study for early onset extreme obesity supports the role of fat mass and obesity associated gene (FTO) variantsPLoS One2e13612007[PubMed][Google Scholar]
  • 25. BenzinouMCommon nonsynonymous variants in PCSK1 confer risk of obesityNat Genet4094352008[PubMed][Google Scholar]
  • 26. JiaoHGenome wide association study identifies KCNMA1 contributing to human obesityBMC Med Genomics4512011[PubMed][Google Scholar]
  • 27. den HoedMEvaluation of common genetic variants identified by GWAS for early onset and morbid obesity in population-based samplesInt J Obes (Lond)2012E-pub ahead of print[Google Scholar]
  • 28. WillerCJSix new loci associated with body mass index highlight a neuronal influence on body weight regulationNat Genet4125342009[PubMed][Google Scholar]
  • 29. PutterCMissing heritability in the tails of quantitative traits? A simulation study on the impact of slightly altered true genetic modelsHum Hered72173812011[PubMed][Google Scholar]
  • 30. WilliamsPTQuantile-specific penetrance of genes affecting lipoproteins, adiposity and heightPLoS One7e287642012[PubMed][Google Scholar]
  • 31. GueyLTPower in the phenotypic extremes: a simulation study of power in discovery and replication of rare variantsGenet Epidemiol2011E-pub ahead of print[Google Scholar]
  • 32. WHOObesity: preventing and managing the global epidemic. Report of a WHO ConsultationWHO Technical Report Series 894World Health OrganizationGeneva2000[Google Scholar]
  • 33. KumanyikaSKPopulation-based prevention of obesity: the need for comprehensive promotion of healthful eating, physical activity, and energy balance: a scientific statement from American Heart Association Council on Epidemiology and Prevention, Interdisciplinary Committee for Prevention (formerly the expert panel on population and prevention science)Circulation118428642008[PubMed][Google Scholar]
  • 34. SarbassovDDSabatiniDMRedox regulation of the nutrient-sensitive raptor-mTOR pathway and complexJ Biol Chem2803950592005[PubMed][Google Scholar]
  • 35. DaigoKProteomic analysis of native hepatocyte nuclear factor-4alpha (HNF4alpha) isoforms, phosphorylation status, and interactive cofactorsJ Biol Chem286674862011[PubMed][Google Scholar]
  • 36. NakajimaHHepatocyte nuclear factor-4 alpha gene mutations in Japanese non-insulin dependent diabetes mellitus (NIDDM) patientsRes Commun Mol Pathol Pharmacol94327301996[PubMed][Google Scholar]
  • 37. ChoYSMeta-analysis of genome-wide association studies identifies eight new loci for type 2 diabetes in east AsiansNat Genet4467722012[PubMed][Google Scholar]
  • 38. DupuisJNew genetic loci implicated in fasting glucose homeostasis and their impact on type 2 diabetes riskNat Genet42105162010[PubMed][Google Scholar]
  • 39. SaxenaRGenetic variation in GIPR influences the glucose and insulin responses to an oral glucose challengeNat Genet4214282010[PubMed][Google Scholar]
  • 40. ShiJKandrorKVSortilin is essential and sufficient for the formation of Glut4 storage vesicles in 3T3-L1 adipocytesDev Cell9991082005[PubMed][Google Scholar]
  • 41. KaddaiVInvolvement of TNF-alpha in abnormal adipocyte and muscle sortilin expression in obese mice and humansDiabetologia52932402009[PubMed][Google Scholar]
  • 42. ZhangMParacrine overexpression of IGFBP-4 in osteoblasts of transgenic mice decreases bone turnover and causes global growth retardationJournal of bone and mineral research18836432003[PubMed][Google Scholar]
  • 43. LiaoYCChenNTShihYPDongYLoSHUp-regulation of C-terminal tensin-like molecule promotes the tumorigenicity of colon cancer through beta-cateninCancer Res69456362009[PubMed][Google Scholar]
  • 44. MilatFNgKWIs Wnt signalling the final common pathway leading to bone formation?Mol Cell Endocrinol31052622009[PubMed][Google Scholar]
  • 45. PurcellSMCommon polygenic variation contributes to risk of schizophrenia and bipolar disorderNature460748522009[PubMed][Google Scholar]
  • 46. YangJConditional and joint multiple-SNP analysis of GWAS summary statistics identifies additional variants influencing complex traitsNat Genet44369752012[PubMed][Google Scholar]
  • 47. KilpelainenTOBinghamSAKhawKTWarehamNJLoosRJAssociation of variants in the PCSK1 gene with obesity in the EPIC-Norfolk studyHum Mol Genet1834965012009[PubMed][Google Scholar]
  • 48. WillerCJLiYAbecasisGRMETAL: fast and efficient meta-analysis of genomewide association scansBioinformatics26219012010[PubMed][Google Scholar]
  • 49. MorrisAPDirect analysis of unphased SNP genotype data in population-based association studies via Bayesian partition modelling of haplotypesGenet Epidemiol29911072005[PubMed][Google Scholar]
  • 50. EmilssonVGenetics of gene expression and its effect on diseaseNature45242382008[PubMed][Google Scholar]
  • 51. DixonALA genome-wide association study of global gene expressionNat Genet39120272007[PubMed][Google Scholar]
  • 52. ZhongHYangXKaplanLMMolonyCSchadtEEIntegrating pathway analysis and genetics of gene expression for genome-wide association studiesAm J Hum Genet86581912010[PubMed][Google Scholar]
  • 53. MinJLCoexpression Network Analysis in Abdominal and Gluteal Adipose Tissue Reveals Regulatory Genetic Loci for Metabolic Syndrome and Related PhenotypesPLoS Genet8e10025052012[PubMed][Google Scholar]
  • 54. MyersAJA survey of genetic human cortical gene expressionNat Genet39149492007[PubMed][Google Scholar]
Collaboration tool especially designed for Life Science professionals.Drag-and-drop any entity to your messages.