Multitrait genome association analysis identifies new susceptibility genes for human anthropometric variation in the GCAT cohort

Iván Galván-Femenía; Mireia Obón-Santacana; David Piñeyro; Marta Guindo-Martinez; Xavier Duran; Anna Carreras; Raquel Pluvinet; Juan Velasco; Laia Ramos; Susanna Aussó; J M Mercader; Lluis Puig; Manuel Perucho; David Torrents; Victor Moreno; Lauro Sumoy; Rafael de Cid

doi:10.1136/jmedgenet-2018-105437

Article Text

Complex traits

Original article

Multitrait genome association analysis identifies new susceptibility genes for human anthropometric variation in the GCAT cohort

Iván Galván-Femenía1,
Mireia Obón-Santacana1,2,
David Piñeyro3,
Marta Guindo-Martinez4,
Xavier Duran1,
Anna Carreras1,
Raquel Pluvinet3,
Juan Velasco1,
Laia Ramos3,
Susanna Aussó3,
J M Mercader5,6,
Lluis Puig7,
Manuel Perucho8,
David Torrents4,9,
Victor Moreno2,10,
Lauro Sumoy3,
http://orcid.org/0000-0003-3579-6777Rafael de Cid1

¹ GenomesForLife-GCAT Lab Group, Program of Predictive and Personalized Medicine of Cancer (PMPPC), Germans Trias i Pujol Research Institute (IGTP), Crta. de Can Ruti, Badalona, Catalunya, Spain
² Unit of Biomarkers and Susceptibility, Cancer Prevention and Control Program, Catalan Institute of Oncology (ICO), IDIBELL and CIBERESP, Barcelona, Spain
³ High Content Genomics and Bioinformatics Unit, Program of Predictive and Personalized Medicine of Cancer (PMPPC), Germans Trias i Pujol Research Institute (IGTP), Badalona, Catalunya, Spain
⁴ Life Sciences - Computational Genomics, Barcelona Supercomputing Center (BSC-CNS), Joint BSC-CRG-IRB Research Program in Computational Biology, Barcelona, Spain
⁵ Programs in Metabolism and Medical & Population Genetics, Broad Institute of Harvard and MIT, Cambridge, Massachusetts, US
⁶ Diabetes Unit and Center for Human Genetic Research, Massachusetts General Hospital, Boston, Massachusetts, US
⁷ Blood Division, Banc de Sang i Teixits, Barcelona, Spain
⁸ Cancer Genetics and Epigenetics Group, Program of Predictive and Personalized Medicine of Cancer (PMPPC), Germans Trias i Pujol Research Institute (IGTP), Badalona, Catalunya, Spain
⁹ ICREA, Catalan Institution for Research and Advanced Studies, Barcelona, Catalunya, Spain
¹⁰ Department of Clinical Sciences, Faculty of Medicine, University of Barcelona, Barcelona, Spain

Correspondence to Dr Rafael de Cid, GCAT lab Group, Program of Predictive and Personalized Medicine of Cancer (PMPPC), Germans Trias i Pujol Research Institute (IGTP), Crta. de Can Ruti, Badalona 08916, Spain; rdecid{at}igtp.cat

Abstract

Background Heritability estimates have revealed an important contribution of SNP variants for most common traits; however, SNP analysis by single-trait genome-wide association studies (GWAS) has failed to uncover their impact. In this study, we applied a multitrait GWAS approach to discover additional factor of the missing heritability of human anthropometric variation.

Methods We analysed 205 traits, including diseases identified at baseline in the GCAT cohort (Genomes For Life- Cohort study of the Genomes of Catalonia) (n=4988), a Mediterranean adult population-based cohort study from the south of Europe. We estimated SNP heritability contribution and single-trait GWAS for all traits from 15 million SNP variants. Then, we applied a multitrait-related approach to study genome-wide association to anthropometric measures in a two-stage meta-analysis with the UK Biobank cohort (n=336 107).

Results Heritability estimates (eg, skin colour, alcohol consumption, smoking habit, body mass index, educational level or height) revealed an important contribution of SNP variants, ranging from 18% to 77%. Single-trait analysis identified 1785 SNPs with genome-wide significance threshold. From these, several previously reported single-trait hits were confirmed in our sample with LINC01432 (p=1.9×10⁻⁹) variants associated with male baldness, LDLR variants with hyperlipidaemia (ICD-9:272) (p=9.4×10⁻¹⁰) and variants in IRF4 (p=2.8×10⁻⁵⁷), SLC45A2 (p=2.2×10⁻¹³⁰), HERC2 (p=2.8×10⁻¹⁷⁶), OCA2 (p=2.4×10⁻¹²¹) and MC1R (p=7.7×10⁻²²) associated with hair, eye and skin colour, freckling, tanning capacity and sun burning sensitivity and the Fitzpatrick phototype score, all highly correlated cross-phenotypes. Multitrait meta-analysis of anthropometric variation validated 27 loci in a two-stage meta-analysis with a large British ancestry cohort, six of which are newly reported here (p value threshold <5×10⁻⁹) at ZRANB2-AS2, PIK3R1, EPHA7, MAD1L1, CACUL1 and MAP3K9.

Conclusion Considering multiple-related genetic phenotypes improve associated genome signal detection. These results indicate the potential value of data-driven multivariate phenotyping for genetic studies in large population-based cohorts to contribute to knowledge of complex traits.

gwas
cohort
complex traits
multitrait
phenome

This is an open access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited, appropriate credit is given, any changes made indicated, and the use is non-commercial. See: http://creativecommons.org/licenses/by-nc/4.0/.

https://doi.org/10.1136/jmedgenet-2018-105437

Statistics from Altmetric.com

Request Permissions

If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.

Introduction

Common disorders cause 85% of deaths in the European Union (EU).1 The increasing incidence and prevalence of cancer, cardiovascular diseases, chronic respiratory diseases, diabetes and mental illness represent a challenge that leads to extra costs for the healthcare system. Moreover, as European population is getting older, this scenario will be heightened in the next few years. Like complex traits, many common diseases are complex inherited conditions with genetic and environmental determinants. Advancing in their understanding requires the use of multifaceted and long-term prospective approaches. Cohort analyses provide an exceptional tool for dissecting the architecture of complex diseases by contributing knowledge for evidence-based prevention, as exemplified by the Framingham Heart Study2 or the European Prospective Investigation into Cancer and Nutrition cohort study.3

In the last decades, high performance DNA genotyping technology has fuelled genomic research in large cohorts, having been the most promising line in research on the aetiology of most common diseases. Genome-wide association studies (GWAS) have provided valuable information for many single conditions.4 Despite the perception of the limitations of the GWAS analyses, efforts combining massive data deriving from whole-genome sequencing at population scale with novel conceptual and methodological analysis frameworks have been set forth to explore the last frontier of the missing heritability issue,5 driving the field of genomic research on complex diseases to a new age.6Pritchard and colleagues recently proposed the breakthrough idea of the omnigenic character of genetic architecture of diseases and complex traits.7 They suggested that beyond a handful of driver genes (ie, core genes) directly connected to an illness, the missing heritability could be accounted for by multiple genes (ie, peripheral genes) not clustered in functional pathways, but dispersed along the genome, explaining the pleiotropy frequently seen in most complex traits. Core genes have been already outlined by the GWAS approach, but most of the possible contributing genes have been disregarded based on methodological issues such as p value or lower minor allele frequency (MAF). Pathway disturbances have also been a landmark in the search for genetic associations,8 but not always appear to the root of the mechanism of inheritance of complex diseases, at least for peripheral genes.7 With this challenging vision, a multitrait genome association analysis of the whole phenome9 becomes a more appropriate way to detect peripheral gene variation effects and new network disturbances affecting core genes. Multitrait analysis approaches are developed for research of genetically complex conditions using raw or summary-level data statistics from GWAS in order to explain the largest possible amount of the covariation between SNPs and traits.10–15

The contribution of total genetic variation, known as heritability (broad-sense heritability, h ²), is estimated now from genome-wide studies in large cohorts directly from SNP data (known as h²SNP). However, even if most disease conditions have a strong genetic basis, it is well known that our capacity to find genetic effects depends on the overall genetic contribution of the trait. Overall estimations differed depending on the ancestry, sample ascertainment, gender and age of the population under study. Recently, data from the UK Biobank determined genetic contributions with a phenome-based approach16 and identified a shared familial environment as a significant important factor besides genetic heritability values in 12 common diseases analysed.17

In this study, we present new data on phenotype-wide estimation of the heritability of 205 complex traits (including diseases) and new insights into the genetics of anthropometric traits in a Mediterranean Caucasian population using a two-stage meta-analysis approach with multiple-related phenotypes (MRPs).

Materials and methods

Population

The methodology of the GCAT study has been previously described.18 Briefly, the subjects of the present study are part of the GCAT project, a prospective study that includes a cohort of a total of 19 267 participants recruited from the general population of Catalonia, a western Mediterranean region in the Northeast of Spain. Healthy general population volunteers between 40 and 65 years with the sole condition of being users of the Spanish National Health Service were invited to be part of the study mostly through the Blood and Tissue Bank, a public agency of the Catalan Department of Health. All eligible participants signed an informed consent agreement form and answered a comprehensive epidemiological questionnaire. Anthropometric measures and blood samples were also collected at baseline by trained healthcare personnel. The GCAT study was approved by the local ethics committee (Germans Trias University Hospital) in 2013 and started on 2014.

Study participants

This study analyses the GCATcore data, a subset of 5459 participants (3066 women) with genotype data belonging to the interim GCATdataset, August 2017 (see the URLs section). GCATcore participants were randomly selected from whole cohort based on overall demographic distribution (ie, gender, age, residence). In this study, in order to increase the robustness of heritability estimates, only Caucasian participants with a Spanish origin (based on principal component analysis (PCA) analysis, see later in this section) and with available genetic data were finally included: 4988 GCAT participants (2777 women). All samples passed genotyping quality control (QC) (see later in this section).

Phenome

Baseline variables were obtained from a self-reported epidemiological questionnaire and included biological traits, medical diagnoses, drug use, lifestyle habits and sociodemographic and socioeconomic variables.18 Description of GCAT variables dataset is available at GCAT (see the URLs section). To keep as many as possible of the genotyped samples in the study, we imputed anthropometric missing values (<1%) from the overall distribution values using statistical approaches. Missing values (<1%) for biological and anthropometric measures (height, weight, waist and hip circumference, systolic and diastolic blood pressure and heart rate) were imputed by stratifying the whole GCAT cohort by gender and age and using multiple imputation by the fully conditional specification method, implemented in the R mice package.19 For GWAS analysis, we retained all variables with at least five observations (n=205). For heritability estimates, only variables with at least 500 individuals per class were retained (n=96) for robustness. The description of the traits and measures included in this study is summarised in online supplementary table S1.

Supplementary file 1

[SP1.pdf]

Genotyping, relatedness and population structure

Genotyping of the 5459 GCAT participants (GCATcore) was done using the Infinium Expanded Multi-Ethnic Genotyping Array (MEGA^Ex) (ILLUMINA, San Diego, California, USA). A customised cluster file was produced from the entire sample dataset and used for joint calling. We applied PCA to detect any hidden substructure and the method of moments for the estimation of identity by descent probabilities to exclude cases with cryptic relatedness. The extensive QC protocol used for cluster analysis and call filtering is accessible at GCAT (see the URLs section) and presented as supplementary material (online supplementary file S1). Briefly, GCAT participants were excluded from the analysis for different reasons, including poor call rate <0.94 (n=61), gender mismatch (n=19), duplicates (n=8), family relatedness up to second degree (n=88) and excess or loss of heterozygosity (n=52). Non-Caucasian individuals detected as outliers in the PCA plot of the European populations from the 1000 Genomes Project (n=96) and born outside of Spain (n=147) were also excluded from the study. After QC and filtering, 4988 GCAT participants and 1 652 023 genetic variants were included. Genotyping was performed at the PMPPC-IGTP High Content Genomics and Bioinformatics Unit.

Supplementary file 2

[SP2.pdf]

Multipanel imputation

For imputation analysis, 665 592 SNPs were included (40%). Sexual and mitochondrial chromosomes were discarded as well as autosomal chromosome variants with MAF <0.01 and AT-CG sites. We followed a two-stage imputation procedure, which consists of prephasing the genotypes into whole chromosome haplotypes followed by imputation itself.20 The prephasing was performed using SHAPEIT2, and genotype imputation was performed with IMPUTE2. As reference panels for genotype imputation, we used the 1000 Genomes Project phase 3,21 the Genome of the Netherlands,22 UK10K23 and the Haplotype Reference Consortium.24 All variants with IMPUTE2 info <0.7 were removed. After imputing the genotypes using each reference panel separately, we combined the results selecting the variants with a higher info score when they were present in more than one reference panel. The SNP dosage from IMPUTE2 was transformed to binary PLINK format by using the ‘-hard-call-threshold 0.1’ flag from PLINK. The final core set had approximately 15 million variants with MAF>0.001 and 9.5 million variants with MAF>0.01. Imputation was performed at the Barcelona Supercomputing Center.

Heritability

Trait SNP heritability (h² _SNP) was estimated from SNP/INDEL array/imputed data with the GREML-LDMS method implemented in the GCTA software.25 Since this method is relatively unbiased regarding MAF and linkage disequilibrium (LD) parameters, we considered autosomal variants with MAF>0.001 (15 060 719 SNPs) to avoid under/overestimation of heritability due to the relatively small sample analysed in the core study. Cryptic relatedness of distant relatives was also considered, and individuals whose relatedness in the genetic relationship matrix was >0.025 were discarded (n=4717). Population stratification was controlled in the linear mixed model using the first 20 principal components of the PCA derived from population genetic structure analysis of the GCAT. Gender and age were also included as covariates in the model. The h² _SNPCIs were calculated by using FIESTA.26

Single-trait genome-wide association analysis

We performed independent GWAs analyses for 205 selected traits (61 continuous and 144 binary). A total of 9 499 600 SNPs with MAF>0.01 were considered for this purpose. Linear regression models for continuous traits were assessed with PLINK.27 For binary traits, given the unbalanced design of most of the traits considered, we used a scoring test with saddle point approximation included in the SPAtest R package.28 This approach compensates a slight loss of power with the inclusion of uncommon and rare conditions, without affecting robustness. All the models included the first 20 PCAs, age and gender as covariates. A PCA-mixed analysis was applied to approximate the number of independent traits29 (online supplementary figure S1). Based on these figures, Bonferroni correction for multiple traits was defined at p<5×10⁻¹⁰ accounting for 100 independent traits explaining 80% of the phenome variability.

Supplementary file 3

[SP3.pdf]

Multitrait meta-analysis for correlated traits

We applied a multitrait approach for the analysis of anthropometric traits (weight, height, body mass index (BMI) and waist and hip circumference) in a two-stage association study using individuals of British ancestry from the UK Biobank cohort (N=336 107).30 Waist-to-hip ratio was excluded from this analysis due to its unavailability from the UK Biobank resource. UK Biobank summary-level statistics was calculated using linear regression models with the inferred gender and the first 10 PCAs as covariates, similarly to the model applied on GCAT data (see the URLs section). All SNPs with suggestive association p<1x10⁻⁵ for any trait were retained from the GCAT GWAS analysis. Then, only SNPs intersecting with the UK Biobank resource were used for multitrait meta-analysis association testing in both samples, and p<5x10⁻⁹was considered significant. The multitrait association testing was based on the distribution of the sum of squares of the z scores which is insensitive to the direction of the scores.31 Briefly, let Z = ( , , …, ) be the z scores for a given SNP for k phenotypes. The sum of squares of the z scores, , can be approximated by the χ² distribution ( ). Let Σ be the covariance matrix of the genome-wide z scores from the phenotypes under analysis. And let be the eigenvalues of Σ , the distribution of is well approximated by , where a, b and d depend on . Then, we calculated the p value as: . To estimate the covariance matrix of the correlated traits, we selected independent SNPs (LD pruning in PLINK “--indep-pairwise 50 5 0.2”) and filtered out SNPs with |z scores|>1.96 to avoid possible bias in the estimation of Σ because of the difference in sample size and association p values in the GCAT-UK Biobank. A summary flow chart of the methods applied in this study is shown in figure 1.

Figure 1

Flow chart of the methods and criteria used in this study. GCAT, Genomes For Life- Cohort Study of the Genomes of Catalonia; GWAS, genome-wide association studies; MAF, minor allele frequency; QC, quality control.

Polygenic risk score

Genetic architecture was analysed by the polygenic risk score (PRS). Polygenic risk score software (PRSice)32 was used to predict the genetic variability of the identified loci for a given trait. PRSice plots the percentage of variance explained for a trait by using SNPs with different p value thresholds (P_T) (online supplementary figure S2). Here, we considered P_T=0.05.

Supplementary file 4

[SP4.pdf]

URLs

GCAT study, http://genomesforlife.com;

National Human Genome Research Institute GWAS Catalog, http://www.genome.gov/gwastudies/ (gwas_catalog_v1.0-associations_e91_r2018-02-06);

1000 Genomes Project http://www.internationalgenome.org/ (phase 3, v5a.20130502);

Genome of Netherland http://www.nlgenome.nl/ (Release 5.4);

UK10K https://www.uk10k.org/ (Release 2012-06-02, updated on 15 Feb 2016) ;

Haplotype Reference Consortium http://www.haplotype-reference-consortium.org/(Release 1.1);

UKBiobank GWAS Results; https://sites.google.com/broadinstitute.org/ukbbgwasresults/home?authuser=0, (Manifest20170915);

GTExportal, https://www.gtexportal.org/home/. (last data accession, Release V.7, dbGaP accession phs000424. v7. P2);

Results

Heritability estimates

SNP heritability estimation (h² _SNP) in the GCATcore study showed values ranging from 77% to 18%, with height being the trait showing the strongest SNP contribution. The h² _SNP SE for most traits was high (near 10%), with wide CIs, as expected by sample size. However, robustness of the analysis is supported by similar values to those reported elsewhere (see wide summary in Genome-wide complex trait analysis, Wikipedia. The Free Encyclopedia, 2018). Statistically significant h² _SNP estimations for continuous and binary traits (cases >500) are shown in table 1. In particular, values for height: h² _SNP=0.77, 95% CI0.56 to 0.94 and BMI: h² _SNP=0.38, 95% CI0.20 to 0.59 were identical to the maxima achieved in other European populations, using comparable genomic approaches. Besides the anthropometric traits, the Fitzpatrick’s phototype score, a numerical classification schema for human skin colour to measure the response of different types of skin to ultraviolet light, had a high genetic consistency in our sample (h² _SNP=0.63, 95% CI 0.4 to 0.8), and concordantly all related categories (eye colour, hair colour, freckling and skin sensitivity) showed high heritability (h² _SNP>0.3). It is worth noting that skin colour had the lowest value (h² _SNP=0.18, 95% CI 0.02 to 0.38), which is in concordance with the blurred genetic architecture of skin colour.33 Interestingly, other non-biological traits showed relatively high values in our study. Educational level showed the third highest heritability value (h² _SNP=0.54, 95% CI 0.35 to 0.74). Lower estimates have been observed in other Caucasian populations, but this could be explained by the fact that this estimate is for educational level as a categorical variable and not as binary (higher/lower). Self-perceived health was similar to h² _SNP from recent data from a larger UK Biobank study,16 with values around 20% (h² _SNP=0.22, 95% CI 0.04 to 0.43).

View this table:

Table 1

h² _SNP of the analysed traits with h² _SNP>0, SE <0.12, p<0.05 and n_b >500

Phenome analysis

GWAS identified 6820 associations in 1785 SNPs with genome-wide significance threshold p<5×10⁻⁸ and 29 343 associations with a suggestive association p<1×10⁻⁵. Here, we report 26 genome-wide association hits identified in our study which confirm results previously identified in other European ancestry samples (GWAS Catalog database (release V.1.0, e90, 27 September 2017)).4 In table 2, we show the SNP associations with the minimum p value for each locus, the remaining SNPs are shown in online Supplementary file 5. Five genes associated with pigmentary traits were identified in the analysis with highly significant SNP associations: SLC45A2 (rs16891982, β=−0.546, SE=0.021, p=2.2×10⁻¹³⁰), IRF4 (rs12203592, β=1.915, SE=0.118, p=2.8×10⁻⁵⁷), HERC2 (rs1667394, β=−0.608, SE=0.02, p=2.8×10⁻¹⁷⁶), OCA2 (rs11855019, β=−0.548, SE=0.022, p=2.4×10⁻¹²¹) and MC1R (rs1805007, β=3.615, SE=0.326, p=7.7×10⁻²²) (online supplementary figure S3). These genes are involved in the regulation and distribution of melanin pigmentation or enzymes involved in melanogenesis itself within the melanocyte cells present in the skin, hair and eyes in Caucasian populations.33–35 Pigmentary traits (mainly the red hair colour phenotype) are related to the defensive capacity of the skin in response to sun exposure (UV-induced skin tanning or sun burning), and it has been established as a risk factor for sun-induced cancers (both melanoma and non-melanocytic skin cancers).36 Other GWAS hits from the phenome-wide analysis validated previously reported findings in CCDC141-LOC105373766 (rs79146658, β=2.359, SE=0.374, p=3.4×10⁻¹⁰), SMARCA4-LDLR (rs10412048, β=−0.5, SE=0.079, p=3.2×10⁻¹⁰; rs6511720, β=−0.493, SE=0.08, p=9.4×10⁻¹⁰) and LINC01432 (rs1160312, β=0.193, SE=0.03, p=1.9×10⁻⁹) loci, related with cardiovascular risk (heart_rate), hyperlipidaemia (icd9_code3_272) and male pattern baldness (hair_loss_40), respectively (see table 2).

Supplementary file 5

[SP5.pdf]

Supplementary file 6

[SP6.pdf]

View this table:

Table 2

Twenty-six genome-wide associated loci with GCAT traits and reported in the GWAS Catalog

Multitrait meta-analysis of anthropometric traits

Anthropometric traits had a high heritability in our sample (height=77%, BMI=38%, weight=37%, hip circumference=31% and waist circumference=24%), and all were highly correlated (online supplementary figure S1). In the first stage, from single-trait GWAS, we retained 606 SNPs with suggestive association (p<1×10⁻⁵) (see figure 2). None of them reached the genome-wide significance threshold. In the second stage, we analysed those 476 SNPs that intersected with the UK Biobank cohort dataset. Multitrait meta-analysis identified 111 SNPs in 27 independent loci with p<5×10⁻⁹ (online Supplementary file 7). Table 3 shows the SNPs with the highest significance for each independent loci and the univariate summary statistics of the anthropometric traits in both cohorts.

Supplementary file 7

[SP7.pdf]

Figure 2

Manhattan plot of the anthropometric traits (BMI, height, weight and hip and waist circumference) from the GCAT. BMI, body mass index.

View this table:

Table 3

Loci associated with anthropometric traits in GCAT and UK Biobank cohorts

We estimated the covariance matrix (Σ) for each dataset (GCAT, UK Biobank and GCAT +UK Biobank). Then, as described in the Materials and methods section, we selected those independent SNPs with |z scores|<1.96, resulting in 765 646, 630 890 and 535 860 being considered for the Σ estimation. Eigenvalues of Σ showed d=1.36, 1.4 and 2.72 values. Covariance matrices were similar in both GCAT and UK Biobank (online supplementary tables S4 and S5). One degree of freedom (GCAT and UK Biobank) and three (GCAT +UK Biobank) of the ² distribution were considered for multitrait analysis. We identified 27 independent multitrait loci associated in GCAT and UK Biobank (table 3). We intersected these SNPs with the GWAS Catalog, and we found that 5 SNPs had previously been reported in multiple GWAS, 16 loci were reported considering a ±250 000 base pair window from the identified SNP and 6 were new loci involving the following genes/SNPs: MAD1L1 (rs62444886, p=2.3×10⁻¹⁵), PIK3R1 (rs12657050, p=2.8×10⁻¹³; rs695166, p=8.4×10⁻¹⁵), ZRANB2-AS2 (rs11205277, p=1.4×10⁻⁹), EPHA7 (rs143547391, p=6.5×10⁻¹⁰), CACUL1 (rs12414412, p=4×10⁻¹³) and MAP3K9 (rs7151024, p=5.7×10⁻¹⁰). Regarding DPYD, DPYD-IT1 (rs140281723), GABRG3-AS1 and GABRG3 (rs184405367) genes/SNPs, we did not replicate association in UK Biobank samples (UKmulti p=0.035 and 1, respectively). The risk allele, frequency and functional annotation using the Variant Effect Predictor tool37 of identified variants are shown in online Supplementary file 9.

Supplementary file 8

[SP8.pdf]

Supplementary file 9

[SP9.pdf]

Polygenic risk score

The skin phototype association analysis identified five loci accounting for a high predictive value (PRS of 15.6%) suggesting few main genes (oligogenic architecture) contributing to the phenotype (online supplementary figure S2). However, for anthropometric traits, 27 loci were identified in our cohort but with a lower PRS (2.3%) suggesting a polygenic architecture with multiple genes and a high environmental impact. The newly identified loci only increased PRS slightly over the corresponding single-trait analysis (2.2% to 2.5%, 2.3% to 3.3%, 2.2% to 3.5%, 2.5% to 3.7% and 1.5% to 2.6% for height, weight, BMI and hip and waist circumference, respectively) pointing towards the multitrait approach as an effective screening strategy to identify new biomarkers.

Discussion

Dissecting the architecture of common diseases should incorporate multitrait approaches to understand the phenome and its genetic aetiology, including pleiotropy and the co-occurrence of multiple morbidities, correlated traits and the diseasome as targets for genomic analysis.38 In this study, we used the GCAT study, a South-European Mediterranean population prospective cohort to analyse the phenotypic variation attributable to genotype variability for 205 selected human traits (including diseases as well as biological, anthropometric and social features). Our results show that by considering genetic covariance matrices for interrelated traits, we increased the number of detected loci from six new loci for anthropometric traits, pointing to multitrait analysis as an effective strategy to gain statistical power to identify genetic association.

The relative importance of genetic and non-genetic factors varies across populations. Moreover, this is not constant in a population and changes with age.16 Here, we have reported heritability estimates on an adult population based on SNP data. In the present study, h² _SNP values move in a wide range from 18% to 77%, being anthropometric traits (height) and skin colour-related traits (Fitzpatrick’s phototype score) the traits with the highest genetic determination. In our cohort, heritability of anthropometric traits, such as height and BMI, was likely estimated as a maximum, with negligible missed heritability when comparing with other reported estimates in similar populations39 and in the same way being the observed genetic variance only a small part of their complete variance (around 3%). In the case of skin colour-related traits, the portion of the explained variance was larger, in accordance with a less complex polygenic nature of this trait, and fewer genes baring stronger predictive value (IRF4, HERC2, OCA2, MC1R and SLC45A2) (PRS=15.6%). The variants identified in these loci associated with skin colour-related traits are functional and have been reported elsewhere in several studies. These differences in heritability and prediction values indicate a different genomic architecture, suggesting an exposure variation, the exposome,3 as a main actor for many polygenic traits. Higher estimates in self-perceived health heritability, and probably some other reported traits such as ‘smoking_habits’, ‘smoking_packs’, or ‘sadness’ (item from the Mental-Health Inventory 5-item questionnaire), reflect a pleiotropic effect40 with multiple associated loci. In this sense, a recent meta-analysis on subjective well-being revealed new loci accounting for a polygenic model of well-being status.41

Single-trait GWAS analysis identified a number of genetic variants associated with skin colour-related traits (online supplementary figure S3) and other complex traits (heart rate, hyperlipidaemia or male pattern baldness); whereas failed to identify specific variants associated with any single anthropometric trait (at the p<5×10⁻⁸ threshold cut-off). However, we should observe that gender differences were not considered in this analysis even though it has been shown that genetic effects have a gender bias.42 Applying multitrait analyses of anthropometric traits, we identified 27 loci, six of which had not been reported previously; CALCUL1, ZRANB2-AS2, MAD1L1, EPHA7, PIK3R1 and MAP3K9. Owing to LD and the occurrence of all identified variants in non-coding regions (see online Supplementary file 9), we cannot be certain about the genes involved. Two out of six of the identified associated variants, in CALCUL1 and MAP3K9, are putative expression quantitative trait loci (eQTL) (see the URLs section). Three of the variants (ZRANB2-AS2chr1:71702511, EPHA7chr6:94075927 and MAP3K9chr14:71268446) are specific of the GCAT sample (p<5×10⁻⁹) (online Supplementary files 10,11, S,12) probably due to genetic background differences between populations (ie, LD patterns) or as an expression of a particular genetic contribution of the Mediterranean populations to these polygenic traits. Identified variants implicate genes with diverse functions, involved in several pathways and processes. Some of them are involved in growth, developmental or metabolic processes.

MAP3K9, mitogen-activated protein kinase 9, has been associated to some rare cancers (ie, retroperitoneum carcinoma and retroperitoneum neuroblastoma), and GWAS studies have identified variants associated with reasoning ability.43 Based on GTEx database (see URL section) we identified rs7151024 as an eQTL, expressed in subcutaneous adipose tissue (p=1.4×10⁻⁸, eQTL effect size (es)=−0.38) that may affect fat distribution and anthropometric traits. ZRANB2-AS2 is a non-coding RNA, and GWAS studies have identified variants in ZRANB2-AS2 associated with facial morphology,44 and also with general cognitive function,45 traits which are genetically correlated with a wide range of physical variables. EPHA7 belongs to the ephrin receptor subfamily of protein-tyrosine kinase, implicated in mediating developmental events, particularly in the nervous system. EPHA7 has been implicated in neurodevelopment processes46 as well being as a tumour suppressor gene in cancer.47 CACUL1, CDK2-associated cullin domain 1, is a cell cycle-dependent kinase binding protein capable of promoting cell progression. In the GWAS Catalog, any of the anthropometric traits analysed here have been associated with variants in CACUL1 (online Supplementary file 13). However, the associated rs12414412, reported as an eQTL expressed in skeletal muscle (p=1.4×10⁻⁷, eQTL es=−0.31), may affect body constitution. CACUL1 suppresses androgen receptor (AR) transcriptional activity, impairing LSD-mediated activation of the AR,48 whose genetic variation is associated with longitudinal height in young boys.49 MAD1L1, mitotic arrest deficient 1-like protein 1, is a component of the mitotic spindle-assembly checkpoint, and some cancers (prostate and gastric) have been associated to MAD1L1 dysfunction.50 Our study identified BMI, weight and hip and waist circumference single-trait association (p<10⁻⁵) with the intronic variant rs62444886 in the MAD1L1 locus, as well as a significant multitrait association in meta-analysis (table 3, online Supplementary file 14). GWAS analysis identified MAD1L1 as a susceptibility gene for bipolar disorder and schizophrenia, involved in reward system functions in healthy adults,51 but until now, no other study has identified it as a genetic contributor to weight. The higher prevalence of obesity and related disorders such as diabetes in schizophrenia patients could reflect a possible underlying common genetic contribution. In this sense, we observed also GWAS significant signals in INS-IGF2 (GCAT-UKmulti p=1.5×10⁻²¹), an analogue of the INS gene (previously associated with diabetes type I and type II disorders).52 Additionally, epigenome-wide association studies in adults53 and children54 support a role for MAD1L1 in BMI–methylation association, with differentially methylated CpG patterns in CD4+ and CD8+ T cells between obese and non-obese women. PIK3R1, phosphoinositide-3-kinase regulatory subunit 1, plays a role in the metabolic actions of insulin, and a mutation in this gene has been associated with insulin resistance. Moreover, common variants are associated with lower body fat percentage as well as the control of peripheral adipose tissue mobilisation.55 Genetic variation in the GWAS Catalog is also associated with cartilage thickness56 and mineral bone density,57 both related to anthropometric traits. Diseases associated with PIK3R1 include SHORT syndrome,58 characterised by individuals with short stature and a restricted intrauterine growth, in addition to multiple anomalies. Our study identified the intronic variant (rs695166) associated with waist circumference association in single-trait analysis (p<10⁻⁶), but not in the UKdataset, which associates with height (p=2.3×10⁻¹⁴). However, analysis of the UKBiobank data supported a similar peak profile overlapping the gene region (see online Supplementary file 12) and multitrait analysis association (GCAT-UK multi p=8.4×10⁻¹⁵) (table 3).

Multiple approaches for multitrait analysis using GWAS data have been successfully applied in the research of genetically complex conditions using raw data or summary-level data statistics. Using raw data, Ferreira and Purcell11 used a test based on the Wilk’s lambda derived from a canonical correlation analysis. Korte et al 13 implemented a mixed-model approach accounting for correlation structure and the kinship relatedness matrix. O’Reilly et al 14 proposed an inverted regression model for each SNP as the response and all the traits as covariates. Regarding the use of GWAS summary-level data statistics, Cotsapas et al 10 developed a statistic for cross-phenotype analysis based on an asymptotic ² distribution derived from p values of the SNP associations. Zhu et al 15 implemented CPASSOC that accounts for the genetic correlation structure of the traits and the sample size for each cohort. Kim et al 12 proposed an adaptive association test for multiple traits that uses Monte Carlo simulations to approximate its null distribution. Recently, Bayes factor approaches59 have been proposed for studying multitrait genetic associations. Here, for meta-analysis purposes, we chose the multitrait analysis described by Yang and Wang.31 This test, based on the ² distribution with ‘d’ df, depends on the genetic covariance structure of the traits and considers the distribution of the sum square of the z scores which is insensitive to the heterogeneous effect of the SNP. Nevertheless, this approach doesn’t allow allele effect estimation. In this sense, maximum likelihood methods have been recently proposed to deal with this limitation41 by accounting for different measures of the same phenotypic trait with different levels of heritability.

In complex diseases research, MRPs are the common observation in genome-wide association analysis of large cohorts, and over simplification of extreme phenotypes or the use of standardised phenotypes for meta-analysis reduces the power to detect the underlying genetic contribution to complex traits. As an alternative, multitrait analyses help to detect additional loci that are missing by applying a conventional meta-analysis. Our results highlight the potential value of data-driven multivariate phenotyping for genetic studies in large complex cohorts.

Supplementary file 10

[SP10.pdf]

Supplementary file 11

[SP11.pdf]

Supplementary file 15

[SP15.pdf]

Acknowledgments

The authors thank all the GCAT participants and all BST members for generously helping with this research.

References

↵
Eurostat Statistics Explained. Mortality and life expectancy statistics, 2016. http://ec.europa.eu/eurostat/statistics-explained/index.php/Mortality_and_life_expectancy_statistics
↵
2. Dawber TR ,
3. Meadors GF ,
4. Moore FE
. Epidemiological approaches to heart disease: the Framingham Study. Am J Public Health Nations Health 1951;41:279–86.doi:10.2105/AJPH.41.3.279
OpenUrl CrossRef PubMed Web of Science
↵
2. Riboli E ,
3. Hunt KJ ,
4. Slimani N ,
5. Ferrari P ,
6. Norat T ,
7. Fahey M ,
8. Charrondière UR ,
9. Hémon B ,
10. Casagrande C ,
11. Vignat J ,
12. Overvad K ,
13. Tjønneland A ,
14. Clavel-Chapelon F ,
15. Thiébaut A ,
16. Wahrendorf J ,
17. Boeing H ,
18. Trichopoulos D ,
19. Trichopoulou A ,
20. Vineis P ,
21. Palli D ,
22. Bueno-De-Mesquita HB ,
23. Peeters PH ,
24. Lund E ,
25. Engeset D ,
26. González CA ,
27. Barricarte A ,
28. Berglund G ,
29. Hallmans G ,
30. Day NE ,
31. Key TJ ,
32. Kaaks R ,
33. Saracci R
. European Prospective Investigation into Cancer and Nutrition (EPIC): study populations and data collection. Public Health Nutr 2002;5:1113–24.doi:10.1079/PHN2002394
OpenUrl CrossRef PubMed Web of Science
↵
2. Welter D ,
3. MacArthur J ,
4. Morales J ,
5. Burdett T ,
6. Hall P ,
7. Junkins H ,
8. Klemm A ,
9. Flicek P ,
10. Manolio T ,
11. Hindorff L ,
12. Parkinson H
. The NHGRI GWAS Catalog, a curated resource of SNP-trait associations. Nucleic Acids Res 2014;42:D1001–6.doi:10.1093/nar/gkt1229
OpenUrl CrossRef PubMed Web of Science
↵
2. Manolio TA ,
3. Collins FS ,
4. Cox NJ ,
5. Goldstein DB ,
6. Hindorff LA ,
7. Hunter DJ ,
8. McCarthy MI ,
9. Ramos EM ,
10. Cardon LR ,
11. Chakravarti A ,
12. Cho JH ,
13. Guttmacher AE ,
14. Kong A ,
15. Kruglyak L ,
16. Mardis E ,
17. Rotimi CN ,
18. Slatkin M ,
19. Valle D ,
20. Whittemore AS ,
21. Boehnke M ,
22. Clark AG ,
23. Eichler EE ,
24. Gibson G ,
25. Haines JL ,
26. Mackay TF ,
27. McCarroll SA ,
28. Visscher PM
. Finding the missing heritability of complex diseases. Nature 2009;461:747–53.doi:10.1038/nature08494
OpenUrl CrossRef PubMed Web of Science
↵
2. Visscher PM ,
3. Wray NR ,
4. Zhang Q ,
5. Sklar P ,
6. McCarthy MI ,
7. Brown MA ,
8. Yang J
. 10 Years of GWAS Discovery: Biology, Function, and Translation. Am J Hum Genet 2017;101:5–22.doi:10.1016/j.ajhg.2017.06.005
OpenUrl CrossRef PubMed
↵
2. Boyle EA ,
3. Li YI ,
4. Pritchard JK
. An Expanded View of Complex Traits: From Polygenic to Omnigenic. Cell 2017;169:1177–86.doi:10.1016/j.cell.2017.05.038
OpenUrl CrossRef PubMed
↵
2. Chakravarti A ,
3. Turner TN
. Revealing rate-limiting steps in complex disease biology: The crucial importance of studying rare, extreme-phenotype families. Bioessays 2016;38:578–86.doi:10.1002/bies.201500203
OpenUrl
↵
2. Freimer N ,
3. Sabatti C
. The human phenome project. Nat Genet 2003;34:15–21.doi:10.1038/ng0503-15
OpenUrl CrossRef PubMed Web of Science
↵
2. Cotsapas C ,
3. Voight BF ,
4. Rossin E ,
5. Lage K ,
6. Neale BM ,
7. Wallace C ,
8. Abecasis GR ,
9. Barrett JC ,
10. Behrens T ,
11. Cho J ,
12. De Jager PL ,
13. Elder JT ,
14. Graham RR ,
15. Gregersen P ,
16. Klareskog L ,
17. Siminovitch KA ,
18. van Heel DA ,
19. Wijmenga C ,
20. Worthington J ,
21. Todd JA ,
22. Hafler DA ,
23. Rich SS ,
24. Daly MJ
. FOCiS Network of Consortia. Pervasive sharing of genetic effects in autoimmune disease. PLoS Genet 2011;7:e1002254.
↵
2. Ferreira MAR ,
3. Purcell SM
. A multivariate test of association. Bioinformatics 2009;25:132–3.doi:10.1093/bioinformatics/btn563
OpenUrl CrossRef PubMed Web of Science
↵
2. Kim J ,
3. Bai Y ,
4. Pan W
. An Adaptive Association Test for Multiple Phenotypes with GWAS Summary Statistics. Genet Epidemiol 2015;39:651–63.doi:10.1002/gepi.21931
OpenUrl CrossRef PubMed
↵
2. Korte A ,
3. Vilhjálmsson BJ ,
4. Segura V ,
5. Platt A ,
6. Long Q ,
7. Nordborg M
. A mixed-model approach for genome-wide association studies of correlated traits in structured populations. Nat Genet 2012;44:1066–71.doi:10.1038/ng.2376
OpenUrl CrossRef PubMed
↵
2. O’Reilly PF ,
3. Hoggart CJ ,
4. Pomyen Y ,
5. Calboli FC ,
6. Elliott P ,
7. Jarvelin MR ,
8. Coin LJ
. MultiPhen: joint model of multiple phenotypes can increase discovery in GWAS. PLoS One 2012;7:e34861.doi:10.1371/journal.pone.0034861
↵
2. Zhu X ,
3. Feng T ,
4. Tayo BO ,
5. Liang J ,
6. Young JH ,
7. Franceschini N ,
8. Smith JA ,
9. Yanek LR ,
10. Sun YV ,
11. Edwards TL ,
12. Chen W ,
13. Nalls M ,
14. Fox E ,
15. Sale M ,
16. Bottinger E ,
17. Rotimi C ,
18. Liu Y ,
19. McKnight B ,
20. Liu K ,
21. Arnett DK ,
22. Chakravati A ,
23. Cooper RS ,
24. Redline S
; COGENT BP Consortium. Meta-analysis of correlated traits via summary statistics from GWASs with an application in hypertension. Am J Hum Genet 2015;96:21–36.doi:10.1016/j.ajhg.2014.11.011
OpenUrl CrossRef PubMed
↵
2. Ge T ,
3. Chen CY ,
4. Neale BM ,
5. Sabuncu MR ,
6. Smoller JW
. Phenome-wide heritability analysis of the UK Biobank. PLoS Genet 2017;13:e1006711.doi:10.1371/journal.pgen.1006711
OpenUrl
↵
2. Muñoz M ,
3. Pong-Wong R ,
4. Canela-Xandri O ,
5. Rawlik K ,
6. Haley CS ,
7. Tenesa A
. Evaluating the contribution of genetics and familial shared environment to common disease using the UK Biobank. Nat Genet 2016;48:980–3.doi:10.1038/ng.3618
OpenUrl PubMed
↵
2. Obón-Santacana M ,
3. Vilardell M ,
4. Carreras A ,
5. Duran X ,
6. Velasco J ,
7. Galván-Femenía I ,
8. Alonso T ,
9. Puig L ,
10. Sumoy L ,
11. Duell EJ ,
12. Perucho M ,
13. Moreno V ,
14. de Cid R
. GCAT|Genomes for life: a prospective cohort study of the genomes of Catalonia. BMJ Open 2018;8:e018324.doi:10.1136/bmjopen-2017-018324
↵
2. Liu Y ,
3. De A
. Multiple Imputation by Fully Conditional Specification for Dealing with Missing Data in a Large Epidemiologic Study. Int J Stat Med Res 2015;4:287–95.doi:10.6000/1929-6029.2015.04.03.7
OpenUrl
↵
2. Howie B ,
3. Fuchsberger C ,
4. Stephens M ,
5. Marchini J ,
6. Abecasis GR
. Fast and accurate genotype imputation in genome-wide association studies through pre-phasing. Nat Genet 2012;44:955–9.doi:10.1038/ng.2354
OpenUrl CrossRef PubMed
↵
2. Auton A ,
3. Brooks LD ,
4. Durbin RM ,
5. Garrison EP ,
6. Kang HM ,
7. Korbel JO ,
8. Marchini JL ,
9. McCarthy S ,
10. McVean GA ,
11. Abecasis GR
; 1000 Genomes Project Consortium. A global reference for human genetic variation. Nature 2015;526:68–74.doi:10.1038/nature15393
OpenUrl CrossRef PubMed
↵
2. Deelen P ,
3. Menelaou A ,
4. van Leeuwen EM ,
5. Kanterakis A ,
6. van Dijk F ,
7. Medina-Gomez C ,
8. Francioli LC ,
9. Hottenga JJ ,
10. Karssen LC ,
11. Estrada K ,
12. Kreiner-Møller E ,
13. Rivadeneira F ,
14. van Setten J ,
15. Gutierrez-Achury J ,
16. Westra HJ ,
17. Franke L ,
18. van Enckevort D ,
19. Dijkstra M ,
20. Byelas H ,
21. van Duijn CM ,
22. de Bakker PI ,
23. Wijmenga C ,
24. Swertz MA
; Genome of Netherlands Consortium. Improved imputation quality of low-frequency and rare variants in European samples using the ‘Genome of The Netherlands’. Eur J Hum Genet 2014;22:1321–6.doi:10.1038/ejhg.2014.19
OpenUrl CrossRef PubMed
↵
2. Huang J ,
3. Howie B ,
4. McCarthy S ,
5. Memari Y ,
6. Walter K ,
7. Min JL ,
8. Danecek P ,
9. Malerba G ,
10. Trabetti E ,
11. Zheng HF ,
12. Gambaro G ,
13. Richards JB ,
14. Durbin R ,
15. Timpson NJ ,
16. Marchini J ,
17. Soranzo N ,
18. Turki SA ,
19. Amuzu A ,
20. Anderson CA ,
21. Anney R ,
22. Antony D ,
23. Artigas MS ,
24. Ayub M ,
25. Bala S ,
26. Barrett JC ,
27. Barroso I ,
28. Beales P ,
29. Benn M ,
30. Bentham J ,
31. Bhattacharya S ,
32. Birney E ,
33. Blackwood D ,
34. Bobrow M ,
35. Bochukova E ,
36. Bolton PF ,
37. Bounds R ,
38. Boustred C ,
39. Breen G ,
40. Calissano M ,
41. Carss K ,
42. Casas JP ,
43. Chambers JC ,
44. Charlton R ,
45. Chatterjee K ,
46. Chen L ,
47. Ciampi A ,
48. Cirak S ,
49. Clapham P ,
50. Clement G ,
51. Coates G ,
52. Cocca M ,
53. Collier DA ,
54. Cosgrove C ,
55. Cox T ,
56. Craddock N ,
57. Crooks L ,
58. Curran S ,
59. Curtis D ,
60. Daly A ,
61. Inm D ,
62. Day-Williams A ,
63. Dedoussis G ,
64. Down T ,
65. Du Y ,
66. van DCM ,
67. Dunham I ,
68. Edkins S ,
69. Ekong R ,
70. Ellis P ,
71. Evans DM ,
72. Farooqi IS ,
73. Fitzpatrick DR ,
74. Flicek P ,
75. Floyd J ,
76. Foley AR ,
77. Franklin CS ,
78. Futema M ,
79. Gallagher L ,
80. Gasparini P ,
81. Gaunt TR ,
82. Geihs M ,
83. Geschwind D ,
84. Greenwood C ,
85. Griffin H ,
86. Grozeva D ,
87. Guo X ,
88. Guo X ,
89. Gurling H ,
90. Hart D ,
91. Hendricks AE ,
92. Holmans P ,
93. Huang L ,
94. Hubbard T ,
95. Humphries SE ,
96. Hurles ME ,
97. Hysi P ,
98. Iotchkova V ,
99. Isaacs A ,
100. Jackson DK ,
101. Jamshidi Y ,
102. Johnson J ,
103. Joyce C ,
104. Karczewski KJ ,
105. Kaye J ,
106. Keane T ,
107. Kemp JP ,
108. Kennedy K ,
109. Kent A ,
110. Keogh J ,
111. Khawaja F ,
112. Kleber ME ,
113. van KM ,
114. Kolb-Kokocinski A ,
115. Kooner JS ,
116. Lachance G ,
117. Langenberg C ,
118. Langford C ,
119. Lawson D ,
120. Lee I ,
121. van LEM ,
122. Lek M ,
123. Li R ,
124. Li Y ,
125. Liang J ,
126. Lin H ,
127. Liu R ,
128. Lönnqvist J ,
129. Lopes LR ,
130. Lopes M ,
131. Luan J ,
132. MacArthur DG ,
133. Mangino M ,
134. Marenne G ,
135. März W ,
136. Maslen J ,
137. Matchan A ,
138. Mathieson I ,
139. McGuffin P ,
140. McIntosh AM ,
141. McKechanie AG ,
142. McQuillin A ,
143. Metrustry S ,
144. Migone N ,
145. Mitchison HM ,
146. Moayyeri A ,
147. Morris J ,
148. Morris R ,
149. Muddyman D ,
150. Muntoni F ,
151. Nordestgaard BG ,
152. Northstone K ,
153. O’Donovan MC ,
154. O’Rahilly S ,
155. Onoufriadis A ,
156. Oualkacha K ,
157. Owen MJ ,
158. Palotie A ,
159. Panoutsopoulou K ,
160. Parker V ,
161. Parr JR ,
162. Paternoster L ,
163. Paunio T ,
164. Payne F ,
165. Payne SJ ,
166. Perry JRB ,
167. Pietilainen O ,
168. Plagnol V ,
169. Pollitt RC ,
170. Povey S ,
171. Quail MA ,
172. Quaye L ,
173. Raymond L ,
174. Rehnström K ,
175. Ridout CK ,
176. Ring S ,
177. Ritchie GRS ,
178. Roberts N ,
179. Robinson RL ,
180. Savage DB ,
181. Scambler P ,
182. Schiffels S ,
183. Schmidts M ,
184. Schoenmakers N ,
185. Scott RH ,
186. Scott RA ,
187. Semple RK ,
188. Serra E ,
189. Sharp SI ,
190. Shaw A ,
191. Shihab HA ,
192. Shin S-Y ,
193. Skuse D ,
194. Small KS ,
195. Smee C ,
196. Smith GD ,
197. Southam L ,
198. Spasic-Boskovic O ,
199. Spector TD ,
200. Clair DS ,
201. Pourcain BS ,
202. Stalker J ,
203. Stevens E ,
204. Sun J ,
205. Surdulescu G ,
206. Suvisaari J ,
207. Syrris P ,
208. Tachmazidou I ,
209. Taylor R ,
210. Tian J ,
211. Tobin MD ,
212. Toniolo D ,
213. Traglia M ,
214. Tybjaerg-Hansen A ,
215. Valdes AM ,
216. Vandersteen AM ,
217. Varbo A ,
218. Vijayarangakannan P ,
219. Visscher PM ,
220. Wain LV ,
221. Walters JTR ,
222. Wang G ,
223. Wang J ,
224. Wang Y ,
225. Ward K ,
226. Wheeler E ,
227. Whincup P ,
228. Whyte T ,
229. Williams HJ ,
230. Williamson KA ,
231. Wilson C ,
232. Wilson SG ,
233. Wong K ,
234. Xu C ,
235. Yang J ,
236. Zaza G ,
237. Zeggini E ,
238. Zhang F ,
239. Zhang P ,
240. Zhang W
; UK10K Consortium. Improved imputation of low-frequency and rare variants using the UK10K haplotype reference panel. Nat Commun 2015;6:8111.doi:10.1038/ncomms9111
OpenUrl CrossRef PubMed
↵
2. McCarthy S ,
3. Das S ,
4. Kretzschmar W ,
5. Delaneau O ,
6. Wood AR ,
7. Teumer A ,
8. Kang HM ,
9. Fuchsberger C ,
10. Danecek P ,
11. Sharp K ,
12. Luo Y ,
13. Sidore C ,
14. Kwong A ,
15. Timpson N ,
16. Koskinen S ,
17. Vrieze S ,
18. Scott LJ ,
19. Zhang H ,
20. Mahajan A ,
21. Veldink J ,
22. Peters U ,
23. Pato C ,
24. van Duijn CM ,
25. Gillies CE ,
26. Gandin I ,
27. Mezzavilla M ,
28. Gilly A ,
29. Cocca M ,
30. Traglia M ,
31. Angius A ,
32. Barrett JC ,
33. Boomsma D ,
34. Branham K ,
35. Breen G ,
36. Brummett CM ,
37. Busonero F ,
38. Campbell H ,
39. Chan A ,
40. Chen S ,
41. Chew E ,
42. Collins FS ,
43. Corbin LJ ,
44. Smith GD ,
45. Dedoussis G ,
46. Dorr M ,
47. Farmaki AE ,
48. Ferrucci L ,
49. Forer L ,
50. Fraser RM ,
51. Gabriel S ,
52. Levy S ,
53. Groop L ,
54. Harrison T ,
55. Hattersley A ,
56. Holmen OL ,
57. Hveem K ,
58. Kretzler M ,
59. Lee JC ,
60. McGue M ,
61. Meitinger T ,
62. Melzer D ,
63. Min JL ,
64. Mohlke KL ,
65. Vincent JB ,
66. Nauck M ,
67. Nickerson D ,
68. Palotie A ,
69. Pato M ,
70. Pirastu N ,
71. McInnis M ,
72. Richards JB ,
73. Sala C ,
74. Salomaa V ,
75. Schlessinger D ,
76. Schoenherr S ,
77. Slagboom PE ,
78. Small K ,
79. Spector T ,
80. Stambolian D ,
81. Tuke M ,
82. Tuomilehto J ,
83. Van den Berg LH ,
84. Van Rheenen W ,
85. Volker U ,
86. Wijmenga C ,
87. Toniolo D ,
88. Zeggini E ,
89. Gasparini P ,
90. Sampson MG ,
91. Wilson JF ,
92. Frayling T ,
93. de Bakker PI ,
94. Swertz MA ,
95. McCarroll S ,
96. Kooperberg C ,
97. Dekker A ,
98. Altshuler D ,
99. Willer C ,
100. Iacono W ,
101. Ripatti S ,
102. Soranzo N ,
103. Walter K ,
104. Swaroop A ,
105. Cucca F ,
106. Anderson CA ,
107. Myers RM ,
108. Boehnke M ,
109. McCarthy MI ,
110. Durbin R
; Haplotype Reference Consortium. A reference panel of 64,976 haplotypes for genotype imputation. Nat Genet 2016;48:1279–83.doi:10.1038/ng.3643
OpenUrl CrossRef PubMed
↵
2. Yang J ,
3. Lee SH ,
4. Goddard ME ,
5. Visscher PM
. GCTA: a tool for genome-wide complex trait analysis. Am J Hum Genet 2011;88:76–82.doi:10.1016/j.ajhg.2010.11.011
OpenUrl CrossRef PubMed
↵
2. Schweiger R ,
3. Fisher E ,
4. Rahmani E ,
5. Shenhav L ,
6. Rosset S ,
7. Halperin E
. Using Stochastic Approximation Techniques to Efficiently Construct Confidence Intervals for Heritability: In. Research in Computational Molecular Biology. Cham: Springer, 2017:241–56.
↵
2. Chang CC ,
3. Chow CC ,
4. Tellier LC ,
5. Vattikuti S ,
6. Purcell SM ,
7. Lee JJ
. Second-generation PLINK: rising to the challenge of larger and richer datasets. Gigascience 2015;4:7.doi:10.1186/s13742-015-0047-8
OpenUrl CrossRef PubMed
↵
2. Dey R ,
3. Schmidt EM ,
4. Abecasis GR ,
5. Lee S
. A Fast and Accurate Algorithm to Test for Binary Phenotypes and Its Application to PheWAS. Am J Hum Genet 2017;101:37–49.doi:10.1016/j.ajhg.2017.05.014
OpenUrl
↵
2. Chavent M ,
3. Kuentz-Simonet V ,
4. Labenne A ,
5. Saracco J
. Multivariate analysis of mixed type data: The PCAmixdata R package, 2014.
↵
2. Sudlow C ,
3. Gallacher J ,
4. Allen N ,
5. Beral V ,
6. Burton P ,
7. Danesh J ,
8. Downey P ,
9. Elliott P ,
10. Green J ,
11. Landray M ,
12. Liu B ,
13. Matthews P ,
14. Ong G ,
15. Pell J ,
16. Silman A ,
17. Young A ,
18. Sprosen T ,
19. Peakman T ,
20. Collins R
. UK biobank: an open access resource for identifying the causes of a wide range of complex diseases of middle and old age. PLoS Med 2015;12:e1001779.doi:10.1371/journal.pmed.1001779
↵
2. Yang Q ,
3. Wang Y
. Methods for Analyzing Multivariate Phenotypes in Genetic Association Studies. J Probab Stat 2012;2012:1–13.doi:10.1155/2012/652569
OpenUrl
↵
2. Euesden J ,
3. Lewis CM ,
4. O’Reilly PF
. PRSice: Polygenic Risk Score software. Bioinformatics 2015;31:1466–8.doi:10.1093/bioinformatics/btu848
OpenUrl CrossRef PubMed
↵
2. McEvoy B ,
3. Beleza S ,
4. Shriver MD
. The genetic architecture of normal variation in human pigmentation: an evolutionary perspective and model. Hum Mol Genet 2006;15:R176–81.doi:10.1093/hmg/ddl217
OpenUrl CrossRef PubMed Web of Science
↵
2. Liu F ,
3. Visser M ,
4. Duffy DL ,
5. Hysi PG ,
6. Jacobs LC ,
7. Lao O ,
8. Zhong K ,
9. Walsh S ,
10. Chaitanya L ,
11. Wollstein A ,
12. Zhu G ,
13. Montgomery GW ,
14. Henders AK ,
15. Mangino M ,
16. Glass D ,
17. Bataille V ,
18. Sturm RA ,
19. Rivadeneira F ,
20. Hofman A ,
21. van IJcken WF ,
22. Uitterlinden AG ,
23. Palstra RJ ,
24. Spector TD ,
25. Martin NG ,
26. Nijsten TE ,
27. Kayser M
. Genetics of skin color variation in Europeans: genome-wide association studies with functional follow-up. Hum Genet 2015;134:823–35.doi:10.1007/s00439-015-1559-0
OpenUrl CrossRef PubMed
↵
2. Robles-Espinoza CD ,
3. Roberts ND ,
4. Chen S ,
5. Leacy FP ,
6. Alexandrov LB ,
7. Pornputtapong N ,
8. Halaban R ,
9. Krauthammer M ,
10. Cui R ,
11. Timothy Bishop D ,
12. Adams DJ
. Germline MC1R status influences somatic mutation burden in melanoma. Nat Commun 2016;7:12064.doi:10.1038/ncomms12064
OpenUrl
↵
2. Sturm RA
. Skin colour and skin cancer - MC1R, the genetic link. Melanoma Res 2002;12:405–16.doi:10.1097/00008390-200209000-00001
OpenUrl CrossRef PubMed Web of Science
↵
2. McLaren W ,
3. Gil L ,
4. Hunt SE ,
5. Riat HS ,
6. Ritchie GR ,
7. Thormann A ,
8. Flicek P ,
9. Cunningham F
. The Ensembl Variant Effect Predictor. Genome Biol 2016;17:122.doi:10.1186/s13059-016-0974-4
OpenUrl CrossRef PubMed
↵
2. Wysocki K ,
3. Ritter L
. Diseasome: an approach to understanding gene-disease interactions. Annu Rev Nurs Res 2011;29:55–72.doi:10.1891/0739-6686.29.55
OpenUrl Abstract/FREE Full Text
↵
2. Yang J ,
3. Bakshi A ,
4. Zhu Z ,
5. Hemani G ,
6. Vinkhuyzen AA ,
7. Lee SH ,
8. Robinson MR ,
9. Perry JR ,
10. Nolte IM ,
11. van Vliet-Ostaptchouk JV ,
12. Snieder H ,
13. Esko T ,
14. Milani L ,
15. Mägi R ,
16. Metspalu A ,
17. Hamsten A ,
18. Magnusson PK ,
19. Pedersen NL ,
20. Ingelsson E ,
21. Soranzo N ,
22. Keller MC ,
23. Wray NR ,
24. Goddard ME ,
25. Visscher PM
; LifeLines Cohort Study. Genetic variance estimation with imputed variants finds negligible missing heritability for human height and body mass index. Nat Genet 2015;47:1114–20.doi:10.1038/ng.3390
OpenUrl CrossRef PubMed
↵
2. Krapohl E ,
3. Rimfeld K ,
4. Shakeshaft NG ,
5. Trzaskowski M ,
6. McMillan A ,
7. Pingault JB ,
8. Asbury K ,
9. Harlaar N ,
10. Kovas Y ,
11. Dale PS ,
12. Plomin R
. The high heritability of educational achievement reflects many genetically influenced traits, not just intelligence. Proc Natl Acad Sci U S A 2014;111:15273–8.doi:10.1073/pnas.1408777111
OpenUrl Abstract/FREE Full Text
↵
2. Turley P ,
3. Walters RK ,
4. Maghzian O ,
5. Okbay A ,
6. Lee JJ ,
7. Fontana MA ,
8. Nguyen-Viet TA ,
9. Wedow R ,
10. Zacher M ,
11. Furlotte NA ,
12. Magnusson P ,
13. Oskarsson S ,
14. Johannesson M ,
15. Visscher PM ,
16. Laibson D ,
17. Cesarini D ,
18. Neale BM ,
19. Benjamin DJ
; 23andMe Research Team, Social Science Genetic Association Consortium. Multi-trait analysis of genome-wide association summary statistics using MTAG. Nat Genet 2018;50:229–37.doi:10.1038/s41588-017-0009-4
OpenUrl
↵
2. Weedon MN ,
3. Lango H ,
4. Lindgren CM ,
5. Wallace C ,
6. Evans DM ,
7. Mangino M ,
8. Freathy RM ,
9. Perry JR ,
10. Stevens S ,
11. Hall AS ,
12. Samani NJ ,
13. Shields B ,
14. Prokopenko I ,
15. Farrall M ,
16. Dominiczak A ,
17. Johnson T ,
18. Bergmann S ,
19. Beckmann JS ,
20. Vollenweider P ,
21. Waterworth DM ,
22. Mooser V ,
23. Palmer CN ,
24. Morris AD ,
25. Ouwehand WH ,
26. Zhao JH ,
27. Li S ,
28. Loos RJ ,
29. Barroso I ,
30. Deloukas P ,
31. Sandhu MS ,
32. Wheeler E ,
33. Soranzo N ,
34. Inouye M ,
35. Wareham NJ ,
36. Caulfield M ,
37. Munroe PB ,
38. Hattersley AT ,
39. McCarthy MI ,
40. Frayling TM
; Diabetes Genetics Initiative, Wellcome Trust Case Control Consortium, Cambridge GEM Consortium. Genome-wide association analysis identifies 20 loci that influence adult height. Nat Genet 2008;40:575–83.doi:10.1038/ng.121
OpenUrl CrossRef PubMed Web of Science
↵
2. McClay JL ,
3. Adkins DE ,
4. Åberg K ,
5. Bukszár J ,
6. Khachane AN ,
7. Keefe RSE ,
8. Perkins DO ,
9. McEvoy JP ,
10. Stroup TS ,
11. Vann RE ,
12. Beardsley PM ,
13. Lieberman JA ,
14. Sullivan PF ,
15. van den Oord EJCG
. Genome-wide pharmacogenomic study of neurocognition as an indicator of antipsychotic treatment response in schizophrenia. Neuropsychopharmacology 2011;36:616–26.doi:10.1038/npp.2010.193
OpenUrl CrossRef PubMed Web of Science
↵
2. Lee MK ,
3. Shaffer JR ,
4. Leslie EJ ,
5. Orlova E ,
6. Carlson JC ,
7. Feingold E ,
8. Marazita ML ,
9. Weinberg SM
. Genome-wide association study of facial morphology reveals novel associations with FREM1 and PARK2. PLoS One 2017;12:e0176566.doi:10.1371/journal.pone.0176566
↵
2. Hill WD ,
3. Marioni RE ,
4. Maghzian O ,
5. Ritchie SJ ,
6. Hagenaars SP ,
7. McIntosh AM ,
8. Gale CR ,
9. Davies G ,
10. Deary IJ
. A combined analysis of genetically correlated traits identifies 187 loci and a role for neurogenesis and myelination in intelligence. Mol Psychiatry;15.doi:10.1038/s41380-017-0001-5
↵
2. Wang X ,
3. Sun J ,
4. Li C ,
5. Mao B
. EphA7 modulates apical constriction of hindbrain neuroepithelium during neurulation in Xenopus. Biochem Biophys Res Commun 2016;479:759–65.doi:10.1016/j.bbrc.2016.09.138
OpenUrl
↵
2. Prost G ,
3. Braun S ,
4. Hertwig F ,
5. Winkler M ,
6. Jagemann L ,
7. Nolbrant S ,
8. Leefa IV ,
9. Offen N ,
10. Miharada K ,
11. Lang S ,
12. Artner I ,
13. Nuber UA
. The putative tumor suppressor gene EphA7 is a novel BMI-1 target. Oncotarget 2016;7:58203–17.doi:10.18632/oncotarget.11279
OpenUrl
↵
2. Choi H ,
3. Lee SH ,
4. Um SJ ,
5. Kim EJ
. CACUL1 functions as a negative regulator of androgen receptor in prostate cancer cells. Cancer Lett 2016;376:360–6.doi:10.1016/j.canlet.2016.04.019
OpenUrl
↵
2. Voorhoeve PG ,
3. van Mechelen W ,
4. Uitterlinden AG ,
5. Delemarre-van de Waal HA ,
6. Lamberts SW
. Androgen receptor gene CAG repeat polymorphism in longitudinal height and body composition in children and adolescents. Clin Endocrinol 2011;74:732–5.doi:10.1111/j.1365-2265.2011.03986.x
OpenUrl CrossRef PubMed
↵
2. Tsukasaki K ,
3. Miller CW ,
4. Greenspun E ,
5. Eshaghian S ,
6. Kawabata H ,
7. Fujimoto T ,
8. Tomonaga M ,
9. Sawyers C ,
10. Said JW ,
11. Koeffler HP
. Mutations in the mitotic check point gene, MAD1L1, in human cancers. Oncogene 2001;20:3301–5.doi:10.1038/sj.onc.1204421
OpenUrl CrossRef PubMed Web of Science
↵
Schizophrenia Psychiatric Genome-Wide Association Study (GWAS) Consortium. Genome-wide association study identifies five new schizophrenia loci. Nat Genet 2011;43:969–76.
OpenUrl CrossRef PubMed
↵
2. Ng MC ,
3. Shriner D ,
4. Chen BH ,
5. Li J ,
6. Chen WM ,
7. Guo X ,
8. Liu J ,
9. Bielinski SJ ,
10. Yanek LR ,
11. Nalls MA ,
12. Comeau ME ,
13. Rasmussen-Torvik LJ ,
14. Jensen RA ,
15. Evans DS ,
16. Sun YV ,
17. An P ,
18. Patel SR ,
19. Lu Y ,
20. Long J ,
21. Armstrong LL ,
22. Wagenknecht L ,
23. Yang L ,
24. Snively BM ,
25. Palmer ND ,
26. Mudgal P ,
27. Langefeld CD ,
28. Keene KL ,
29. Freedman BI ,
30. Mychaleckyj JC ,
31. Nayak U ,
32. Raffel LJ ,
33. Goodarzi MO ,
34. Chen YD ,
35. Taylor HA ,
36. Correa A ,
37. Sims M ,
38. Couper D ,
39. Pankow JS ,
40. Boerwinkle E ,
41. Adeyemo A ,
42. Doumatey A ,
43. Chen G ,
44. Mathias RA ,
45. Vaidya D ,
46. Singleton AB ,
47. Zonderman AB ,
48. Igo RP ,
49. Sedor JR ,
50. Kabagambe EK ,
51. Siscovick DS ,
52. McKnight B ,
53. Rice K ,
54. Liu Y ,
55. Hsueh WC ,
56. Zhao W ,
57. Bielak LF ,
58. Kraja A ,
59. Province MA ,
60. Bottinger EP ,
61. Gottesman O ,
62. Cai Q ,
63. Zheng W ,
64. Blot WJ ,
65. Lowe WL ,
66. Pacheco JA ,
67. Crawford DC ,
68. Grundberg E ,
69. Rich SS ,
70. Hayes MG ,
71. Shu XO ,
72. Loos RJ ,
73. Borecki IB ,
74. Peyser PA ,
75. Cummings SR ,
76. Psaty BM ,
77. Fornage M ,
78. Iyengar SK ,
79. Evans MK ,
80. Becker DM ,
81. Kao WH ,
82. Wilson JG ,
83. Rotter JI ,
84. Sale MM ,
85. Liu S ,
86. Rotimi CN ,
87. Bowden DW
. FIND Consortium eMERGE Consortium DIAGRAM Consortium MuTHER Consortium MEta-analysis of type 2 DIabetes in African Americans Consortium. Meta-analysis of genome-wide association studies in African Americans provides insights into the genetic architecture of type 2 diabetes. PLoS Genet 2014;10:e1004517.doi:10.1371/journal.pgen.1004517
↵
2. Demerath EW ,
3. Guan W ,
4. Grove ML ,
5. Aslibekyan S ,
6. Mendelson M ,
7. Zhou YH ,
8. Hedman ÅK ,
9. Sandling JK ,
10. Li LA ,
11. Irvin MR ,
12. Zhi D ,
13. Deloukas P ,
14. Liang L ,
15. Liu C ,
16. Bressler J ,
17. Spector TD ,
18. North K ,
19. Li Y ,
20. Absher DM ,
21. Levy D ,
22. Arnett DK ,
23. Fornage M ,
24. Pankow JS ,
25. Boerwinkle E ,
26. ÅK H ,
27. Li L-A IMR
. Epigenome-wide association study (EWAS) of BMI, BMI change and waist circumference in African American adults identifies multiple replicated loci. Hum Mol Genet 2015;24:4464–79.doi:10.1093/hmg/ddv161
OpenUrl CrossRef PubMed
↵
2. Rzehak P ,
3. Covic M ,
4. Saffery R ,
5. Reischl E ,
6. Wahl S ,
7. Grote V ,
8. Weber M ,
9. Xhonneux A ,
10. Langhendries JP ,
11. Ferre N ,
12. Closa-Monasterolo R ,
13. Escribano J ,
14. Verduci E ,
15. Riva E ,
16. Socha P ,
17. Gruszfeld D ,
18. Koletzko B
. DNA-Methylation and Body Composition in Preschool Children: Epigenome-Wide-Analysis in the European Childhood Obesity Project (CHOP)-Study. Sci Rep 2017;7:14349.doi:10.1038/s41598-017-13099-4
OpenUrl
↵
2. Lotta LA ,
3. Gulati P ,
4. Day FR ,
5. Payne F ,
6. Ongen H ,
7. van de Bunt M ,
8. Gaulton KJ ,
9. Eicher JD ,
10. Sharp SJ ,
11. Luan J ,
12. De Lucia Rolfe E ,
13. Stewart ID ,
14. Wheeler E ,
15. Willems SM ,
16. Adams C ,
17. Yaghootkar H ,
18. Forouhi NG ,
19. Khaw KT ,
20. Johnson AD ,
21. Semple RK ,
22. Frayling T ,
23. Perry JR ,
24. Dermitzakis E ,
25. McCarthy MI ,
26. Barroso I ,
27. Wareham NJ ,
28. Savage DB ,
29. Langenberg C ,
30. O’Rahilly S ,
31. Scott RA
; EPIC-InterAct Consortium Cambridge FPLD1 Consortium. Integrative genomic analysis implicates limited peripheral adipose storage capacity in the pathogenesis of human insulin resistance. Nat Genet 2017;49:17–26.doi:10.1038/ng.3714
OpenUrl CrossRef PubMed
↵
2. Castaño-Betancourt MC ,
3. Evans DS ,
4. Ramos YF ,
5. Boer CG ,
6. Metrustry S ,
7. Liu Y ,
8. den Hollander W ,
9. van Rooij J ,
10. Kraus VB ,
11. Yau MS ,
12. Mitchell BD ,
13. Muir K ,
14. Hofman A ,
15. Doherty M ,
16. Doherty S ,
17. Zhang W ,
18. Kraaij R ,
19. Rivadeneira F ,
20. Barrett-Connor E ,
21. Maciewicz RA ,
22. Arden N ,
23. Nelissen RG ,
24. Kloppenburg M ,
25. Jordan JM ,
26. Nevitt MC ,
27. Slagboom EP ,
28. Hart DJ ,
29. Lafeber F ,
30. Styrkarsdottir U ,
31. Zeggini E ,
32. Evangelou E ,
33. Spector TD ,
34. Uitterlinden AG ,
35. Lane NE ,
36. Meulenbelt I ,
37. Valdes AM ,
38. van Meurs JB
. Novel Genetic Variants for Cartilage Thickness and Hip Osteoarthritis. PLoS Genet 2016;12:e1006260.doi:10.1371/journal.pgen.1006260
↵
2. Mullin BH ,
3. Walsh JP ,
4. Zheng HF ,
5. Brown SJ ,
6. Surdulescu GL ,
7. Curtis C ,
8. Breen G ,
9. Dudbridge F ,
10. Richards JB ,
11. Spector TD ,
12. Wilson SG
. Genome-wide association study using family-based cohorts identifies the WLS and CCDC170/ESR1 loci as associated with bone mineral density. BMC Genomics 2016;17:136.doi:10.1186/s12864-016-2481-0
OpenUrl
↵
2. Dyment DA ,
3. Smith AC ,
4. Alcantara D ,
5. Schwartzentruber JA ,
6. Basel-Vanagaite L ,
7. Curry CJ ,
8. Temple IK ,
9. Reardon W ,
10. Mansour S ,
11. Haq MR ,
12. Gilbert R ,
13. Lehmann OJ ,
14. Vanstone MR ,
15. Beaulieu CL ,
16. Majewski J ,
17. Bulman DE ,
18. O’Driscoll M ,
19. Boycott KM ,
20. Innes AM
; FORGE Canada Consortium. Mutations in PIK3R1 cause SHORT syndrome. Am J Hum Genet 2013;93:158–66.doi:10.1016/j.ajhg.2013.06.005
OpenUrl CrossRef PubMed
↵
2. Majumdar A ,
3. Haldar T ,
4. Bhattacharya S ,
5. Witte JS
. An efficient Bayesian meta-analysis approach for studying cross-phenotype genetic associations. PLoS Genet 2018;14:e1007139.doi:10.1371/journal.pgen.1007139

Footnotes

Contributors All authors contributed to the feedback of the manuscript and played an important role in implementing the study. IG-F, MP, VM and RdC conceived the study. IG-F and RdC planned the study. LP coordinated the cohort recruitment. AC, JV and XD prepared the samples. MO-S and XD curated the epidemiological data variables. DP, RP, LR, SA and LS conducted the genotyping. IG-F, DP and LS analysed the clustering analysis. IG-F, MG-M, JMM and DT conducted the imputation analysis. IG-F and RdC conducted and supervised the genetic analysis. IG-F, MO-S and RdC wrote the manuscript. RdC submitted and supervised the study.
Funding This work was supported in part by the Spanish Ministerio de Economía y Competitividad (MINECO) project ADE 10/00026, by the Catalan Departament de Salut and by the Departament d’Empresa i Coneixement de la Generalitat de Catalunya, the Agència de Gestió d’Estudis Universitaris i de Recerca (AGAUR) (SGR 1269, SGR 1589 and SGR 647). RdC is the recipient of a Ramon y Cajal grant (RYC-2011-07822). The Project GCAT is coordinated by the Germans Trias i Pujol Research Institute (IGTP), in collaboration with the Catalan Institute of Oncology (ICO), and in partnership with the Blood and Tissue Bank of Catalonia (BST). IGTP is part of the CERCA Programme/Generalitat de Catalunya.
Competing interests None declared.
Patient consent Obtained.
Ethics approval http://www.ceicgermanstrias.cat/.
Provenance and peer review Not commissioned; externally peer reviewed.
Correction notice This article has been corrected since it was published online first. JMM has been added to the authors list and to the ’Contributors' section.

[1] ↵
Eurostat Statistics Explained. Mortality and life expectancy statistics, 2016. http://ec.europa.eu/eurostat/statistics-explained/index.php/Mortality_and_life_expectancy_statistics

[2] ↵

Dawber TR ,
Meadors GF ,
Moore FE
. Epidemiological approaches to heart disease: the Framingham Study. Am J Public Health Nations Health 1951;41:279–86.doi:10.2105/AJPH.41.3.279
OpenUrl CrossRef PubMed Web of Science

[4] Dawber TR ,

[5] Meadors GF ,

[6] Moore FE

[7] ↵

Riboli E ,
Hunt KJ ,
Slimani N ,
Ferrari P ,
Norat T ,
Fahey M ,
Charrondière UR ,
Hémon B ,
Casagrande C ,
Vignat J ,
Overvad K ,
Tjønneland A ,
Clavel-Chapelon F ,
Thiébaut A ,
Wahrendorf J ,
Boeing H ,
Trichopoulos D ,
Trichopoulou A ,
Vineis P ,
Palli D ,
Bueno-De-Mesquita HB ,
Peeters PH ,
Lund E ,
Engeset D ,
González CA ,
Barricarte A ,
Berglund G ,
Hallmans G ,
Day NE ,
Key TJ ,
Kaaks R ,
Saracci R
. European Prospective Investigation into Cancer and Nutrition (EPIC): study populations and data collection. Public Health Nutr 2002;5:1113–24.doi:10.1079/PHN2002394
OpenUrl CrossRef PubMed Web of Science

[9] Riboli E ,

[10] Hunt KJ ,

[11] Slimani N ,

[12] Ferrari P ,

[13] Norat T ,

[14] Fahey M ,

[15] Charrondière UR ,

[16] Hémon B ,

[17] Casagrande C ,

[18] Vignat J ,

[19] Overvad K ,

[20] Tjønneland A ,

[21] Clavel-Chapelon F ,

[22] Thiébaut A ,

[23] Wahrendorf J ,

[24] Boeing H ,

[25] Trichopoulos D ,

[26] Trichopoulou A ,

[27] Vineis P ,

[28] Palli D ,

[29] Bueno-De-Mesquita HB ,

[30] Peeters PH ,

[31] Lund E ,

[32] Engeset D ,

[33] González CA ,

[34] Barricarte A ,

[35] Berglund G ,

[36] Hallmans G ,

[37] Day NE ,

[38] Key TJ ,

[39] Kaaks R ,

[40] Saracci R

[41] ↵

Welter D ,
MacArthur J ,
Morales J ,
Burdett T ,
Hall P ,
Junkins H ,
Klemm A ,
Flicek P ,
Manolio T ,
Hindorff L ,
Parkinson H
. The NHGRI GWAS Catalog, a curated resource of SNP-trait associations. Nucleic Acids Res 2014;42:D1001–6.doi:10.1093/nar/gkt1229
OpenUrl CrossRef PubMed Web of Science

[43] Welter D ,

[44] MacArthur J ,

[45] Morales J ,

[46] Burdett T ,

[47] Hall P ,

[48] Junkins H ,

[49] Klemm A ,

[50] Flicek P ,

[51] Manolio T ,

[52] Hindorff L ,

[53] Parkinson H

[54] ↵

Manolio TA ,
Collins FS ,
Cox NJ ,
Goldstein DB ,
Hindorff LA ,
Hunter DJ ,
McCarthy MI ,
Ramos EM ,
Cardon LR ,
Chakravarti A ,
Cho JH ,
Guttmacher AE ,
Kong A ,
Kruglyak L ,
Mardis E ,
Rotimi CN ,
Slatkin M ,
Valle D ,
Whittemore AS ,
Boehnke M ,
Clark AG ,
Eichler EE ,
Gibson G ,
Haines JL ,
Mackay TF ,
McCarroll SA ,
Visscher PM
. Finding the missing heritability of complex diseases. Nature 2009;461:747–53.doi:10.1038/nature08494
OpenUrl CrossRef PubMed Web of Science

[56] Manolio TA ,

[57] Collins FS ,

[58] Cox NJ ,

[59] Goldstein DB ,

[60] Hindorff LA ,

[61] Hunter DJ ,

[62] McCarthy MI ,

[63] Ramos EM ,

[64] Cardon LR ,

[65] Chakravarti A ,

[66] Cho JH ,

[67] Guttmacher AE ,

[68] Kong A ,

[69] Kruglyak L ,

[70] Mardis E ,

[71] Rotimi CN ,

[72] Slatkin M ,

[73] Valle D ,

[74] Whittemore AS ,

[75] Boehnke M ,

[76] Clark AG ,

[77] Eichler EE ,

[78] Gibson G ,

[79] Haines JL ,

[80] Mackay TF ,

[81] McCarroll SA ,

[82] Visscher PM

[83] ↵

Visscher PM ,
Wray NR ,
Zhang Q ,
Sklar P ,
McCarthy MI ,
Brown MA ,
Yang J
. 10 Years of GWAS Discovery: Biology, Function, and Translation. Am J Hum Genet 2017;101:5–22.doi:10.1016/j.ajhg.2017.06.005
OpenUrl CrossRef PubMed

[85] Visscher PM ,

[86] Wray NR ,

[87] Zhang Q ,

[88] Sklar P ,

[89] McCarthy MI ,

[90] Brown MA ,

[91] Yang J

[92] ↵

Boyle EA ,
Li YI ,
Pritchard JK
. An Expanded View of Complex Traits: From Polygenic to Omnigenic. Cell 2017;169:1177–86.doi:10.1016/j.cell.2017.05.038
OpenUrl CrossRef PubMed

[94] Boyle EA ,

[95] Li YI ,

[96] Pritchard JK

[97] ↵

Chakravarti A ,
Turner TN
. Revealing rate-limiting steps in complex disease biology: The crucial importance of studying rare, extreme-phenotype families. Bioessays 2016;38:578–86.doi:10.1002/bies.201500203
OpenUrl

[99] Chakravarti A ,

[100] Turner TN

[101] ↵

Freimer N ,
Sabatti C
. The human phenome project. Nat Genet 2003;34:15–21.doi:10.1038/ng0503-15
OpenUrl CrossRef PubMed Web of Science

[103] Freimer N ,

[104] Sabatti C

[105] ↵

Cotsapas C ,
Voight BF ,
Rossin E ,
Lage K ,
Neale BM ,
Wallace C ,
Abecasis GR ,
Barrett JC ,
Behrens T ,
Cho J ,
De Jager PL ,
Elder JT ,
Graham RR ,
Gregersen P ,
Klareskog L ,
Siminovitch KA ,
van Heel DA ,
Wijmenga C ,
Worthington J ,
Todd JA ,
Hafler DA ,
Rich SS ,
Daly MJ
. FOCiS Network of Consortia. Pervasive sharing of genetic effects in autoimmune disease. PLoS Genet 2011;7:e1002254.

[107] Cotsapas C ,

[108] Voight BF ,

[109] Rossin E ,

[110] Lage K ,

[111] Neale BM ,

[112] Wallace C ,

[113] Abecasis GR ,

[114] Barrett JC ,

[115] Behrens T ,

[116] Cho J ,

[117] De Jager PL ,

[118] Elder JT ,

[119] Graham RR ,

[120] Gregersen P ,

[121] Klareskog L ,

[122] Siminovitch KA ,

[123] van Heel DA ,

[124] Wijmenga C ,

[125] Worthington J ,

[126] Todd JA ,

[127] Hafler DA ,

[128] Rich SS ,

[129] Daly MJ

[130] ↵

Ferreira MAR ,
Purcell SM
. A multivariate test of association. Bioinformatics 2009;25:132–3.doi:10.1093/bioinformatics/btn563
OpenUrl CrossRef PubMed Web of Science

[132] Ferreira MAR ,

[133] Purcell SM

[134] ↵

Kim J ,
Bai Y ,
Pan W
. An Adaptive Association Test for Multiple Phenotypes with GWAS Summary Statistics. Genet Epidemiol 2015;39:651–63.doi:10.1002/gepi.21931
OpenUrl CrossRef PubMed

[136] Kim J ,

[137] Bai Y ,

[138] Pan W

[139] ↵

Korte A ,
Vilhjálmsson BJ ,
Segura V ,
Platt A ,
Long Q ,
Nordborg M
. A mixed-model approach for genome-wide association studies of correlated traits in structured populations. Nat Genet 2012;44:1066–71.doi:10.1038/ng.2376
OpenUrl CrossRef PubMed

[141] Korte A ,

[142] Vilhjálmsson BJ ,

[143] Segura V ,

[144] Platt A ,

[145] Long Q ,

[146] Nordborg M

[147] ↵

O’Reilly PF ,
Hoggart CJ ,
Pomyen Y ,
Calboli FC ,
Elliott P ,
Jarvelin MR ,
Coin LJ
. MultiPhen: joint model of multiple phenotypes can increase discovery in GWAS. PLoS One 2012;7:e34861.doi:10.1371/journal.pone.0034861

[149] O’Reilly PF ,

[150] Hoggart CJ ,

[151] Pomyen Y ,

[152] Calboli FC ,

[153] Elliott P ,

[154] Jarvelin MR ,

[155] Coin LJ

[156] ↵

Zhu X ,
Feng T ,
Tayo BO ,
Liang J ,
Young JH ,
Franceschini N ,
Smith JA ,
Yanek LR ,
Sun YV ,
Edwards TL ,
Chen W ,
Nalls M ,
Fox E ,
Sale M ,
Bottinger E ,
Rotimi C ,
Liu Y ,
McKnight B ,
Liu K ,
Arnett DK ,
Chakravati A ,
Cooper RS ,
Redline S
; COGENT BP Consortium. Meta-analysis of correlated traits via summary statistics from GWASs with an application in hypertension. Am J Hum Genet 2015;96:21–36.doi:10.1016/j.ajhg.2014.11.011
OpenUrl CrossRef PubMed

[158] Zhu X ,

[159] Feng T ,

[160] Tayo BO ,

[161] Liang J ,

[162] Young JH ,

[163] Franceschini N ,

[164] Smith JA ,

[165] Yanek LR ,

[166] Sun YV ,

[167] Edwards TL ,

[168] Chen W ,

[169] Nalls M ,

[170] Fox E ,

[171] Sale M ,

[172] Bottinger E ,

[173] Rotimi C ,

[174] Liu Y ,

[175] McKnight B ,

[176] Liu K ,

[177] Arnett DK ,

[178] Chakravati A ,

[179] Cooper RS ,

[180] Redline S

[181] ↵

Ge T ,
Chen CY ,
Neale BM ,
Sabuncu MR ,
Smoller JW
. Phenome-wide heritability analysis of the UK Biobank. PLoS Genet 2017;13:e1006711.doi:10.1371/journal.pgen.1006711
OpenUrl

[183] Ge T ,

[184] Chen CY ,

[185] Neale BM ,

[186] Sabuncu MR ,

[187] Smoller JW

[188] ↵

Muñoz M ,
Pong-Wong R ,
Canela-Xandri O ,
Rawlik K ,
Haley CS ,
Tenesa A
. Evaluating the contribution of genetics and familial shared environment to common disease using the UK Biobank. Nat Genet 2016;48:980–3.doi:10.1038/ng.3618
OpenUrl PubMed

[190] Muñoz M ,

[191] Pong-Wong R ,

[192] Canela-Xandri O ,

[193] Rawlik K ,

[194] Haley CS ,

[195] Tenesa A

[196] ↵

Obón-Santacana M ,
Vilardell M ,
Carreras A ,
Duran X ,
Velasco J ,
Galván-Femenía I ,
Alonso T ,
Puig L ,
Sumoy L ,
Duell EJ ,
Perucho M ,
Moreno V ,
de Cid R
. GCAT|Genomes for life: a prospective cohort study of the genomes of Catalonia. BMJ Open 2018;8:e018324.doi:10.1136/bmjopen-2017-018324

[198] Obón-Santacana M ,

[199] Vilardell M ,

[200] Carreras A ,

[201] Duran X ,

[202] Velasco J ,

[203] Galván-Femenía I ,

[204] Alonso T ,

[205] Puig L ,

[206] Sumoy L ,

[207] Duell EJ ,

[208] Perucho M ,

[209] Moreno V ,

[210] de Cid R

[211] ↵

Liu Y ,
De A
. Multiple Imputation by Fully Conditional Specification for Dealing with Missing Data in a Large Epidemiologic Study. Int J Stat Med Res 2015;4:287–95.doi:10.6000/1929-6029.2015.04.03.7
OpenUrl

[213] Liu Y ,

[214] De A

[215] ↵

Howie B ,
Fuchsberger C ,
Stephens M ,
Marchini J ,
Abecasis GR
. Fast and accurate genotype imputation in genome-wide association studies through pre-phasing. Nat Genet 2012;44:955–9.doi:10.1038/ng.2354
OpenUrl CrossRef PubMed

[217] Howie B ,

[218] Fuchsberger C ,

[219] Stephens M ,

[220] Marchini J ,

[221] Abecasis GR

[222] ↵

Auton A ,
Brooks LD ,
Durbin RM ,
Garrison EP ,
Kang HM ,
Korbel JO ,
Marchini JL ,
McCarthy S ,
McVean GA ,
Abecasis GR
; 1000 Genomes Project Consortium. A global reference for human genetic variation. Nature 2015;526:68–74.doi:10.1038/nature15393
OpenUrl CrossRef PubMed

[224] Auton A ,

[225] Brooks LD ,

[226] Durbin RM ,

[227] Garrison EP ,

[228] Kang HM ,

[229] Korbel JO ,

[230] Marchini JL ,

[231] McCarthy S ,

[232] McVean GA ,

[233] Abecasis GR

[234] ↵

Deelen P ,
Menelaou A ,
van Leeuwen EM ,
Kanterakis A ,
van Dijk F ,
Medina-Gomez C ,
Francioli LC ,
Hottenga JJ ,
Karssen LC ,
Estrada K ,
Kreiner-Møller E ,
Rivadeneira F ,
van Setten J ,
Gutierrez-Achury J ,
Westra HJ ,
Franke L ,
van Enckevort D ,
Dijkstra M ,
Byelas H ,
van Duijn CM ,
de Bakker PI ,
Wijmenga C ,
Swertz MA
; Genome of Netherlands Consortium. Improved imputation quality of low-frequency and rare variants in European samples using the ‘Genome of The Netherlands’. Eur J Hum Genet 2014;22:1321–6.doi:10.1038/ejhg.2014.19
OpenUrl CrossRef PubMed

[236] Deelen P ,

[237] Menelaou A ,

[238] van Leeuwen EM ,

[239] Kanterakis A ,

[240] van Dijk F ,

[241] Medina-Gomez C ,

[242] Francioli LC ,

[243] Hottenga JJ ,

[244] Karssen LC ,

[245] Estrada K ,

[246] Kreiner-Møller E ,

[247] Rivadeneira F ,

[248] van Setten J ,

[249] Gutierrez-Achury J ,

[250] Westra HJ ,

[251] Franke L ,

[252] van Enckevort D ,

[253] Dijkstra M ,

[254] Byelas H ,

[255] van Duijn CM ,

[256] de Bakker PI ,

[257] Wijmenga C ,

[258] Swertz MA

[261] Huang J ,

[262] Howie B ,

[263] McCarthy S ,

[264] Memari Y ,

[265] Walter K ,

[266] Min JL ,

[267] Danecek P ,

[268] Malerba G ,

[269] Trabetti E ,

[270] Zheng HF ,

[271] Gambaro G ,

[272] Richards JB ,

[273] Durbin R ,

[274] Timpson NJ ,

[275] Marchini J ,

[276] Soranzo N ,

[277] Turki SA ,

[278] Amuzu A ,

[279] Anderson CA ,

[280] Anney R ,

[281] Antony D ,

[282] Artigas MS ,

[283] Ayub M ,

[284] Bala S ,

[285] Barrett JC ,

[286] Barroso I ,

[287] Beales P ,

[288] Benn M ,

[289] Bentham J ,

[290] Bhattacharya S ,

[291] Birney E ,

[292] Blackwood D ,

[293] Bobrow M ,

[294] Bochukova E ,

[295] Bolton PF ,

[296] Bounds R ,

[297] Boustred C ,

[298] Breen G ,

[299] Calissano M ,

[300] Carss K ,

[301] Casas JP ,

[302] Chambers JC ,

[303] Charlton R ,

[304] Chatterjee K ,

[305] Chen L ,

[306] Ciampi A ,

[307] Cirak S ,

[308] Clapham P ,

[309] Clement G ,

[310] Coates G ,

[311] Cocca M ,

[312] Collier DA ,

[313] Cosgrove C ,

[314] Cox T ,

[315] Craddock N ,

[316] Crooks L ,

[317] Curran S ,

[318] Curtis D ,

[319] Daly A ,

[320] Inm D ,

[321] Day-Williams A ,

[322] Dedoussis G ,

[323] Down T ,

[324] Du Y ,

[325] van DCM ,

[326] Dunham I ,

[327] Edkins S ,

[328] Ekong R ,

[329] Ellis P ,

[330] Evans DM ,

[331] Farooqi IS ,

[332] Fitzpatrick DR ,

[333] Flicek P ,

[334] Floyd J ,

[335] Foley AR ,

[336] Franklin CS ,

[337] Futema M ,

[338] Gallagher L ,

[339] Gasparini P ,

[340] Gaunt TR ,

[341] Geihs M ,

[342] Geschwind D ,

[343] Greenwood C ,

[344] Griffin H ,

[345] Grozeva D ,

[346] Guo X ,

[347] Guo X ,

[348] Gurling H ,

[349] Hart D ,

[350] Hendricks AE ,

[351] Holmans P ,

[352] Huang L ,

[353] Hubbard T ,

[354] Humphries SE ,

[355] Hurles ME ,

[356] Hysi P ,

[357] Iotchkova V ,

[358] Isaacs A ,

[359] Jackson DK ,

[360] Jamshidi Y ,

[361] Johnson J ,

[362] Joyce C ,

[363] Karczewski KJ ,

[364] Kaye J ,

[365] Keane T ,

[366] Kemp JP ,

[367] Kennedy K ,

[368] Kent A ,

[369] Keogh J ,

[370] Khawaja F ,

[371] Kleber ME ,

[372] van KM ,

[373] Kolb-Kokocinski A ,

[374] Kooner JS ,

[375] Lachance G ,

[376] Langenberg C ,

[377] Langford C ,

[378] Lawson D ,

[379] Lee I ,

[380] van LEM ,

[381] Lek M ,

[382] Li R ,

[383] Li Y ,

[384] Liang J ,

[385] Lin H ,

[386] Liu R ,

[387] Lönnqvist J ,

[388] Lopes LR ,

[389] Lopes M ,

[390] Luan J ,

[391] MacArthur DG ,

[392] Mangino M ,

[393] Marenne G ,

[394] März W ,

[395] Maslen J ,

[396] Matchan A ,

[397] Mathieson I ,

[398] McGuffin P ,

[399] McIntosh AM ,

[400] McKechanie AG ,

[401] McQuillin A ,

[402] Metrustry S ,

[403] Migone N ,

[404] Mitchison HM ,

[405] Moayyeri A ,

[406] Morris J ,

[407] Morris R ,

[408] Muddyman D ,

[409] Muntoni F ,

[410] Nordestgaard BG ,

[411] Northstone K ,

[412] O’Donovan MC ,

[413] O’Rahilly S ,

[414] Onoufriadis A ,

[415] Oualkacha K ,

[416] Owen MJ ,

[417] Palotie A ,

[418] Panoutsopoulou K ,

[419] Parker V ,

[420] Parr JR ,

[421] Paternoster L ,

[422] Paunio T ,

[423] Payne F ,

[424] Payne SJ ,

[425] Perry JRB ,

[426] Pietilainen O ,

[427] Plagnol V ,

[428] Pollitt RC ,

[429] Povey S ,

[430] Quail MA ,

[431] Quaye L ,

[432] Raymond L ,

[433] Rehnström K ,

[434] Ridout CK ,

[435] Ring S ,

[436] Ritchie GRS ,

[437] Roberts N ,

[438] Robinson RL ,

[439] Savage DB ,

[440] Scambler P ,

[441] Schiffels S ,

[442] Schmidts M ,

[443] Schoenmakers N ,

[444] Scott RH ,

[445] Scott RA ,

[446] Semple RK ,

[447] Serra E ,

[448] Sharp SI ,

[449] Shaw A ,

[450] Shihab HA ,

[451] Shin S-Y ,

[452] Skuse D ,

[453] Small KS ,

[454] Smee C ,

[455] Smith GD ,

[456] Southam L ,

[457] Spasic-Boskovic O ,

[458] Spector TD ,

[459] Clair DS ,

[460] Pourcain BS ,

[461] Stalker J ,

[462] Stevens E ,

[463] Sun J ,

[464] Surdulescu G ,

[465] Suvisaari J ,

[466] Syrris P ,

[467] Tachmazidou I ,

[468] Taylor R ,

[469] Tian J ,

[470] Tobin MD ,

[471] Toniolo D ,

[472] Traglia M ,

[473] Tybjaerg-Hansen A ,

[474] Valdes AM ,

[475] Vandersteen AM ,

[476] Varbo A ,

[477] Vijayarangakannan P ,

[478] Visscher PM ,

[479] Wain LV ,

[480] Walters JTR ,

[481] Wang G ,

[482] Wang J ,

[483] Wang Y ,

[484] Ward K ,

[485] Wheeler E ,

[486] Whincup P ,

[487] Whyte T ,

[488] Williams HJ ,

[489] Williamson KA ,

[490] Wilson C ,

[491] Wilson SG ,

[492] Wong K ,

[493] Xu C ,

[494] Yang J ,

[495] Zaza G ,

[496] Zeggini E ,

[497] Zhang F ,

[498] Zhang P ,

[499] Zhang W

[500] ↵

McCarthy S ,
Das S ,
Kretzschmar W ,
Delaneau O ,
Wood AR ,
Teumer A ,
Kang HM ,
Fuchsberger C ,
Danecek P ,
Sharp K ,
Luo Y ,
Sidore C ,
Kwong A ,
Timpson N ,
Koskinen S ,
Vrieze S ,
Scott LJ ,
Zhang H ,
Mahajan A ,
Veldink J ,
Peters U ,
Pato C ,
van Duijn CM ,
Gillies CE ,
Gandin I ,
Mezzavilla M ,
Gilly A ,
Cocca M ,
Traglia M ,
Angius A ,
Barrett JC ,
Boomsma D ,
Branham K ,
Breen G ,
Brummett CM ,
Busonero F ,
Campbell H ,
Chan A ,
Chen S ,
Chew E ,
Collins FS ,
Corbin LJ ,
Smith GD ,
Dedoussis G ,
Dorr M ,
Farmaki AE ,
Ferrucci L ,
Forer L ,
Fraser RM ,
Gabriel S ,
Levy S ,
Groop L ,
Harrison T ,
Hattersley A ,
Holmen OL ,
Hveem K ,
Kretzler M ,
Lee JC ,
McGue M ,
Meitinger T ,
Melzer D ,
Min JL ,
Mohlke KL ,
Vincent JB ,
Nauck M ,
Nickerson D ,
Palotie A ,
Pato M ,
Pirastu N ,
McInnis M ,
Richards JB ,
Sala C ,
Salomaa V ,
Schlessinger D ,
Schoenherr S ,
Slagboom PE ,
Small K ,
Spector T ,
Stambolian D ,
Tuke M ,
Tuomilehto J ,
Van den Berg LH ,
Van Rheenen W ,
Volker U ,
Wijmenga C ,
Toniolo D ,
Zeggini E ,
Gasparini P ,
Sampson MG ,
Wilson JF ,
Frayling T ,
de Bakker PI ,
Swertz MA ,
McCarroll S ,
Kooperberg C ,
Dekker A ,
Altshuler D ,
Willer C ,
Iacono W ,
Ripatti S ,
Soranzo N ,
Walter K ,
Swaroop A ,
Cucca F ,
Anderson CA ,
Myers RM ,
Boehnke M ,
McCarthy MI ,
Durbin R
; Haplotype Reference Consortium. A reference panel of 64,976 haplotypes for genotype imputation. Nat Genet 2016;48:1279–83.doi:10.1038/ng.3643
OpenUrl CrossRef PubMed

Log in using your username and password

Main menu

Log in using your username and password

You are here

Abstract

Statistics from Altmetric.com

Request Permissions

Introduction

Materials and methods

Population

Study participants

Phenome

Supplementary file 1

Genotyping, relatedness and population structure

Supplementary file 2

Multipanel imputation

Heritability

Single-trait genome-wide association analysis

Supplementary file 3

Multitrait meta-analysis for correlated traits

Polygenic risk score

Supplementary file 4

URLs

Results

Heritability estimates

Phenome analysis

Supplementary file 5

Supplementary file 6

Multitrait meta-analysis of anthropometric traits

Supplementary file 7

Supplementary file 8

Supplementary file 9

Polygenic risk score

Discussion

Supplementary file 12

Supplementary file 13

Supplementary file 14

Supplementary file 10

Supplementary file 11

Supplementary file 15

Acknowledgments

References

Footnotes

Read the full text or download the PDF:

Log in using your username and password