Background Parkinson’s disease (PD) is a neurodegenerative disorder with complex genetic architecture. Besides rare mutations in high-risk genes related to monogenic familial forms of PD, multiple variants associated with sporadic PD were discovered via association studies.
Methods We studied the whole-exome sequencing data of 340 PD cases and 146 ethnically matched controls from the Parkinson’s Progression Markers Initiative (PPMI) and performed burden analysis for different rare variant classes. Disease prediction models were built based on clinical, non-clinical and genetic features, including both common and rare variants, and two machine learning methods.
Results We observed a significant exome-wide burden of singleton loss-of-function variants (corrected p=0.037). Overall, no exome-wide burden of rare amino acid changing variants was detected. Finally, we built a disease prediction model combining singleton loss-of-function variants, a polygenic risk score based on common variants, and family history of PD as features and reached an area under the curve of 0.703 (95% CI 0.698 to 0.708). By incorporating a rare variant feature, our model increased the performance of the state-of-the-art classification model for the PPMI dataset, which reached an area under the curve of 0.639 based on common variants alone.
Conclusion The main finding of this study is to highlight the contribution of singleton loss-of-function variants to the complex genetics of PD and that disease risk prediction models combining singleton and common variants can improve models built solely on common variants.
- Parkinson-s disease
- complex traits
This is an open access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited, appropriate credit is given, any changes made indicated, and the use is non-commercial. See: http://creativecommons.org/licenses/by-nc/4.0/.
Statistics from Altmetric.com
DRB and PB are joint first authors.
Contributors DRB and PM conceived the study. DRB analysed the data. PB performed machine learning. All authors interpreted the results and wrote the manuscript.
Funding This work was supported by grants from the Luxembourg National Research Fund (FNR) within the National Centre for Excellence in Research on Parkinson’s disease (NCER-PD to PB, RK, PM) and PEARL program (FNR/P13/6682797 to RK), the EU Joint Program-Neurodegenerative Diseases (JPND; COURAGE-PD to RK, DRB, PM), the German Research Council (DFG; KR2119/8-1 to RK) and by the European Union’s Horizon2020 research and innovation program under grant agreement no. 692320 (WIDESPREAD; CENTRE-PD to RK and PM).
Competing interests RK received speaker’s honoraria and travel grants from Medtronic. DRB works for MeGeno S.A. Luxembourg.
Patient consent for publication Not required.
Provenance and peer review Not commissioned; externally peer reviewed.
Data availability statement Data are available in a public, open access repository. Data used in the preparation of this article were obtained from the Parkinson’s Progression Markers Initiative (PPMI) database (www.ppmi-info.org/data). For up-to-date information on the study, visit the PPMI website (www.ppmi-info.org).
If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.