High-resolution mapping and analysis of copy number variations in the human genome: A data resource for clinical and research applications
- Tamim H. Shaikh1,2,11,
- Xiaowu Gai3,11,
- Juan C. Perin3,
- Joseph T. Glessner4,
- Hongbo Xie3,
- Kevin Murphy5,
- Ryan O'Hara3,
- Tracy Casalunovo4,
- Laura K. Conlin1,
- Monica D'Arcy5,
- Edward C. Frackelton4,
- Elizabeth A. Geiger1,
- Chad Haldeman-Englert1,
- Marcin Imielinski4,
- Cecilia E. Kim4,
- Livija Medne1,
- Kiran Annaiah4,
- Jonathan P. Bradfield4,
- Elvira Dabaghyan4,
- Andrew Eckert4,
- Chioma C. Onyiah4,
- Svetlana Ostapenko3,
- F. George Otieno4,
- Erin Santa4,
- Julie L. Shaner4,
- Robert Skraban4,
- Ryan M. Smith4,
- Josephine Elia6,7,
- Elizabeth Goldmuntz2,9,
- Nancy B. Spinner1,2,
- Elaine H. Zackai1,2,
- Rosetta M. Chiavacci4,
- Robert Grundmeier2,3,8,
- Eric F. Rappaport3,
- Struan F.A. Grant1,2,4,
- Peter S. White2,3,5,12 and
- Hakon Hakonarson1,2,4,10,12
- 1 Division of Genetics, The Children's Hospital of Philadelphia, Philadelphia, Pennsylvania 19104, USA;
- 2 Department of Pediatrics, University of Pennsylvania School of Medicine, Philadelphia, Pennsylvania 19104, USA;
- 3 Center for Biomedical Informatics, The Children's Hospital of Philadelphia, Philadelphia, Pennsylvania 19104, USA;
- 4 Center for Applied Genomics, The Children's Hospital of Philadelphia, Philadelphia, Pennsylvania 19104, USA;
- 5 Division of Oncology, The Children's Hospital of Philadelphia, Philadelphia, Pennsylvania 19104, USA;
- 6 Department of Child and Adolescent Psychiatry, The Children's Hospital of Philadelphia, Philadelphia, Pennsylvania 19104, USA;
- 7 Department of Psychiatry, University of Pennsylvania School of Medicine, Philadelphia, Pennsylvania 19104, USA;
- 8 Division of General Pediatrics, The Children's Hospital of Philadelphia, Philadelphia, Pennsylvania 19104, USA;
- 9 Division of Cardiology, The Children's Hospital of Philadelphia, Philadelphia, Pennsylvania 19104, USA;
- 10 Division of Pulmonary Medicine, The Children's Hospital of Philadelphia, Philadelphia, Pennsylvania 19104, USA
-
↵11 These authors contributed equally to this work.
Abstract
We present a database of copy number variations (CNVs) detected in 2026 disease-free individuals, using high-density, SNP-based oligonucleotide microarrays. This large cohort, comprised mainly of Caucasians (65.2%) and African-Americans (34.2%), was analyzed for CNVs in a single study using a uniform array platform and computational process. We have catalogued and characterized 54,462 individual CNVs, 77.8% of which were identified in multiple unrelated individuals. These nonunique CNVs mapped to 3272 distinct regions of genomic variation spanning 5.9% of the genome; 51.5% of these were previously unreported, and >85% are rare. Our annotation and analysis confirmed and extended previously reported correlations between CNVs and several genomic features such as repetitive DNA elements, segmental duplications, and genes. We demonstrate the utility of this data set in distinguishing CNVs with pathologic significance from normal variants. Together, this analysis and annotation provides a useful resource to assist with the assessment of CNVs in the contexts of human variation, disease susceptibility, and clinical molecular diagnostics.
Footnotes
-
↵12 Corresponding authors.
E-mail white{at}genome.chop.edu; fax (215) 590-3020.
E-mail hakonarson{at}chop.edu; fax (267) 426-0363.
-
[Supplemental material is available online at http://www.genome.org. The CNV data reported here are available at http://cnv.chop.edu. These data are also available in the Database of Genomic Variants (DGV) (http://projects.tcag.ca/variation). The individual level intensity data from the Illumina arrays are available in dbGaP (http://www.ncbi.nlm.nih.gov/dbgap) under accession phs000199.v1.p1.]
-
Article published online before print. Article and publication date are at http://www.genome.org/cgi/doi/10.1101/gr.083501.108.
-
- Received December 1, 2008.
- Accepted June 17, 2009.
- Copyright © 2009 by Cold Spring Harbor Laboratory Press