Evaluation of microsatellite variation in the 1000 Genomes Project pilot studies is indicative of the quality and utility of the raw data and alignments

Genomics. 2011 Apr;97(4):193-9. doi: 10.1016/j.ygeno.2011.01.001. Epub 2011 Jan 9.

Abstract

We performed an analysis of global microsatellite variation on the two kindreds sequenced at high depth (~20×-60×) in the 1000 Genomes Project pilot studies because alterations in these highly mutable repetitive sequences have been linked with many phenotypes and disease risks. The standard alignment technique performs poorly in microsatellite regions as a consequence of low effective coverage (~1×-5×) resulting in 79% of the informative loci exhibiting non-Mendelian inheritance patterns. We used a more stringent approach in computing robust allelotypes resulting in 94.4% of the 1095 informative repeats conforming to traditional inheritance. The high-confidence allelotypes were analyzed to obtain an estimate of the minimum polymorphism rate as a function of motif length, motif sequence, and distribution within the genome.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Female
  • Genetic Variation
  • Genome, Human / genetics*
  • Humans
  • Male
  • Microsatellite Repeats / genetics*
  • Pedigree
  • Pilot Projects
  • Sequence Alignment / methods
  • Sequence Analysis, DNA / standards