Complex SNP-related sequence variation in segmental genome duplications

Nat Genet. 2004 Aug;36(8):861-6. doi: 10.1038/ng1401. Epub 2004 Jul 11.

Abstract

There is uncertainty about the true nature of predicted single-nucleotide polymorphisms (SNPs) in segmental duplications (duplicons) and whether these markers genuinely exist at increased density as indicated in public databases. We explored these issues by genotyping 157 predicted SNPs in duplicons and control regions in normal diploid genomes and fully homozygous complete hydatidiform moles. Our data identified many true SNPs in duplicon regions and few paralogous sequence variants. Twenty-eight percent of the polymorphic duplicon sequences we tested involved multisite variation, a new type of polymorphism representing the sum of the signals from many individual duplicon copies that vary in sequence content due to duplication, deletion or gene conversion. Multisite variations can masquerade as normal SNPs when genotyped. Given that duplicons comprise at least 5% of the genome and many are yet to be annotated in the genome draft, effective strategies to identify multisite variation must be established and deployed.

Publication types

  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Evolution, Molecular
  • Female
  • Gene Dosage
  • Genetic Markers
  • Genetic Variation
  • Genome, Human
  • Genotype
  • Humans
  • Hydatidiform Mole / genetics
  • Polymorphism, Single Nucleotide*
  • Pregnancy
  • Repetitive Sequences, Nucleic Acid*

Substances

  • Genetic Markers