Genome mapping on nanochannel arrays for structural variation analysis and sequence assembly

Nat Biotechnol. 2012 Aug;30(8):771-6. doi: 10.1038/nbt.2303.

Abstract

We describe genome mapping on nanochannel arrays. In this approach, specific sequence motifs in single DNA molecules are fluorescently labeled, and the DNA molecules are uniformly stretched in thousands of silicon channels on a nanofluidic device. Fluorescence imaging allows the construction of maps of the physical distances between occurrences of the sequence motifs. We demonstrate the analysis, individually and as mixtures, of 95 bacterial artificial chromosome (BAC) clones that cover the 4.7-Mb human major histocompatibility complex region. We obtain accurate, haplotype-resolved, sequence motif maps hundreds of kilobases in length, resulting in a median coverage of 114× for the BACs. The final sequence motif map assembly contains three contigs. With an average distance of 9 kb between labels, we detect 22 haplotype differences. We also use the sequence motif maps to provide scaffolds for de novo assembly of sequencing data. Nanochannel genome mapping should facilitate de novo assembly of sequencing reads from complex regions in diploid organisms, haplotype and structural variation analysis and comparative genomics.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Base Sequence
  • Chromosome Mapping / methods*
  • Chromosomes, Artificial, Bacterial
  • Fluorescent Dyes / chemistry
  • Haplotypes / genetics
  • Humans
  • Major Histocompatibility Complex / genetics
  • Microfluidic Analytical Techniques / instrumentation*
  • Molecular Sequence Data
  • Nanotechnology / instrumentation*
  • Nucleotide Motifs

Substances

  • Fluorescent Dyes