Review
SNP analysis to dissect human traits

https://doi.org/10.1016/S0959-4388(00)00261-0Get rights and content

Abstract

The analysis of complex human diseases has been spurred by the number of published genomic sequence variants — many identified in the course of sequencing the human genome. But, to be useful for genetic analysis, variants have to be mapped accurately, their frequencies in various populations determined, and automated high-throughput assay techniques developed. Recently proposed methods address these issues: the use of ‘reduced representation shotgun’ methods for more efficient detection of single nucleotide polymorphisms (SNPs), the employment of high-throughput genotyping techniques, the development of SNP maps that incorporate information about linkage disequilibrium, and the use of SNPs in identifying susceptibility genes for common illnesses.

Introduction

There is increasing appreciation of the role played by genetic predisposition and susceptibility in such common neurological disorders as Alzheimer's disease (AD), epilepsy, Parkinson's disease, multiple sclerosis and stroke. These disorders show familial aggregation, although the mode of inheritance is not clearly Mendelian in most cases. The role that a common genomic variant might play in susceptibility to disease is best exemplified by the role that the apolipoprotein E (APOE) ε4 allele plays in AD. The ε4 allele is highly associated with the presence of AD and with earlier age of onset of disease. It is a robust association seen in many populations studied [1]. Polymorphic variation has also been implicated in stroke and cardiovascular disease [2] and in multiple sclerosis [3]. It is increasingly clear that the risk of developing many common disorders and the metabolism of medications used to treat these conditions are substantially influenced by underlying genomic variation, although the effects of any one variant might be small.

In the 1980s, restriction enzymes were used to identify single base-pair changes in genomic DNA that result in the gain or loss of a restriction site [4]. These nucleotide variants were called ‘restriction fragment length polymorphisms’ (RFLPs) and were used in early linkage studies. In the early 1990s, RFLPs were replaced by simple tandem repeats or microsatellite markers. These markers show high levels of allelic variation, are distributed throughout the human genome, and can be efficiently amplified using PCR. Microsatellite markers have been used successfully in the positional cloning of many monogenic disease genes by linkage and allelic association [5].

In recent years, there has been a greater interest in studying the genetic basis of more common disorders, in which multiple genes of small effect are involved and for which the modes of inheritance are more complex. Standard linkage analysis using large pedigrees has only limited power to detect such small effects in these disorders [6]. Association studies using unrelated cases and controls, or using smaller family groups such as sibling pairs or ‘two parents and affected child’ trios have been proposed to be more likely to detect these small effects. Quantitative analysis and mathematical modeling have suggested that genome-wide association studies using single nucleotide polymorphisms (SNPs) are more effective than linkage analysis for identifying complex disease genes 6., 7., 8.. Such studies can then take advantage of SNPs, which are easy to type, highly abundant (found on average once per 1.3 kb in the genome) and stable (i.e. not prone to the ‘slippage’ seen with microsatellite repeats).

SNPs that are associated with disease may have a direct effect on the function of the gene in which they are located. A variant may result in an amino acid change or may alter exon–intron splicing, thereby directly modifying the relevant protein, or it may exist in a regulatory region, altering the level of expression or the stability of the mRNA. Alternatively a SNP may be in linkage disequilibrium (LD) with the ‘true’ functional variant. LD, also known as allelic association, exists when alleles at two distinct locations of the genome are more highly associated than expected. To this end, the development of SNP-based LD maps could facilitate whole-genome association studies, leading to more efficient detection of candidate susceptibility genes.

The immense interest in studies with SNPs is illustrated by a recent PubMed review of papers with the keyword ‘SNP’ from June 2000 to present: nearly 500 papers have been published. In this review, we highlight the have been published. In this review, we highlight the results from some of the most significant studies.

Section snippets

Identification and characterization of SNPs

Many different techniques can be used to identify and characterize SNPs, including single-strand conformation polymorphism analysis, heteroduplex analysis by denaturing high-performance liquid chromatography (DHPLC), direct DNA sequencing and computational methods [9•]. Thanks to the wealth of sequence information in public databases, computational tools can be used to identify SNPs in silico by aligning independently submitted sequences for a given gene (either cDNA or genomic sequences).

SNP genotyping methodologies

Association studies with SNPs typically use genomic DNA from hundreds of individuals and numerous SNPs. The development of high-throughput technologies has been vital to the widespread use of SNPs in research and industry. The most common SNP typing methods currently include hybridization, primer extension and cleavage methods. Each of these methods must be connected to an appropriate detection system. Detection technologies include fluorescent polarization [20], luminometric detection of

Linkage disequilibrium and SNP maps

As mentioned above, association studies with SNPs can be performed using SNPs that are predicted to have a direct functional consequence in a gene or by using SNPs selected randomly as a marker for LD. LD is generally defined as a measure of the degree of association between two genetic markers and can be used to identify regions of the genome associated with the disease. The construction of a SNP map for the purposes of LD has been complicated by the marked genomic variability in LD that

SNPs and candidate-gene analysis

Several hundred genes have been analyzed for their SNP content 34., 35., 36., 37., 38., 39., 40.. Although the methods and populations used differ, a consistent observation has been that changes in non-coding sequences and synonymous changes in coding sequence are generally more common than non-synonymous changes, reflecting greater selective pressure on the coding sequence. (A synonymous change refers to one that does not alter an amino acid whereas a non-synonymous change causes an amino acid

SNPs in pharmacogenetics

Pharmacogenetic initiatives try to identify genetic variants that influence a patient's response to a drug (ideally suited or likely to induce side effects) [49]. For example, gene variants in a drug-metabolizing enzyme have been linked to adverse reactions with azathioprine, mercaptopurine and thioguanine [50]. Another common SNP is associated with antibiotic-induced cardiac arrhythmia, which is clinically silent before drug exposure [51].

Patients with a variation in the core promoter of the

Conclusions

Most common diseases are thought to result from a mixture of genetic and environmental risk factors; as a result, the contribution of each gene is likely to be relatively small. Allelic association methods are more powerful in the detection of these genetic risk factors than conventional linkage approaches; however, allelic association methods require genetic markers to be very closely spaced because they rely on linkage disequilibrium between the marker and the disease allele.

Recent

Acknowledgements

This work was supported by NIH grants AG16208 and AA08403 and funding from the Leda J Sears Trust.

References and recommended reading

Papers of particular interest, published within the annual period of review, have been highlighted as:

  • • of special interest

  • •• of outstanding interest

References (53)

  • T. Emahazion et al.

    SNP association studies in Alzheimer's disease highlight problems for complex disease analysis

    Trends Genet

    (2001)
  • E.R. Martin et al.

    SNPing away at complex diseases: analysis of single-nucleotide polymorphisms around APOE in Alzheimer disease

    Am J Hum Genet

    (2000)
  • U.A. Meyer

    Pharmacogenetics and adverse drug reactions

    Lancet

    (2000)
  • D. Botstein et al.

    Construction of a genetic linkage map in man using restriction length polymorphisms

    Am J Hum Genet

    (1980)
  • P. Deloukas et al.

    A physical map of 30,000 human genes

    Science

    (1998)
  • N. Risch et al.

    The future of genetic studies of complex human diseases

    Science

    (1996)
  • E.S. Lander

    The new genomics: global views of biology

    Science

    (1996)
  • L. Kruglyak

    The use of a genetic map of biallelic markers in linkage studies

    Nat Genet

    (1997)
  • M.M. Shi

    Enabling large-scale pharmacogenetic studies by high throughput mutation detection and genotyping technologies

    Clin Chem

    (2001)
  • D.G. Cox et al.

    Data mining: efficiency of using sequence databases for polymorphism discovery

    Hum Mutat

    (2001)
  • K.H. Buetow et al.

    High-throughput development and characterization of a genome-wide collection of gene-based single nucleotide polymorphism markers by chip-based matrix-assisted laser desorption/ionization time-of-flight mass spectrometry

    Proc Natl Acad Sci USA

    (2001)
  • J.H. Wolford et al.

    High-throughput SNP detection by using DNA pooling and denaturing high performance liquid chromatography (DHPLC)

    Hum Genet

    (2000)
  • D. Altschuler et al.

    An SNP map of the human genome generated by reduced representation shotgun sequencing

    Nature

    (2000)
  • G. Marth et al.

    Single nucleotide polymorphisms in the public domain: how useful are they?

    Nat Genet

    (2001)
  • A map of human genome sequence variation containing 1.42 million single nucleotide polymorphisms

    Nature

    (2001)
  • E. Dawson et al.

    A SNP resource for human chromosome 22: extracting dense clusters of SNPs from the genomic sequence

    Genome Res

    (2001)
  • Cited by (40)

    • Cross-disorder analysis of endometriosis and its comorbid diseases reveals shared genes and molecular pathways and proposes putative biomarkers of endometriosis

      2020, Reproductive BioMedicine Online
      Citation Excerpt :

      For each dataset, multiple test correction to the unadjusted P-values was computed using the FDR method, and values of FDR <0.05 and fold change (|FC|) >1.5 were set as the threshold criteria to identify differentially expressed genes. Second, and over recent years, the analysis of gene variation, and more specifically of single-nucleotide polymorphisms (SNP), has become a promising trend in biomedical research as it has been widely proven to help to identify genetic predisposition and susceptibility to a disease (Matalliotakis et al., 2017a; Nowotny et al., 2001). As Phenopedia retrieves lists of genes that have been studied for genetic association with a given MeSH term, the study interrogated for the presence of genetic variation in the ‘endometriosis sibling disorders’ (ESD; see below) genes using DisGeNET (http://disgenet.org/home/), which constitutes a comprehensive platform collecting data on genes and variants associated with human diseases (Piñero et al., 2015; Piñero et al., 2017).

    • Association between vitamin D receptor gene FokI and TaqI variants with autism spectrum disorder predisposition in Iranian population

      2020, Gene
      Citation Excerpt :

      However, the complex pathophysiology of ASD cannot simply be explained by a single genetic variant. In 2001, Nowotny and his colleagues discovered that variants association methods are potent tools for identification of genetic susceptibility in common diseases (Nowotny et al., 2001). Hence, the investigation of single nucleotide polymorphisms in autism may ascertain the probable predisposing genetic factors to develop ASD.

    • Influence of candidate polymorphisms on the dipeptidyl peptidase IV and μ-opioid receptor genes expression in aspect of the β-casomorphin-7 modulation functions in autism

      2015, Peptides
      Citation Excerpt :

      Following autism factor is single nucleotide polymorphism, which can clarify risk factors for disease by their action as genomic markers. Nowotny et al. [32] showed that SNP allelic association methods are powerful tools in detection of genetic factors predisposing to most common diseases. We analyzed the differences between ASD and controls in the distribution of genotypes and resultant influence of the analyzed SNPs on gene expression (Figs. 2, 3, 5 and 6).

    • A novel efficient dynamic programming algorithm for haplotype block partitioning

      2010, Journal of Theoretical Biology
      Citation Excerpt :

      Haplotype can be defined as an asset of SNPs on a single chromosome that are associated and inherited as a unit. Recently, haplotype analysis has been successfully applied to the identification of DNA variations relevant to several common and complex diseases (Bonnen et al., 2002; Indap et al., 2005; Mas et al., 2005; Reif et al., 2006; Gray et al., 2000; Nowotny et al., 2001). Many studies suggest that human genome may be arranged into block structure, in which SNPs are relevant and only a small number of SNPs are sufficient to capture most of haplotype structures, called tag SNP (Daly et al., 2001; Gabriel et al., 2002; Patil et al., 2001; Dawson et al., 2002; Mahdevar et al., 2010; Zhang et al., 2002; Wall and Pritchard, 2003).

    • Genetic variation in human disease and a new role for copy number variants

      2007, Mutation Research - Fundamental and Molecular Mechanisms of Mutagenesis
    View all citing articles on Scopus
    View full text