SNP analysis to dissect human traits

doi:10.1016/S0959-4388(00)00261-0

Current Opinion in Neurobiology

Volume 11, Issue 5, 1 October 2001, Pages 637-641

https://doi.org/10.1016/S0959-4388(00)00261-0 Get rights and content

Abstract

The analysis of complex human diseases has been spurred by the number of published genomic sequence variants — many identified in the course of sequencing the human genome. But, to be useful for genetic analysis, variants have to be mapped accurately, their frequencies in various populations determined, and automated high-throughput assay techniques developed. Recently proposed methods address these issues: the use of ‘reduced representation shotgun’ methods for more efficient detection of single nucleotide polymorphisms (SNPs), the employment of high-throughput genotyping techniques, the development of SNP maps that incorporate information about linkage disequilibrium, and the use of SNPs in identifying susceptibility genes for common illnesses.

Introduction

There is increasing appreciation of the role played by genetic predisposition and susceptibility in such common neurological disorders as Alzheimer's disease (AD), epilepsy, Parkinson's disease, multiple sclerosis and stroke. These disorders show familial aggregation, although the mode of inheritance is not clearly Mendelian in most cases. The role that a common genomic variant might play in susceptibility to disease is best exemplified by the role that the apolipoprotein E (APOE) ε4 allele plays in AD. The ε4 allele is highly associated with the presence of AD and with earlier age of onset of disease. It is a robust association seen in many populations studied [1]. Polymorphic variation has also been implicated in stroke and cardiovascular disease [2] and in multiple sclerosis [3]. It is increasingly clear that the risk of developing many common disorders and the metabolism of medications used to treat these conditions are substantially influenced by underlying genomic variation, although the effects of any one variant might be small.

In the 1980s, restriction enzymes were used to identify single base-pair changes in genomic DNA that result in the gain or loss of a restriction site [4]. These nucleotide variants were called ‘restriction fragment length polymorphisms’ (RFLPs) and were used in early linkage studies. In the early 1990s, RFLPs were replaced by simple tandem repeats or microsatellite markers. These markers show high levels of allelic variation, are distributed throughout the human genome, and can be efficiently amplified using PCR. Microsatellite markers have been used successfully in the positional cloning of many monogenic disease genes by linkage and allelic association [5].

In recent years, there has been a greater interest in studying the genetic basis of more common disorders, in which multiple genes of small effect are involved and for which the modes of inheritance are more complex. Standard linkage analysis using large pedigrees has only limited power to detect such small effects in these disorders [6]. Association studies using unrelated cases and controls, or using smaller family groups such as sibling pairs or ‘two parents and affected child’ trios have been proposed to be more likely to detect these small effects. Quantitative analysis and mathematical modeling have suggested that genome-wide association studies using single nucleotide polymorphisms (SNPs) are more effective than linkage analysis for identifying complex disease genes 6., 7., 8.. Such studies can then take advantage of SNPs, which are easy to type, highly abundant (found on average once per 1.3 kb in the genome) and stable (i.e. not prone to the ‘slippage’ seen with microsatellite repeats).

SNPs that are associated with disease may have a direct effect on the function of the gene in which they are located. A variant may result in an amino acid change or may alter exon–intron splicing, thereby directly modifying the relevant protein, or it may exist in a regulatory region, altering the level of expression or the stability of the mRNA. Alternatively a SNP may be in linkage disequilibrium (LD) with the ‘true’ functional variant. LD, also known as allelic association, exists when alleles at two distinct locations of the genome are more highly associated than expected. To this end, the development of SNP-based LD maps could facilitate whole-genome association studies, leading to more efficient detection of candidate susceptibility genes.

The immense interest in studies with SNPs is illustrated by a recent PubMed review of papers with the keyword ‘SNP’ from June 2000 to present: nearly 500 papers have been published. In this review, we highlight the have been published. In this review, we highlight the results from some of the most significant studies.

Section snippets

Identification and characterization of SNPs

Many different techniques can be used to identify and characterize SNPs, including single-strand conformation polymorphism analysis, heteroduplex analysis by denaturing high-performance liquid chromatography (DHPLC), direct DNA sequencing and computational methods [9•]. Thanks to the wealth of sequence information in public databases, computational tools can be used to identify SNPs in silico by aligning independently submitted sequences for a given gene (either cDNA or genomic sequences).

SNP genotyping methodologies

Association studies with SNPs typically use genomic DNA from hundreds of individuals and numerous SNPs. The development of high-throughput technologies has been vital to the widespread use of SNPs in research and industry. The most common SNP typing methods currently include hybridization, primer extension and cleavage methods. Each of these methods must be connected to an appropriate detection system. Detection technologies include fluorescent polarization [20], luminometric detection of

Linkage disequilibrium and SNP maps

As mentioned above, association studies with SNPs can be performed using SNPs that are predicted to have a direct functional consequence in a gene or by using SNPs selected randomly as a marker for LD. LD is generally defined as a measure of the degree of association between two genetic markers and can be used to identify regions of the genome associated with the disease. The construction of a SNP map for the purposes of LD has been complicated by the marked genomic variability in LD that

SNPs and candidate-gene analysis

Several hundred genes have been analyzed for their SNP content 34., 35., 36., 37., 38., 39., 40.. Although the methods and populations used differ, a consistent observation has been that changes in non-coding sequences and synonymous changes in coding sequence are generally more common than non-synonymous changes, reflecting greater selective pressure on the coding sequence. (A synonymous change refers to one that does not alter an amino acid whereas a non-synonymous change causes an amino acid

SNPs in pharmacogenetics

Pharmacogenetic initiatives try to identify genetic variants that influence a patient's response to a drug (ideally suited or likely to induce side effects) [49]. For example, gene variants in a drug-metabolizing enzyme have been linked to adverse reactions with azathioprine, mercaptopurine and thioguanine [50]. Another common SNP is associated with antibiotic-induced cardiac arrhythmia, which is clinically silent before drug exposure [51].

Patients with a variation in the core promoter of the

Conclusions

Most common diseases are thought to result from a mixture of genetic and environmental risk factors; as a result, the contribution of each gene is likely to be relatively small. Allelic association methods are more powerful in the detection of these genetic risk factors than conventional linkage approaches; however, allelic association methods require genetic markers to be very closely spaced because they rely on linkage disequilibrium between the marker and the disease allele.

Recent

Acknowledgements

This work was supported by NIH grants AG16208 and AA08403 and funding from the Leda J Sears Trust.

References and recommended reading

Papers of particular interest, published within the annual period of review, have been highlighted as:

• of special interest
•• of outstanding interest

References (53)

P.H. St George-Hyslop
Molecular genetics of Alzheimer's disease
Biol Psychiatry
(2000)
A.H.B. Wu et al.
Correlation of polymorphisms to coagulation and biochemical risk factors for cardiovasuclar diseases
Am J Cardiol
(2001)
J.R. Oksenberg et al.
Multiple sclerosis: genomic rewards
J Neuroimmunol
(2001)
J.L. Escary et al.
A first high-density map of 981 biallelic markers on humane chromosome 14
Genomics
(2000)
A.M. Dunning et al.
The extent of linkage disequilibrium in four populations with distinct demographic histories
Am J Hum Genet
(2000)
G.R. Abecasis et al.
Extent and distribution of linkage disequlibrium in three genomic regions
Am J Hum Genet
(2001)
D. Gordon et al.
Significant evidence for linkage disequilibrium over a 5-cM region among Afrikaners
Genomics
(2000)
F. Cambien et al.
Sequence diversity in 36 candidate genes for cardiovascular disorders
Am J Hum Genet
(1999)
S.R. Sunyaev et al.
SNP frequencies in human genes — an excess of rare alleles and differing modes of selection
Trends Genet
(2000)
A.J. Brookes
The essence of SNPs
Gene
(1999)

T. Emahazion et al.

SNP association studies in Alzheimer's disease highlight problems for complex disease analysis

Trends Genet

(2001)

E.R. Martin et al.

SNPing away at complex diseases: analysis of single-nucleotide polymorphisms around APOE in Alzheimer disease

Am J Hum Genet

(2000)

U.A. Meyer

Pharmacogenetics and adverse drug reactions

Lancet

(2000)

D. Botstein et al.

Construction of a genetic linkage map in man using restriction length polymorphisms

Am J Hum Genet

(1980)

P. Deloukas et al.

A physical map of 30,000 human genes

Science

(1998)

N. Risch et al.

The future of genetic studies of complex human diseases

Science

(1996)

E.S. Lander

The new genomics: global views of biology

Science

(1996)

L. Kruglyak

The use of a genetic map of biallelic markers in linkage studies

Nat Genet

(1997)

M.M. Shi

Enabling large-scale pharmacogenetic studies by high throughput mutation detection and genotyping technologies

Clin Chem

(2001)

D.G. Cox et al.

Data mining: efficiency of using sequence databases for polymorphism discovery

Hum Mutat

(2001)

K.H. Buetow et al.

High-throughput development and characterization of a genome-wide collection of gene-based single nucleotide polymorphism markers by chip-based matrix-assisted laser desorption/ionization time-of-flight mass spectrometry

Proc Natl Acad Sci USA

(2001)

J.H. Wolford et al.

High-throughput SNP detection by using DNA pooling and denaturing high performance liquid chromatography (DHPLC)

Hum Genet

(2000)

D. Altschuler et al.

An SNP map of the human genome generated by reduced representation shotgun sequencing

Nature

(2000)

G. Marth et al.

Single nucleotide polymorphisms in the public domain: how useful are they?

Nat Genet

(2001)

A map of human genome sequence variation containing 1.42 million single nucleotide polymorphisms

Nature

(2001)

E. Dawson et al.

A SNP resource for human chromosome 22: extracting dense clusters of SNPs from the genomic sequence

Genome Res

(2001)

Cited by (40)

Cross-disorder analysis of endometriosis and its comorbid diseases reveals shared genes and molecular pathways and proposes putative biomarkers of endometriosis
2020, Reproductive BioMedicine Online
Citation Excerpt :
For each dataset, multiple test correction to the unadjusted P-values was computed using the FDR method, and values of FDR <0.05 and fold change (|FC|) >1.5 were set as the threshold criteria to identify differentially expressed genes. Second, and over recent years, the analysis of gene variation, and more specifically of single-nucleotide polymorphisms (SNP), has become a promising trend in biomedical research as it has been widely proven to help to identify genetic predisposition and susceptibility to a disease (Matalliotakis et al., 2017a; Nowotny et al., 2001). As Phenopedia retrieves lists of genes that have been studied for genetic association with a given MeSH term, the study interrogated for the presence of genetic variation in the ‘endometriosis sibling disorders’ (ESD; see below) genes using DisGeNET (http://disgenet.org/home/), which constitutes a comprehensive platform collecting data on genes and variants associated with human diseases (Piñero et al., 2015; Piñero et al., 2017).
Women with endometriosis are considered to be at higher risk of several chronic diseases, such as autoimmune disorders, gynaecological cancers, asthma/atopic diseases and cardiovascular and inflammatory bowel diseases. Could the study of endometriosis-associated comorbidities help to identify potential biomarkers and target pathways of endometriosis?
A systematic review was performed to identify all possible endometriosis-associated comorbid conditions. Next, this list of disorders was coded into MeSH terms, and the gene expression profiles were downloaded from the Phenopedia database and subsequently analysed following a systems biology approach.
The results identified a group of 127 candidate genes that were recurrently expressed in endometriosis and its closest comorbidities and that were defined as ‘endometriosis sibling disorders’ (ESD). The enrichment analysis showed that these candidate genes are principally involved in immune and drug responses, hormone metabolism and cell proliferation, which are well-known hallmarks of endometriosis. The expression of ESD genes was then validated on independent sample cohorts (n = 207 samples), in which the involvement of 16 genes (AGTR1, BDNF, C3, CCL2, CD40, CYP17A1, ESR1, IGF1, IGF2, IL10, MMP1, MMP7, MMP9, PGR, SERPINE1 and TIMP2) in endometriosis was confirmed. Several of these genes harbour polymorphisms that associate to either endometriosis or its comorbid conditions.
The study results highlight the molecular processes underlying the aetiopathogenesis of endometriosis and its comorbid conditions, and identify putative endometriosis biomarkers.
Association between vitamin D receptor gene FokI and TaqI variants with autism spectrum disorder predisposition in Iranian population
2020, Gene
Citation Excerpt :
However, the complex pathophysiology of ASD cannot simply be explained by a single genetic variant. In 2001, Nowotny and his colleagues discovered that variants association methods are potent tools for identification of genetic susceptibility in common diseases (Nowotny et al., 2001). Hence, the investigation of single nucleotide polymorphisms in autism may ascertain the probable predisposing genetic factors to develop ASD.
Autism spectrum disorder (ASD) is one of the neurodevelopmental and cognitive conditions that involves 1 in 160 children around the world. Several studies showed that there is a relationship between vitamin D receptor (VDR) gene polymorphisms with the neurodevelopmental behavioral disorders. In the current study, we aimed to highlight the association of VDR gene polymorphisms (FokI and TaqI) with the risk of autism in Birjand population.
In this case-control study eighty-one patients recognized with ASD and one hundred-eight healthy controls were recruited to the study from 2017 to 2018. Genotyping was carried out by polymerase chain reaction followed by restriction fragment length polymorphism (PCR-RFLP) technique for all subjects.
Calculated odds ratio and P-value for the alleles of VDR gene FokI and TaqI variants between autistic patients and controls did not show a significant difference (P > 0.05). However, calculated homozygous recessive (tt) for TaqI polymorphism was statistically significant (P = 0.015) in control group and there was also statistically meaningful difference in both case and control groups in ft haplotype (P = 0.04).
These results provide preliminary evidence that genetic variants of the VDR gene (FokI and TaqI) might have a possible reduced risk of ASD occurrence in children. The additional examination is needed to acquire more decisive and precise results in this area.
Influence of candidate polymorphisms on the dipeptidyl peptidase IV and μ-opioid receptor genes expression in aspect of the β-casomorphin-7 modulation functions in autism
2015, Peptides
Citation Excerpt :
Following autism factor is single nucleotide polymorphism, which can clarify risk factors for disease by their action as genomic markers. Nowotny et al. [32] showed that SNP allelic association methods are powerful tools in detection of genetic factors predisposing to most common diseases. We analyzed the differences between ASD and controls in the distribution of genotypes and resultant influence of the analyzed SNPs on gene expression (Figs. 2, 3, 5 and 6).
Autism Spectrum Disorder (ASD) is a neurodevelopmental disorder with population prevalence of approximately 60–70 per 10,000. Data shows that both opioid system function enhancement and opiate administration can result in autistic-like symptoms. Cow milk opioid peptides, including β-casomorphin-7 (BCM7, Tyr-Pro-Phe-Pro-Gly-Pro-Ile), affect the μ-opioid receptor (MOR) and are subjected to degradation resulting from the proline dipeptidyl peptidase IV (DPPIV, EC 3.4.14.5) enzyme activity. The presence of MOR and DPPIV activity are crucial factors determining biological activity of BCM7 in the human body. Our study examined the effect of β-casomorphin-7 on the MOR and DPPIV genes expression according to specific point mutations in these genes. In addition, we investigated frequency of A118G SNP in the MOR gene and rs7608798 of the DPPIV (A/G) gene in healthy and autistic children. Our research indicated correlation in DPPIV gene expression under the influence of BCM7 and hydrolyzed milk between healthy and ASD-affected children with genotype GG (P < 0.0001). We also observed increased MOR gene expression in healthy children with genotype AG at polymorphic site A118G under influence of BCM7 and hydrolyzed milk. The G allele frequency was 0.09 in MOR gene and 0.68 in the DPPIV gene. But our results suggest no association between presence of the alleles G and A at position rs7608798 in DPPIV gene nor alleles A and G at position A118G of the MOR and increased incidence of ASD. Our studies emphasize the compulsion for genetic analysis in correlation with genetic factors affecting development and enhancement of autism symptoms.
A novel efficient dynamic programming algorithm for haplotype block partitioning
2010, Journal of Theoretical Biology
Citation Excerpt :
Haplotype can be defined as an asset of SNPs on a single chromosome that are associated and inherited as a unit. Recently, haplotype analysis has been successfully applied to the identification of DNA variations relevant to several common and complex diseases (Bonnen et al., 2002; Indap et al., 2005; Mas et al., 2005; Reif et al., 2006; Gray et al., 2000; Nowotny et al., 2001). Many studies suggest that human genome may be arranged into block structure, in which SNPs are relevant and only a small number of SNPs are sufficient to capture most of haplotype structures, called tag SNP (Daly et al., 2001; Gabriel et al., 2002; Patil et al., 2001; Dawson et al., 2002; Mahdevar et al., 2010; Zhang et al., 2002; Wall and Pritchard, 2003).
In this paper, a new efficient algorithm is presented for haplotype block partitioning based on haplotype diversity. In this algorithm, finding the largest meaningful block that satisfies the diversity condition is the main goal as an optimization problem. The algorithm can be performed in polynomial time complexity with regard to the number of haplotypes and SNPs. We apply our algorithm on three biological data sets from chromosome 21 in three different population data sets from HapMap data bulk; the obtained results show the efficiency and better performance of our algorithm in comparison with three other well known methods.
Genetic variation in human disease and a new role for copy number variants
2007, Mutation Research - Fundamental and Molecular Mechanisms of Mutagenesis
While complex diseases, such as inflammatory bowel disease, do not follow distinctive Mendelian inheritance patterns, there is now considerable evidence from twin and pedigree studies to show that there are significant genetic influences in the development of many such diseases. In times past, this type of information was considered to be interesting, and was used mainly to alert other members of the families that they may also be at increased risk of developing the disease. However, with the ability to evaluate the genetic basis of common disease, this information will have important consequences for the diagnosis, prevention and treatment of the disorder. The genetic basis for common disease is likely to be more complicated than we had previously anticipated, since we now recognise epigenetic causes of disease, and other subtle gene regulatory mechanisms. Copy number variants have been highlighted in this review, as being a phenomenon that we have known about for a long time, but that has not previously been clearly associated with human disease. As complex disease is related to changes in gene expression, any variation in the human genome that alters gene expression is now a candidate for being involved in the disease process.
Susceptibility genes and modifiers for cardiac arrhythmias
2005, Cardiovascular Research

View all citing articles on Scopus

View full text

ReviewSNP analysis to dissect human traits

Abstract

Introduction

Section snippets

Identification and characterization of SNPs

SNP genotyping methodologies

Linkage disequilibrium and SNP maps

SNPs and candidate-gene analysis

SNPs in pharmacogenetics

Conclusions

Acknowledgements

References and recommended reading

Biol Psychiatry

Am J Cardiol

J Neuroimmunol

Genomics

Am J Hum Genet

Am J Hum Genet

Genomics

Am J Hum Genet

Trends Genet

Gene

Trends Genet

Am J Hum Genet

Lancet

Construction of a genetic linkage map in man using restriction length polymorphisms

Am J Hum Genet

A physical map of 30,000 human genes

Science

The future of genetic studies of complex human diseases

Science

The new genomics: global views of biology

Science

The use of a genetic map of biallelic markers in linkage studies

Nat Genet

Enabling large-scale pharmacogenetic studies by high throughput mutation detection and genotyping technologies

Clin Chem

Data mining: efficiency of using sequence databases for polymorphism discovery

Hum Mutat

High-throughput development and characterization of a genome-wide collection of gene-based single nucleotide polymorphism markers by chip-based matrix-assisted laser desorption/ionization time-of-flight mass spectrometry

Proc Natl Acad Sci USA

High-throughput SNP detection by using DNA pooling and denaturing high performance liquid chromatography (DHPLC)

Hum Genet

An SNP map of the human genome generated by reduced representation shotgun sequencing

Nature

Single nucleotide polymorphisms in the public domain: how useful are they?

Nat Genet

A map of human genome sequence variation containing 1.42 million single nucleotide polymorphisms

Nature

A SNP resource for human chromosome 22: extracting dense clusters of SNPs from the genomic sequence

Genome Res

Review
SNP analysis to dissect human traits