TY - JOUR T1 - WGSA: an annotation pipeline for human genome sequencing studies JF - Journal of Medical Genetics JO - J Med Genet SP - 111 LP - 112 DO - 10.1136/jmedgenet-2015-103423 VL - 53 IS - 2 AU - Xiaoming Liu AU - Simon White AU - Bo Peng AU - Andrew D Johnson AU - Jennifer A Brody AU - Alexander H Li AU - Zhuoyi Huang AU - Andrew Carroll AU - Peng Wei AU - Richard Gibbs AU - Robert J Klein AU - Eric Boerwinkle Y1 - 2016/02/01 UR - http://jmg.bmj.com/content/53/2/111.abstract N2 - DNA sequencing technologies continue to make progress in increased throughput and quality, and decreased cost. As we transition from whole exome capture sequencing to whole genome sequencing (WGS), our ability to convert machine-generated variant calls, including single nucleotide variant (SNV) and insertion-deletion variants (indels), into human-interpretable knowledge has lagged far behind the ability to obtain enormous amounts of variants. To help narrow this gap, here we present WGSA (WGS annotator), a functional annotation pipeline for human genome sequencing studies, which is runnable out of the box on the Amazon Compute Cloud and is freely downloadable at (https://sites.google.com/site/jpopgen/wgsa/).Functional annotation is a key step in WGS analysis. In one way, annotation helps the analyst filter to a subset of elements of particular interest (eg, cell type specific enhancers), in another way annotation helps the investigators to increase the power of identifying phenotype-associated loci (eg, association test using functional prediction score as a weight) and interpret potentially interesting findings. Currently, there are several popular gene model based annotation tools, including ANNOVAR,1 SnpEff2 and the Ensembl Variant Effect Predictor (VEP).3 These can annotate a variety of protein coding and non-coding gene models from a range of species. It is well known among practitioners that different databases (eg, RefSeq4 and Ensembl5) use different models for … ER -