Fusion transcripts and transcribed retrotransposed loci discovered through comprehensive transcriptome analysis using Paired-End diTags (PETs)

  1. Yijun Ruan1,6,
  2. Hong Sain Ooi2,
  3. Siew Woh Choo2,
  4. Kuo Ping Chiu2,
  5. Xiao Dong Zhao1,
  6. K.G. Srinivasan1,
  7. Fei Yao1,
  8. Chiou Yu Choo1,
  9. Jun Liu1,
  10. Pramila Ariyaratne2,
  11. Wilson G.W. Bin2,
  12. Vladimir A. Kuznetsov2,
  13. Atif Shahab3,
  14. Wing-Kin Sung2,4,
  15. Guillaume Bourque2,
  16. Nallasivam Palanisamy5, and
  17. Chia-Lin Wei1,6
  1. 1 Genome Technology and Biology Group, Genome Institute of Singapore, Singapore 138672, Singapore;
  2. 2 Information and Mathematical Science Group, Genome Institute of Singapore, Singapore 138672, Singapore;
  3. 3 Bioinformatics Institute, Singapore 138671, Singapore;
  4. 4 School of Computing, National University of Singapore, Singapore 117543, Singapore;
  5. 5 Cancer Biology Group, Genome Institute of Singapore, Singapore 138672, Singapore

Abstract

Identification of unconventional functional features such as fusion transcripts is a challenging task in the effort to annotate all functional DNA elements in the human genome. Paired-End diTag (PET) analysis possesses a unique capability to accurately and efficiently characterize the two ends of DNA fragments, which may have either normal or unusual compositions. This unique nature of PET analysis makes it an ideal tool for uncovering unconventional features residing in the human genome. Using the PET approach for comprehensive transcriptome analysis, we were able to identify fusion transcripts derived from genome rearrangements and actively expressed retrotransposed pseudogenes, which would be difficult to capture by other means. Here, we demonstrate this unique capability through the analysis of 865,000 individual transcripts in two types of cancer cells. In addition to the characterization of a large number of differentially expressed alternative 5′ and 3′ transcript variants and novel transcriptional units, we identified 70 fusion transcript candidates in this study. One was validated as the product of a fusion gene between BCAS4 and BCAS3 resulting from an amplification followed by a translocation event between the two loci, chr20q13 and chr17q23. Through an examination of PETs that mapped to multiple genomic locations, we identified 4055 retrotransposed loci in the human genome, of which at least three were found to be transcriptionally active. The PET mapping strategy presented here promises to be a useful tool in annotating the human genome, especially aberrations in human cancer genomes.

Footnotes

  • 6 Corresponding authors.

    6 E-mail weicl{at}gis.a-star.edu.sg; fax 65-64789059.

    6 E-mail ruanyj{at}gis.a-star.edu.sg; fax 65-64789059.

  • [Supplemental material is available online at www.genome.org. DNA sequences reported in this study are available to public through UCSC genome browser at http://genome.ucsc.edu track name GIS-PET RNA and http://t2g.bii.a-star.edu.sg.]

  • Article is online at http://www.genome.org/cgi/doi/10.1101/gr.6018607

    • Received October 27, 2006.
    • Accepted March 12, 2007.
  • Freely available online through the Genome Research Open Access option.

| Table of Contents
OPEN ACCESS ARTICLE

Preprint Server