SOAPdenovo-Trans: <i>de novo</i> transcriptome assembly with short RNA-Seq reads

Yinlong Xie(University of Copenhagen), Gengxiong Wu(University of Copenhagen), Jingbo Tang(University of Copenhagen), Ruibang Luo(University of Copenhagen), Jordan Patterson(University of Copenhagen), Shanlin Liu(University of Copenhagen), Weihua Huang(University of Copenhagen), Guangzhu He(University of Copenhagen), Shengchang Gu(University of Copenhagen), Shengkang Li(University of Copenhagen), Xin Zhou(University of Copenhagen), Tak‐Wah Lam(University of Copenhagen), Yingrui Li(University of Copenhagen), Xun Xu(University of Copenhagen), Gane Ka‐Shu Wong(University of Copenhagen), Jun Wang(University of Copenhagen)
Bioinformatics
February 13, 2014
Cited by 997Open Access
Full Text

Abstract

MOTIVATION: Transcriptome sequencing has long been the favored method for quickly and inexpensively obtaining a large number of gene sequences from an organism with no reference genome. Owing to the rapid increase in throughputs and decrease in costs of next- generation sequencing, RNA-Seq in particular has become the method of choice. However, the very short reads (e.g. 2 × 90 bp paired ends) from next generation sequencing makes de novo assembly to recover complete or full-length transcript sequences an algorithmic challenge. RESULTS: Here, we present SOAPdenovo-Trans, a de novo transcriptome assembler designed specifically for RNA-Seq. We evaluated its performance on transcriptome datasets from rice and mouse. Using as our benchmarks the known transcripts from these well-annotated genomes (sequenced a decade ago), we assessed how SOAPdenovo-Trans and two other popular transcriptome assemblers handled such practical issues as alternative splicing and variable expression levels. Our conclusion is that SOAPdenovo-Trans provides higher contiguity, lower redundancy and faster execution. AVAILABILITY AND IMPLEMENTATION: Source code and user manual are available at http://sourceforge.net/projects/soapdenovotrans/.


Related Papers

No related papers found

Powered by citation graph analysis