Oases: robust de novo RNA-seq assembly across the dynamic range of expression levels

MOTIVATION: High-throughput sequencing has made the analysis of new model organisms more affordable. Although assembling a new genome can still be costly and difficult, it is possible to use RNA-seq to sequence mRNA. In the absence of a known genome, it is necessary to assemble these sequences de no...

Full description

Saved in:
Bibliographic Details
Published inBioinformatics (Oxford, England) Vol. 28; no. 8; pp. 1086 - 1092
Main Authors Schulz, Marcel H, Zerbino, Daniel R, Vingron, Martin, Birney, Ewan
Format Journal Article
LanguageEnglish
Published Oxford Oxford University Press 15.04.2012
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:MOTIVATION: High-throughput sequencing has made the analysis of new model organisms more affordable. Although assembling a new genome can still be costly and difficult, it is possible to use RNA-seq to sequence mRNA. In the absence of a known genome, it is necessary to assemble these sequences de novo, taking into account possible alternative isoforms and the dynamic range of expression values. RESULTS: We present a software package named Oases designed to heuristically assemble RNA-seq reads in the absence of a reference genome, across a broad spectrum of expression values and in presence of alternative isoforms. It achieves this by using an array of hash lengths, a dynamic filtering of noise, a robust resolution of alternative splicing events and the efficient merging of multiple assemblies. It was tested on human and mouse RNA-seq data and is shown to improve significantly on the transABySS and Trinity de novo transcriptome assemblers. Availability and implementation: Oases is freely available under the GPL license at www.ebi.ac.uk/~zerbino/oases/ CONTACT: dzerbino@ucsc.edu Supplementary information: Supplementary data are available at Bioinformatics online.
Bibliography:http://dx.doi.org/10.1093/bioinformatics/bts094
ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
Associate Editor: Ivo Hofacker
ISSN:1367-4803
1367-4811
1367-4811
DOI:10.1093/bioinformatics/bts094