CASPER: context-aware scheme for paired-end reads from high-throughput amplicon sequencing

Merging the forward and reverse reads from paired-end sequencing is a critical task that can significantly improve the performance of downstream tasks, such as genome assembly and mapping, by providing them with virtually elongated reads. However, due to the inherent limitations of most paired-end s...

Full description

Saved in:
Bibliographic Details
Published inBMC bioinformatics Vol. 15 Suppl 9; no. Suppl 9; p. S10
Main Authors Kwon, Sunyoung, Lee, Byunghan, Yoon, Sungroh
Format Journal Article
LanguageEnglish
Published England BioMed Central 10.09.2014
BioMed Central Ltd
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Merging the forward and reverse reads from paired-end sequencing is a critical task that can significantly improve the performance of downstream tasks, such as genome assembly and mapping, by providing them with virtually elongated reads. However, due to the inherent limitations of most paired-end sequencers, the chance of observing erroneous bases grows rapidly as the end of a read is approached, which becomes a critical hurdle for accurately merging paired-end reads. Although there exist several sophisticated approaches to this problem, their performance in terms of quality of merging often remains unsatisfactory. To address this issue, here we present a context-aware scheme for paired-end reads (CASPER): a computational method to rapidly and robustly merge overlapping paired-end reads. Being particularly well suited to amplicon sequencing applications, CASPER is thoroughly tested with both simulated and real high-throughput amplicon sequencing data. According to our experimental results, CASPER significantly outperforms existing state-of-the art paired-end merging tools in terms of accuracy and robustness. CASPER also exploits the parallelism in the task of paired-end merging and effectively speeds up by multithreading. CASPER is freely available for academic use at http://best.snu.ac.kr/casper.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ObjectType-Article-2
ObjectType-Feature-1
ObjectType-Conference-3
SourceType-Conference Papers & Proceedings-2
ObjectType-Conference-1
ObjectType-Feature-3
ISSN:1471-2105
1471-2105
DOI:10.1186/1471-2105-15-s9-s10