Slider—maximum use of probability information for alignment of short sequence reads and SNP detection

Motivation: A plethora of alignment tools have been created that are designed to best fit different types of alignment conditions. While some of these are made for aligning Illumina Sequence Analyzer reads, none of these are fully utilizing its probability (prb) output. In this article, we will intr...

Full description

Saved in:
Bibliographic Details
Published inBioinformatics Vol. 25; no. 1; pp. 6 - 13
Main Authors Malhis, Nawar, Butterfield, Yaron S. N., Ester, Martin, Jones, Steven J. M.
Format Journal Article
LanguageEnglish
Published Oxford Oxford University Press 01.01.2009
Oxford Publishing Limited (England)
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Motivation: A plethora of alignment tools have been created that are designed to best fit different types of alignment conditions. While some of these are made for aligning Illumina Sequence Analyzer reads, none of these are fully utilizing its probability (prb) output. In this article, we will introduce a new alignment approach (Slider) that reduces the alignment problem space by utilizing each read base's probabilities given in the prb files. Results: Compared with other aligners, Slider has higher alignment accuracy and efficiency. In addition, given that Slider matches bases with probabilities other than the most probable, it significantly reduces the percentage of base mismatches. The result is that its SNP predictions are more accurate than other SNP prediction approaches used today that start from the most probable sequence, including those using base quality. Contact: nmalhis@bcgsc.ca Supplementary information and availability: http://www.bcgsc.ca/platform/bioinfo/software/slider
Bibliography:ArticleID:btn565
ark:/67375/HXZ-JD4MN6QR-L
Associate Editor: Dmitrij Frishman
To whom correspondence should be addressed.
istex:08228231FD6CE9854A7AD2A2D6D4306073962827
ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:1367-4803
1460-2059
1367-4811
DOI:10.1093/bioinformatics/btn565