Improving the quality of automatic DNA sequence assembly using fluorescent trace-data classifications

Virtually all large-scale sequencing projects use automatic sequence-assembly programs to aid in the determination of DNA sequences. The computer-generated assemblies required substantial hand-editing to transform them into submissions for GenBank. As the size of sequencing projects increases, it be...

Full description

Saved in:
Bibliographic Details
Published inProceedings. International Conference on Intelligent Systems for Molecular Biology Vol. 4; p. 3
Main Authors Allex, C F, Baldwin, S F, Shavlik, J W, Blattner, F R
Format Journal Article
LanguageEnglish
Published United States 1996
Subjects
Online AccessGet more information

Cover

Loading…
More Information
Summary:Virtually all large-scale sequencing projects use automatic sequence-assembly programs to aid in the determination of DNA sequences. The computer-generated assemblies required substantial hand-editing to transform them into submissions for GenBank. As the size of sequencing projects increases, it becomes essential to improve the quality of the automated assemblies so that this time consuming hand-editing may be reduced. Current ABI sequencing technology uses base calls made from fluorescently-labeled DNA fragments run on gels. We present a new representation for the fluorescent trace data associated with individual base calls. This representation can be used before, during, and after fragment assembly to improve the quality of assemblies. We demonstrate one such use-end-trimming of sub-optimal data-that results in a significant improvement in the quality of subsequent assemblies.
ISSN:1553-0833