A generalized global alignment algorithm

Motivation: Homologous sequences are sometimes similar over some regions but different over other regions. Homologous sequences have a much lower global similarity if the different regions are much longer than the similar regions. Results: We present a generalized global alignment algorithm for comp...

Full description

Saved in:
Bibliographic Details
Published inBioinformatics Vol. 19; no. 2; pp. 228 - 233
Main Authors Huang, Xiaoqiu, Chao, Kun-Mao
Format Journal Article
LanguageEnglish
Published Oxford Oxford University Press 22.01.2003
Oxford Publishing Limited (England)
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Motivation: Homologous sequences are sometimes similar over some regions but different over other regions. Homologous sequences have a much lower global similarity if the different regions are much longer than the similar regions. Results: We present a generalized global alignment algorithm for comparing sequences with intermittent similarities, an ordered list of similar regions separated by different regions. A generalized global alignment model is defined to handle sequences with intermittent similarities. A dynamic programming algorithm is designed to compute an optimal general alignment in time proportional to the product of sequence lengths and in space proportional to the sum of sequence lengths. The algorithm is implemented as a computer program named GAP3 (Global Alignment Program Version 3). The generalized global alignment model is validated by experimental results produced with GAP3 on both DNA and protein sequences. The GAP3 program extends the ability of standard global alignment programs to recognize homologous sequences of lower similarity. Availability: The GAP3 program is freely available for academic use at http://bioinformatics.iastate.edu/aat/align/align.html. Contact: xqhuang@cs.iastate.edu; kmchao@csie.ntu.edu.tw * To whom correspondence should be addressed.
Bibliography:local:190228
istex:461EE26EFAE81640C7B9B0A74112A1777BA42989
ark:/67375/HXZ-G7TGZ9VW-G
PII:1460-2059
ObjectType-Article-2
SourceType-Scholarly Journals-1
ObjectType-Feature-1
content type line 23
ObjectType-Article-1
ObjectType-Feature-2
ISSN:1367-4803
1460-2059
1367-4811
DOI:10.1093/bioinformatics/19.2.228