A generalized global alignment algorithm
Motivation: Homologous sequences are sometimes similar over some regions but different over other regions. Homologous sequences have a much lower global similarity if the different regions are much longer than the similar regions. Results: We present a generalized global alignment algorithm for comp...
Saved in:
Published in | Bioinformatics Vol. 19; no. 2; pp. 228 - 233 |
---|---|
Main Authors | , |
Format | Journal Article |
Language | English |
Published |
Oxford
Oxford University Press
22.01.2003
Oxford Publishing Limited (England) |
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | Motivation: Homologous sequences are sometimes similar over some regions but different over other regions. Homologous sequences have a much lower global similarity if the different regions are much longer than the similar regions. Results: We present a generalized global alignment algorithm for comparing sequences with intermittent similarities, an ordered list of similar regions separated by different regions. A generalized global alignment model is defined to handle sequences with intermittent similarities. A dynamic programming algorithm is designed to compute an optimal general alignment in time proportional to the product of sequence lengths and in space proportional to the sum of sequence lengths. The algorithm is implemented as a computer program named GAP3 (Global Alignment Program Version 3). The generalized global alignment model is validated by experimental results produced with GAP3 on both DNA and protein sequences. The GAP3 program extends the ability of standard global alignment programs to recognize homologous sequences of lower similarity. Availability: The GAP3 program is freely available for academic use at http://bioinformatics.iastate.edu/aat/align/align.html. Contact: xqhuang@cs.iastate.edu; kmchao@csie.ntu.edu.tw * To whom correspondence should be addressed. |
---|---|
Bibliography: | local:190228 istex:461EE26EFAE81640C7B9B0A74112A1777BA42989 ark:/67375/HXZ-G7TGZ9VW-G PII:1460-2059 ObjectType-Article-2 SourceType-Scholarly Journals-1 ObjectType-Feature-1 content type line 23 ObjectType-Article-1 ObjectType-Feature-2 |
ISSN: | 1367-4803 1460-2059 1367-4811 |
DOI: | 10.1093/bioinformatics/19.2.228 |