Efficient constrained multiple sequence alignment with performance guarantee

The constrained multiple sequence alignment problem is to align a set of sequences subject to a given constrained sequence, which arises from some knowledge of the structure of the sequences. This paper presents new algorithms for this problem, which are more efficient in terms of time and space (me...

Full description

Saved in:
Bibliographic Details
Published in2nd IEEE Computer Society Bioinformatics Conference (CSB 2003) Vol. 2; pp. 337 - 346
Main Authors Francis YL Chin, Ho, N.L., Lam, T.W., Prudence WH Wong, Chan, M.Y.
Format Conference Proceeding Journal Article
LanguageEnglish
Published United States IEEE 2003
Subjects
Online AccessGet full text
ISBN0769520006
9780769520001
ISSN1555-3930
DOI10.1109/CSB.2003.1227334

Cover

Abstract The constrained multiple sequence alignment problem is to align a set of sequences subject to a given constrained sequence, which arises from some knowledge of the structure of the sequences. This paper presents new algorithms for this problem, which are more efficient in terms of time and space (memory) than the previous algorithms and with a worst-case guarantee on the quality of the alignment. Saving the space requirement by a quadratic factor is particularly significant as the previous O(n/sup 4/)-space algorithm has limited application due to its huge memory requirement. Experiments on real data sets confirm that our new algorithms show improvements in both alignment quality and resource requirements.
AbstractList The constrained multiple sequence alignment problem is to align a set of sequences subject to a given constrained sequence, which arises from some knowledge of the structure of the sequences. This paper presents new algorithms for this problem, which are more efficient in terms of time and space (memory) than the previous algorithms and with a worst-case guarantee on the quality of the alignment. Saving the space requirement by a quadratic factor is particularly significant as the previous O(n/sup 4/)-space algorithm has limited application due to its huge memory requirement. Experiments on real data sets confirm that our new algorithms show improvements in both alignment quality and resource requirements.
The Constrained Multiple Sequence Alignment problem is to align a set of sequences subject to a given constrained sequence, which arises from some knowledge of the structure of the sequences. This paper presents new algorithms for this problem, which are more efficient in terms of time and space (memory) than the previous algorithms [14], and with a worst-case guarantee on the quality of the alignment. Saving the space requirement by a quadratic factor is particularly significant as the previous O(n(4))-space algorithm has limited application due to its huge memory requirement. Experiments on real data sets confirm that our new algorithms show improvements in both alignment quality and resource requirements.
Author Prudence WH Wong
Ho, N.L.
Francis YL Chin
Lam, T.W.
Chan, M.Y.
Author_xml – sequence: 1
  surname: Francis YL Chin
  fullname: Francis YL Chin
  organization: Dept. of Comput. Sci. & Inf. Syst., Hong Kong Univ., China
– sequence: 2
  givenname: N.L.
  surname: Ho
  fullname: Ho, N.L.
  organization: Dept. of Comput. Sci. & Inf. Syst., Hong Kong Univ., China
– sequence: 3
  givenname: T.W.
  surname: Lam
  fullname: Lam, T.W.
  organization: Dept. of Comput. Sci. & Inf. Syst., Hong Kong Univ., China
– sequence: 4
  surname: Prudence WH Wong
  fullname: Prudence WH Wong
  organization: Dept. of Comput. Sci. & Inf. Syst., Hong Kong Univ., China
– sequence: 5
  givenname: M.Y.
  surname: Chan
  fullname: Chan, M.Y.
  organization: Dept. of Comput. Sci. & Inf. Syst., Hong Kong Univ., China
BackLink https://www.ncbi.nlm.nih.gov/pubmed/16452809$$D View this record in MEDLINE/PubMed
BookMark eNpFkFtLw0AQhRes2It9FwTJH0icvWcftdQLBHxQn8s2ma0rySbmgvjvmxLFh2HgnI_DnFmSWagDEnJFIaEUzO3m9T5hADyhjGnOxRlZglZGjhqoGVlQKWXMDYc5WXfd56iCkFQbdkHmVAnJUjALkm2d87nH0Ed5Hbq-tT5gEVVD2fumxKjDrwFDjpEt_SFUJ-7b9x9Rg62r28qerMNgWxt6xEty7mzZ4fp3r8j7w_Zt8xRnL4_Pm7ss9szoPs7RifE4K1ObcqFBKibUOJRKKmihTAoguWSuyA1XqVSoXQGonDO6oG7PV-Rmym2GfYXFrml9Zduf3V-tEbieAI-I__b0KH4EHh9a3Q
ContentType Conference Proceeding
Journal Article
DBID 6IE
6IL
CBEJK
RIE
RIL
CGR
CUY
CVF
ECM
EIF
NPM
DOI 10.1109/CSB.2003.1227334
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Xplore POP ALL
IEEE Xplore All Conference Proceedings
IEEE Electronic Library (IEL)
IEEE Proceedings Order Plans (POP All) 1998-Present
Medline
MEDLINE
MEDLINE (Ovid)
MEDLINE
MEDLINE
PubMed
DatabaseTitle MEDLINE
Medline Complete
MEDLINE with Full Text
PubMed
MEDLINE (Ovid)
DatabaseTitleList
MEDLINE
Database_xml – sequence: 1
  dbid: NPM
  name: PubMed
  url: https://proxy.k.utb.cz/login?url=http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed
  sourceTypes: Index Database
– sequence: 2
  dbid: EIF
  name: MEDLINE
  url: https://proxy.k.utb.cz/login?url=https://www.webofscience.com/wos/medline/basic-search
  sourceTypes: Index Database
– sequence: 3
  dbid: RIE
  name: IEEE Electronic Library (IEL)
  url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
Discipline Biology
Computer Science
EndPage 346
ExternalDocumentID 16452809
1227334
Genre Research Support, Non-U.S. Gov't
Journal Article
GroupedDBID 6IE
6IK
6IL
AAJGR
AAVQY
AAWTH
ALMA_UNASSIGNED_HOLDINGS
BEFXN
BFFAM
BGNUA
BKEBE
BPEOZ
CBEJK
OCL
RIB
RIC
RIE
RIL
29O
CGR
CUY
CVF
ECM
EIF
NPM
ID FETCH-LOGICAL-i297t-cef4555a58a8347056246624115141d698005352fdc936856e7fd0e6ff97d1fb3
IEDL.DBID RIE
ISBN 0769520006
9780769520001
ISSN 1555-3930
IngestDate Sat Sep 18 00:01:18 EDT 2021
Tue Aug 26 17:28:48 EDT 2025
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-i297t-cef4555a58a8347056246624115141d698005352fdc936856e7fd0e6ff97d1fb3
PMID 16452809
PageCount 10
ParticipantIDs pubmed_primary_16452809
ieee_primary_1227334
PublicationCentury 2000
PublicationDate 20030000
2003-00-00
PublicationDateYYYYMMDD 2003-01-01
PublicationDate_xml – year: 2003
  text: 20030000
PublicationDecade 2000
PublicationPlace United States
PublicationPlace_xml – name: United States
PublicationTitle 2nd IEEE Computer Society Bioinformatics Conference (CSB 2003)
PublicationTitleAbbrev CSB
PublicationTitleAlternate Proc IEEE Comput Soc Bioinform Conf
PublicationYear 2003
Publisher IEEE
Publisher_xml – name: IEEE
SSID ssj0000451792
ssj0052898
Score 1.3577238
Snippet The constrained multiple sequence alignment problem is to align a set of sequences subject to a given constrained sequence, which arises from some knowledge of...
The Constrained Multiple Sequence Alignment problem is to align a set of sequences subject to a given constrained sequence, which arises from some knowledge of...
SourceID pubmed
ieee
SourceType Index Database
Publisher
StartPage 337
SubjectTerms Algorithms
Amino Acid Sequence
Base Sequence
Biological control systems
Biology computing
Computational biology
Computer science
Costs
Degradation
Heuristic algorithms
Information systems
Molecular Sequence Data
Pattern Recognition, Automated - methods
RNA
Sequence Alignment - methods
Sequence Analysis - methods
Sequences
Software
Title Efficient constrained multiple sequence alignment with performance guarantee
URI https://ieeexplore.ieee.org/document/1227334
https://www.ncbi.nlm.nih.gov/pubmed/16452809
Volume 2
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1La8JAEB5UKPRkq7a1D9lDj01MzGOz14oipYrQCt4k2Z2VUlAp8dD--s5uEgXpoYdAQpJl2Z1lXt98A_CYavI6kgAdnyvPIaEQjuA6cLih50LUdL5MQH86iyeL8GUZLWvwdKiFQUQLPkPX3NpcvtrKvQmV9f0BKdsgrEOdxKyo1TrEUwxPChelZy4iW4FSEuxUz36VpvREf_j2bMlA3XLMsrnKiXFplcy4CdNqegW25NPd55krf06YG_87_wvoHMv52PygqC6hhpsWnBVdKL9b0Kw6O7DyoLfhdWSZJWg0Jo0BafpIoGIV-pBVAGxGVvza4gmYCeiy3bEMga1J-My2YQcW49H7cOKUfRecj4HguSNRh1EUpVGSJkHIjYkUxnSR8eiHvopFUrDCaCWF4a-PkWvlYay14MrXWXAFjc12gzfAAvLnApnpKCTPR9J4IhODOE19Rd8mUnShbZZptSuoNVblCnXhutiG4wuTgE08cfv3D3dwblF2NjZyD438a48PZC3kWQ_qs_m0Z4XlF-b7ueI
linkProvider IEEE
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3NT8IwFH9BjNETCqj42YNHOzbWrutVAkEFYiIk3MjWD2JMgJhx0L_ettsgIR48LNmyrdna17zf-_o9gIdEG6sjDhUOmPSxEQqOOdMhZpaeSylt9pd16I_G0WBKXmZ0VoHHbS2MUsolnynPnrpYvlyJjXWVtYOOUbYhOYBDo_cJzau1th4Vy5TCeGGbc-pqUAqKnfI6KAOVPm93358cHahXjFq0V9mDl07N9GswKj8wzy759DZZ6omfPe7G__7BKTR3BX3obauqzqCilnU4yvtQftehVvZ2QMVWb8Cw57glzGhIWAhpO0koicr8Q1SmYCOD4xcuowBZly5a7woR0MKIn1041YRpvzfpDnDReQF_dDjLsFCaUEoTGidxSJgFSSQyh4GPAQlkxOOcF0ZLwS2DfaSYlr6KtOZMBjoNz6G6XC3VJaDQWHShSDUlxvYRZjye8k6UJIE0z8aCt6Bhp2m-zsk15sUMteAiX4bdDRuCjX1-9fcL93A8mIyG8-Hz-PUaTlzOnfOU3EA1-9qoW4MdsvTOicwvbwy8Iw
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2nd+IEEE+Computer+Society+Bioinformatics+Conference+%28CSB+2003%29&rft.atitle=Efficient+constrained+multiple+sequence+alignment+with+performance+guarantee&rft.au=Francis+YL+Chin&rft.au=Ho%2C+N.L.&rft.au=Lam%2C+T.W.&rft.au=Prudence+WH+Wong&rft.date=2003-01-01&rft.pub=IEEE&rft.isbn=9780769520001&rft.spage=337&rft.epage=346&rft_id=info:doi/10.1109%2FCSB.2003.1227334&rft.externalDocID=1227334
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1555-3930&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1555-3930&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1555-3930&client=summon