Cache-Oblivious Dynamic Programming for Bioinformatics

We present efficient cache-oblivious algorithms for some well-studied string problems in bioinformatics including the longest common subsequence, global pairwise sequence alignment and three-way sequence alignment (or median), both with affine gap costs, and RNA secondary structure prediction with s...

Full description

Saved in:
Bibliographic Details
Published inIEEE/ACM transactions on computational biology and bioinformatics Vol. 7; no. 3; pp. 495 - 510
Main Authors Chowdhury, Rezaul Alan, Hai-Son Le, Ramachandran, Vijaya
Format Journal Article
LanguageEnglish
Published United States IEEE 01.07.2010
The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Subjects
Online AccessGet full text

Cover

Loading…
Abstract We present efficient cache-oblivious algorithms for some well-studied string problems in bioinformatics including the longest common subsequence, global pairwise sequence alignment and three-way sequence alignment (or median), both with affine gap costs, and RNA secondary structure prediction with simple pseudoknots. For each of these problems, we present cache-oblivious algorithms that match the best-known time complexity, match or improve the best-known space complexity, and improve significantly over the cache-efficiency of earlier algorithms. We present experimental results which show that our cache-oblivious algorithms run faster than software and implementations based on previous best algorithms for these problems.
AbstractList We present efficient cache-oblivious algorithms for some well-studied string problems in bioinformatics including the longest common subsequence, global pairwise sequence alignment and three-way sequence alignment (or median), both with affine gap costs, and RNA secondary structure prediction with simple pseudoknots. For each of these problems, we present cache-oblivious algorithms that match the best-known time complexity, match or improve the best-known space complexity, and improve significantly over the cache-efficiency of earlier algorithms. We present experimental results which show that our cache-oblivious algorithms run faster than software and implementations based on previous best algorithms for these problems.
We present efficient cache-oblivious algorithms for some well-studied string problems in bioinformatics including the longest common subsequence, global pairwise sequence alignment and three-way sequence alignment (or median), both with affine gap costs, and RNA secondary structure prediction with simple pseudoknots. For each of these problems, we present cache-oblivious algorithms that match the best-known time complexity, match or improve the best-known space complexity, and improve significantly over the cache-efficiency of earlier algorithms. We present experimental results which show that our cache-oblivious algorithms run faster than software and implementations based on previous best algorithms for these problems.We present efficient cache-oblivious algorithms for some well-studied string problems in bioinformatics including the longest common subsequence, global pairwise sequence alignment and three-way sequence alignment (or median), both with affine gap costs, and RNA secondary structure prediction with simple pseudoknots. For each of these problems, we present cache-oblivious algorithms that match the best-known time complexity, match or improve the best-known space complexity, and improve significantly over the cache-efficiency of earlier algorithms. We present experimental results which show that our cache-oblivious algorithms run faster than software and implementations based on previous best algorithms for these problems.
Author Ramachandran, Vijaya
Hai-Son Le
Chowdhury, Rezaul Alan
Author_xml – sequence: 1
  givenname: Rezaul Alan
  surname: Chowdhury
  fullname: Chowdhury, Rezaul Alan
  email: shaikat@cs.utexas.edu
  organization: Center for Comput. Visualization, Univ. of Texas at Austin, Austin, TX, USA
– sequence: 2
  surname: Hai-Son Le
  fullname: Hai-Son Le
  email: hple+@cs.cmu.edu
  organization: Dept. of Machine Learning, Carnegie Mellon Univ., Pittsburgh, PA, USA
– sequence: 3
  givenname: Vijaya
  surname: Ramachandran
  fullname: Ramachandran, Vijaya
  email: vlr@cs.utexas.edu
  organization: Dept. of Comput. Sci., Univ. of Texas at Austin, Austin, TX, USA
BackLink https://www.ncbi.nlm.nih.gov/pubmed/20671320$$D View this record in MEDLINE/PubMed
BookMark eNqF0UtPGzEQAGCrAhUSeuNWqYrEgQsbZmyvvT424SkhwQHOltfrDUa7a2pvKvHvcRSaA1LFyXP4ZjyPCdkbwuAIOUaYI4I6f1wuFnMKUM0V_0YOsSxloZTge5uYl0WpBDsgk5ReAChXwL-TAwpCIqNwSMTS2GdX3Ned_-vDOs0u3gbTezt7iGEVTd_7YTVrQ5wtfPBDDnozepuOyH5ruuR-fLxT8nR1-bi8Ke7ur2-Xv-8KyySMBc-ft5RZwVDIyoGwElHUirnKtBW2DqvGNHXdogNbtxxlw4QsRe4SLWLDpuR0W_c1hj9rl0bd-2Rd15nB5W61LBVDhgBfS14pRqWgWZ58ki9hHYc8hkaoKJWSocrq14da171r9Gv0vYlv-t_qMjjbAhtDStG1O4KgN5fRm8vozWW04pnTT9z6Me8yDGM0vvtf0s9tknfO7epzAYpJwd4B1BaWgA
CODEN ITCBCY
CitedBy_id crossref_primary_10_1145_2858788_2688514
crossref_primary_10_1016_j_tcs_2011_07_024
crossref_primary_10_1109_TPDS_2011_218
crossref_primary_10_1145_3022671_2983993
crossref_primary_10_1007_s00453_015_0073_z
Cites_doi 10.1145/1378533.1378574
10.1145/1248377.1248392
10.1145/322063.322075
10.1109/SFFCS.1999.814600
10.1038/nature01858
10.1016/0022-0000(80)90002-1
10.1016/0020-0190(78)90037-6
10.1017/CBO9780511790492
10.1016/0022-2836(82)90398-9
10.1201/9780203750131
10.1016/0304-3975(92)90132-Y
10.1093/bioinformatics/13.1.45
10.1007/978-3-540-39763-2_31
10.1007/BF02462326
10.1145/1088149.1088197
10.1137/S0097539702402007
10.1017/CBO9780511574931
10.1093/bioinformatics/4.1.11
10.1006/jtbi.2000.2177
10.1145/1109557.1109622
10.1089/106652700750050862
10.1016/S0166-218X(00)00186-4
10.1109/IPDPS.2010.5470354
10.1073/pnas.85.8.2444
10.1007/BF00265993
10.1145/48529.48535
10.1145/360825.360861
10.1109/SPIRE.2000.878178
10.1006/jmbi.1998.2436
10.1145/321921.321922
ContentType Journal Article
Copyright Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2010
Copyright_xml – notice: Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2010
DBID 97E
RIA
RIE
AAYXX
CITATION
CGR
CUY
CVF
ECM
EIF
NPM
7QF
7QO
7QQ
7SC
7SE
7SP
7SR
7TA
7TB
7U5
8BQ
8FD
F28
FR3
H8D
JG9
JQ2
KR7
L7M
L~C
L~D
P64
7X8
DOI 10.1109/TCBB.2008.94
DatabaseName IEEE All-Society Periodicals Package (ASPP) 2005–Present
IEEE All-Society Periodicals Package (ASPP) 1998–Present
IEEE Electronic Library (IEL)
CrossRef
Medline
MEDLINE
MEDLINE (Ovid)
MEDLINE
MEDLINE
PubMed
Aluminium Industry Abstracts
Biotechnology Research Abstracts
Ceramic Abstracts
Computer and Information Systems Abstracts
Corrosion Abstracts
Electronics & Communications Abstracts
Engineered Materials Abstracts
Materials Business File
Mechanical & Transportation Engineering Abstracts
Solid State and Superconductivity Abstracts
METADEX
Technology Research Database
ANTE: Abstracts in New Technology & Engineering
Engineering Research Database
Aerospace Database
Materials Research Database
ProQuest Computer Science Collection
Civil Engineering Abstracts
Advanced Technologies Database with Aerospace
Computer and Information Systems Abstracts – Academic
Computer and Information Systems Abstracts Professional
Biotechnology and BioEngineering Abstracts
MEDLINE - Academic
DatabaseTitle CrossRef
MEDLINE
Medline Complete
MEDLINE with Full Text
PubMed
MEDLINE (Ovid)
Materials Research Database
Civil Engineering Abstracts
Aluminium Industry Abstracts
Technology Research Database
Computer and Information Systems Abstracts – Academic
Mechanical & Transportation Engineering Abstracts
Electronics & Communications Abstracts
ProQuest Computer Science Collection
Computer and Information Systems Abstracts
Ceramic Abstracts
Materials Business File
METADEX
Biotechnology and BioEngineering Abstracts
Computer and Information Systems Abstracts Professional
Aerospace Database
Engineered Materials Abstracts
Biotechnology Research Abstracts
Solid State and Superconductivity Abstracts
Engineering Research Database
Corrosion Abstracts
Advanced Technologies Database with Aerospace
ANTE: Abstracts in New Technology & Engineering
MEDLINE - Academic
DatabaseTitleList
Materials Research Database
MEDLINE - Academic
Engineering Research Database
MEDLINE
Database_xml – sequence: 1
  dbid: NPM
  name: PubMed
  url: https://proxy.k.utb.cz/login?url=http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed
  sourceTypes: Index Database
– sequence: 2
  dbid: EIF
  name: MEDLINE
  url: https://proxy.k.utb.cz/login?url=https://www.webofscience.com/wos/medline/basic-search
  sourceTypes: Index Database
– sequence: 3
  dbid: RIE
  name: IEEE Electronic Library (IEL)
  url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
Discipline Biology
EISSN 1557-9964
EndPage 510
ExternalDocumentID 2778383831
20671320
10_1109_TCBB_2008_94
4609376
Genre orig-research
Research Support, U.S. Gov't, Non-P.H.S
Journal Article
GroupedDBID 0R~
29I
4.4
53G
5GY
5VS
6IK
8US
97E
AAJGR
AAKMM
AALFJ
AARMG
AASAJ
AAWTH
AAWTV
ABAZT
ABQJQ
ABVLG
ACGFO
ACGFS
ACIWK
ACM
ACPRK
ADBCU
ADL
AEBYY
AEFXT
AEJOY
AENEX
AENSD
AETIX
AFRAH
AFWIH
AFWXC
AGQYO
AGSQL
AHBIQ
AIBXA
AIKLT
AKJIK
AKQYR
AKRVB
ALMA_UNASSIGNED_HOLDINGS
ASPBG
ATWAV
AVWKF
BDXCO
BEFXN
BFFAM
BGNUA
BKEBE
BPEOZ
CCLIF
CS3
DU5
EBS
EJD
FEDTE
GUFHI
HGAVV
HZ~
I07
IEDLZ
IFIPE
IPLJI
JAVBF
LAI
LHSKQ
M43
O9-
OCL
P1C
P2P
PQQKQ
RIA
RIE
RNI
RNS
ROL
RZB
TN5
XOL
AAYOK
AAYXX
CITATION
RIG
CGR
CUY
CVF
ECM
EIF
NPM
7QF
7QO
7QQ
7SC
7SE
7SP
7SR
7TA
7TB
7U5
8BQ
8FD
F28
FR3
H8D
JG9
JQ2
KR7
L7M
L~C
L~D
P64
7X8
ID FETCH-LOGICAL-c370t-4596f23c631678e06c7116b93e8af81fe18dadbbf1e0cbf417d367569041c11d3
IEDL.DBID RIE
ISSN 1545-5963
1557-9964
IngestDate Fri Jul 11 08:40:16 EDT 2025
Fri Jul 11 03:09:34 EDT 2025
Mon Jun 30 07:17:54 EDT 2025
Mon Jul 21 06:05:52 EDT 2025
Sun Jul 06 05:02:55 EDT 2025
Thu Apr 24 23:06:32 EDT 2025
Tue Aug 26 17:13:04 EDT 2025
IsPeerReviewed true
IsScholarly true
Issue 3
Keywords Combinatorial algorithms
Theory of Computation
Biology and genetics
Tradeoffs between Complexity Measures
Analysis of Algorithms and Problem Complexity
Language English
License https://ieeexplore.ieee.org/Xplorehelp/downloads/license-information/IEEE.html
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c370t-4596f23c631678e06c7116b93e8af81fe18dadbbf1e0cbf417d367569041c11d3
Notes ObjectType-Article-2
SourceType-Scholarly Journals-1
ObjectType-Feature-1
content type line 14
ObjectType-Article-1
ObjectType-Feature-2
content type line 23
PMID 20671320
PQID 1082277319
PQPubID 85499
PageCount 16
ParticipantIDs proquest_journals_1082277319
pubmed_primary_20671320
ieee_primary_4609376
proquest_miscellaneous_759313100
proquest_miscellaneous_748932762
crossref_primary_10_1109_TCBB_2008_94
crossref_citationtrail_10_1109_TCBB_2008_94
PublicationCentury 2000
PublicationDate 2010-07-01
PublicationDateYYYYMMDD 2010-07-01
PublicationDate_xml – month: 07
  year: 2010
  text: 2010-07-01
  day: 01
PublicationDecade 2010
PublicationPlace United States
PublicationPlace_xml – name: United States
– name: New York
PublicationTitle IEEE/ACM transactions on computational biology and bioinformatics
PublicationTitleAbbrev TCBB
PublicationTitleAlternate IEEE/ACM Trans Comput Biol Bioinform
PublicationYear 2010
Publisher IEEE
The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Publisher_xml – name: IEEE
– name: The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
References ref13
ref35
ref12
ref34
Cannone (ref7) 2002; 3
ref15
ref31
ref11
ref33
ref10
ref32
Cormen (ref14) 2001
ref2
ref1
Chowdhury (ref8) 2007
ref17
Pedersen (ref36) 1999
ref39
ref38
ref19
ref18
Kleinberg (ref26) 2005
ref24
ref23
ref20
ref42
ref41
ref22
Hong (ref25)
Knudsen (ref27) 2008
Seward (ref40) 2008
ref21
Chowdhury (ref9) 2007
ref28
ref29
DeSantis (ref16) 2003; 19
ref4
ref3
ref6
ref5
Le (ref30) 2006
Powell (ref37) 2008
References_xml – volume: 3
  issue: 2
  volume-title: BMC Bioinformatics
  year: 2002
  ident: ref7
  article-title: The Comparative RNA Web (CRW) Site: An Online Database of Comparative Sequence and Structure Information for Ribosomal, Intron, and Other RNAs
– ident: ref12
  doi: 10.1145/1378533.1378574
– ident: ref11
  doi: 10.1145/1248377.1248392
– year: 2007
  ident: ref8
  article-title: Algorithms and Data Structures for Cache-Efficient Computation: Theory and Experimental Evaluation
– year: 2008
  ident: ref27
  publication-title: Multiple Parsimony Alignment with “affalign”, software package multalign.tar
– ident: ref32
  doi: 10.1145/322063.322075
– ident: ref18
  doi: 10.1109/SFFCS.1999.814600
– ident: ref41
  doi: 10.1038/nature01858
– ident: ref33
  doi: 10.1016/0022-0000(80)90002-1
– ident: ref24
  doi: 10.1016/0020-0190(78)90037-6
– ident: ref17
  doi: 10.1017/CBO9780511790492
– ident: ref20
  doi: 10.1016/0022-2836(82)90398-9
– ident: ref42
  doi: 10.1201/9780203750131
– ident: ref5
  doi: 10.1016/0304-3975(92)90132-Y
– ident: ref21
  doi: 10.1093/bioinformatics/13.1.45
– ident: ref28
  doi: 10.1007/978-3-540-39763-2_31
– ident: ref4
  doi: 10.1007/BF02462326
– start-page: 326
  volume-title: Proc. 13th Ann. ACM Symp. Theory of Computation (STOC ’81)
  ident: ref25
  article-title: I/O Complexity: The Red-Blue Pebble Game
– ident: ref19
  doi: 10.1145/1088149.1088197
– year: 2007
  ident: ref9
  article-title: Efficient Cache-Oblivious String Algorithms for Bioinformatics
– ident: ref15
  doi: 10.1137/S0097539702402007
– ident: ref22
  doi: 10.1017/CBO9780511574931
– ident: ref34
  doi: 10.1093/bioinformatics/4.1.11
– volume-title: Valgrind (Debugging and Profiling Tool for x86-Linux Programs)
  year: 2008
  ident: ref40
– volume-title: Introduction to Algorithms
  year: 2001
  ident: ref14
– year: 1999
  ident: ref36
  article-title: Algorithms in Computational Biology
– ident: ref38
  doi: 10.1006/jtbi.2000.2177
– ident: ref10
  doi: 10.1145/1109557.1109622
– volume-title: undergraduate honors thesis
  year: 2006
  ident: ref30
  article-title: Algorithms for Identification of Patterns in Biogeography and Median Alignment of Three Sequences in Bioinformatics
– ident: ref31
  doi: 10.1089/106652700750050862
– ident: ref3
  doi: 10.1016/S0166-218X(00)00186-4
– ident: ref13
  doi: 10.1109/IPDPS.2010.5470354
– ident: ref35
  doi: 10.1073/pnas.85.8.2444
– ident: ref29
  doi: 10.1007/BF00265993
– ident: ref1
  doi: 10.1145/48529.48535
– volume: 19
  start-page: 1461
  volume-title: Bioinformatics
  year: 2003
  ident: ref16
  article-title: Comprehensive Aligned Sequence Construction for Automated Design of Effective Probes (Cascade-P) Using 16S rDNA
– volume-title: Algorithm Design
  year: 2005
  ident: ref26
– ident: ref23
  doi: 10.1145/360825.360861
– volume-title: Software Package
  year: 2008
  ident: ref37
– ident: ref6
  doi: 10.1109/SPIRE.2000.878178
– ident: ref39
  doi: 10.1006/jmbi.1998.2436
– ident: ref2
  doi: 10.1145/321921.321922
SSID ssj0024904
Score 2.0516574
Snippet We present efficient cache-oblivious algorithms for some well-studied string problems in bioinformatics including the longest common subsequence, global...
SourceID proquest
pubmed
crossref
ieee
SourceType Aggregation Database
Index Database
Enrichment Source
Publisher
StartPage 495
SubjectTerms Algorithm design and analysis
Algorithms
Base Pairing
Base Sequence
Bioinformatics
Biology computing
cache-efficient
cache-oblivious
Computational Biology - methods
Costs
Dynamic programming
Heuristic algorithms
median
Nucleic Acid Conformation
Partitioning algorithms
Polynomials
RNA
RNA - chemistry
RNA secondary structure prediction
Sequence alignment
Sequence Alignment - methods
Software
Software algorithms
Title Cache-Oblivious Dynamic Programming for Bioinformatics
URI https://ieeexplore.ieee.org/document/4609376
https://www.ncbi.nlm.nih.gov/pubmed/20671320
https://www.proquest.com/docview/1082277319
https://www.proquest.com/docview/748932762
https://www.proquest.com/docview/759313100
Volume 7
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1NT9wwEB0BEhKXQvkoaWmVQzlBljhxnOQIS1crJAoHkLhFsT2WELAg2K3U_vrO2NlUVIC4RYqjOONJZiZ-8x7Ad0oxtFSuSHz6JtHqRBtnE5q0a5VBhcajfH-q8aU8uSquFmC_74VBRA8-wwEf-r18e29m_KvsQCqqv0u1CItUuIVerX-8erWXCuSMICnIq3qQe31wMTw6CqjJmmV4mLCcG4efRSIvrfJ6lumjzWgVTufzDCCTm8Fsqgfmz38Uju99kDX40KWd8WHwk4-wgJN1WA5ClL83QA2Z1zk507fXvxgUGx8Hofr4PMC37ijAxZTexnRFR7XK9M6bcDn6cTEcJ52iQmLyMp0mkizjstwo7n-vMFWmFELpOseqdZVwKCrbWq2dwNRoJ0Vpc6ooqIKWwghh8y1YmtxPcBviwrnCURVbytZSypXXlVMOM4M6rTCzJoK9uXUb09GNs-rFbePLjrRueFmCCmYtI9jtRz8Emo1Xxm2wNfsxnSEj2JkvXNO9g09MfZplZUnfmAji_jS9Pbwl0k6QrNl47p2MAsIbQ4o6F7wNEsGn4BL93eee9PnlWX2BlYA1YHDvDixNH2f4lVKYqf7mffcvc-Hrhw
linkProvider IEEE
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1RT9RAEJ4gxOiLCogWTukDPmmPbrvdto9ycjkV0Icj4a3p7s4mRjiI3pnIr2dmt1ciEeNbk27T7ey0M9P95vsA9ijF0FK5IvHpm0SrE22cTWjSrlUGFRqP8j1Rk1P56aw4W4F3fS8MInrwGQ750O_l20uz4F9l-1JR_V2qB7BGcb8QoVvrllmv9mKBnBMkBflVD3Ov96ejg4OAm6xZiIcpy7l1-I9Y5MVV7s8zfbwZP4Xj5UwDzOT7cDHXQ3N9h8Txfx_lGTzpEs_4ffCUdVjB2QY8DFKUvzdBjZjZOfmiz7_9Ylhs_CFI1cdfA4DrgkJcTAluTFd0ZKtM8PwcTseH09Ek6TQVEpOX6TyRZBmX5UZxB3yFqTKlEErXOVatq4RDUdnWau0EpkY7KUqbU01BNbQURgibb8Hq7HKGLyEunCsc1bGlbC0lXXldOeUwM6jTCjNrIni7tG5jOsJx1r04b3zhkdYNL0vQwaxlBG_60VeBaOOecZtszX5MZ8gIBsuFa7q38CeTn2ZZWdJXJoK4P03vD2-KtDMkazaefSejkPCPIUWdC94IieBFcIn-7ktP2v77rHbh0WR6fNQcfTz5vAOPA_KAob4DWJ3_WOArSmjm-rX34xvmee7Q
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Cache-Oblivious+Dynamic+Programming+for+Bioinformatics&rft.jtitle=IEEE%2FACM+transactions+on+computational+biology+and+bioinformatics&rft.au=Chowdhury%2C+Rezaul+Alan&rft.au=Hai-Son+Le&rft.au=Ramachandran%2C+Vijaya&rft.date=2010-07-01&rft.issn=1545-5963&rft.volume=7&rft.issue=3&rft.spage=495&rft.epage=510&rft_id=info:doi/10.1109%2FTCBB.2008.94&rft.externalDBID=n%2Fa&rft.externalDocID=10_1109_TCBB_2008_94
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1545-5963&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1545-5963&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1545-5963&client=summon