Cache-Oblivious Dynamic Programming for Bioinformatics
We present efficient cache-oblivious algorithms for some well-studied string problems in bioinformatics including the longest common subsequence, global pairwise sequence alignment and three-way sequence alignment (or median), both with affine gap costs, and RNA secondary structure prediction with s...
Saved in:
Published in | IEEE/ACM transactions on computational biology and bioinformatics Vol. 7; no. 3; pp. 495 - 510 |
---|---|
Main Authors | , , |
Format | Journal Article |
Language | English |
Published |
United States
IEEE
01.07.2010
The Institute of Electrical and Electronics Engineers, Inc. (IEEE) |
Subjects | |
Online Access | Get full text |
Cover
Loading…
Abstract | We present efficient cache-oblivious algorithms for some well-studied string problems in bioinformatics including the longest common subsequence, global pairwise sequence alignment and three-way sequence alignment (or median), both with affine gap costs, and RNA secondary structure prediction with simple pseudoknots. For each of these problems, we present cache-oblivious algorithms that match the best-known time complexity, match or improve the best-known space complexity, and improve significantly over the cache-efficiency of earlier algorithms. We present experimental results which show that our cache-oblivious algorithms run faster than software and implementations based on previous best algorithms for these problems. |
---|---|
AbstractList | We present efficient cache-oblivious algorithms for some well-studied string problems in bioinformatics including the longest common subsequence, global pairwise sequence alignment and three-way sequence alignment (or median), both with affine gap costs, and RNA secondary structure prediction with simple pseudoknots. For each of these problems, we present cache-oblivious algorithms that match the best-known time complexity, match or improve the best-known space complexity, and improve significantly over the cache-efficiency of earlier algorithms. We present experimental results which show that our cache-oblivious algorithms run faster than software and implementations based on previous best algorithms for these problems. We present efficient cache-oblivious algorithms for some well-studied string problems in bioinformatics including the longest common subsequence, global pairwise sequence alignment and three-way sequence alignment (or median), both with affine gap costs, and RNA secondary structure prediction with simple pseudoknots. For each of these problems, we present cache-oblivious algorithms that match the best-known time complexity, match or improve the best-known space complexity, and improve significantly over the cache-efficiency of earlier algorithms. We present experimental results which show that our cache-oblivious algorithms run faster than software and implementations based on previous best algorithms for these problems.We present efficient cache-oblivious algorithms for some well-studied string problems in bioinformatics including the longest common subsequence, global pairwise sequence alignment and three-way sequence alignment (or median), both with affine gap costs, and RNA secondary structure prediction with simple pseudoknots. For each of these problems, we present cache-oblivious algorithms that match the best-known time complexity, match or improve the best-known space complexity, and improve significantly over the cache-efficiency of earlier algorithms. We present experimental results which show that our cache-oblivious algorithms run faster than software and implementations based on previous best algorithms for these problems. |
Author | Ramachandran, Vijaya Hai-Son Le Chowdhury, Rezaul Alan |
Author_xml | – sequence: 1 givenname: Rezaul Alan surname: Chowdhury fullname: Chowdhury, Rezaul Alan email: shaikat@cs.utexas.edu organization: Center for Comput. Visualization, Univ. of Texas at Austin, Austin, TX, USA – sequence: 2 surname: Hai-Son Le fullname: Hai-Son Le email: hple+@cs.cmu.edu organization: Dept. of Machine Learning, Carnegie Mellon Univ., Pittsburgh, PA, USA – sequence: 3 givenname: Vijaya surname: Ramachandran fullname: Ramachandran, Vijaya email: vlr@cs.utexas.edu organization: Dept. of Comput. Sci., Univ. of Texas at Austin, Austin, TX, USA |
BackLink | https://www.ncbi.nlm.nih.gov/pubmed/20671320$$D View this record in MEDLINE/PubMed |
BookMark | eNqF0UtPGzEQAGCrAhUSeuNWqYrEgQsbZmyvvT424SkhwQHOltfrDUa7a2pvKvHvcRSaA1LFyXP4ZjyPCdkbwuAIOUaYI4I6f1wuFnMKUM0V_0YOsSxloZTge5uYl0WpBDsgk5ReAChXwL-TAwpCIqNwSMTS2GdX3Ned_-vDOs0u3gbTezt7iGEVTd_7YTVrQ5wtfPBDDnozepuOyH5ruuR-fLxT8nR1-bi8Ke7ur2-Xv-8KyySMBc-ft5RZwVDIyoGwElHUirnKtBW2DqvGNHXdogNbtxxlw4QsRe4SLWLDpuR0W_c1hj9rl0bd-2Rd15nB5W61LBVDhgBfS14pRqWgWZ58ki9hHYc8hkaoKJWSocrq14da171r9Gv0vYlv-t_qMjjbAhtDStG1O4KgN5fRm8vozWW04pnTT9z6Me8yDGM0vvtf0s9tknfO7epzAYpJwd4B1BaWgA |
CODEN | ITCBCY |
CitedBy_id | crossref_primary_10_1145_2858788_2688514 crossref_primary_10_1016_j_tcs_2011_07_024 crossref_primary_10_1109_TPDS_2011_218 crossref_primary_10_1145_3022671_2983993 crossref_primary_10_1007_s00453_015_0073_z |
Cites_doi | 10.1145/1378533.1378574 10.1145/1248377.1248392 10.1145/322063.322075 10.1109/SFFCS.1999.814600 10.1038/nature01858 10.1016/0022-0000(80)90002-1 10.1016/0020-0190(78)90037-6 10.1017/CBO9780511790492 10.1016/0022-2836(82)90398-9 10.1201/9780203750131 10.1016/0304-3975(92)90132-Y 10.1093/bioinformatics/13.1.45 10.1007/978-3-540-39763-2_31 10.1007/BF02462326 10.1145/1088149.1088197 10.1137/S0097539702402007 10.1017/CBO9780511574931 10.1093/bioinformatics/4.1.11 10.1006/jtbi.2000.2177 10.1145/1109557.1109622 10.1089/106652700750050862 10.1016/S0166-218X(00)00186-4 10.1109/IPDPS.2010.5470354 10.1073/pnas.85.8.2444 10.1007/BF00265993 10.1145/48529.48535 10.1145/360825.360861 10.1109/SPIRE.2000.878178 10.1006/jmbi.1998.2436 10.1145/321921.321922 |
ContentType | Journal Article |
Copyright | Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2010 |
Copyright_xml | – notice: Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2010 |
DBID | 97E RIA RIE AAYXX CITATION CGR CUY CVF ECM EIF NPM 7QF 7QO 7QQ 7SC 7SE 7SP 7SR 7TA 7TB 7U5 8BQ 8FD F28 FR3 H8D JG9 JQ2 KR7 L7M L~C L~D P64 7X8 |
DOI | 10.1109/TCBB.2008.94 |
DatabaseName | IEEE All-Society Periodicals Package (ASPP) 2005–Present IEEE All-Society Periodicals Package (ASPP) 1998–Present IEEE Electronic Library (IEL) CrossRef Medline MEDLINE MEDLINE (Ovid) MEDLINE MEDLINE PubMed Aluminium Industry Abstracts Biotechnology Research Abstracts Ceramic Abstracts Computer and Information Systems Abstracts Corrosion Abstracts Electronics & Communications Abstracts Engineered Materials Abstracts Materials Business File Mechanical & Transportation Engineering Abstracts Solid State and Superconductivity Abstracts METADEX Technology Research Database ANTE: Abstracts in New Technology & Engineering Engineering Research Database Aerospace Database Materials Research Database ProQuest Computer Science Collection Civil Engineering Abstracts Advanced Technologies Database with Aerospace Computer and Information Systems Abstracts Academic Computer and Information Systems Abstracts Professional Biotechnology and BioEngineering Abstracts MEDLINE - Academic |
DatabaseTitle | CrossRef MEDLINE Medline Complete MEDLINE with Full Text PubMed MEDLINE (Ovid) Materials Research Database Civil Engineering Abstracts Aluminium Industry Abstracts Technology Research Database Computer and Information Systems Abstracts – Academic Mechanical & Transportation Engineering Abstracts Electronics & Communications Abstracts ProQuest Computer Science Collection Computer and Information Systems Abstracts Ceramic Abstracts Materials Business File METADEX Biotechnology and BioEngineering Abstracts Computer and Information Systems Abstracts Professional Aerospace Database Engineered Materials Abstracts Biotechnology Research Abstracts Solid State and Superconductivity Abstracts Engineering Research Database Corrosion Abstracts Advanced Technologies Database with Aerospace ANTE: Abstracts in New Technology & Engineering MEDLINE - Academic |
DatabaseTitleList | Materials Research Database MEDLINE - Academic Engineering Research Database MEDLINE |
Database_xml | – sequence: 1 dbid: NPM name: PubMed url: https://proxy.k.utb.cz/login?url=http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed sourceTypes: Index Database – sequence: 2 dbid: EIF name: MEDLINE url: https://proxy.k.utb.cz/login?url=https://www.webofscience.com/wos/medline/basic-search sourceTypes: Index Database – sequence: 3 dbid: RIE name: IEEE Electronic Library (IEL) url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/ sourceTypes: Publisher |
DeliveryMethod | fulltext_linktorsrc |
Discipline | Biology |
EISSN | 1557-9964 |
EndPage | 510 |
ExternalDocumentID | 2778383831 20671320 10_1109_TCBB_2008_94 4609376 |
Genre | orig-research Research Support, U.S. Gov't, Non-P.H.S Journal Article |
GroupedDBID | 0R~ 29I 4.4 53G 5GY 5VS 6IK 8US 97E AAJGR AAKMM AALFJ AARMG AASAJ AAWTH AAWTV ABAZT ABQJQ ABVLG ACGFO ACGFS ACIWK ACM ACPRK ADBCU ADL AEBYY AEFXT AEJOY AENEX AENSD AETIX AFRAH AFWIH AFWXC AGQYO AGSQL AHBIQ AIBXA AIKLT AKJIK AKQYR AKRVB ALMA_UNASSIGNED_HOLDINGS ASPBG ATWAV AVWKF BDXCO BEFXN BFFAM BGNUA BKEBE BPEOZ CCLIF CS3 DU5 EBS EJD FEDTE GUFHI HGAVV HZ~ I07 IEDLZ IFIPE IPLJI JAVBF LAI LHSKQ M43 O9- OCL P1C P2P PQQKQ RIA RIE RNI RNS ROL RZB TN5 XOL AAYOK AAYXX CITATION RIG CGR CUY CVF ECM EIF NPM 7QF 7QO 7QQ 7SC 7SE 7SP 7SR 7TA 7TB 7U5 8BQ 8FD F28 FR3 H8D JG9 JQ2 KR7 L7M L~C L~D P64 7X8 |
ID | FETCH-LOGICAL-c370t-4596f23c631678e06c7116b93e8af81fe18dadbbf1e0cbf417d367569041c11d3 |
IEDL.DBID | RIE |
ISSN | 1545-5963 1557-9964 |
IngestDate | Fri Jul 11 08:40:16 EDT 2025 Fri Jul 11 03:09:34 EDT 2025 Mon Jun 30 07:17:54 EDT 2025 Mon Jul 21 06:05:52 EDT 2025 Sun Jul 06 05:02:55 EDT 2025 Thu Apr 24 23:06:32 EDT 2025 Tue Aug 26 17:13:04 EDT 2025 |
IsPeerReviewed | true |
IsScholarly | true |
Issue | 3 |
Keywords | Combinatorial algorithms Theory of Computation Biology and genetics Tradeoffs between Complexity Measures Analysis of Algorithms and Problem Complexity |
Language | English |
License | https://ieeexplore.ieee.org/Xplorehelp/downloads/license-information/IEEE.html |
LinkModel | DirectLink |
MergedId | FETCHMERGED-LOGICAL-c370t-4596f23c631678e06c7116b93e8af81fe18dadbbf1e0cbf417d367569041c11d3 |
Notes | ObjectType-Article-2 SourceType-Scholarly Journals-1 ObjectType-Feature-1 content type line 14 ObjectType-Article-1 ObjectType-Feature-2 content type line 23 |
PMID | 20671320 |
PQID | 1082277319 |
PQPubID | 85499 |
PageCount | 16 |
ParticipantIDs | proquest_journals_1082277319 pubmed_primary_20671320 ieee_primary_4609376 proquest_miscellaneous_759313100 proquest_miscellaneous_748932762 crossref_primary_10_1109_TCBB_2008_94 crossref_citationtrail_10_1109_TCBB_2008_94 |
PublicationCentury | 2000 |
PublicationDate | 2010-07-01 |
PublicationDateYYYYMMDD | 2010-07-01 |
PublicationDate_xml | – month: 07 year: 2010 text: 2010-07-01 day: 01 |
PublicationDecade | 2010 |
PublicationPlace | United States |
PublicationPlace_xml | – name: United States – name: New York |
PublicationTitle | IEEE/ACM transactions on computational biology and bioinformatics |
PublicationTitleAbbrev | TCBB |
PublicationTitleAlternate | IEEE/ACM Trans Comput Biol Bioinform |
PublicationYear | 2010 |
Publisher | IEEE The Institute of Electrical and Electronics Engineers, Inc. (IEEE) |
Publisher_xml | – name: IEEE – name: The Institute of Electrical and Electronics Engineers, Inc. (IEEE) |
References | ref13 ref35 ref12 ref34 Cannone (ref7) 2002; 3 ref15 ref31 ref11 ref33 ref10 ref32 Cormen (ref14) 2001 ref2 ref1 Chowdhury (ref8) 2007 ref17 Pedersen (ref36) 1999 ref39 ref38 ref19 ref18 Kleinberg (ref26) 2005 ref24 ref23 ref20 ref42 ref41 ref22 Hong (ref25) Knudsen (ref27) 2008 Seward (ref40) 2008 ref21 Chowdhury (ref9) 2007 ref28 ref29 DeSantis (ref16) 2003; 19 ref4 ref3 ref6 ref5 Le (ref30) 2006 Powell (ref37) 2008 |
References_xml | – volume: 3 issue: 2 volume-title: BMC Bioinformatics year: 2002 ident: ref7 article-title: The Comparative RNA Web (CRW) Site: An Online Database of Comparative Sequence and Structure Information for Ribosomal, Intron, and Other RNAs – ident: ref12 doi: 10.1145/1378533.1378574 – ident: ref11 doi: 10.1145/1248377.1248392 – year: 2007 ident: ref8 article-title: Algorithms and Data Structures for Cache-Efficient Computation: Theory and Experimental Evaluation – year: 2008 ident: ref27 publication-title: Multiple Parsimony Alignment with “affalign”, software package multalign.tar – ident: ref32 doi: 10.1145/322063.322075 – ident: ref18 doi: 10.1109/SFFCS.1999.814600 – ident: ref41 doi: 10.1038/nature01858 – ident: ref33 doi: 10.1016/0022-0000(80)90002-1 – ident: ref24 doi: 10.1016/0020-0190(78)90037-6 – ident: ref17 doi: 10.1017/CBO9780511790492 – ident: ref20 doi: 10.1016/0022-2836(82)90398-9 – ident: ref42 doi: 10.1201/9780203750131 – ident: ref5 doi: 10.1016/0304-3975(92)90132-Y – ident: ref21 doi: 10.1093/bioinformatics/13.1.45 – ident: ref28 doi: 10.1007/978-3-540-39763-2_31 – ident: ref4 doi: 10.1007/BF02462326 – start-page: 326 volume-title: Proc. 13th Ann. ACM Symp. Theory of Computation (STOC ’81) ident: ref25 article-title: I/O Complexity: The Red-Blue Pebble Game – ident: ref19 doi: 10.1145/1088149.1088197 – year: 2007 ident: ref9 article-title: Efficient Cache-Oblivious String Algorithms for Bioinformatics – ident: ref15 doi: 10.1137/S0097539702402007 – ident: ref22 doi: 10.1017/CBO9780511574931 – ident: ref34 doi: 10.1093/bioinformatics/4.1.11 – volume-title: Valgrind (Debugging and Profiling Tool for x86-Linux Programs) year: 2008 ident: ref40 – volume-title: Introduction to Algorithms year: 2001 ident: ref14 – year: 1999 ident: ref36 article-title: Algorithms in Computational Biology – ident: ref38 doi: 10.1006/jtbi.2000.2177 – ident: ref10 doi: 10.1145/1109557.1109622 – volume-title: undergraduate honors thesis year: 2006 ident: ref30 article-title: Algorithms for Identification of Patterns in Biogeography and Median Alignment of Three Sequences in Bioinformatics – ident: ref31 doi: 10.1089/106652700750050862 – ident: ref3 doi: 10.1016/S0166-218X(00)00186-4 – ident: ref13 doi: 10.1109/IPDPS.2010.5470354 – ident: ref35 doi: 10.1073/pnas.85.8.2444 – ident: ref29 doi: 10.1007/BF00265993 – ident: ref1 doi: 10.1145/48529.48535 – volume: 19 start-page: 1461 volume-title: Bioinformatics year: 2003 ident: ref16 article-title: Comprehensive Aligned Sequence Construction for Automated Design of Effective Probes (Cascade-P) Using 16S rDNA – volume-title: Algorithm Design year: 2005 ident: ref26 – ident: ref23 doi: 10.1145/360825.360861 – volume-title: Software Package year: 2008 ident: ref37 – ident: ref6 doi: 10.1109/SPIRE.2000.878178 – ident: ref39 doi: 10.1006/jmbi.1998.2436 – ident: ref2 doi: 10.1145/321921.321922 |
SSID | ssj0024904 |
Score | 2.0516574 |
Snippet | We present efficient cache-oblivious algorithms for some well-studied string problems in bioinformatics including the longest common subsequence, global... |
SourceID | proquest pubmed crossref ieee |
SourceType | Aggregation Database Index Database Enrichment Source Publisher |
StartPage | 495 |
SubjectTerms | Algorithm design and analysis Algorithms Base Pairing Base Sequence Bioinformatics Biology computing cache-efficient cache-oblivious Computational Biology - methods Costs Dynamic programming Heuristic algorithms median Nucleic Acid Conformation Partitioning algorithms Polynomials RNA RNA - chemistry RNA secondary structure prediction Sequence alignment Sequence Alignment - methods Software Software algorithms |
Title | Cache-Oblivious Dynamic Programming for Bioinformatics |
URI | https://ieeexplore.ieee.org/document/4609376 https://www.ncbi.nlm.nih.gov/pubmed/20671320 https://www.proquest.com/docview/1082277319 https://www.proquest.com/docview/748932762 https://www.proquest.com/docview/759313100 |
Volume | 7 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1NT9wwEB0BEhKXQvkoaWmVQzlBljhxnOQIS1crJAoHkLhFsT2WELAg2K3U_vrO2NlUVIC4RYqjOONJZiZ-8x7Ad0oxtFSuSHz6JtHqRBtnE5q0a5VBhcajfH-q8aU8uSquFmC_74VBRA8-wwEf-r18e29m_KvsQCqqv0u1CItUuIVerX-8erWXCuSMICnIq3qQe31wMTw6CqjJmmV4mLCcG4efRSIvrfJ6lumjzWgVTufzDCCTm8Fsqgfmz38Uju99kDX40KWd8WHwk4-wgJN1WA5ClL83QA2Z1zk507fXvxgUGx8Hofr4PMC37ijAxZTexnRFR7XK9M6bcDn6cTEcJ52iQmLyMp0mkizjstwo7n-vMFWmFELpOseqdZVwKCrbWq2dwNRoJ0Vpc6ooqIKWwghh8y1YmtxPcBviwrnCURVbytZSypXXlVMOM4M6rTCzJoK9uXUb09GNs-rFbePLjrRueFmCCmYtI9jtRz8Emo1Xxm2wNfsxnSEj2JkvXNO9g09MfZplZUnfmAji_jS9Pbwl0k6QrNl47p2MAsIbQ4o6F7wNEsGn4BL93eee9PnlWX2BlYA1YHDvDixNH2f4lVKYqf7mffcvc-Hrhw |
linkProvider | IEEE |
linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1RT9RAEJ4gxOiLCogWTukDPmmPbrvdto9ycjkV0Icj4a3p7s4mRjiI3pnIr2dmt1ciEeNbk27T7ey0M9P95vsA9ijF0FK5IvHpm0SrE22cTWjSrlUGFRqP8j1Rk1P56aw4W4F3fS8MInrwGQ750O_l20uz4F9l-1JR_V2qB7BGcb8QoVvrllmv9mKBnBMkBflVD3Ov96ejg4OAm6xZiIcpy7l1-I9Y5MVV7s8zfbwZP4Xj5UwDzOT7cDHXQ3N9h8Txfx_lGTzpEs_4ffCUdVjB2QY8DFKUvzdBjZjZOfmiz7_9Ylhs_CFI1cdfA4DrgkJcTAluTFd0ZKtM8PwcTseH09Ek6TQVEpOX6TyRZBmX5UZxB3yFqTKlEErXOVatq4RDUdnWau0EpkY7KUqbU01BNbQURgibb8Hq7HKGLyEunCsc1bGlbC0lXXldOeUwM6jTCjNrIni7tG5jOsJx1r04b3zhkdYNL0vQwaxlBG_60VeBaOOecZtszX5MZ8gIBsuFa7q38CeTn2ZZWdJXJoK4P03vD2-KtDMkazaefSejkPCPIUWdC94IieBFcIn-7ktP2v77rHbh0WR6fNQcfTz5vAOPA_KAob4DWJ3_WOArSmjm-rX34xvmee7Q |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Cache-Oblivious+Dynamic+Programming+for+Bioinformatics&rft.jtitle=IEEE%2FACM+transactions+on+computational+biology+and+bioinformatics&rft.au=Chowdhury%2C+Rezaul+Alan&rft.au=Hai-Son+Le&rft.au=Ramachandran%2C+Vijaya&rft.date=2010-07-01&rft.issn=1545-5963&rft.volume=7&rft.issue=3&rft.spage=495&rft.epage=510&rft_id=info:doi/10.1109%2FTCBB.2008.94&rft.externalDBID=n%2Fa&rft.externalDocID=10_1109_TCBB_2008_94 |
thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1545-5963&client=summon |
thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1545-5963&client=summon |
thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1545-5963&client=summon |