A family of tree-based generators for bubbles in directed graphs
Bubbles are pairs of internally vertex-disjoint $(s, t)$-paths in a directed graph. In de Bruijn graphs built from reads of RNA and DNA data, bubbles represent interesting biological events, such as alternative splicing (AS) and allelic differences (SNPs and indels). However, the set of all bubbles...
Saved in:
Published in | Journal of graph algorithms and applications Vol. 25; no. 1; pp. 563 - 580 |
---|---|
Main Authors | , , , , , |
Format | Journal Article |
Language | English |
Published |
Brown University
01.01.2021
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Abstract | Bubbles are pairs of internally vertex-disjoint $(s, t)$-paths in a directed graph. In de Bruijn graphs built from reads of RNA and DNA data, bubbles represent interesting biological events, such as alternative splicing (AS) and allelic differences (SNPs and indels). However, the set of all bubbles in a de Bruijn graph built from real data is usually too large to be efficiently enumerated and analysed in practice. In particular, despite significant research done in this area, listing bubbles still remains the main bottleneck for tools that detect AS events in a reference-free context. Recently, in [Acuña et al., Algorithmica 2020] the concept of a bubble generator was introduced as a way for obtaining a compact representation of the bubble space of a graph. Although this bubble generator was quite effective in finding AS events, preliminary experiments showed that it is about 5 times slower than state-of-art methods. In this paper we propose a new family of bubble generators which improve substantially on previous work: bubble generators in this new family are about two orders of magnitude faster and are still able to achieve similar precision in identifying AS events. To highlight the practical value of our new bubble generators, we also report some experimental results on real datasets. Keywords: bubble generator, directed graphs, alternative splicing |
---|---|
AbstractList | Bubbles are pairs of internally vertex-disjoint $(s, t)$-paths in a directed graph. In de Bruijn graphs built from reads of RNA and DNA data, bubbles represent interesting biological events, such as alternative splicing (AS) and allelic differences (SNPs and indels). However, the set of all bubbles in a de Bruijn graph built from real data is usually too large to be efficiently enumerated and analysed in practice. In particular, despite significant research done in this area, listing bubbles still remains the main bottleneck for tools that detect AS events in a reference-free context. Recently, in [Acuña et al., Algorithmica 2020] the concept of a bubble generator was introduced as a way for obtaining a compact representation of the bubble space of a graph. Although this bubble generator was quite effective in finding AS events, preliminary experiments showed that it is about 5 times slower than state-of-art methods. In this paper we propose a new family of bubble generators which improve substantially on previous work: bubble generators in this new family are about two orders of magnitude faster and are still able to achieve similar precision in identifying AS events. To highlight the practical value of our new bubble generators, we also report some experimental results on real datasets. Keywords: bubble generator, directed graphs, alternative splicing Bubbles are pairs of internally vertex-disjoint (s, t)-paths in a directed graph. In de Bruijn graphs built from reads of RNA and DNA data, bubbles represent interesting biological events, such as alternative splicing (AS) and allelic differences (SNPs and indels). However, the set of all bubbles in a de Bruijn graph built from real data is usually too large to be efficiently enumerated and analysed in practice. In particular, despite significant research done in this area, listing bubbles still remains the main bottleneck for tools that detect AS events in a reference-free context. Recently, in [1] the concept of a bubble generator was introduced as a way for obtaining a compact representation of the bubble space of a graph. Although this bubble generator was quite effective in finding AS events, preliminary experiments showed that it is about 5 times slower than state-of-art methods. In this paper we propose a new family of bubble generators which improve substantially on previous work: bubble generators in this new family are about two orders of magnitude faster and are still able to achieve similar precision in identifying AS events. To highlight the practical value of our new bubble generators, we also report some experimental results on real datasets. |
Author | Pepè Sciarria, Luca Sinaimeri, Blerina Sagot, Marie-France Italiano, Giuseppe Acuña, Vicente Soares de Lima, Leandro Ishi |
Author_xml | – sequence: 1 givenname: Vicente surname: Acuña fullname: Acuña, Vicente – sequence: 2 givenname: Leandro Ishi surname: Soares de Lima fullname: Soares de Lima, Leandro Ishi – sequence: 3 givenname: Giuseppe surname: Italiano fullname: Italiano, Giuseppe – sequence: 4 givenname: Luca surname: Pepè Sciarria fullname: Pepè Sciarria, Luca – sequence: 5 givenname: Marie-France surname: Sagot fullname: Sagot, Marie-France – sequence: 6 givenname: Blerina surname: Sinaimeri fullname: Sinaimeri, Blerina |
BackLink | https://inria.hal.science/hal-03504540$$DView record in HAL |
BookMark | eNpNkEFLAzEQhYNUsK1e_AW5KmydyW426c1SrBUKXvQcJtmk3bLdLckq9N_rWhFP8xi-9w7fhI3arvWM3SLMFEr5sN8SzQCkEhdsjFKUGSqcj_7lKzZJaQ8gcqH0mD0ueKBD3Zx4F3gfvc8sJV_xrW99pL6LiYcucvthbeMTr1te1dG7fkAiHXfpml0GapK_-b1T9r56eluus83r88tysckcKiky64CkEK6iUkk_LySWCgmxQACYa62d0wGUhoBVWVmrrXcUBNpc2bwUVT5ld-fdHTXmGOsDxZPpqDbrxcYMP8glFLKAT_xm78-si11K0Ye_AoIZPJnBk_nxlH8BPH5baA |
ContentType | Journal Article |
Copyright | Distributed under a Creative Commons Attribution 4.0 International License |
Copyright_xml | – notice: Distributed under a Creative Commons Attribution 4.0 International License |
DBID | AAYXX CITATION 1XC VOOES |
DOI | 10.7155/jgaa.00572 |
DatabaseName | CrossRef Hyper Article en Ligne (HAL) Hyper Article en Ligne (HAL) (Open Access) |
DatabaseTitle | CrossRef |
DatabaseTitleList | CrossRef |
DeliveryMethod | fulltext_linktorsrc |
Discipline | Mathematics Computer Science |
EISSN | 1526-1719 |
EndPage | 580 |
ExternalDocumentID | oai_HAL_hal_03504540v1 10_7155_jgaa_00572 |
GroupedDBID | -~9 29K 2WC 5GY AAFWJ AAKPC AAYXX ACGFO ACIPV AFPKN ALMA_UNASSIGNED_HOLDINGS CITATION D-I E3Z EBS EJD F5P GROUPED_DOAJ M~E OK1 P2P REM RNS TR2 XSB 1XC C1A LO0 VOOES |
ID | FETCH-LOGICAL-c1752-bc0a522cda675e9451671a11410009888cc8f0780f1d6dbb8becaf21b37b362d3 |
ISSN | 1526-1719 |
IngestDate | Wed Nov 06 06:42:38 EST 2024 Wed Oct 02 14:43:44 EDT 2024 |
IsDoiOpenAccess | true |
IsOpenAccess | true |
IsPeerReviewed | true |
IsScholarly | true |
Issue | 1 |
Keywords | alternative splicing bubble generator directed graphs |
Language | English |
License | Distributed under a Creative Commons Attribution 4.0 International License: http://creativecommons.org/licenses/by/4.0 |
LinkModel | OpenURL |
MergedId | FETCHMERGED-LOGICAL-c1752-bc0a522cda675e9451671a11410009888cc8f0780f1d6dbb8becaf21b37b362d3 |
ORCID | 0000-0002-9797-7592 |
OpenAccessLink | http://dx.doi.org/10.7155/jgaa.00572 |
PageCount | 18 |
ParticipantIDs | hal_primary_oai_HAL_hal_03504540v1 crossref_primary_10_7155_jgaa_00572 |
PublicationCentury | 2000 |
PublicationDate | 2021-01-01 2021 |
PublicationDateYYYYMMDD | 2021-01-01 |
PublicationDate_xml | – month: 01 year: 2021 text: 2021-01-01 day: 01 |
PublicationDecade | 2020 |
PublicationTitle | Journal of graph algorithms and applications |
PublicationYear | 2021 |
Publisher | Brown University |
Publisher_xml | – name: Brown University |
SSID | ssj0023278 |
Score | 2.2500832 |
Snippet | Bubbles are pairs of internally vertex-disjoint $(s, t)$-paths in a directed graph. In de Bruijn graphs built from reads of RNA and DNA data, bubbles represent... Bubbles are pairs of internally vertex-disjoint (s, t)-paths in a directed graph. In de Bruijn graphs built from reads of RNA and DNA data, bubbles represent... |
SourceID | hal crossref |
SourceType | Open Access Repository Aggregation Database |
StartPage | 563 |
SubjectTerms | Computer Science Genetics Life Sciences |
Title | A family of tree-based generators for bubbles in directed graphs |
URI | https://inria.hal.science/hal-03504540 |
Volume | 25 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV3Nb9MwFLfKuMCBwQAxBpMF3KqMOnGX5EaAjQ7aadI2tFtkO84aabRVPjhw4A_jr-M9O19FRRpcosixosTvp-ef3ychb7geJa5izJFY_pFz7TkiYaGjuRfIIGGJ0pjvPDs9nFzyz1fjq8HgVy9qqSrlgfqxMa_kf6QKYyBXzJL9B8m2L4UBuAf5whUkDNdbyThq7BPo6M-1dnBPSrArsjbe88IGY1ZS3mgb92oUHE7BOtXFX5ipeTgUN9fLPCvn32wV576nu0WJqtDR_p4ZBvo1w0jPFijnS0xtGiYak6jMhKkWWB5heFLMsxaSpTG0GIPtp6wq9Gqle9HGgJw8t_G800qJ4ZleGdd-0LdWuJ2dwlgVetEmfZ3rHjrMrzWn3jBWK2qbIb0GSKt1xzzsbeBjq97_3Bt8YE64N1wLcYApuG63AzZe_0l0Hp99PI6nJ6df1p-2lbgn0TSeA1DQIYvFC7_DyfuuC0oOw0lnP4_aw77nGhbQ_ogtjYsf8bb7hDUydGfe2PINt7l4SB7UoqeRRdgjMtCLHbLdNPygtf7fIfdnbZHf4jF5F1ELP7pMaQc_2sGPAvxoDT-aLWgDP2rh94RcHh9dfJg4dUMORwHLdB2pRgL4ukoEHDN1iD2efSYYhgpjXdogUCpIgXOOUoZ9ymQACkKkLpOeL4EoJd5TsrVYLvQzQn3FxZhzJbgMuXDTQDHlw9nX1aGfpmG4S143KxOvbN2VGM6ruH4xrl9s1m-XvEJhNBM2C-j5bSbtkXsIV2tce0G2yrzSL4FulnLfmGn2jXR_A0XUg80 |
link.rule.ids | 230,315,783,787,867,888,4031,27935,27936,27937 |
linkProvider | ISSN International Centre |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=A+family+of+tree-based+generators+for+bubbles+in+directed+graphs&rft.jtitle=Journal+of+graph+algorithms+and+applications&rft.au=Acu%C3%B1a%2C+Vicente&rft.au=Soares+de+Lima%2C+Leandro+Ishi&rft.au=Italiano%2C+Giuseppe+F&rft.au=Sciarria%2C+Luca+Pep%C3%A8&rft.date=2021&rft.pub=Brown+University&rft.issn=1526-1719&rft.eissn=1526-1719&rft.volume=25&rft.issue=1&rft.spage=549&rft.epage=562&rft_id=info:doi/10.7155%2Fjgaa.00572&rft.externalDBID=HAS_PDF_LINK&rft.externalDocID=oai_HAL_hal_03504540v1 |
thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1526-1719&client=summon |
thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1526-1719&client=summon |
thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1526-1719&client=summon |