A family of tree-based generators for bubbles in directed graphs

Bubbles are pairs of internally vertex-disjoint $(s, t)$-paths in a directed graph. In de Bruijn graphs built from reads of RNA and DNA data, bubbles represent interesting biological events, such as alternative splicing (AS) and allelic differences (SNPs and indels). However, the set of all bubbles...

Full description

Saved in:
Bibliographic Details
Published inJournal of graph algorithms and applications Vol. 25; no. 1; pp. 563 - 580
Main Authors Acuña, Vicente, Soares de Lima, Leandro Ishi, Italiano, Giuseppe, Pepè Sciarria, Luca, Sagot, Marie-France, Sinaimeri, Blerina
Format Journal Article
LanguageEnglish
Published Brown University 01.01.2021
Subjects
Online AccessGet full text

Cover

Loading…
Abstract Bubbles are pairs of internally vertex-disjoint $(s, t)$-paths in a directed graph. In de Bruijn graphs built from reads of RNA and DNA data, bubbles represent interesting biological events, such as alternative splicing (AS) and allelic differences (SNPs and indels). However, the set of all bubbles in a de Bruijn graph built from real data is usually too large to be efficiently enumerated and analysed in practice. In particular, despite significant research done in this area, listing bubbles still remains the main bottleneck for tools that detect AS events in a reference-free context. Recently, in [Acuña et al., Algorithmica 2020] the concept of a bubble generator was introduced as a way for obtaining a compact representation of the bubble space of a graph. Although this bubble generator was quite effective in finding AS events, preliminary experiments showed that it is about 5 times slower than state-of-art methods. In this paper we propose a new family of bubble generators which improve substantially on previous work: bubble generators in this new family are about two orders of magnitude faster and are still able to achieve similar precision in identifying AS events. To highlight the practical value of our new bubble generators, we also report some experimental results on real datasets. Keywords: bubble generator, directed graphs, alternative splicing
AbstractList Bubbles are pairs of internally vertex-disjoint $(s, t)$-paths in a directed graph. In de Bruijn graphs built from reads of RNA and DNA data, bubbles represent interesting biological events, such as alternative splicing (AS) and allelic differences (SNPs and indels). However, the set of all bubbles in a de Bruijn graph built from real data is usually too large to be efficiently enumerated and analysed in practice. In particular, despite significant research done in this area, listing bubbles still remains the main bottleneck for tools that detect AS events in a reference-free context. Recently, in [Acuña et al., Algorithmica 2020] the concept of a bubble generator was introduced as a way for obtaining a compact representation of the bubble space of a graph. Although this bubble generator was quite effective in finding AS events, preliminary experiments showed that it is about 5 times slower than state-of-art methods. In this paper we propose a new family of bubble generators which improve substantially on previous work: bubble generators in this new family are about two orders of magnitude faster and are still able to achieve similar precision in identifying AS events. To highlight the practical value of our new bubble generators, we also report some experimental results on real datasets. Keywords: bubble generator, directed graphs, alternative splicing
Bubbles are pairs of internally vertex-disjoint (s, t)-paths in a directed graph. In de Bruijn graphs built from reads of RNA and DNA data, bubbles represent interesting biological events, such as alternative splicing (AS) and allelic differences (SNPs and indels). However, the set of all bubbles in a de Bruijn graph built from real data is usually too large to be efficiently enumerated and analysed in practice. In particular, despite significant research done in this area, listing bubbles still remains the main bottleneck for tools that detect AS events in a reference-free context. Recently, in [1] the concept of a bubble generator was introduced as a way for obtaining a compact representation of the bubble space of a graph. Although this bubble generator was quite effective in finding AS events, preliminary experiments showed that it is about 5 times slower than state-of-art methods. In this paper we propose a new family of bubble generators which improve substantially on previous work: bubble generators in this new family are about two orders of magnitude faster and are still able to achieve similar precision in identifying AS events. To highlight the practical value of our new bubble generators, we also report some experimental results on real datasets.
Author Pepè Sciarria, Luca
Sinaimeri, Blerina
Sagot, Marie-France
Italiano, Giuseppe
Acuña, Vicente
Soares de Lima, Leandro Ishi
Author_xml – sequence: 1
  givenname: Vicente
  surname: Acuña
  fullname: Acuña, Vicente
– sequence: 2
  givenname: Leandro Ishi
  surname: Soares de Lima
  fullname: Soares de Lima, Leandro Ishi
– sequence: 3
  givenname: Giuseppe
  surname: Italiano
  fullname: Italiano, Giuseppe
– sequence: 4
  givenname: Luca
  surname: Pepè Sciarria
  fullname: Pepè Sciarria, Luca
– sequence: 5
  givenname: Marie-France
  surname: Sagot
  fullname: Sagot, Marie-France
– sequence: 6
  givenname: Blerina
  surname: Sinaimeri
  fullname: Sinaimeri, Blerina
BackLink https://inria.hal.science/hal-03504540$$DView record in HAL
BookMark eNpNkEFLAzEQhYNUsK1e_AW5KmydyW426c1SrBUKXvQcJtmk3bLdLckq9N_rWhFP8xi-9w7fhI3arvWM3SLMFEr5sN8SzQCkEhdsjFKUGSqcj_7lKzZJaQ8gcqH0mD0ueKBD3Zx4F3gfvc8sJV_xrW99pL6LiYcucvthbeMTr1te1dG7fkAiHXfpml0GapK_-b1T9r56eluus83r88tysckcKiky64CkEK6iUkk_LySWCgmxQACYa62d0wGUhoBVWVmrrXcUBNpc2bwUVT5ld-fdHTXmGOsDxZPpqDbrxcYMP8glFLKAT_xm78-si11K0Ye_AoIZPJnBk_nxlH8BPH5baA
ContentType Journal Article
Copyright Distributed under a Creative Commons Attribution 4.0 International License
Copyright_xml – notice: Distributed under a Creative Commons Attribution 4.0 International License
DBID AAYXX
CITATION
1XC
VOOES
DOI 10.7155/jgaa.00572
DatabaseName CrossRef
Hyper Article en Ligne (HAL)
Hyper Article en Ligne (HAL) (Open Access)
DatabaseTitle CrossRef
DatabaseTitleList CrossRef

DeliveryMethod fulltext_linktorsrc
Discipline Mathematics
Computer Science
EISSN 1526-1719
EndPage 580
ExternalDocumentID oai_HAL_hal_03504540v1
10_7155_jgaa_00572
GroupedDBID -~9
29K
2WC
5GY
AAFWJ
AAKPC
AAYXX
ACGFO
ACIPV
AFPKN
ALMA_UNASSIGNED_HOLDINGS
CITATION
D-I
E3Z
EBS
EJD
F5P
GROUPED_DOAJ
M~E
OK1
P2P
REM
RNS
TR2
XSB
1XC
C1A
LO0
VOOES
ID FETCH-LOGICAL-c1752-bc0a522cda675e9451671a11410009888cc8f0780f1d6dbb8becaf21b37b362d3
ISSN 1526-1719
IngestDate Wed Nov 06 06:42:38 EST 2024
Wed Oct 02 14:43:44 EDT 2024
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Issue 1
Keywords alternative splicing
bubble generator
directed graphs
Language English
License Distributed under a Creative Commons Attribution 4.0 International License: http://creativecommons.org/licenses/by/4.0
LinkModel OpenURL
MergedId FETCHMERGED-LOGICAL-c1752-bc0a522cda675e9451671a11410009888cc8f0780f1d6dbb8becaf21b37b362d3
ORCID 0000-0002-9797-7592
OpenAccessLink http://dx.doi.org/10.7155/jgaa.00572
PageCount 18
ParticipantIDs hal_primary_oai_HAL_hal_03504540v1
crossref_primary_10_7155_jgaa_00572
PublicationCentury 2000
PublicationDate 2021-01-01
2021
PublicationDateYYYYMMDD 2021-01-01
PublicationDate_xml – month: 01
  year: 2021
  text: 2021-01-01
  day: 01
PublicationDecade 2020
PublicationTitle Journal of graph algorithms and applications
PublicationYear 2021
Publisher Brown University
Publisher_xml – name: Brown University
SSID ssj0023278
Score 2.2500832
Snippet Bubbles are pairs of internally vertex-disjoint $(s, t)$-paths in a directed graph. In de Bruijn graphs built from reads of RNA and DNA data, bubbles represent...
Bubbles are pairs of internally vertex-disjoint (s, t)-paths in a directed graph. In de Bruijn graphs built from reads of RNA and DNA data, bubbles represent...
SourceID hal
crossref
SourceType Open Access Repository
Aggregation Database
StartPage 563
SubjectTerms Computer Science
Genetics
Life Sciences
Title A family of tree-based generators for bubbles in directed graphs
URI https://inria.hal.science/hal-03504540
Volume 25
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV3Nb9MwFLfKuMCBwQAxBpMF3KqMOnGX5EaAjQ7aadI2tFtkO84aabRVPjhw4A_jr-M9O19FRRpcosixosTvp-ef3ychb7geJa5izJFY_pFz7TkiYaGjuRfIIGGJ0pjvPDs9nFzyz1fjq8HgVy9qqSrlgfqxMa_kf6QKYyBXzJL9B8m2L4UBuAf5whUkDNdbyThq7BPo6M-1dnBPSrArsjbe88IGY1ZS3mgb92oUHE7BOtXFX5ipeTgUN9fLPCvn32wV576nu0WJqtDR_p4ZBvo1w0jPFijnS0xtGiYak6jMhKkWWB5heFLMsxaSpTG0GIPtp6wq9Gqle9HGgJw8t_G800qJ4ZleGdd-0LdWuJ2dwlgVetEmfZ3rHjrMrzWn3jBWK2qbIb0GSKt1xzzsbeBjq97_3Bt8YE64N1wLcYApuG63AzZe_0l0Hp99PI6nJ6df1p-2lbgn0TSeA1DQIYvFC7_DyfuuC0oOw0lnP4_aw77nGhbQ_ogtjYsf8bb7hDUydGfe2PINt7l4SB7UoqeRRdgjMtCLHbLdNPygtf7fIfdnbZHf4jF5F1ELP7pMaQc_2sGPAvxoDT-aLWgDP2rh94RcHh9dfJg4dUMORwHLdB2pRgL4ukoEHDN1iD2efSYYhgpjXdogUCpIgXOOUoZ9ymQACkKkLpOeL4EoJd5TsrVYLvQzQn3FxZhzJbgMuXDTQDHlw9nX1aGfpmG4S143KxOvbN2VGM6ruH4xrl9s1m-XvEJhNBM2C-j5bSbtkXsIV2tce0G2yrzSL4FulnLfmGn2jXR_A0XUg80
link.rule.ids 230,315,783,787,867,888,4031,27935,27936,27937
linkProvider ISSN International Centre
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=A+family+of+tree-based+generators+for+bubbles+in+directed+graphs&rft.jtitle=Journal+of+graph+algorithms+and+applications&rft.au=Acu%C3%B1a%2C+Vicente&rft.au=Soares+de+Lima%2C+Leandro+Ishi&rft.au=Italiano%2C+Giuseppe+F&rft.au=Sciarria%2C+Luca+Pep%C3%A8&rft.date=2021&rft.pub=Brown+University&rft.issn=1526-1719&rft.eissn=1526-1719&rft.volume=25&rft.issue=1&rft.spage=549&rft.epage=562&rft_id=info:doi/10.7155%2Fjgaa.00572&rft.externalDBID=HAS_PDF_LINK&rft.externalDocID=oai_HAL_hal_03504540v1
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1526-1719&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1526-1719&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1526-1719&client=summon