Mars: simplifying bioinformatics workflows through a containerized approach to tool integration and management
Bioinformatics is a rapidly evolving field with numerous specialized tools developed for essential genomic analysis tasks, such as read simulation, mapping, and variant calling. However, managing these tools presents significant challenges due to varied dependencies, execution steps, and output form...
Saved in:
Published in | Bioinformatics advances Vol. 5; no. 1; p. vbaf074 |
---|---|
Main Authors | , |
Format | Journal Article |
Language | English |
Published |
England
Oxford University Press
01.01.2025
|
Subjects | |
Online Access | Get full text |
ISSN | 2635-0041 2635-0041 |
DOI | 10.1093/bioadv/vbaf074 |
Cover
Abstract | Bioinformatics is a rapidly evolving field with numerous specialized tools developed for essential genomic analysis tasks, such as read simulation, mapping, and variant calling. However, managing these tools presents significant challenges due to varied dependencies, execution steps, and output formats, complicating the installation and configuration processes. To address these issues, we introduce "Mars" a bioinformatics solution encapsulated within a singularity container that preloads a comprehensive suite of widely used genomic tools. Mars not only simplifies the installation of these tools but also automates critical workflow functions, including sequence sample preparation, read simulation, read mapping, variant calling, and result comparison. By streamlining the execution of these workflows, Mars enables users to easily manage input-output formats and compare results across different tools, thereby enhancing reproducibility and efficiency. Furthermore, by providing a cohesive environment that integrates tool management with a flexible workflow interface, Mars empowers researchers to focus on their analyses rather than the complexities of tool configuration. This integrated solution facilitates the testing of various combinations of tools and algorithms, enabling users to evaluate performance based on different metrics and identify the optimal tools for their specific genomic analysis needs. Through Mars, we aim to enhance the accessibility and usability of bioinformatics tools, ultimately advancing research in genomic analysis.
Mars is freely available at https://github.com/GenomicAI/mars. It is implemented within a Singularity container environment and supports modular extension for additional genomic tools and custom workflows. |
---|---|
AbstractList | Bioinformatics is a rapidly evolving field with numerous specialized tools developed for essential genomic analysis tasks, such as read simulation, mapping, and variant calling. However, managing these tools presents significant challenges due to varied dependencies, execution steps, and output formats, complicating the installation and configuration processes. To address these issues, we introduce "Mars" a bioinformatics solution encapsulated within a singularity container that preloads a comprehensive suite of widely used genomic tools. Mars not only simplifies the installation of these tools but also automates critical workflow functions, including sequence sample preparation, read simulation, read mapping, variant calling, and result comparison. By streamlining the execution of these workflows, Mars enables users to easily manage input-output formats and compare results across different tools, thereby enhancing reproducibility and efficiency. Furthermore, by providing a cohesive environment that integrates tool management with a flexible workflow interface, Mars empowers researchers to focus on their analyses rather than the complexities of tool configuration. This integrated solution facilitates the testing of various combinations of tools and algorithms, enabling users to evaluate performance based on different metrics and identify the optimal tools for their specific genomic analysis needs. Through Mars, we aim to enhance the accessibility and usability of bioinformatics tools, ultimately advancing research in genomic analysis.SummaryBioinformatics is a rapidly evolving field with numerous specialized tools developed for essential genomic analysis tasks, such as read simulation, mapping, and variant calling. However, managing these tools presents significant challenges due to varied dependencies, execution steps, and output formats, complicating the installation and configuration processes. To address these issues, we introduce "Mars" a bioinformatics solution encapsulated within a singularity container that preloads a comprehensive suite of widely used genomic tools. Mars not only simplifies the installation of these tools but also automates critical workflow functions, including sequence sample preparation, read simulation, read mapping, variant calling, and result comparison. By streamlining the execution of these workflows, Mars enables users to easily manage input-output formats and compare results across different tools, thereby enhancing reproducibility and efficiency. Furthermore, by providing a cohesive environment that integrates tool management with a flexible workflow interface, Mars empowers researchers to focus on their analyses rather than the complexities of tool configuration. This integrated solution facilitates the testing of various combinations of tools and algorithms, enabling users to evaluate performance based on different metrics and identify the optimal tools for their specific genomic analysis needs. Through Mars, we aim to enhance the accessibility and usability of bioinformatics tools, ultimately advancing research in genomic analysis.Mars is freely available at https://github.com/GenomicAI/mars. It is implemented within a Singularity container environment and supports modular extension for additional genomic tools and custom workflows.Availability and implementationMars is freely available at https://github.com/GenomicAI/mars. It is implemented within a Singularity container environment and supports modular extension for additional genomic tools and custom workflows. Bioinformatics is a rapidly evolving field with numerous specialized tools developed for essential genomic analysis tasks, such as read simulation, mapping, and variant calling. However, managing these tools presents significant challenges due to varied dependencies, execution steps, and output formats, complicating the installation and configuration processes. To address these issues, we introduce "Mars" a bioinformatics solution encapsulated within a singularity container that preloads a comprehensive suite of widely used genomic tools. Mars not only simplifies the installation of these tools but also automates critical workflow functions, including sequence sample preparation, read simulation, read mapping, variant calling, and result comparison. By streamlining the execution of these workflows, Mars enables users to easily manage input-output formats and compare results across different tools, thereby enhancing reproducibility and efficiency. Furthermore, by providing a cohesive environment that integrates tool management with a flexible workflow interface, Mars empowers researchers to focus on their analyses rather than the complexities of tool configuration. This integrated solution facilitates the testing of various combinations of tools and algorithms, enabling users to evaluate performance based on different metrics and identify the optimal tools for their specific genomic analysis needs. Through Mars, we aim to enhance the accessibility and usability of bioinformatics tools, ultimately advancing research in genomic analysis. Mars is freely available at https://github.com/GenomicAI/mars. It is implemented within a Singularity container environment and supports modular extension for additional genomic tools and custom workflows. |
Author | Amarasoma, Shanika Ismail, Fathima Nuzla |
Author_xml | – sequence: 1 givenname: Fathima Nuzla orcidid: 0000-0002-0716-1478 surname: Ismail fullname: Ismail, Fathima Nuzla – sequence: 2 givenname: Shanika orcidid: 0000-0001-9509-0069 surname: Amarasoma fullname: Amarasoma, Shanika |
BackLink | https://www.ncbi.nlm.nih.gov/pubmed/40406670$$D View this record in MEDLINE/PubMed |
BookMark | eNpVkctr3DAQxkVJadI01x6LjrlsMnpEtnopYUkfkNJL7mKsh1eNLbmSd0Py19dlNyEFwQjmm983zPeeHKWcPCEfGVww0OKyixnd7nLXYYBGviEnXImrFYBkR6_-x-Ss1t8AwJtGMSnekWMJEpRq4ISkn1jqZ1rjOA0xPMbU04UaU8hlxDnaSh9yuQ9Dfqh03pS87TcUqc1pxph8iU_eUZymktFu6JyXlwca0-z7soznRDE5OmLC3o8-zR_I24BD9WeHekruvt7crb-vbn99-7G-vl1ZIdt5FUBo7lWHwjfOad0qrlvpnGtawNZ7C1y0TEouWFC2BaaDAs3RdQ1Yq8Qp-bLHTttu9M4uzgUHM5U4Ynk0GaP5v5PixvR5ZxgHfcUEWwjnB0LJf7a-zmaM1fphwOTzthrBQS0rtY1epJ9em724PB95EVzsBbbkWosPLxIG5l-QZh-kOQQp_gJ4JpYy |
Cites_doi | 10.1093/bioinformatics/btac743 10.48550/arXiv.1207.3907, 10.1093/bioinformatics/btad074 10.1371/journal.pone.0177459 10.1093/bioinformatics/btab705 10.1038/nbt.3820 10.1038/s41592-021-01101-x 10.1093/bioinformatics/btac308 10.1093/bioinformatics/btp324 10.1038/s41592-024-02430-3 10.1093/bioinformatics/btq033 10.1093/gigascience/giab007 10.1186/s13059-020-1941-7 10.1093/gigascience/giab008 10.21105/joss.01316 10.1038/s41596-024-00986-0 10.1093/nargab/lqac092 10.1093/bioinformatics/btad041 10.13140/RG.2.2.19084.53123 10.1093/bioinformatics/bts378 |
ContentType | Journal Article |
Copyright | The Author(s) 2025. Published by Oxford University Press. The Author(s) 2025. Published by Oxford University Press. 2025 |
Copyright_xml | – notice: The Author(s) 2025. Published by Oxford University Press. – notice: The Author(s) 2025. Published by Oxford University Press. 2025 |
DBID | AAYXX CITATION NPM 7X8 5PM |
DOI | 10.1093/bioadv/vbaf074 |
DatabaseName | CrossRef PubMed MEDLINE - Academic PubMed Central (Full Participant titles) |
DatabaseTitle | CrossRef PubMed MEDLINE - Academic |
DatabaseTitleList | MEDLINE - Academic PubMed |
Database_xml | – sequence: 1 dbid: NPM name: PubMed url: https://proxy.k.utb.cz/login?url=http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed sourceTypes: Index Database |
DeliveryMethod | fulltext_linktorsrc |
Discipline | Biology |
EISSN | 2635-0041 |
ExternalDocumentID | PMC12095131 40406670 10_1093_bioadv_vbaf074 |
Genre | Journal Article |
GroupedDBID | 0R~ AAYXX ABEJV ABGNP ABXVV AFKRA ALMA_UNASSIGNED_HOLDINGS AMNDL BBNVY BENPR BHPHI CCPQU CITATION GROUPED_DOAJ HCIFZ M7P M~E OK1 PHGZM PHGZT PIMPY RPM TOX ZCN ABDBF NPM 7X8 PQGLB 5PM |
ID | FETCH-LOGICAL-c348t-f0392e6ba3e7dd99862984ddd780a8eec0238144231f6c8019f6092adb70cc63 |
ISSN | 2635-0041 |
IngestDate | Thu Aug 21 18:30:17 EDT 2025 Fri Sep 05 16:05:04 EDT 2025 Mon May 26 01:59:42 EDT 2025 Thu Jul 03 08:36:55 EDT 2025 |
IsDoiOpenAccess | true |
IsOpenAccess | true |
IsPeerReviewed | true |
IsScholarly | true |
Issue | 1 |
Language | English |
License | https://creativecommons.org/licenses/by/4.0 The Author(s) 2025. Published by Oxford University Press. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
LinkModel | OpenURL |
MergedId | FETCHMERGED-LOGICAL-c348t-f0392e6ba3e7dd99862984ddd780a8eec0238144231f6c8019f6092adb70cc63 |
Notes | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23 |
ORCID | 0000-0001-9509-0069 0000-0002-0716-1478 |
OpenAccessLink | http://dx.doi.org/10.1093/bioadv/vbaf074 |
PMID | 40406670 |
PQID | 3206984879 |
PQPubID | 23479 |
ParticipantIDs | pubmedcentral_primary_oai_pubmedcentral_nih_gov_12095131 proquest_miscellaneous_3206984879 pubmed_primary_40406670 crossref_primary_10_1093_bioadv_vbaf074 |
PublicationCentury | 2000 |
PublicationDate | 2025-01-01 |
PublicationDateYYYYMMDD | 2025-01-01 |
PublicationDate_xml | – month: 01 year: 2025 text: 2025-01-01 day: 01 |
PublicationDecade | 2020 |
PublicationPlace | England |
PublicationPlace_xml | – name: England |
PublicationTitle | Bioinformatics advances |
PublicationTitleAlternate | Bioinform Adv |
PublicationYear | 2025 |
Publisher | Oxford University Press |
Publisher_xml | – name: Oxford University Press |
References | Bonfield (2025061901540398800_vbaf074-B3) 2021; 10 2025061901540398800_vbaf074-B17 Hickey (2025061901540398800_vbaf074-B12) 2020; 21 Amarasoma (2025061901540398800_vbaf074-B2) 2024 Marco-Sola (2025061901540398800_vbaf074-B16) 2023; 39 Garrison (2025061901540398800_vbaf074-B7) 2023; 39 Garrison (2025061901540398800_vbaf074-B8) 2024; 21 Henriksen (2025061901540398800_vbaf074-B11) 2023; 39 Garrison (2025061901540398800_vbaf074-B9) 2012 Danecek (2025061901540398800_vbaf074-B5) 2021; 10 Li (2025061901540398800_vbaf074-B15) 2009; 25 Poplin (2025061901540398800_vbaf074-B19) 2018 Ono (2025061901540398800_vbaf074-B18) 2022; 4 Kurtzer (2025061901540398800_vbaf074-B13) 2017; 12 Li (2025061901540398800_vbaf074-B14) 2021; 37 Wick (2025061901540398800_vbaf074-B22) 2019; 4 Buchfink (2025061901540398800_vbaf074-B4) 2021; 18 Quinlan (2025061901540398800_vbaf074-B20) 2010; 26 Rausch (2025061901540398800_vbaf074-B21) 2012; 28 Guarracino (2025061901540398800_vbaf074-B10) 2022; 38 Alser (2025061901540398800_vbaf074-B1) 2024; 19 Di Tommaso (2025061901540398800_vbaf074-B6) 2017; 35 |
References_xml | – volume: 39 start-page: btac743 year: 2023 ident: 2025061901540398800_vbaf074-B7 article-title: Unbiased pangenome graphs publication-title: Bioinformatics doi: 10.1093/bioinformatics/btac743 – year: 2012 ident: 2025061901540398800_vbaf074-B9 doi: 10.48550/arXiv.1207.3907, – volume: 39 start-page: btad074 year: 2023 ident: 2025061901540398800_vbaf074-B16 article-title: Optimal gap-affine alignment in O(s) space publication-title: Bioinformatics doi: 10.1093/bioinformatics/btad074 – volume: 12 start-page: e0177459 year: 2017 ident: 2025061901540398800_vbaf074-B13 article-title: Singularity: scientific containers for mobility of compute publication-title: PLoS One doi: 10.1371/journal.pone.0177459 – volume: 37 start-page: 4572 year: 2021 ident: 2025061901540398800_vbaf074-B14 article-title: New strategies to improve minimap2 alignment accuracy publication-title: Bioinformatics doi: 10.1093/bioinformatics/btab705 – volume: 35 start-page: 316 year: 2017 ident: 2025061901540398800_vbaf074-B6 article-title: Nextflow enables reproducible computational workflows publication-title: Nat Biotechnol doi: 10.1038/nbt.3820 – volume: 18 start-page: 366 year: 2021 ident: 2025061901540398800_vbaf074-B4 article-title: Sensitive protein alignments at tree-of-life scale using DIAMOND publication-title: Nat Methods doi: 10.1038/s41592-021-01101-x – volume: 38 start-page: 3319 year: 2022 ident: 2025061901540398800_vbaf074-B10 article-title: ODGI: understanding pangenome graphs publication-title: Bioinformatics doi: 10.1093/bioinformatics/btac308 – volume: 25 start-page: 1754 year: 2009 ident: 2025061901540398800_vbaf074-B15 article-title: Fast and accurate short read alignment with Burrows–Wheeler transform publication-title: Bioinformatics doi: 10.1093/bioinformatics/btp324 – volume: 21 start-page: 2008 year: 2024 ident: 2025061901540398800_vbaf074-B8 article-title: Building pangenome graphs publication-title: Nat Methods doi: 10.1038/s41592-024-02430-3 – volume: 26 start-page: 841 year: 2010 ident: 2025061901540398800_vbaf074-B20 article-title: BEDTools: a flexible suite of utilities for comparing genomic features publication-title: Bioinformatics doi: 10.1093/bioinformatics/btq033 – volume: 10 start-page: giab007 year: 2021 ident: 2025061901540398800_vbaf074-B3 article-title: HTSlib: C library for reading/writing high-throughput sequencing data publication-title: Gigascience doi: 10.1093/gigascience/giab007 – volume: 21 start-page: 35 year: 2020 ident: 2025061901540398800_vbaf074-B12 article-title: Genotyping structural variants in pangenome graphs using the vg toolkit publication-title: Genome Biol doi: 10.1186/s13059-020-1941-7 – year: 2018 ident: 2025061901540398800_vbaf074-B19 – volume: 10 start-page: giab008 year: 2021 ident: 2025061901540398800_vbaf074-B5 article-title: Twelve years of SAMtools and BCFtools publication-title: Gigascience doi: 10.1093/gigascience/giab008 – ident: 2025061901540398800_vbaf074-B17 – volume: 4 start-page: 1316 year: 2019 ident: 2025061901540398800_vbaf074-B22 article-title: Badread: simulation of error-prone long reads publication-title: JOSS doi: 10.21105/joss.01316 – volume: 19 start-page: 2529 year: 2024 ident: 2025061901540398800_vbaf074-B1 article-title: Packaging and containerization of computational methods publication-title: Nat Protoc doi: 10.1038/s41596-024-00986-0 – volume: 4 start-page: lqac092 year: 2022 ident: 2025061901540398800_vbaf074-B18 article-title: PBSIM3: a simulator for all types of PacBio and ONT long reads publication-title: NAR Genom Bioinform doi: 10.1093/nargab/lqac092 – volume: 39 start-page: btad041 year: 2023 ident: 2025061901540398800_vbaf074-B11 article-title: NGSNGS: next generation simulator for next generation sequencing data publication-title: Bioinformatics doi: 10.1093/bioinformatics/btad041 – year: 2024 ident: 2025061901540398800_vbaf074-B2 doi: 10.13140/RG.2.2.19084.53123 – volume: 28 start-page: i333 year: 2012 ident: 2025061901540398800_vbaf074-B21 article-title: DELLY: structural variant discovery by integrated paired-end and split-read analysis publication-title: Bioinformatics doi: 10.1093/bioinformatics/bts378 |
SSID | ssj0002776143 |
Score | 2.2780318 |
Snippet | Bioinformatics is a rapidly evolving field with numerous specialized tools developed for essential genomic analysis tasks, such as read simulation, mapping,... |
SourceID | pubmedcentral proquest pubmed crossref |
SourceType | Open Access Repository Aggregation Database Index Database |
StartPage | vbaf074 |
SubjectTerms | Original |
Title | Mars: simplifying bioinformatics workflows through a containerized approach to tool integration and management |
URI | https://www.ncbi.nlm.nih.gov/pubmed/40406670 https://www.proquest.com/docview/3206984879 https://pubmed.ncbi.nlm.nih.gov/PMC12095131 |
Volume | 5 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1db9MwFLXKJiReEN8rH5WRkHhAYanj2glvA7WaUFcQZFLfIjtOtMCSoCUF0V_PtZ2kCd3D4CWK3ChO7z2y770-PkboFeM0UITFDhGucCjMmI6gnDuQhcVEUulLU4c8W7HTc_pxPVuPRssea2lTy7fx9tp9Jf_jVWgDv-pdsv_g2e6l0AD34F-4gofheiMfn0FWqlP6KtO8cLtjSWZlI4ZqBJg17yq9LH9V3Yk8wtDThd70l20h3GxVxU0UqhU5WwWJlqicDxky7QrwsJ-GS1DtoJYLW15eaH5jLt6sNtvLbg44ycWVqMrchK5fL0SRfRf9AgSZ9QoQZpzScjaO1u2yU8o1bc1AO9vDkx00f0qRuvasnr0B3Ypdgengb8BN79GhdvbqU7Q4Xy6jcL4Ob6FDwrlZtG9rN9_MEiuHWERTDrqv61Q8vWPbxXHTwTBK2Us9_mbQ9kKS8B662-QS-MQC4z4aJcUDdNueLvr7ISo0PN7hHjjwEBy4AwduwIEFHoADt-DAdYk1OHAPHBjAgXfgeITCxTz8cOo0x2s4sUf92kldiI0TJoWXcKUg7WYk8KlSivuu8JMkNuEchXh7mrIYIpkgZW5AhJLcjWPmPUYHRVkkRwhzTeXwpjEnVFABITdNE8W5CoRMFeNkjF631ox-WBGVyJIfvMjaPWrsPkYvW2NHMM7pxStRJOWmijziMvg8nwdj9MQav3sXhZmIMe6OkT9wS_eA1lAf_lJkF0ZLXW8dn0296dMbdPwM3dnh_zk6qK82yQsISWs5QYfv56vPXyampDMxuPsDk-madw |
linkProvider | National Library of Medicine |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Mars%3A+simplifying+bioinformatics+workflows+through+a+containerized+approach+to+tool+integration+and+management&rft.jtitle=Bioinformatics+advances&rft.au=Ismail%2C+Fathima+Nuzla&rft.au=Amarasoma%2C+Shanika&rft.date=2025-01-01&rft.issn=2635-0041&rft.eissn=2635-0041&rft.volume=5&rft.issue=1&rft.spage=vbaf074&rft_id=info:doi/10.1093%2Fbioadv%2Fvbaf074&rft.externalDBID=NO_FULL_TEXT |
thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=2635-0041&client=summon |
thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=2635-0041&client=summon |
thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=2635-0041&client=summon |