Mars: simplifying bioinformatics workflows through a containerized approach to tool integration and management

Bioinformatics is a rapidly evolving field with numerous specialized tools developed for essential genomic analysis tasks, such as read simulation, mapping, and variant calling. However, managing these tools presents significant challenges due to varied dependencies, execution steps, and output form...

Full description

Saved in:
Bibliographic Details
Published inBioinformatics advances Vol. 5; no. 1; p. vbaf074
Main Authors Ismail, Fathima Nuzla, Amarasoma, Shanika
Format Journal Article
LanguageEnglish
Published England Oxford University Press 01.01.2025
Subjects
Online AccessGet full text
ISSN2635-0041
2635-0041
DOI10.1093/bioadv/vbaf074

Cover

Abstract Bioinformatics is a rapidly evolving field with numerous specialized tools developed for essential genomic analysis tasks, such as read simulation, mapping, and variant calling. However, managing these tools presents significant challenges due to varied dependencies, execution steps, and output formats, complicating the installation and configuration processes. To address these issues, we introduce "Mars" a bioinformatics solution encapsulated within a singularity container that preloads a comprehensive suite of widely used genomic tools. Mars not only simplifies the installation of these tools but also automates critical workflow functions, including sequence sample preparation, read simulation, read mapping, variant calling, and result comparison. By streamlining the execution of these workflows, Mars enables users to easily manage input-output formats and compare results across different tools, thereby enhancing reproducibility and efficiency. Furthermore, by providing a cohesive environment that integrates tool management with a flexible workflow interface, Mars empowers researchers to focus on their analyses rather than the complexities of tool configuration. This integrated solution facilitates the testing of various combinations of tools and algorithms, enabling users to evaluate performance based on different metrics and identify the optimal tools for their specific genomic analysis needs. Through Mars, we aim to enhance the accessibility and usability of bioinformatics tools, ultimately advancing research in genomic analysis. Mars is freely available at https://github.com/GenomicAI/mars. It is implemented within a Singularity container environment and supports modular extension for additional genomic tools and custom workflows.
AbstractList Bioinformatics is a rapidly evolving field with numerous specialized tools developed for essential genomic analysis tasks, such as read simulation, mapping, and variant calling. However, managing these tools presents significant challenges due to varied dependencies, execution steps, and output formats, complicating the installation and configuration processes. To address these issues, we introduce "Mars" a bioinformatics solution encapsulated within a singularity container that preloads a comprehensive suite of widely used genomic tools. Mars not only simplifies the installation of these tools but also automates critical workflow functions, including sequence sample preparation, read simulation, read mapping, variant calling, and result comparison. By streamlining the execution of these workflows, Mars enables users to easily manage input-output formats and compare results across different tools, thereby enhancing reproducibility and efficiency. Furthermore, by providing a cohesive environment that integrates tool management with a flexible workflow interface, Mars empowers researchers to focus on their analyses rather than the complexities of tool configuration. This integrated solution facilitates the testing of various combinations of tools and algorithms, enabling users to evaluate performance based on different metrics and identify the optimal tools for their specific genomic analysis needs. Through Mars, we aim to enhance the accessibility and usability of bioinformatics tools, ultimately advancing research in genomic analysis.SummaryBioinformatics is a rapidly evolving field with numerous specialized tools developed for essential genomic analysis tasks, such as read simulation, mapping, and variant calling. However, managing these tools presents significant challenges due to varied dependencies, execution steps, and output formats, complicating the installation and configuration processes. To address these issues, we introduce "Mars" a bioinformatics solution encapsulated within a singularity container that preloads a comprehensive suite of widely used genomic tools. Mars not only simplifies the installation of these tools but also automates critical workflow functions, including sequence sample preparation, read simulation, read mapping, variant calling, and result comparison. By streamlining the execution of these workflows, Mars enables users to easily manage input-output formats and compare results across different tools, thereby enhancing reproducibility and efficiency. Furthermore, by providing a cohesive environment that integrates tool management with a flexible workflow interface, Mars empowers researchers to focus on their analyses rather than the complexities of tool configuration. This integrated solution facilitates the testing of various combinations of tools and algorithms, enabling users to evaluate performance based on different metrics and identify the optimal tools for their specific genomic analysis needs. Through Mars, we aim to enhance the accessibility and usability of bioinformatics tools, ultimately advancing research in genomic analysis.Mars is freely available at https://github.com/GenomicAI/mars. It is implemented within a Singularity container environment and supports modular extension for additional genomic tools and custom workflows.Availability and implementationMars is freely available at https://github.com/GenomicAI/mars. It is implemented within a Singularity container environment and supports modular extension for additional genomic tools and custom workflows.
Bioinformatics is a rapidly evolving field with numerous specialized tools developed for essential genomic analysis tasks, such as read simulation, mapping, and variant calling. However, managing these tools presents significant challenges due to varied dependencies, execution steps, and output formats, complicating the installation and configuration processes. To address these issues, we introduce "Mars" a bioinformatics solution encapsulated within a singularity container that preloads a comprehensive suite of widely used genomic tools. Mars not only simplifies the installation of these tools but also automates critical workflow functions, including sequence sample preparation, read simulation, read mapping, variant calling, and result comparison. By streamlining the execution of these workflows, Mars enables users to easily manage input-output formats and compare results across different tools, thereby enhancing reproducibility and efficiency. Furthermore, by providing a cohesive environment that integrates tool management with a flexible workflow interface, Mars empowers researchers to focus on their analyses rather than the complexities of tool configuration. This integrated solution facilitates the testing of various combinations of tools and algorithms, enabling users to evaluate performance based on different metrics and identify the optimal tools for their specific genomic analysis needs. Through Mars, we aim to enhance the accessibility and usability of bioinformatics tools, ultimately advancing research in genomic analysis. Mars is freely available at https://github.com/GenomicAI/mars. It is implemented within a Singularity container environment and supports modular extension for additional genomic tools and custom workflows.
Author Amarasoma, Shanika
Ismail, Fathima Nuzla
Author_xml – sequence: 1
  givenname: Fathima Nuzla
  orcidid: 0000-0002-0716-1478
  surname: Ismail
  fullname: Ismail, Fathima Nuzla
– sequence: 2
  givenname: Shanika
  orcidid: 0000-0001-9509-0069
  surname: Amarasoma
  fullname: Amarasoma, Shanika
BackLink https://www.ncbi.nlm.nih.gov/pubmed/40406670$$D View this record in MEDLINE/PubMed
BookMark eNpVkctr3DAQxkVJadI01x6LjrlsMnpEtnopYUkfkNJL7mKsh1eNLbmSd0Py19dlNyEFwQjmm983zPeeHKWcPCEfGVww0OKyixnd7nLXYYBGviEnXImrFYBkR6_-x-Ss1t8AwJtGMSnekWMJEpRq4ISkn1jqZ1rjOA0xPMbU04UaU8hlxDnaSh9yuQ9Dfqh03pS87TcUqc1pxph8iU_eUZymktFu6JyXlwca0-z7soznRDE5OmLC3o8-zR_I24BD9WeHekruvt7crb-vbn99-7G-vl1ZIdt5FUBo7lWHwjfOad0qrlvpnGtawNZ7C1y0TEouWFC2BaaDAs3RdQ1Yq8Qp-bLHTttu9M4uzgUHM5U4Ynk0GaP5v5PixvR5ZxgHfcUEWwjnB0LJf7a-zmaM1fphwOTzthrBQS0rtY1epJ9em724PB95EVzsBbbkWosPLxIG5l-QZh-kOQQp_gJ4JpYy
Cites_doi 10.1093/bioinformatics/btac743
10.48550/arXiv.1207.3907,
10.1093/bioinformatics/btad074
10.1371/journal.pone.0177459
10.1093/bioinformatics/btab705
10.1038/nbt.3820
10.1038/s41592-021-01101-x
10.1093/bioinformatics/btac308
10.1093/bioinformatics/btp324
10.1038/s41592-024-02430-3
10.1093/bioinformatics/btq033
10.1093/gigascience/giab007
10.1186/s13059-020-1941-7
10.1093/gigascience/giab008
10.21105/joss.01316
10.1038/s41596-024-00986-0
10.1093/nargab/lqac092
10.1093/bioinformatics/btad041
10.13140/RG.2.2.19084.53123
10.1093/bioinformatics/bts378
ContentType Journal Article
Copyright The Author(s) 2025. Published by Oxford University Press.
The Author(s) 2025. Published by Oxford University Press. 2025
Copyright_xml – notice: The Author(s) 2025. Published by Oxford University Press.
– notice: The Author(s) 2025. Published by Oxford University Press. 2025
DBID AAYXX
CITATION
NPM
7X8
5PM
DOI 10.1093/bioadv/vbaf074
DatabaseName CrossRef
PubMed
MEDLINE - Academic
PubMed Central (Full Participant titles)
DatabaseTitle CrossRef
PubMed
MEDLINE - Academic
DatabaseTitleList MEDLINE - Academic
PubMed
Database_xml – sequence: 1
  dbid: NPM
  name: PubMed
  url: https://proxy.k.utb.cz/login?url=http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed
  sourceTypes: Index Database
DeliveryMethod fulltext_linktorsrc
Discipline Biology
EISSN 2635-0041
ExternalDocumentID PMC12095131
40406670
10_1093_bioadv_vbaf074
Genre Journal Article
GroupedDBID 0R~
AAYXX
ABEJV
ABGNP
ABXVV
AFKRA
ALMA_UNASSIGNED_HOLDINGS
AMNDL
BBNVY
BENPR
BHPHI
CCPQU
CITATION
GROUPED_DOAJ
HCIFZ
M7P
M~E
OK1
PHGZM
PHGZT
PIMPY
RPM
TOX
ZCN
ABDBF
NPM
7X8
PQGLB
5PM
ID FETCH-LOGICAL-c348t-f0392e6ba3e7dd99862984ddd780a8eec0238144231f6c8019f6092adb70cc63
ISSN 2635-0041
IngestDate Thu Aug 21 18:30:17 EDT 2025
Fri Sep 05 16:05:04 EDT 2025
Mon May 26 01:59:42 EDT 2025
Thu Jul 03 08:36:55 EDT 2025
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Issue 1
Language English
License https://creativecommons.org/licenses/by/4.0
The Author(s) 2025. Published by Oxford University Press.
This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
LinkModel OpenURL
MergedId FETCHMERGED-LOGICAL-c348t-f0392e6ba3e7dd99862984ddd780a8eec0238144231f6c8019f6092adb70cc63
Notes ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ORCID 0000-0001-9509-0069
0000-0002-0716-1478
OpenAccessLink http://dx.doi.org/10.1093/bioadv/vbaf074
PMID 40406670
PQID 3206984879
PQPubID 23479
ParticipantIDs pubmedcentral_primary_oai_pubmedcentral_nih_gov_12095131
proquest_miscellaneous_3206984879
pubmed_primary_40406670
crossref_primary_10_1093_bioadv_vbaf074
PublicationCentury 2000
PublicationDate 2025-01-01
PublicationDateYYYYMMDD 2025-01-01
PublicationDate_xml – month: 01
  year: 2025
  text: 2025-01-01
  day: 01
PublicationDecade 2020
PublicationPlace England
PublicationPlace_xml – name: England
PublicationTitle Bioinformatics advances
PublicationTitleAlternate Bioinform Adv
PublicationYear 2025
Publisher Oxford University Press
Publisher_xml – name: Oxford University Press
References Bonfield (2025061901540398800_vbaf074-B3) 2021; 10
2025061901540398800_vbaf074-B17
Hickey (2025061901540398800_vbaf074-B12) 2020; 21
Amarasoma (2025061901540398800_vbaf074-B2) 2024
Marco-Sola (2025061901540398800_vbaf074-B16) 2023; 39
Garrison (2025061901540398800_vbaf074-B7) 2023; 39
Garrison (2025061901540398800_vbaf074-B8) 2024; 21
Henriksen (2025061901540398800_vbaf074-B11) 2023; 39
Garrison (2025061901540398800_vbaf074-B9) 2012
Danecek (2025061901540398800_vbaf074-B5) 2021; 10
Li (2025061901540398800_vbaf074-B15) 2009; 25
Poplin (2025061901540398800_vbaf074-B19) 2018
Ono (2025061901540398800_vbaf074-B18) 2022; 4
Kurtzer (2025061901540398800_vbaf074-B13) 2017; 12
Li (2025061901540398800_vbaf074-B14) 2021; 37
Wick (2025061901540398800_vbaf074-B22) 2019; 4
Buchfink (2025061901540398800_vbaf074-B4) 2021; 18
Quinlan (2025061901540398800_vbaf074-B20) 2010; 26
Rausch (2025061901540398800_vbaf074-B21) 2012; 28
Guarracino (2025061901540398800_vbaf074-B10) 2022; 38
Alser (2025061901540398800_vbaf074-B1) 2024; 19
Di Tommaso (2025061901540398800_vbaf074-B6) 2017; 35
References_xml – volume: 39
  start-page: btac743
  year: 2023
  ident: 2025061901540398800_vbaf074-B7
  article-title: Unbiased pangenome graphs
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/btac743
– year: 2012
  ident: 2025061901540398800_vbaf074-B9
  doi: 10.48550/arXiv.1207.3907,
– volume: 39
  start-page: btad074
  year: 2023
  ident: 2025061901540398800_vbaf074-B16
  article-title: Optimal gap-affine alignment in O(s) space
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/btad074
– volume: 12
  start-page: e0177459
  year: 2017
  ident: 2025061901540398800_vbaf074-B13
  article-title: Singularity: scientific containers for mobility of compute
  publication-title: PLoS One
  doi: 10.1371/journal.pone.0177459
– volume: 37
  start-page: 4572
  year: 2021
  ident: 2025061901540398800_vbaf074-B14
  article-title: New strategies to improve minimap2 alignment accuracy
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/btab705
– volume: 35
  start-page: 316
  year: 2017
  ident: 2025061901540398800_vbaf074-B6
  article-title: Nextflow enables reproducible computational workflows
  publication-title: Nat Biotechnol
  doi: 10.1038/nbt.3820
– volume: 18
  start-page: 366
  year: 2021
  ident: 2025061901540398800_vbaf074-B4
  article-title: Sensitive protein alignments at tree-of-life scale using DIAMOND
  publication-title: Nat Methods
  doi: 10.1038/s41592-021-01101-x
– volume: 38
  start-page: 3319
  year: 2022
  ident: 2025061901540398800_vbaf074-B10
  article-title: ODGI: understanding pangenome graphs
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/btac308
– volume: 25
  start-page: 1754
  year: 2009
  ident: 2025061901540398800_vbaf074-B15
  article-title: Fast and accurate short read alignment with Burrows–Wheeler transform
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/btp324
– volume: 21
  start-page: 2008
  year: 2024
  ident: 2025061901540398800_vbaf074-B8
  article-title: Building pangenome graphs
  publication-title: Nat Methods
  doi: 10.1038/s41592-024-02430-3
– volume: 26
  start-page: 841
  year: 2010
  ident: 2025061901540398800_vbaf074-B20
  article-title: BEDTools: a flexible suite of utilities for comparing genomic features
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/btq033
– volume: 10
  start-page: giab007
  year: 2021
  ident: 2025061901540398800_vbaf074-B3
  article-title: HTSlib: C library for reading/writing high-throughput sequencing data
  publication-title: Gigascience
  doi: 10.1093/gigascience/giab007
– volume: 21
  start-page: 35
  year: 2020
  ident: 2025061901540398800_vbaf074-B12
  article-title: Genotyping structural variants in pangenome graphs using the vg toolkit
  publication-title: Genome Biol
  doi: 10.1186/s13059-020-1941-7
– year: 2018
  ident: 2025061901540398800_vbaf074-B19
– volume: 10
  start-page: giab008
  year: 2021
  ident: 2025061901540398800_vbaf074-B5
  article-title: Twelve years of SAMtools and BCFtools
  publication-title: Gigascience
  doi: 10.1093/gigascience/giab008
– ident: 2025061901540398800_vbaf074-B17
– volume: 4
  start-page: 1316
  year: 2019
  ident: 2025061901540398800_vbaf074-B22
  article-title: Badread: simulation of error-prone long reads
  publication-title: JOSS
  doi: 10.21105/joss.01316
– volume: 19
  start-page: 2529
  year: 2024
  ident: 2025061901540398800_vbaf074-B1
  article-title: Packaging and containerization of computational methods
  publication-title: Nat Protoc
  doi: 10.1038/s41596-024-00986-0
– volume: 4
  start-page: lqac092
  year: 2022
  ident: 2025061901540398800_vbaf074-B18
  article-title: PBSIM3: a simulator for all types of PacBio and ONT long reads
  publication-title: NAR Genom Bioinform
  doi: 10.1093/nargab/lqac092
– volume: 39
  start-page: btad041
  year: 2023
  ident: 2025061901540398800_vbaf074-B11
  article-title: NGSNGS: next generation simulator for next generation sequencing data
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/btad041
– year: 2024
  ident: 2025061901540398800_vbaf074-B2
  doi: 10.13140/RG.2.2.19084.53123
– volume: 28
  start-page: i333
  year: 2012
  ident: 2025061901540398800_vbaf074-B21
  article-title: DELLY: structural variant discovery by integrated paired-end and split-read analysis
  publication-title: Bioinformatics
  doi: 10.1093/bioinformatics/bts378
SSID ssj0002776143
Score 2.2780318
Snippet Bioinformatics is a rapidly evolving field with numerous specialized tools developed for essential genomic analysis tasks, such as read simulation, mapping,...
SourceID pubmedcentral
proquest
pubmed
crossref
SourceType Open Access Repository
Aggregation Database
Index Database
StartPage vbaf074
SubjectTerms Original
Title Mars: simplifying bioinformatics workflows through a containerized approach to tool integration and management
URI https://www.ncbi.nlm.nih.gov/pubmed/40406670
https://www.proquest.com/docview/3206984879
https://pubmed.ncbi.nlm.nih.gov/PMC12095131
Volume 5
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1db9MwFLXKJiReEN8rH5WRkHhAYanj2glvA7WaUFcQZFLfIjtOtMCSoCUF0V_PtZ2kCd3D4CWK3ChO7z2y770-PkboFeM0UITFDhGucCjMmI6gnDuQhcVEUulLU4c8W7HTc_pxPVuPRssea2lTy7fx9tp9Jf_jVWgDv-pdsv_g2e6l0AD34F-4gofheiMfn0FWqlP6KtO8cLtjSWZlI4ZqBJg17yq9LH9V3Yk8wtDThd70l20h3GxVxU0UqhU5WwWJlqicDxky7QrwsJ-GS1DtoJYLW15eaH5jLt6sNtvLbg44ycWVqMrchK5fL0SRfRf9AgSZ9QoQZpzScjaO1u2yU8o1bc1AO9vDkx00f0qRuvasnr0B3Ypdgengb8BN79GhdvbqU7Q4Xy6jcL4Ob6FDwrlZtG9rN9_MEiuHWERTDrqv61Q8vWPbxXHTwTBK2Us9_mbQ9kKS8B662-QS-MQC4z4aJcUDdNueLvr7ISo0PN7hHjjwEBy4AwduwIEFHoADt-DAdYk1OHAPHBjAgXfgeITCxTz8cOo0x2s4sUf92kldiI0TJoWXcKUg7WYk8KlSivuu8JMkNuEchXh7mrIYIpkgZW5AhJLcjWPmPUYHRVkkRwhzTeXwpjEnVFABITdNE8W5CoRMFeNkjF631ox-WBGVyJIfvMjaPWrsPkYvW2NHMM7pxStRJOWmijziMvg8nwdj9MQav3sXhZmIMe6OkT9wS_eA1lAf_lJkF0ZLXW8dn0296dMbdPwM3dnh_zk6qK82yQsISWs5QYfv56vPXyampDMxuPsDk-madw
linkProvider National Library of Medicine
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Mars%3A+simplifying+bioinformatics+workflows+through+a+containerized+approach+to+tool+integration+and+management&rft.jtitle=Bioinformatics+advances&rft.au=Ismail%2C+Fathima+Nuzla&rft.au=Amarasoma%2C+Shanika&rft.date=2025-01-01&rft.issn=2635-0041&rft.eissn=2635-0041&rft.volume=5&rft.issue=1&rft.spage=vbaf074&rft_id=info:doi/10.1093%2Fbioadv%2Fvbaf074&rft.externalDBID=NO_FULL_TEXT
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=2635-0041&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=2635-0041&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=2635-0041&client=summon