BAD2matrix: Phylogenomic matrix concatenation, indel coding, and more
Premise Common steps in phylogenomic matrix production include biological sequence concatenation, morphological data concatenation, insertion/deletion (indel) coding, gene content (presence/absence) coding, removing uninformative characters for parsimony analysis, recording with reduced amino acid a...
Saved in:
Published in | Applications in plant sciences Vol. 12; no. 6; pp. e11604 - n/a |
---|---|
Main Authors | , , , , , |
Format | Journal Article |
Language | English |
Published |
United States
John Wiley & Sons, Inc
01.11.2024
Botanical Society of America John Wiley and Sons Inc Wiley |
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | Premise
Common steps in phylogenomic matrix production include biological sequence concatenation, morphological data concatenation, insertion/deletion (indel) coding, gene content (presence/absence) coding, removing uninformative characters for parsimony analysis, recording with reduced amino acid alphabets, and occupancy filtering. Existing software does not accomplish these tasks on a phylogenomic scale using a single program.
Methods and Results
BAD2matrix is a Python script that performs the above‐mentioned steps in phylogenomic matrix construction for DNA or amino acid sequences as well as morphological data. The script works in UNIX‐like environments (e.g., LINUX, MacOS, Windows Subsystem for LINUX).
Conclusions
BAD2matrix helps simplify phylogenomic pipelines and can be downloaded from https://github.com/dpl10/BAD2matrix/tree/master under a GNU General Public License v2. |
---|---|
Bibliography: | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 ObjectType-Commentary-3 content type line 23 U.S. National Science Foundation (NSF) SC0014377 None USDOE Office of Science (SC), Biological and Environmental Research (BER) |
ISSN: | 2168-0450 2168-0450 |
DOI: | 10.1002/aps3.11604 |