Fizzy: feature subset selection for metagenomics

Some of the current software tools for comparative metagenomics provide ecologists with the ability to investigate and explore bacterial communities using α- & β-diversity. Feature subset selection--a sub-field of machine learning--can also provide a unique insight into the differences between m...

Full description

Saved in:
Bibliographic Details
Published inBMC bioinformatics Vol. 16; no. 358; p. 358
Main Authors Ditzler, Gregory, Morrison, J Calvin, Lan, Yemin, Rosen, Gail L
Format Journal Article
LanguageEnglish
Published England BioMed Central Ltd 04.11.2015
BioMed Central
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Some of the current software tools for comparative metagenomics provide ecologists with the ability to investigate and explore bacterial communities using α- & β-diversity. Feature subset selection--a sub-field of machine learning--can also provide a unique insight into the differences between metagenomic or 16S phenotypes. In particular, feature subset selection methods can obtain the operational taxonomic units (OTUs), or functional features, that have a high-level of influence on the condition being studied. For example, in a previous study we have used information-theoretic feature selection to understand the differences between protein family abundances that best discriminate between age groups in the human gut microbiome. We have developed a new Python command line tool, which is compatible with the widely adopted BIOM format, for microbial ecologists that implements information-theoretic subset selection methods for biological data formats. We demonstrate the software tools capabilities on publicly available datasets. We have made the software implementation of Fizzy available to the public under the GNU GPL license. The standalone implementation can be found at http://github.com/EESI/Fizzy.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
USDOE Office of Science (SC)
SC0004335
ISSN:1471-2105
1471-2105
DOI:10.1186/s12859-015-0793-8