Reference genome-independent taxonomic profiling of microbiomes with mOTUs3

Background: Taxonomic profiling is a fundamental task in microbiome research that aims to detect and quantify the relative abundance of microorganisms in biological samples. Available methods using shotgun metagenomic data generally depend on the availability of sequenced and taxonomically annotated...

Full description

Saved in:
Bibliographic Details
Published inbioRxiv
Main Authors Ruscheweyh, Hans-Joachim, Milanese, Alessio, Paoli, Lucas, Karcher, Nicolai, Clayssen, Quentin, Metzger, Marisa Isabell, Wirbel, Jakob, Bork, Peer, Mende, Daniel, Zeller, Georg, Sunagawa, Shinichi
Format Paper
LanguageEnglish
Published Cold Spring Harbor Cold Spring Harbor Laboratory Press 08.04.2022
Cold Spring Harbor Laboratory
Edition1.2
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Background: Taxonomic profiling is a fundamental task in microbiome research that aims to detect and quantify the relative abundance of microorganisms in biological samples. Available methods using shotgun metagenomic data generally depend on the availability of sequenced and taxonomically annotated reference genomes. However, the majority of microorganisms have not been cultured yet and lack such reference genomes. Thus, a substantial fraction of microbial community members remains unaccounted for during taxonomic profiling of metagenomes, particularly in samples from underexplored environments. To address this issue, we have developed the mOTU profiler, a tool that enables reference genome-independent species-level profiling of metagenomes. As such, it supports the identification and quantification of both "known" and "unknown" species based on a set of select marker genes. Results: Here, we present mOTUs3, a command line tool that enables the profiling of metagenomes for >33,000 species-level operational taxonomic units. To achieve this, we leveraged the reconstruction and analysis of >600,000 draft genomes, most of which are metagenome assembled genomes (MAGs), from diverse microbiomes, including soil, freshwater systems, and the gastrointestinal tract of ruminants and other animals, which we found to be greatly underrepresented by reference genomes. Overall, two-thirds of all species-level taxa lacked a reference genome. The cumulative relative abundance of these newly included taxa was low in well-studied microbiomes, such as the human body sites (6-11%). By contrast, they accounted for substantial proportions (ocean, freshwater, soil: 43-63%) or even the vast majority (pig, fish, cattle: 60-80%) of the relative abundance across diverse non-human-associated microbiomes. Using community-developed benchmarks and datasets, we found mOTUs3 to be more accurate than other methods and to be more congruent with 16S rRNA gene-based methods for taxonomic profiling. Furthermore, we demonstrate that mOTUs3 greatly increases the resolution of well-known microbial groups into species-level taxa and helps identify new differentially abundant taxa in comparative metagenomic studies. Conclusions: We developed mOTUs3 to enable accurate species-level profiling of metagenomes. Compared to other methods, it provides a more comprehensive view of prokaryotic community diversity, in particular for currently underexplored microbiomes. To facilitate comparative analyses by the research community, it is released with >11,000 precomputed profiles for publicly available metagenomes and is freely available at: https://github.com/motu-tool/mOTUs. Competing Interest Statement The authors have declared no competing interest. Footnotes * This version of the manuscript provides substantial modifications, including additional analyses based on the updated mOTUs database and profiler. * https://motu-tool.org/
Bibliography:SourceType-Working Papers-1
ObjectType-Working Paper/Pre-Print-1
content type line 50
Competing Interest Statement: The authors have declared no competing interest.
ISSN:2692-8205
2692-8205
DOI:10.1101/2021.04.20.440600