A complete domain-to-species taxonomy for Bacteria and Archaea

The Genome Taxonomy Database is a phylogenetically consistent, genome-based taxonomy that provides rank-normalized classifications for ~150,000 bacterial and archaeal genomes from domain to genus. However, almost 40% of the genomes in the Genome Taxonomy Database lack a species name. We address this...

Full description

Saved in:
Bibliographic Details
Published inNature biotechnology Vol. 38; no. 9; pp. 1079 - 1086
Main Authors Parks, Donovan H., Chuvochina, Maria, Chaumeil, Pierre-Alain, Rinke, Christian, Mussig, Aaron J., Hugenholtz, Philip
Format Journal Article
LanguageEnglish
Published New York Nature Publishing Group US 01.09.2020
Nature Publishing Group
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:The Genome Taxonomy Database is a phylogenetically consistent, genome-based taxonomy that provides rank-normalized classifications for ~150,000 bacterial and archaeal genomes from domain to genus. However, almost 40% of the genomes in the Genome Taxonomy Database lack a species name. We address this limitation by using commonly accepted average nucleotide identity criteria to set bounds on species and propose species clusters that encompass all publicly available bacterial and archaeal genomes. Unlike previous average nucleotide identity studies, we chose a single representative genome to serve as the effective nomenclatural ‘type’ defining each species. Of the 24,706 proposed species clusters, 8,792 are based on published names. We assigned placeholder names to the remaining 15,914 species clusters to provide names to the growing number of genomes from uncultivated species. This resource provides a complete domain-to-species taxonomic framework for bacterial and archaeal genomes, which will facilitate research on uncultivated species and improve communication of scientific results. A full species classification is built for all publicly available bacterial and archaeal genomes.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:1087-0156
1546-1696
DOI:10.1038/s41587-020-0501-8