The use of taxon-specific reference databases compromises metagenomic classification

A recent article in BMC Genomics describes a new bioinformatics tool, HumanMycobiomeScan, to classify fungal taxa in metagenomic samples. This tool was used to characterize the gut mycobiome of hunter-gatherers and Western populations, resulting in the identification of a range of fungal species in...

Full description

Saved in:
Bibliographic Details
Published inBMC genomics Vol. 21; no. 1; pp. 184 - 5
Main Authors R. Marcelino, Vanessa, Holmes, Edward C., Sorrell, Tania C.
Format Journal Article
LanguageEnglish
Published London BioMed Central 27.02.2020
BioMed Central Ltd
BMC
Subjects
Online AccessGet full text
ISSN1471-2164
1471-2164
DOI10.1186/s12864-020-6592-2

Cover

Abstract A recent article in BMC Genomics describes a new bioinformatics tool, HumanMycobiomeScan, to classify fungal taxa in metagenomic samples. This tool was used to characterize the gut mycobiome of hunter-gatherers and Western populations, resulting in the identification of a range of fungal species in the vast majority of samples. In the HumanMycobiomeScan pipeline, sequence reads are mapped against a reference database containing fungal genome sequences only. We argue that using reference databases comprised of a single taxonomic group leads to an unacceptably high number of false-positives due to: (i) mapping to conserved genetic regions in reference genomes, and (ii) sequence contamination in the assembled reference genomes. To demonstrate this, we replaced the HumanMycobiomeScan’s fungal reference database with one containing genome sequences of amphibians and reptiles and re-analysed their case study. The classification pipeline recovered all species present in the reference database, revealing turtles (Geoemydidae), bull frogs (Pyxicephalidae) and snakes (Colubridae) as the most abundant herpetological taxa in the human gut. We also re-analysed their case study using a kingdom-agnostic pipeline. This revealed that while the gut of hunter-gatherers and Western subjects may be colonized by a range of microbial eukaryotes, only three fungal families were retrieved. These results highlight the pitfalls of using taxon-specific reference databases for metagenome classification, even when they are comprised of curated whole genome data. We propose that databases containing all domains of life provide the most suitable option for metagenomic species profiling, especially when targeting microbial eukaryotes.
AbstractList A recent article in BMC Genomics describes a new bioinformatics tool, HumanMycobiomeScan, to classify fungal taxa in metagenomic samples. This tool was used to characterize the gut mycobiome of hunter-gatherers and Western populations, resulting in the identification of a range of fungal species in the vast majority of samples. In the HumanMycobiomeScan pipeline, sequence reads are mapped against a reference database containing fungal genome sequences only. We argue that using reference databases comprised of a single taxonomic group leads to an unacceptably high number of false-positives due to: (i) mapping to conserved genetic regions in reference genomes, and (ii) sequence contamination in the assembled reference genomes. To demonstrate this, we replaced the HumanMycobiomeScan’s fungal reference database with one containing genome sequences of amphibians and reptiles and re-analysed their case study. The classification pipeline recovered all species present in the reference database, revealing turtles (Geoemydidae), bull frogs (Pyxicephalidae) and snakes (Colubridae) as the most abundant herpetological taxa in the human gut. We also re-analysed their case study using a kingdom-agnostic pipeline. This revealed that while the gut of hunter-gatherers and Western subjects may be colonized by a range of microbial eukaryotes, only three fungal families were retrieved. These results highlight the pitfalls of using taxon-specific reference databases for metagenome classification, even when they are comprised of curated whole genome data. We propose that databases containing all domains of life provide the most suitable option for metagenomic species profiling, especially when targeting microbial eukaryotes.
A recent article in BMC Genomics describes a new bioinformatics tool, HumanMycobiomeScan, to classify fungal taxa in metagenomic samples. This tool was used to characterize the gut mycobiome of hunter-gatherers and Western populations, resulting in the identification of a range of fungal species in the vast majority of samples. In the HumanMycobiomeScan pipeline, sequence reads are mapped against a reference database containing fungal genome sequences only. We argue that using reference databases comprised of a single taxonomic group leads to an unacceptably high number of false-positives due to: (i) mapping to conserved genetic regions in reference genomes, and (ii) sequence contamination in the assembled reference genomes. To demonstrate this, we replaced the HumanMycobiomeScan's fungal reference database with one containing genome sequences of amphibians and reptiles and re-analysed their case study. The classification pipeline recovered all species present in the reference database, revealing turtles (Geoemydidae), bull frogs (Pyxicephalidae) and snakes (Colubridae) as the most abundant herpetological taxa in the human gut. We also re-analysed their case study using a kingdom-agnostic pipeline. This revealed that while the gut of hunter-gatherers and Western subjects may be colonized by a range of microbial eukaryotes, only three fungal families were retrieved. These results highlight the pitfalls of using taxon-specific reference databases for metagenome classification, even when they are comprised of curated whole genome data. We propose that databases containing all domains of life provide the most suitable option for metagenomic species profiling, especially when targeting microbial eukaryotes. Keywords: Reference database, Fungi, Microbiome, Metagenomic classifier, Misclassification, Assembly errors
A recent article in BMC Genomics describes a new bioinformatics tool, HumanMycobiomeScan, to classify fungal taxa in metagenomic samples. This tool was used to characterize the gut mycobiome of hunter-gatherers and Western populations, resulting in the identification of a range of fungal species in the vast majority of samples. In the HumanMycobiomeScan pipeline, sequence reads are mapped against a reference database containing fungal genome sequences only. We argue that using reference databases comprised of a single taxonomic group leads to an unacceptably high number of false-positives due to: (i) mapping to conserved genetic regions in reference genomes, and (ii) sequence contamination in the assembled reference genomes. To demonstrate this, we replaced the HumanMycobiomeScan’s fungal reference database with one containing genome sequences of amphibians and reptiles and re-analysed their case study. The classification pipeline recovered all species present in the reference database, revealing turtles (Geoemydidae), bull frogs (Pyxicephalidae) and snakes (Colubridae) as the most abundant herpetological taxa in the human gut. We also re-analysed their case study using a kingdom-agnostic pipeline. This revealed that while the gut of hunter-gatherers and Western subjects may be colonized by a range of microbial eukaryotes, only three fungal families were retrieved. These results highlight the pitfalls of using taxon-specific reference databases for metagenome classification, even when they are comprised of curated whole genome data. We propose that databases containing all domains of life provide the most suitable option for metagenomic species profiling, especially when targeting microbial eukaryotes.
A recent article in BMC Genomics describes a new bioinformatics tool, HumanMycobiomeScan, to classify fungal taxa in metagenomic samples. This tool was used to characterize the gut mycobiome of hunter-gatherers and Western populations, resulting in the identification of a range of fungal species in the vast majority of samples. In the HumanMycobiomeScan pipeline, sequence reads are mapped against a reference database containing fungal genome sequences only. We argue that using reference databases comprised of a single taxonomic group leads to an unacceptably high number of false-positives due to: (i) mapping to conserved genetic regions in reference genomes, and (ii) sequence contamination in the assembled reference genomes. To demonstrate this, we replaced the HumanMycobiomeScan's fungal reference database with one containing genome sequences of amphibians and reptiles and re-analysed their case study. The classification pipeline recovered all species present in the reference database, revealing turtles (Geoemydidae), bull frogs (Pyxicephalidae) and snakes (Colubridae) as the most abundant herpetological taxa in the human gut. We also re-analysed their case study using a kingdom-agnostic pipeline. This revealed that while the gut of hunter-gatherers and Western subjects may be colonized by a range of microbial eukaryotes, only three fungal families were retrieved. These results highlight the pitfalls of using taxon-specific reference databases for metagenome classification, even when they are comprised of curated whole genome data. We propose that databases containing all domains of life provide the most suitable option for metagenomic species profiling, especially when targeting microbial eukaryotes.A recent article in BMC Genomics describes a new bioinformatics tool, HumanMycobiomeScan, to classify fungal taxa in metagenomic samples. This tool was used to characterize the gut mycobiome of hunter-gatherers and Western populations, resulting in the identification of a range of fungal species in the vast majority of samples. In the HumanMycobiomeScan pipeline, sequence reads are mapped against a reference database containing fungal genome sequences only. We argue that using reference databases comprised of a single taxonomic group leads to an unacceptably high number of false-positives due to: (i) mapping to conserved genetic regions in reference genomes, and (ii) sequence contamination in the assembled reference genomes. To demonstrate this, we replaced the HumanMycobiomeScan's fungal reference database with one containing genome sequences of amphibians and reptiles and re-analysed their case study. The classification pipeline recovered all species present in the reference database, revealing turtles (Geoemydidae), bull frogs (Pyxicephalidae) and snakes (Colubridae) as the most abundant herpetological taxa in the human gut. We also re-analysed their case study using a kingdom-agnostic pipeline. This revealed that while the gut of hunter-gatherers and Western subjects may be colonized by a range of microbial eukaryotes, only three fungal families were retrieved. These results highlight the pitfalls of using taxon-specific reference databases for metagenome classification, even when they are comprised of curated whole genome data. We propose that databases containing all domains of life provide the most suitable option for metagenomic species profiling, especially when targeting microbial eukaryotes.
Abstract A recent article in BMC Genomics describes a new bioinformatics tool, HumanMycobiomeScan, to classify fungal taxa in metagenomic samples. This tool was used to characterize the gut mycobiome of hunter-gatherers and Western populations, resulting in the identification of a range of fungal species in the vast majority of samples. In the HumanMycobiomeScan pipeline, sequence reads are mapped against a reference database containing fungal genome sequences only. We argue that using reference databases comprised of a single taxonomic group leads to an unacceptably high number of false-positives due to: (i) mapping to conserved genetic regions in reference genomes, and (ii) sequence contamination in the assembled reference genomes. To demonstrate this, we replaced the HumanMycobiomeScan’s fungal reference database with one containing genome sequences of amphibians and reptiles and re-analysed their case study. The classification pipeline recovered all species present in the reference database, revealing turtles (Geoemydidae), bull frogs (Pyxicephalidae) and snakes (Colubridae) as the most abundant herpetological taxa in the human gut. We also re-analysed their case study using a kingdom-agnostic pipeline. This revealed that while the gut of hunter-gatherers and Western subjects may be colonized by a range of microbial eukaryotes, only three fungal families were retrieved. These results highlight the pitfalls of using taxon-specific reference databases for metagenome classification, even when they are comprised of curated whole genome data. We propose that databases containing all domains of life provide the most suitable option for metagenomic species profiling, especially when targeting microbial eukaryotes.
ArticleNumber 184
Audience Academic
Author R. Marcelino, Vanessa
Sorrell, Tania C.
Holmes, Edward C.
Author_xml – sequence: 1
  givenname: Vanessa
  orcidid: 0000-0003-1755-0597
  surname: R. Marcelino
  fullname: R. Marcelino, Vanessa
  email: vrmarcelino@gmail.com
  organization: Marie Bashir Institute for Infectious Diseases and Biosecurity and Faculty of Medicine and Health, Sydney Medical School, Westmead Clinical School, The University of Sydney, Centre for Infectious Diseases and Microbiology, Westmead Institute for Medical Research, School of Life & Environmental Sciences, Charles Perkins Centre, The University of Sydney
– sequence: 2
  givenname: Edward C.
  surname: Holmes
  fullname: Holmes, Edward C.
  organization: Marie Bashir Institute for Infectious Diseases and Biosecurity and Faculty of Medicine and Health, Sydney Medical School, Westmead Clinical School, The University of Sydney, School of Life & Environmental Sciences, Charles Perkins Centre, The University of Sydney
– sequence: 3
  givenname: Tania C.
  surname: Sorrell
  fullname: Sorrell, Tania C.
  organization: Marie Bashir Institute for Infectious Diseases and Biosecurity and Faculty of Medicine and Health, Sydney Medical School, Westmead Clinical School, The University of Sydney, Centre for Infectious Diseases and Microbiology, Westmead Institute for Medical Research
BackLink https://www.ncbi.nlm.nih.gov/pubmed/32106809$$D View this record in MEDLINE/PubMed
BookMark eNp9kluL1DAcxYusuBf9AL5IwRd96JqkubQvwrJ4GVgQdHwOafpPN0MnGZNU1m9vOrPrbheVUpqmv3PSnJzT4sh5B0XxEqNzjBv-LmLScFohgirOWlKRJ8UJpgJXBHN69GB8XJzGuEEIi4awZ8VxTTDiDWpPivX6GsopQulNmdSNd1XcgbbG6jKAgQBOQ9mrpDoVIZbab3fBb-083kJSA7j8pks9qhhnlUrWu-fFU6PGCC9un2fF948f1pefq6svn1aXF1eV5rVIFeOYcagpQ5oYyrVmdYeork2HNcOComZ_13lrpO4bJFinTM1UBrnqDanPitXBt_dqI3fBblX4Jb2ycj_hwyBVSFaPIGlH1LxraEhPVdt0RvUciIBWtBQhmr3eH7x2U7eFXoNLQY0L0-UXZ6_l4H9KgShjmGeDN7cGwf-YICaZY9IwjsqBn6IkNW8pIZy0GX39CN34KbgcVaZE01COG3ZPDSpvwDrj87p6NpUXPB8qEVTMGZz_hcpXD_lgcl-MzfMLwduFIDMJbtKgphjl6tvXJfvqYSh_0rjrTwbEAdDBx5gbI7VN-w7kv7CjxEjOTZWHpsrcVDk3Vc7W-JHyzvx_GnLQxMy6AcJ9bv8W_Qae8fd8
CitedBy_id crossref_primary_10_1002_ecy_3607
crossref_primary_10_7717_peerj_17893
crossref_primary_10_1099_mgen_0_000654
crossref_primary_10_3390_ijms23179834
crossref_primary_10_1016_j_fsigen_2021_102627
crossref_primary_10_3389_fmicb_2024_1336532
crossref_primary_10_1177_17562848211047130
crossref_primary_10_1038_s41525_025_00482_8
crossref_primary_10_3389_fmicb_2021_685935
crossref_primary_10_1128_mbio_02444_21
crossref_primary_10_1002_edn3_597
crossref_primary_10_1016_j_csbj_2020_11_049
crossref_primary_10_1099_mgen_0_000547
crossref_primary_10_1371_journal_pone_0292077
crossref_primary_10_1002_gea_22030
crossref_primary_10_1093_bioadv_vbae016
crossref_primary_10_1016_j_forsciint_2022_111462
crossref_primary_10_1186_s13059_020_02014_2
crossref_primary_10_1186_s42523_022_00207_7
crossref_primary_10_3389_fmicb_2022_854423
crossref_primary_10_1038_s41597_025_04469_z
crossref_primary_10_3389_fbinf_2024_1278228
crossref_primary_10_1186_s12864_023_09338_w
crossref_primary_10_1007_s00248_023_02230_w
crossref_primary_10_1371_journal_pcbi_1009581
crossref_primary_10_1016_j_csbj_2022_09_024
crossref_primary_10_1099_mgen_0_000949
crossref_primary_10_1186_s40168_023_01505_1
Cites_doi 10.1186/s12864-019-5883-y
10.1016/j.cub.2015.04.055
10.1038/nature20167
10.1101/641332
10.1371/journal.pone.0016410
10.1101/gr.245373.118
10.1186/s12859-018-2336-6
10.1038/s41564-017-0012-7
10.1101/712166
10.1093/bib/bbx120
10.1126/science.319.5870.1616a
10.1038/nmeth.1923
10.1186/gb-2014-15-3-r46
10.1093/nar/25.17.3389
10.1186/s40168-019-0638-1
10.1038/s41564-018-0202-y
10.1371/journal.pone.0192898
10.7717/peerj.675
ContentType Journal Article
Copyright The Author(s). 2020
COPYRIGHT 2020 BioMed Central Ltd.
2020. This work is licensed under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
Copyright_xml – notice: The Author(s). 2020
– notice: COPYRIGHT 2020 BioMed Central Ltd.
– notice: 2020. This work is licensed under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
DBID C6C
AAYXX
CITATION
CGR
CUY
CVF
ECM
EIF
NPM
ISR
3V.
7QP
7QR
7SS
7TK
7U7
7X7
7XB
88E
8AO
8FD
8FE
8FH
8FI
8FJ
8FK
ABUWG
AEUYN
AFKRA
AZQEC
BBNVY
BENPR
BHPHI
C1K
CCPQU
DWQXO
FR3
FYUFA
GHDGH
GNUQQ
HCIFZ
K9.
LK8
M0S
M1P
M7P
P64
PHGZM
PHGZT
PIMPY
PJZUB
PKEHL
PPXIY
PQEST
PQGLB
PQQKQ
PQUKI
PRINS
RC3
7X8
5PM
DOA
DOI 10.1186/s12864-020-6592-2
DatabaseName Springer Nature OA Free Journals
CrossRef
Medline
MEDLINE
MEDLINE (Ovid)
MEDLINE
MEDLINE
PubMed
Gale In Context: Science
ProQuest Central (Corporate)
Calcium & Calcified Tissue Abstracts
Chemoreception Abstracts
Entomology Abstracts (Full archive)
Neurosciences Abstracts
Toxicology Abstracts
Health & Medical Collection
ProQuest Central (purchase pre-March 2016)
Medical Database (Alumni Edition)
ProQuest Pharma Collection
Technology Research Database
ProQuest SciTech Collection
ProQuest Natural Science Collection
Hospital Premium Collection
Hospital Premium Collection (Alumni Edition)
ProQuest Central (Alumni) (purchase pre-March 2016)
ProQuest Central (Alumni)
ProQuest One Sustainability
ProQuest Central UK/Ireland
ProQuest Central Essentials
Biological Science Collection
ProQuest Central
Natural Science Collection
Environmental Sciences and Pollution Management
ProQuest One
ProQuest Central Korea
Engineering Research Database
Health Research Premium Collection
Health Research Premium Collection (Alumni)
ProQuest Central Student
SciTech Premium Collection
ProQuest Health & Medical Complete (Alumni)
Biological Sciences
Health & Medical Collection (Alumni)
Medical Database
Biological Science Database
Biotechnology and BioEngineering Abstracts
ProQuest Central Premium
ProQuest One Academic (New)
Publicly Available Content Database
ProQuest Health & Medical Research Collection
ProQuest One Academic Middle East (New)
ProQuest One Health & Nursing
ProQuest One Academic Eastern Edition (DO NOT USE)
ProQuest One Applied & Life Sciences
ProQuest One Academic
ProQuest One Academic UKI Edition
ProQuest Central China
Genetics Abstracts
MEDLINE - Academic
PubMed Central (Full Participant titles)
DOAJ Directory of Open Access Journals
DatabaseTitle CrossRef
MEDLINE
Medline Complete
MEDLINE with Full Text
PubMed
MEDLINE (Ovid)
Publicly Available Content Database
ProQuest Central Student
Technology Research Database
ProQuest One Academic Middle East (New)
ProQuest Central Essentials
ProQuest Health & Medical Complete (Alumni)
ProQuest Central (Alumni Edition)
SciTech Premium Collection
ProQuest One Community College
ProQuest One Health & Nursing
ProQuest Natural Science Collection
ProQuest Pharma Collection
ProQuest Central China
Environmental Sciences and Pollution Management
ProQuest Central
ProQuest One Applied & Life Sciences
ProQuest One Sustainability
ProQuest Health & Medical Research Collection
Genetics Abstracts
Health Research Premium Collection
Health and Medicine Complete (Alumni Edition)
Natural Science Collection
ProQuest Central Korea
Health & Medical Research Collection
Biological Science Collection
Chemoreception Abstracts
ProQuest Central (New)
ProQuest Medical Library (Alumni)
ProQuest Biological Science Collection
Toxicology Abstracts
ProQuest One Academic Eastern Edition
ProQuest Hospital Collection
Health Research Premium Collection (Alumni)
Biological Science Database
ProQuest SciTech Collection
Neurosciences Abstracts
ProQuest Hospital Collection (Alumni)
Biotechnology and BioEngineering Abstracts
Entomology Abstracts
ProQuest Health & Medical Complete
ProQuest Medical Library
ProQuest One Academic UKI Edition
Engineering Research Database
ProQuest One Academic
Calcium & Calcified Tissue Abstracts
ProQuest One Academic (New)
ProQuest Central (Alumni)
MEDLINE - Academic
DatabaseTitleList


Publicly Available Content Database
MEDLINE

MEDLINE - Academic


CrossRef
Database_xml – sequence: 1
  dbid: C6C
  name: Springer Nature OA Free Journals
  url: http://www.springeropen.com/
  sourceTypes: Publisher
– sequence: 2
  dbid: DOA
  name: Directory of Open Access Journals
  url: https://www.doaj.org/
  sourceTypes: Open Website
– sequence: 3
  dbid: NPM
  name: PubMed
  url: https://proxy.k.utb.cz/login?url=http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed
  sourceTypes: Index Database
– sequence: 4
  dbid: EIF
  name: MEDLINE
  url: https://proxy.k.utb.cz/login?url=https://www.webofscience.com/wos/medline/basic-search
  sourceTypes: Index Database
– sequence: 5
  dbid: BENPR
  name: ProQuest Central
  url: https://www.proquest.com/central
  sourceTypes: Aggregation Database
DeliveryMethod fulltext_linktorsrc
Discipline Biology
EISSN 1471-2164
EndPage 5
ExternalDocumentID oai_doaj_org_article_4b2a3210e82d4a98bfad6e27e9794004
PMC7045516
A616427472
32106809
10_1186_s12864_020_6592_2
Genre Journal Article
GeographicLocations Italy
Tanzania
Australia
GeographicLocations_xml – name: Tanzania
– name: Italy
– name: Australia
GroupedDBID ---
0R~
23N
2WC
2XV
53G
5VS
6J9
7X7
88E
8AO
8FE
8FH
8FI
8FJ
AAFWJ
AAHBH
AAJSJ
AASML
ABDBF
ABUWG
ACGFO
ACGFS
ACIHN
ACIWK
ACPRK
ACUHS
ADBBV
ADUKV
AEAQA
AENEX
AEUYN
AFKRA
AFPKN
AFRAH
AHBYD
AHMBA
AHYZX
ALMA_UNASSIGNED_HOLDINGS
AMKLP
AMTXH
AOIJS
BAPOH
BAWUL
BBNVY
BCNDV
BENPR
BFQNJ
BHPHI
BMC
BPHCQ
BVXVI
C6C
CCPQU
CS3
DIK
DU5
E3Z
EAD
EAP
EAS
EBD
EBLON
EBS
EMB
EMK
EMOBN
ESX
F5P
FYUFA
GROUPED_DOAJ
GX1
HCIFZ
HMCUK
HYE
IAO
IGS
IHR
INH
INR
ISR
ITC
KQ8
LK8
M1P
M48
M7P
M~E
O5R
O5S
OK1
OVT
P2P
PGMZT
PHGZM
PHGZT
PIMPY
PJZUB
PPXIY
PQGLB
PQQKQ
PROAC
PSQYO
PUEGO
RBZ
RNS
ROL
RPM
RSV
SBL
SOJ
SV3
TR2
TUS
U2A
UKHRP
W2D
WOQ
WOW
XSB
AAYXX
ALIPV
CITATION
-A0
3V.
ACRMQ
ADINQ
AIXEN
C24
CGR
CUY
CVF
ECM
EIF
NPM
PMFND
7QP
7QR
7SS
7TK
7U7
7XB
8FD
8FK
AZQEC
C1K
DWQXO
FR3
GNUQQ
K9.
P64
PKEHL
PQEST
PQUKI
PRINS
RC3
7X8
5PM
ID FETCH-LOGICAL-c637t-56156e3450c2f46cc53b04c3fb1c51740874087359223d8075baf35acc56adf23
IEDL.DBID U2A
ISSN 1471-2164
IngestDate Wed Aug 27 01:24:57 EDT 2025
Thu Aug 21 13:48:38 EDT 2025
Sun Aug 24 03:11:46 EDT 2025
Fri Jul 25 10:32:33 EDT 2025
Tue Jun 17 21:50:50 EDT 2025
Tue Jun 10 20:49:04 EDT 2025
Fri Jun 27 03:58:49 EDT 2025
Thu Jan 02 22:37:41 EST 2025
Tue Jul 01 00:39:07 EDT 2025
Thu Apr 24 22:54:17 EDT 2025
Sat Sep 06 07:21:42 EDT 2025
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Issue 1
Keywords Fungi
Metagenomic classifier
Reference database
Misclassification
Assembly errors
Microbiome
Language English
License Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c637t-56156e3450c2f46cc53b04c3fb1c51740874087359223d8075baf35acc56adf23
Notes ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
content type line 23
ORCID 0000-0003-1755-0597
OpenAccessLink https://link.springer.com/10.1186/s12864-020-6592-2
PMID 32106809
PQID 2378846185
PQPubID 44682
PageCount 5
ParticipantIDs doaj_primary_oai_doaj_org_article_4b2a3210e82d4a98bfad6e27e9794004
pubmedcentral_primary_oai_pubmedcentral_nih_gov_7045516
proquest_miscellaneous_2369422629
proquest_journals_2378846185
gale_infotracmisc_A616427472
gale_infotracacademiconefile_A616427472
gale_incontextgauss_ISR_A616427472
pubmed_primary_32106809
crossref_citationtrail_10_1186_s12864_020_6592_2
crossref_primary_10_1186_s12864_020_6592_2
springer_journals_10_1186_s12864_020_6592_2
ProviderPackageCode CITATION
AAYXX
PublicationCentury 2000
PublicationDate 2020-02-27
PublicationDateYYYYMMDD 2020-02-27
PublicationDate_xml – month: 02
  year: 2020
  text: 2020-02-27
  day: 27
PublicationDecade 2020
PublicationPlace London
PublicationPlace_xml – name: London
– name: England
PublicationTitle BMC genomics
PublicationTitleAbbrev BMC Genomics
PublicationTitleAlternate BMC Genomics
PublicationYear 2020
Publisher BioMed Central
BioMed Central Ltd
BMC
Publisher_xml – name: BioMed Central
– name: BioMed Central Ltd
– name: BMC
References DE Wood (6592_CR9) 2016; 15
6592_CR10
P Clausen (6592_CR13) 2018; 19
6592_CR12
DH Parks (6592_CR1) 2017; 2
M Soverini (6592_CR5) 2019; 20
MC de Goffau (6592_CR20) 2018; 3
B Langmead (6592_CR6) 2012; 9
FP Breitwieser (6592_CR4) 2017; 20
S Rampelli (6592_CR11) 2015; 25
FP Breitwieser (6592_CR18) 2019; 29
SF Altschul (6592_CR15) 1997; 25
S Merchant (6592_CR19) 2014; 2
PD Donovan (6592_CR8) 2018; 13
D Ainsworth (6592_CR16) 2017; 45
MR Olm (6592_CR3) 2019; 7
MS Longo (6592_CR17) 2011; 6
MI Bidartondo (6592_CR14) 2008; 319
6592_CR7
M Shi (6592_CR2) 2016; 540
References_xml – volume: 20
  start-page: 496
  year: 2019
  ident: 6592_CR5
  publication-title: BMC Genomics
  doi: 10.1186/s12864-019-5883-y
– volume: 25
  start-page: 1682
  year: 2015
  ident: 6592_CR11
  publication-title: Curr Biol
  doi: 10.1016/j.cub.2015.04.055
– volume: 540
  start-page: 539
  year: 2016
  ident: 6592_CR2
  publication-title: Nature
  doi: 10.1038/nature20167
– ident: 6592_CR12
  doi: 10.1101/641332
– volume: 6
  start-page: e16410
  year: 2011
  ident: 6592_CR17
  publication-title: PLoS One
  doi: 10.1371/journal.pone.0016410
– volume: 29
  start-page: 954
  year: 2019
  ident: 6592_CR18
  publication-title: Genome Res
  doi: 10.1101/gr.245373.118
– volume: 19
  start-page: 307
  year: 2018
  ident: 6592_CR13
  publication-title: BMC Bioinformatics
  doi: 10.1186/s12859-018-2336-6
– volume: 2
  start-page: 1533
  year: 2017
  ident: 6592_CR1
  publication-title: Nat Microbiol
  doi: 10.1038/s41564-017-0012-7
– ident: 6592_CR10
  doi: 10.1101/712166
– volume: 20
  start-page: 1125
  year: 2017
  ident: 6592_CR4
  publication-title: Brief Bioinform
  doi: 10.1093/bib/bbx120
– volume: 319
  start-page: 1616
  year: 2008
  ident: 6592_CR14
  publication-title: Science
  doi: 10.1126/science.319.5870.1616a
– volume: 9
  start-page: 357
  year: 2012
  ident: 6592_CR6
  publication-title: Nat Methods
  doi: 10.1038/nmeth.1923
– volume: 15
  start-page: R46
  year: 2016
  ident: 6592_CR9
  publication-title: Genome Biol
  doi: 10.1186/gb-2014-15-3-r46
– volume: 25
  start-page: 3389
  year: 1997
  ident: 6592_CR15
  publication-title: Nucleic Acids Res
  doi: 10.1093/nar/25.17.3389
– ident: 6592_CR7
– volume: 45
  start-page: 1649
  year: 2017
  ident: 6592_CR16
  publication-title: Nucleic Acids Res
– volume: 7
  start-page: 26
  year: 2019
  ident: 6592_CR3
  publication-title: Microbiome
  doi: 10.1186/s40168-019-0638-1
– volume: 3
  start-page: 851
  year: 2018
  ident: 6592_CR20
  publication-title: Nat Microbiol
  doi: 10.1038/s41564-018-0202-y
– volume: 13
  start-page: e0192898
  year: 2018
  ident: 6592_CR8
  publication-title: PLoS One
  doi: 10.1371/journal.pone.0192898
– volume: 2
  start-page: e675
  year: 2014
  ident: 6592_CR19
  publication-title: PeerJ
  doi: 10.7717/peerj.675
SSID ssj0017825
Score 2.4535964
Snippet A recent article in BMC Genomics describes a new bioinformatics tool, HumanMycobiomeScan, to classify fungal taxa in metagenomic samples. This tool was used to...
A recent article in BMC Genomics describes a new bioinformatics tool, HumanMycobiomeScan, to classify fungal taxa in metagenomic samples. This tool was used to...
Abstract A recent article in BMC Genomics describes a new bioinformatics tool, HumanMycobiomeScan, to classify fungal taxa in metagenomic samples. This tool...
SourceID doaj
pubmedcentral
proquest
gale
pubmed
crossref
springer
SourceType Open Website
Open Access Repository
Aggregation Database
Index Database
Enrichment Source
Publisher
StartPage 184
SubjectTerms Amphibians
Amphibians - classification
Amphibians - genetics
Animal Genetics and Genomics
Animals
Assembly errors
Bacteria
Bacteria - classification
Bacteria - genetics
Bioinformatics
Biomedical and Life Sciences
Case studies
Classification
Colubridae
Computational biology
Computational Biology - methods
Conserved sequence
Correspondence
Criminal investigation
Data Curation
Deoxyribonucleic acid
Diet
DNA
DNA Barcoding, Taxonomic - methods
Eukaryote microbial genomics
Evaluation
Feces - chemistry
Feces - microbiology
Fungi
Fungi - classification
Fungi - genetics
Gene mapping
Genetic aspects
Genomes
Genomics
Geoemydidae
Humans
Hunter-gatherers
Hunting and gathering societies
Identification
Identification and classification
Italy
Life Sciences
Metagenomic classifier
Metagenomics
Metagenomics - methods
Microarrays
Microbial Genetics and Genomics
Microbiome
Microorganisms
Misclassification
Nucleotide sequence
Organisms
Pipelines
Plant Genetics and Genomics
Proteomics
Pyxicephalus
Reference database
Reptiles
Reptiles - classification
Reptiles - genetics
Snakes
Species
Tanzania
Taxa
Taxonomy
Turtles
SummonAdditionalLinks – databaseName: DOAJ Directory of Open Access Journals
  dbid: DOA
  link: http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrV3Ni9UwEA-yIHgR18_qKlUEQSnb5qvpcRWX1YMH3YW9hXyq4Lay7YP1v3cmzavbFfXi4UF5mdBmfpNkhkx-Q8hzYTvmlEI-PE8rbmEqWtf5SrSe-Ub4Rs5Zvh_k0Ql_fypOL5X6wpywmR54Vtw-t9TgPZOgqOemUzYaLwNtQ9diTe_EBFp39TaYyucHsO-JfIbZKLk_wioseZVCJdHRiq52oUTW__uSfGlPupoveeXQNO1Fh7fIzexElgfzx--Sa6G_Ta7PZSV_3CHHgH25GUM5xHIyF0Nf4X1KzAkql6oiJaaG4hY2lphVfj4A3vB8FiaDtK1nIOvQscZeCby75OTw7fGboypXT6icZO1UgWMkZGBc1I5GLp0TzNbcsWgbh_TUtUo_BvqgzCMnsTWRCQOC0vhI2T2y0w99eEBKsDYmYwydsJ5bHlXbsmiUsIB0Q0MsSL3VpnaZWhwrXHzTKcRQUs8AaABAIwCaFuTl0uX7zKvxN-HXCNEiiJTY6Q8wFJ0NRf_LUAryDAHWSHrRY1bNZ7MZR_3u00d9ICFoxPAc3vQiC8UBRuBMvqQAekCerJXk3koSUHLr5q0d6bwqjJoieT-X4CIV5OnSjD0x060PwwZlZIe3m2lXkPuz2S3jxuFJVUNLuzLIlWLWLf3XL4kzvAXXXTSyIK-2pvvrs_6o94f_Q--PyA2KEw9pANo9sjOdb8JjcOQm-yTN2Z-J60HR
  priority: 102
  providerName: Directory of Open Access Journals
– databaseName: Health & Medical Collection
  dbid: 7X7
  link: http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV3da9RAEF-0IvgifhutEkUQlNBkv5I8SRVL9cEHbeHelv1sBZvUSw70v3dms5eain04OG5nuezM7M5MduY3hLwSpmW2aRAPz9GCG9iKxrauELVjrhKuklOW7xd5eMw_r8QqvXAbUlrl9kyMB7XrLb4j36MIfM4lmJd35z8L7BqFt6uphcZ1ciNCl4E-16s54KrA-ol0k1k1cm-As1jyIgZMoqUFXdiiCNn_78H8l2W6nDV56eo0WqSDO-R2ciXz_Un2d8k1390jN6fmkr_vkyPQgHwz-LwP-ah_9V2BVZWYGZTPvUVyTBBFQzbkmFu-7kHq8P3MjxrBW8-A1qJ7jbOiCB-Q44OPRx8Oi9RDobCS1WMB7pGQnnFRWhq4tFYwU3LLgqksglSXTfww4AdlDpGJjQ5MaCCU2gXKHpKdru_8Y5KDzjEZgm-Fcdzw0NQ1C7oRBuRdUR8yUm65qWwCGMc-Fz9UDDQaqSYBKBCAQgEompE385TzCV3jKuL3KKKZEIGx4w_9-kSlfaa4oRrLknxDHddtY4J20tPatzW2gOcZeYkCVgh90WFuzYneDIP69O2r2pcQOmKQDv_0OhGFHlZgdSpVAD4gWtaCcndBCVKyy-GtHql0NgzqQpMz8mIexpmY79b5foM0ssUaZ9pm5NGkdvO6cXmyKWGkXijkgjHLke77aUQOr8GBF5XMyNut6l481n_5_uTqRTwltyhuKSzzr3fJzrje-GfgqI3medyNfwDn9Tmz
  priority: 102
  providerName: ProQuest
– databaseName: Scholars Portal Journals: Open Access
  dbid: M48
  link: http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV3da9UwFA9jIvgi87s6pYogKNU2X00fRKY4pqAPugt7C0mabMLWztte2P57z0l7Ozun4MOFcnNCm_ORcw45-R1CngtbMacU4uHVNOMWTNG6qs5EWbO6EHUhhyrfr3JvwT8fiIMNsm5vNTKwuzK1w35Si-Xx67Of5-_A4N9Gg1fyTQd7rORZTIRERTPYka-BY5KYi33hF4cK4AzFeLB55bSZa4oI_n_u0785qstFlJdOUqOD2t0iN8fIMt0ZVOEW2fDNbXJ96DV5fofsg0Kkq86nbUh7c9Y2GV6yxEKhdGo1kmK9KPq1LsVS82ULSgDPJ743iOV6ArQOo22cFSV6lyx2P-5_2MvGlgqZk6zsM4iWhPSMi9zRwKVzgtmcOxZs4RCzOlfxx4AflNUIVGxNYMIAoTR1oOwe2Wzaxj8gKaggkyH4StiaWx5UWbJglLAg_oL6kJB8zU3tRrxxbHtxrGPeoaQeBKBBABoFoGlCXk5TTgewjX8Rv0cRTYSIkx3_aJeHejQ7zS01eEvJK1pzUykbTC09LX1VYkd4npBnKGCNSBgNltocmlXX6U_fv-kdCZkk5uzwphcjUWhhBc6MNxeADwieNaPcnlGClNx8eK1Heq3pmiKiP5cQNyXk6TSMM7H8rfHtCmlkhVeeaZWQ-4PaTevG5UmVw0g5U8gZY-YjzY-jCCReQjwvCpmQV2vVvfisv_L94X9RPyI3KFoYggCU22SzX678YwjjevskGucv7mBAqg
  priority: 102
  providerName: Scholars Portal
Title The use of taxon-specific reference databases compromises metagenomic classification
URI https://link.springer.com/article/10.1186/s12864-020-6592-2
https://www.ncbi.nlm.nih.gov/pubmed/32106809
https://www.proquest.com/docview/2378846185
https://www.proquest.com/docview/2369422629
https://pubmed.ncbi.nlm.nih.gov/PMC7045516
https://doaj.org/article/4b2a3210e82d4a98bfad6e27e9794004
Volume 21
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV3dixMxEA96h-CL-O3qWVYRBGWxzfc-9sodp-Ah5xX6FpJscgrernRb0P_emex2dc8P8KGlbSa0yW8mmTQzvyHkhXAl81ojH15FC-7AFJ0vq0KoilUzUc1kF-V7Kk-W_N1KrPo87nYX7b67kkwrdTJrLd-0sJJKXqTjjihpAevuvoCjO1rjks6HqwPY8kR_ffnHbqMNKPH0_74a_7IdXQ2VvHJfmrah49vkVu8_5vMO8DvkWqjvkhtdRcnv98g5wJ5v25A3Md_Yb01dYColhgPlQ0GRHKNCcfdqcwwoXzcANby-DBuLjK2XIOvRp8ZeCbf7ZHl8dL44KfrCCYWXTG0K8ImEDIyLqaeRS-8Fc1PuWXQzj8zUU50eDOaDsgrpiJ2NTFgQlLaKlD0ge3VTh0ckB0VjMsZQCldxx6NWikWrhQOQZzTEjEx3s2l8zyqOxS2-mHS60NJ0ABgAwCAAhmbk1dDla0ep8S_hQ4RoEEQ27PRBs74wvXEZ7qjFXKSgacVtqV20lQxUhVJh3XeekecIsEG-ixoDai7stm3N249nZi7hvIgnc_iml71QbGAE3vb5CTAPSJE1kjwYSQJKfty80yPTLwitocjbzyV4Rxl5NjRjTwxyq0OzRRlZYmIzLTPysFO7Ydw4PKmn0KJGCjmamHFL_flTogtX4LWLmczI653q_vxZf533x_8l_YTcpGhhmOqvDsjeZr0NT8FZ27gJua5WakL2D49OP5zBu4VcTNIfH_D8nutJMt4fmmY7Yw
linkProvider Springer Nature
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV3db9MwELemTQheEN8EBgQEQgJFaxzbSR4Q2mBTy0aFRiftzfhzILFkNK1g_xR_I3f5Ghlib3uoVNXntvb9fGfHd78j5DnXeWKyDPnwLI2YhqWoTW4jntrExtzGoonynYrxAftwyA9XyO8uFwbDKjubWBtqWxp8Rr5BkficCXAvb09-RFg1Cm9XuxIaDSx23elPOLJVbybvQb8vKN3Znr0bR21VgciIJF1EsGHgwiWMjwz1TBjDEz1iJvE6NkjbPMrqV8Jz8JwWuXq18glXICiU9Uh0ACZ_jWFG6ypZ29qeftrv7y3A3_L27jTOxEYF1l-wqD6iwfdFdOD96iIB_7qCv3zh-TjNc5e1tQ_cuUGut5vXcLNB202y4opb5EpTzvL0NpkB5sJl5cLShwv1qywizOPEWKSwr2YSYkgqus4qxGj2eQk4g_fHbqGQLvYYZA1u6LFXDZo75OBS5vcuWS3Kwt0nIaA8Ed67nGvLNPNZmiZeZVwDwmLqfEBG3WxK01KaY2WN77I-2mRCNgqQoACJCpA0IK_6LicNn8dFwluool4QqbjrD8r5kWxXtmSaKkyEchm1TOWZ9soKR1OXp1h0ngXkGSpYItlGgdE8R2pZVXLyeV9uCjis4mMB-KWXrZAvYQRGtckRMA_IzzWQXB9IgpbMsLnDkWytUSXP1k5AnvbN2BMj7ApXLlFG5JhVTfOA3Gtg148bhyeyEbSkA0AOJmbYUnz7WnOVp3Bk4LEIyOsOumd_67_z_uDiQTwhV8ezj3tybzLdfUiuUVxeSDKQrpPVxXzpHsE2caEft2szJF8u2xz8Ad_8djI
linkToPdf http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1ba9VAEF6kovgi3pu2ahRBUEKTvWXzWKuHVqWIttC3Za9VsEk5yQH9984kOdHUC_hwIJydJdmZ2Z1ZZuYbQp4JWzGnFOLheZpxC1vRuspnovTMF8IXcsjyPZIHJ_ztqTgd-5y262z3dUhyqGlAlKa6273wcdjiSu62cKpKnvVXH1HRDM7gqxwtH0Zr5f4URgDzJ8ZQ5h-nzYxRj9n_-8n8i2m6nDZ5KXbam6TFLXJz9CXTvUH4t8mVUN8h14bukt_vkmNQgXTVhrSJaWe-NXWGZZWYGpROzUVSzBBFS9ammFy-bEDs8HweOoPoredA69C_xlm9DO-Rk8Wb4_2DbGyikDnJyi4D_0jIwLjIHY1cOieYzblj0RYOUapz1f8Y8IMyj9DE1kQmDBBK4yNl98lG3dRhk6SgdEzGGCphPbc8qrJk0ShhQeAFDTEh-Zqb2o0I49jo4qvubxpK6kEAGgSgUQCaJuTFNOVigNf4F_ErFNFEiMjY_R_N8kyPG01zSw3WJQVFPTeVstF4GWgZqhJ7wPOEPEUBa8S-qDG55sys2lYffvqo9yTcHfGWDm96PhLFBlbgzFirAHxAuKwZ5c6MEqTk5sNrPdLj4dBqihj-XIKnlJAn0zDOxIS3OjQrpJEVFjnTKiEPBrWb1o3LkyqHkXKmkDPGzEfqL5976PASPHhRyIS8XKvuz8_6K9-3_ov6Mbn-4fVCvz88erdNblDcbIgAUO6QjW65Cg_Bh-vso36f_gCSiz1P
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=The+use+of+taxon-specific+reference+databases+compromises+metagenomic+classification&rft.jtitle=BMC+genomics&rft.au=R.+Marcelino%2C+Vanessa&rft.au=Holmes%2C+Edward+C.&rft.au=Sorrell%2C+Tania+C.&rft.date=2020-02-27&rft.pub=BioMed+Central&rft.eissn=1471-2164&rft.volume=21&rft.issue=1&rft_id=info:doi/10.1186%2Fs12864-020-6592-2&rft.externalDocID=10_1186_s12864_020_6592_2
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1471-2164&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1471-2164&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1471-2164&client=summon