Plant DNA barcoding necessitates marker-specific efforts to establish more comprehensive reference databases1

The problem of low species-level identification rates in plants by DNA barcoding is exacerbated by the fact that reference databases are far from being comprehensive. We investigate the impact of increased sampling depth on identification success by analyzing the efficacy of established plant barcod...

Full description

Saved in:
Bibliographic Details
Published inGenome Vol. 64; no. 3; pp. 265 - 298
Main Authors Kolter, Andreas, Gemeinholzer, Birgit
Format Journal Article
LanguageEnglish
Published 1840 Woodward Drive, Suite 1, Ottawa, ON K2C 0P7 NRC Research Press 2021
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:The problem of low species-level identification rates in plants by DNA barcoding is exacerbated by the fact that reference databases are far from being comprehensive. We investigate the impact of increased sampling depth on identification success by analyzing the efficacy of established plant barcode marker sequences (rbcL, matK, trnL-trnF, psbA-trnH, ITS). Adding sequences of the same species to the reference database led to an increase in correct species assignment of +10.9% for rbcL and +19.0% for ITS. Simultaneously, erroneous identification dropped from ∼40% to ∼12.5%. Despite its evolutionary constraints, ITS showed the highest identification rate and identification gain by increased sampling effort, which makes it a very suitable marker in the planning phase of a barcode study. The limited sequence availability of trnL-trnF is problematic for an otherwise very promising plastid plant barcoding marker. Future developments in machine learning algorithms have the potential to give new impetus to plant barcoding, but are dependent on extensive reference databases. We expect that our results will be incorporated into future plans for the development of DNA barcoding reference databases and will lead to these being developed with greater depth and taxonomic coverage.
ISSN:0831-2796
1480-3321
DOI:10.1139/gen-2019-0198