Supervised machine learning outperforms taxonomy‐based environmental DNA metabarcoding applied to biomonitoring

Biodiversity monitoring is the standard for environmental impact assessment of anthropogenic activities. Several recent studies showed that high‐throughput amplicon sequencing of environmental DNA (eDNA metabarcoding) could overcome many limitations of the traditional morphotaxonomy‐based bioassessm...

Full description

Saved in:
Bibliographic Details
Published inMolecular ecology resources Vol. 18; no. 6; pp. 1381 - 1391
Main Authors Cordier, Tristan, Forster, Dominik, Dufresne, Yoann, Martins, Catarina I. M., Stoeck, Thorsten, Pawlowski, Jan
Format Journal Article
LanguageEnglish
Published England Wiley Subscription Services, Inc 01.11.2018
Wiley/Blackwell
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Biodiversity monitoring is the standard for environmental impact assessment of anthropogenic activities. Several recent studies showed that high‐throughput amplicon sequencing of environmental DNA (eDNA metabarcoding) could overcome many limitations of the traditional morphotaxonomy‐based bioassessment. Recently, we demonstrated that supervised machine learning (SML) can be used to predict accurate biotic indices values from eDNA metabarcoding data, regardless of the taxonomic affiliation of the sequences. However, it is unknown to which extent the accuracy of such models depends on taxonomic resolution of molecular markers or how SML compares with metabarcoding approaches targeting well‐established bioindicator species. In this study, we address these issues by training predictive models upon five different ribosomal bacterial and eukaryotic markers and measuring their performance to assess the environmental impact of marine aquaculture on independent data sets. Our results show that all tested markers are yielding accurate predictive models and that they all outperform the assessment relying solely on taxonomically assigned sequences. Remarkably, we did not find any significant difference in the performance of the models built using universal eukaryotic or prokaryotic markers. Using any molecular marker with a taxonomic range broad enough to comprise different potential bioindicator taxa, SML approach can overcome the limits of taxonomy‐based eDNA bioassessment.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:1755-098X
1755-0998
DOI:10.1111/1755-0998.12926