A multivariate analysis to explain residue errors in pathogen concentration in wastewater-based epidemiology

With the beginning of the COVID-19 pandemic, wastewater-based epidemiology (WBE), which according to Larsen et al. (2021), describes the science of linking pathogens and chemicals found in wastewater to population-level health, received an enormous boost worldwide. The basic procedure in WBE is to a...

Full description

Saved in:
Bibliographic Details
Published inThe Science of the total environment Vol. 959; p. 178149
Main Authors Wallner, Markus, Müller, Omar V., Goméz, Andrea A., Joost, Ingeborg, Düker, Urda, Klawonn, Frank, Nogueira, Regina
Format Journal Article
LanguageEnglish
Published Netherlands Elsevier B.V 10.01.2025
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:With the beginning of the COVID-19 pandemic, wastewater-based epidemiology (WBE), which according to Larsen et al. (2021), describes the science of linking pathogens and chemicals found in wastewater to population-level health, received an enormous boost worldwide. The basic procedure in WBE is to analyse pathogen concentrations and to relate these measurements to cases from clinical data. This prediction of cases is subject to large errors, due to various factors such as dilution effects, decay or wastewater matrix and inhibitors. In this study we used different models to identify the most important, what we call, wastewater-based epidemiologically relevant parameters (WBERP) to describe these errors. We used linear regression and random forest regression as base models for predicting cases and random forest regression also to analyse the importance of different WBERP. Two catchments, one with a large proportion of combined sewers and one with separate sewers, served as study areas. Our results show that the most important information to be included in any model are the variants of concern (VOCs), a time-variable parameter. The performance for both catchments is improved by ~30 % in terms of root mean square error when the VOCs are used as additional information. For practical applications, this is a real drawback as it means that every time a new pathogen variant becomes dominant, we need to know the specific behaviour of the variant in the wastewater and its detection in order to interpret the WBE data correctly. This limits the predictive capabilities of such systems, perhaps not in terms of dynamics but for quantitative statements. The addition of other physicochemical parameters and faecal markers only marginally improved the results. Furthermore, there were differences in the importance of the parameters between the catchments, which limits the generalisability of the conclusions. The results show that more complex wastewater matrices (high proportion of combined sewer system) influence the relationship between pathogen concentration and medical cases more than those of less complex wastewater matrices (separate sewer system). [Display omitted] •Using RNA load instead of concentration only marginally improves predictions.•Variants of Concern (VOCs) were the most important secondary information.•Linear models are more suitable for separate sewers than for combined sewers.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:0048-9697
1879-1026
1879-1026
DOI:10.1016/j.scitotenv.2024.178149