Public health and pipe breaks in water distribution systems: Analysis with internet search volume as a proxy

Drinking water distribution infrastructure has been identified as a factor in waterborne disease outbreaks and improved understanding of the public health risks associated with distribution system failures has been identified as a priority area for research. Pipe breaks may pose a risk, as their occ...

Full description

Saved in:
Bibliographic Details
Published inWater research (Oxford) Vol. 53; pp. 26 - 34
Main Authors Shortridge, Julie E., Guikema, Seth D.
Format Journal Article
LanguageEnglish
Published England Elsevier Ltd 15.04.2014
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Drinking water distribution infrastructure has been identified as a factor in waterborne disease outbreaks and improved understanding of the public health risks associated with distribution system failures has been identified as a priority area for research. Pipe breaks may pose a risk, as their occurrence and repair can result in low or negative pressure, potentially allowing contamination of drinking water from adjacent soils. However, measuring this phenomenon is challenging because the most likely health impact is mild gastrointestinal (GI) illness, which is unlikely to result in a doctor or hospital visit. Here we present a novel method that uses data mining techniques and internet search volume to assess the relationship between pipe breaks and symptoms of GI illness in two U.S. cities. Weekly search volume for the terms diarrhea and vomiting was used as the response variable with the number of pipe breaks in each city as a covariate as well as additional covariates to control for seasonal patterns, search volume persistence, and other sources of GI illness. The fit and predictive accuracy of multiple regression and data mining techniques were compared, with the best performance obtained using random forest and bagged regression tree models. Pipe breaks were found to be an important and positively correlated predictor of internet search volume in multiple models in both cities, supporting previous investigations that indicated an increased risk of GI illness from distribution system disturbances. [Display omitted] •We regress internet searches for GI illness against pipe breaks and seasonal factors.•Regression tree models are able to accurately fit and predict search volume.•We observe a positive relationship between pipe breaks and searches in two cities.•Search volume persistence and seasonal trends were also observed in both cities.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:0043-1354
1879-2448
DOI:10.1016/j.watres.2014.01.013