Spatial Modeling of Daily PM2.5, NO2, and CO Concentrations Measured by a Low-Cost Sensor Network: Comparison of Linear, Machine Learning, and Hybrid Land Use Models

Previous studies have characterized spatial patterns of pollution with land use regression (LUR) models from distributed passive or filter samplers at low temporal resolution. Large-scale deployment of low-cost sensors (LCS), which typically sample in real time, may enable time-resolved or real-time...

Full description

Saved in:
Bibliographic Details
Published inEnvironmental science & technology Vol. 55; no. 13; pp. 8631 - 8641
Main Authors Jain, Sakshi, Presto, Albert A., Zimmerman, Naomi
Format Journal Article
LanguageEnglish
Published Easton American Chemical Society 06.07.2021
Subjects
Online AccessGet full text
ISSN0013-936X
1520-5851
1520-5851
DOI10.1021/acs.est.1c02653

Cover

Loading…
More Information
Summary:Previous studies have characterized spatial patterns of pollution with land use regression (LUR) models from distributed passive or filter samplers at low temporal resolution. Large-scale deployment of low-cost sensors (LCS), which typically sample in real time, may enable time-resolved or real-time modeling of concentration surfaces. The aim of this study was to develop spatiotemporal models of PM2.5, NO2, and CO using an LCS network in Pittsburgh, Pennsylvania. We modeled daily average concentrations in August 2016–December 2017 across 50 sites. Land use variables included 13 time-independent (e.g., elevation) and time-dependent (e.g., temperature) predictors. We examined two models: LUR and a machine-learning-enabled land use model (land use random forest, LURF). The LURF models outperformed LUR models, with increase in the average externally cross-validated R 2 of 0.10–0.19. Using wavelet decomposition to separate short-lived events from the regional background, we also created time-decomposed LUR and LURF models. Compared to the standard model, this resulted in improvement in R 2 of up to 0.14. The time-decomposed models were more influenced by spatial parameters. Mapping our models across Allegheny County, we observed that time-decomposed LURF models created robust PM2.5 predictions, suggesting that this approach may improve our ability to map air pollutants at high spatiotemporal resolution.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
content type line 23
ISSN:0013-936X
1520-5851
1520-5851
DOI:10.1021/acs.est.1c02653