The positive effects of population-based preferential sampling in environmental epidemiology

In environmental epidemiology, exposures are not always available at subject locations and must be predicted using monitoring data. The monitor locations are often outside the control of researchers, and previous studies have shown that “preferential sampling” of monitoring locations can adversely a...

Full description

Saved in:
Bibliographic Details
Published inBiostatistics (Oxford, England) Vol. 17; no. 4; pp. 764 - 778
Main Authors Antonelli, Joseph, Cefalu, Matthew, Bornn, Luke
Format Journal Article
LanguageEnglish
Published England Oxford University Press 01.10.2016
Subjects
Online AccessGet full text
ISSN1465-4644
1468-4357
DOI10.1093/biostatistics/kxw026

Cover

Loading…
More Information
Summary:In environmental epidemiology, exposures are not always available at subject locations and must be predicted using monitoring data. The monitor locations are often outside the control of researchers, and previous studies have shown that “preferential sampling” of monitoring locations can adversely affect exposure prediction and subsequent health effect estimation. We adopt a slightly different definition of preferential sampling than is typically seen in the literature, which we call population-based preferential sampling. Population-based preferential sampling occurs when the location of the monitors is dependent on the subject locations. We show the impact that population-based preferential sampling has on exposure prediction and health effect estimation using analytic results and a simulation study. A simple, one-parameter model is proposed to measure the degree to which monitors are preferentially sampled with respect to population density. We then discuss these concepts in the context of PM2.5 and the EPA Air Quality System monitoring sites, which are generally placed in areas of higher population density to capture the population's exposure.
ISSN:1465-4644
1468-4357
DOI:10.1093/biostatistics/kxw026