The positive effects of population-based preferential sampling in environmental epidemiology

In environmental epidemiology, exposures are not always available at subject locations and must be predicted using monitoring data. The monitor locations are often outside the control of researchers, and previous studies have shown that “preferential sampling” of monitoring locations can adversely a...

Full description

Saved in:
Bibliographic Details
Published inBiostatistics (Oxford, England) Vol. 17; no. 4; pp. 764 - 778
Main Authors Antonelli, Joseph, Cefalu, Matthew, Bornn, Luke
Format Journal Article
LanguageEnglish
Published England Oxford University Press 01.10.2016
Subjects
Online AccessGet full text
ISSN1465-4644
1468-4357
DOI10.1093/biostatistics/kxw026

Cover

Loading…
Abstract In environmental epidemiology, exposures are not always available at subject locations and must be predicted using monitoring data. The monitor locations are often outside the control of researchers, and previous studies have shown that “preferential sampling” of monitoring locations can adversely affect exposure prediction and subsequent health effect estimation. We adopt a slightly different definition of preferential sampling than is typically seen in the literature, which we call population-based preferential sampling. Population-based preferential sampling occurs when the location of the monitors is dependent on the subject locations. We show the impact that population-based preferential sampling has on exposure prediction and health effect estimation using analytic results and a simulation study. A simple, one-parameter model is proposed to measure the degree to which monitors are preferentially sampled with respect to population density. We then discuss these concepts in the context of PM2.5 and the EPA Air Quality System monitoring sites, which are generally placed in areas of higher population density to capture the population's exposure.
AbstractList In environmental epidemiology, exposures are not always available at subject locations and must be predicted using monitoring data. The monitor locations are often outside the control of researchers, and previous studies have shown that “preferential sampling” of monitoring locations can adversely affect exposure prediction and subsequent health effect estimation. We adopt a slightly different definition of preferential sampling than is typically seen in the literature, which we call population-based preferential sampling. Population-based preferential sampling occurs when the location of the monitors is dependent on the subject locations. We show the impact that population-based preferential sampling has on exposure prediction and health effect estimation using analytic results and a simulation study. A simple, one-parameter model is proposed to measure the degree to which monitors are preferentially sampled with respect to population density. We then discuss these concepts in the context of PM2.5 and the EPA Air Quality System monitoring sites, which are generally placed in areas of higher population density to capture the population's exposure.
In environmental epidemiology, exposures are not always available at subject locations and must be predicted using monitoring data. The monitor locations are often outside the control of researchers, and previous studies have shown that “preferential sampling” of monitoring locations can adversely affect exposure prediction and subsequent health effect estimation. We adopt a slightly different definition of preferential sampling than is typically seen in the literature, which we call population-based preferential sampling. Population-based preferential sampling occurs when the location of the monitors is dependent on the subject locations. We show the impact that population-based preferential sampling has on exposure prediction and health effect estimation using analytic results and a simulation study. A simple, one-parameter model is proposed to measure the degree to which monitors are preferentially sampled with respect to population density. We then discuss these concepts in the context of PM 2.5 and the EPA Air Quality System monitoring sites, which are generally placed in areas of higher population density to capture the population's exposure.
Author Antonelli, Joseph
Cefalu, Matthew
Bornn, Luke
AuthorAffiliation 3 Department of Statistics and Actuarial Science, Simon Fraser University , 8888 University Drive, Burnaby, BC, Canada
1 Department of Biostatistics, Harvard University , 655 Huntington Avenue, Boston, MA 02115, USA
2 RAND Corporation , 1776 Main Street, Santa Monica, CA 90401, USA
AuthorAffiliation_xml – name: 2 RAND Corporation , 1776 Main Street, Santa Monica, CA 90401, USA
– name: 3 Department of Statistics and Actuarial Science, Simon Fraser University , 8888 University Drive, Burnaby, BC, Canada
– name: 1 Department of Biostatistics, Harvard University , 655 Huntington Avenue, Boston, MA 02115, USA
Author_xml – sequence: 1
  givenname: Joseph
  surname: Antonelli
  fullname: Antonelli, Joseph
  organization: Department of Biostatistics, Harvard University, 655 Huntington Avenue, Boston, MA 02115, USA
– sequence: 2
  givenname: Matthew
  surname: Cefalu
  fullname: Cefalu, Matthew
  organization: RAND Corporation, 1776 Main Street, Santa Monica, CA 90401, USA
– sequence: 3
  givenname: Luke
  surname: Bornn
  fullname: Bornn, Luke
  organization: Department of Statistics and Actuarial Science, Simon Fraser University, 8888 University Drive, Burnaby, BC, Canada
BackLink https://www.ncbi.nlm.nih.gov/pubmed/27324413$$D View this record in MEDLINE/PubMed
BookMark eNp9kc1OAyEUhYmp8f8NjJkXGIWB0hkXJqbxL2nipu5MCDCXis7AZKDVvr20VWNduIJw7vkuOecQDZx3gNApwecEV_RCWR-ijDZEq8PF28c7LvgOOiCMlzmjw9FgfR_mjDO2jw5DeMW4KCine2i_GNGCMUIP0PP0BbLOBxvtAjIwBnQMmTfprZs3Ce9drmSAOut6MNCDi1Y2WZBt11g3y6zLwC1s712bpKRAZ2torW_8bHmMdo1sApx8nUfo6fZmOr7PJ493D-PrSa7ZiMW8qDlLm5nhnLMS84rT2igoFCcjGGJDiMYVV8pIgxXWUJKaKFzVNZO6omVJj9DVhtvNVQu1Tj_pZSO63rayXwovrdhWnH0RM78QnK1iYAlw9hvw4_zOKQ1cbgZ070NISQht4zqdxLONIFisShFbpYhNKcnM_pi_-f_aPgHDGpzv
CitedBy_id crossref_primary_10_1007_s42081_022_00178_8
crossref_primary_10_1002_env_2573
crossref_primary_10_1016_j_jenvman_2022_117194
crossref_primary_10_1164_rccm_201706_1267OC
Cites_doi 10.1056/NEJM199312093292401
10.1097/EDE.0b013e31819e4331
10.1002/env.2334
10.1080/08958370701492961
10.1111/j.1751-5823.2003.tb00195.x
10.1080/02693799008941549
10.1016/S0045-6535(02)00239-4
10.1186/1476-069X-11-40
10.1111/j.1467-9876.2009.00701.x
10.1001/jama.295.10.1127
10.1093/biostatistics/kxn033
10.1007/s11869-012-0181-8
10.1002/env.2233
10.1002/env.2169
10.1038/jes.2012.126
10.1056/NEJM200012143432401
10.1002/env.1039
10.1093/biostatistics/kxq083
10.1111/j.1467-9876.2008.00618.x
ContentType Journal Article
Copyright The Author 2016. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
The Author 2016. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com. 2016
Copyright_xml – notice: The Author 2016. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
– notice: The Author 2016. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com. 2016
DBID AAYXX
CITATION
CGR
CUY
CVF
ECM
EIF
NPM
5PM
DOI 10.1093/biostatistics/kxw026
DatabaseName CrossRef
Medline
MEDLINE
MEDLINE (Ovid)
MEDLINE
MEDLINE
PubMed
PubMed Central (Full Participant titles)
DatabaseTitle CrossRef
MEDLINE
Medline Complete
MEDLINE with Full Text
PubMed
MEDLINE (Ovid)
DatabaseTitleList CrossRef

MEDLINE
Database_xml – sequence: 1
  dbid: NPM
  name: PubMed
  url: https://proxy.k.utb.cz/login?url=http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed
  sourceTypes: Index Database
– sequence: 2
  dbid: EIF
  name: MEDLINE
  url: https://proxy.k.utb.cz/login?url=https://www.webofscience.com/wos/medline/basic-search
  sourceTypes: Index Database
DeliveryMethod fulltext_linktorsrc
Discipline Biology
EISSN 1468-4357
EndPage 778
ExternalDocumentID PMC6424414
27324413
10_1093_biostatistics_kxw026
Genre Research Support, U.S. Gov't, Non-P.H.S
Journal Article
Research Support, N.I.H., Extramural
GrantInformation_xml – fundername: NIEHS NIH HHS
  grantid: T32 ES007142
GroupedDBID ---
-E4
.2P
.I3
0R~
1TH
23N
2WC
4.4
48X
53G
5GY
5VS
5WA
6PF
70D
AAIJN
AAJKP
AAJQQ
AAMVS
AAOGV
AAPQZ
AAPXW
AARHZ
AAUAY
AAUQX
AAVAP
AAWTL
AAYXX
ABDFA
ABDTM
ABEJV
ABEUO
ABGNP
ABIXL
ABJNI
ABLJU
ABNKS
ABPQP
ABPTD
ABQLI
ABVGC
ABWST
ABXVV
ABZBJ
ACGFS
ACIWK
ACPRK
ACUFI
ACUXJ
ACYTK
ADBBV
ADEYI
ADEZT
ADGZP
ADHKW
ADHZD
ADIPN
ADNBA
ADOCK
ADQBN
ADRDM
ADRTK
ADVEK
ADYJX
ADYVW
ADZXQ
AECKG
AEGPL
AEJOX
AEKKA
AEKSI
AEMDU
AENEX
AENZO
AEPUE
AETBJ
AEWNT
AFFZL
AFIYH
AFOFC
AFRAH
AGINJ
AGKEF
AGORE
AGQXC
AGSYK
AHMBA
AHXPO
AIJHB
AJBYB
AJEEA
AJEUX
AJNCP
ALMA_UNASSIGNED_HOLDINGS
ALTZX
ALUQC
ALXQX
ANAKG
APIBT
APWMN
ATGXG
AXUDD
AZVOD
BAWUL
BAYMD
BCRHZ
BEYMZ
BHONS
BQUQU
BTQHN
C45
CDBKE
CITATION
CS3
CZ4
DAKXR
DIK
DILTD
DU5
D~K
E3Z
EBD
EBS
EE~
EJD
EMOBN
F5P
F9B
FLIZI
FLUFQ
FOEOM
FQBLK
GAUVT
GJXCC
H13
H5~
HAR
HW0
HZ~
IOX
J21
JXSIZ
KBUDW
KOP
KQ8
KSI
KSN
M-Z
N9A
NGC
NMDNZ
NOMLY
NU-
O9-
ODMLO
OJQWA
OJZSN
OK1
OVD
P2P
PAFKI
PEELM
PQQKQ
Q1.
Q5Y
RD5
ROL
ROX
RUSNO
RW1
RXO
SV3
TEORI
TJP
TN5
TR2
W8F
WOQ
X7H
YAYTL
YKOAZ
YXANX
ZKX
~91
ABQTQ
ACIPB
ADRIX
AFXEN
C1A
CAG
CGR
COF
CUY
CVF
ECM
EIF
M49
NPM
NTWIH
O0~
RHF
RIG
RNI
RZO
5PM
ID FETCH-LOGICAL-c474t-2d64eff4f6664806963dfbe2b617e50f11c096bbfaf0b0ce81d1b09dd4ac93883
ISSN 1465-4644
IngestDate Thu Aug 21 14:11:39 EDT 2025
Wed Feb 19 02:32:01 EST 2025
Tue Jul 01 03:45:54 EDT 2025
Thu Apr 24 23:04:56 EDT 2025
IsDoiOpenAccess false
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Issue 4
Keywords Air pollution epidemiology
Preferential sampling
exposure estimation
Language English
License The Author 2016. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
LinkModel OpenURL
MergedId FETCHMERGED-LOGICAL-c474t-2d64eff4f6664806963dfbe2b617e50f11c096bbfaf0b0ce81d1b09dd4ac93883
OpenAccessLink https://academic.oup.com/biostatistics/article-pdf/17/4/764/30142190/kxw026.pdf
PMID 27324413
PageCount 15
ParticipantIDs pubmedcentral_primary_oai_pubmedcentral_nih_gov_6424414
pubmed_primary_27324413
crossref_citationtrail_10_1093_biostatistics_kxw026
crossref_primary_10_1093_biostatistics_kxw026
ProviderPackageCode CITATION
AAYXX
PublicationCentury 2000
PublicationDate 2016-10-01
PublicationDateYYYYMMDD 2016-10-01
PublicationDate_xml – month: 10
  year: 2016
  text: 2016-10-01
  day: 01
PublicationDecade 2010
PublicationPlace England
PublicationPlace_xml – name: England
PublicationTitle Biostatistics (Oxford, England)
PublicationTitleAlternate Biostatistics
PublicationYear 2016
Publisher Oxford University Press
Publisher_xml – name: Oxford University Press
References Gelfand (2019101405384850800_B8) 2012; 23
Gryparis (2019101405384850800_B9) 2009; 10
Nikolov (2019101405384850800_B14) 2011; 22
Pope (2019101405384850800_B17) 2007; 19
Samet (2019101405384850800_B18) 2000; 343
Kim (2019101405384850800_B10) 2009; 20
Breysse (2019101405384850800_B2) 2013; 6
Dominici (2019101405384850800_B6) 2006; 295
Matte (2019101405384850800_B13) 2013; 23
Dominici (2019101405384850800_B7) 2003; 71
Szpiro (2019101405384850800_B20) 2011; 12
Szpiro (2019101405384850800_B19) 2013; 24
Nikolov (2019101405384850800_B15) 2008; 57
Oliver (2019101405384850800_B16) 1990; 4
Kloog (2019101405384850800_B11) 2012; 11
2019101405384850800_B1
Lee (2019101405384850800_B12) 2015; 26
Dockery (2019101405384850800_B5) 1993; 329
Chow (2019101405384850800_B3) 2002; 49
Diggle (2019101405384850800_B4) 2010; 59
References_xml – volume: 329
  start-page: 1753
  year: 1993
  ident: 2019101405384850800_B5
  article-title: An association between air pollution and mortality in six US cities
  publication-title: New England Journal of Medicine
  doi: 10.1056/NEJM199312093292401
– volume: 20
  start-page: 442
  year: 2009
  ident: 2019101405384850800_B10
  article-title: Health effects of long-term air pollution: influence of exposure prediction methods
  publication-title: Epidemiology
  doi: 10.1097/EDE.0b013e31819e4331
– volume: 26
  start-page: 255
  issue: 4
  year: 2015
  ident: 2019101405384850800_B12
  article-title: Impact of preferential sampling on exposure prediction and health effect inference in the context of air pollution epidemiology
  publication-title: Environmetrics
  doi: 10.1002/env.2334
– volume: 19
  start-page: 33
  year: 2007
  ident: 2019101405384850800_B17
  article-title: Mortality effects of longer term exposures to fine particulate air pollution: review of recent epidemiological evidence
  publication-title: Inhalation Toxicology
  doi: 10.1080/08958370701492961
– ident: 2019101405384850800_B1
– volume: 71
  start-page: 243
  year: 2003
  ident: 2019101405384850800_B7
  article-title: Health effects of air pollution: a statistical review
  publication-title: International Statistical Review
  doi: 10.1111/j.1751-5823.2003.tb00195.x
– volume: 4
  start-page: 313
  year: 1990
  ident: 2019101405384850800_B16
  article-title: Kriging: a method of interpolation for geographical information systems
  publication-title: International Journal of Geographical Information System
  doi: 10.1080/02693799008941549
– volume: 49
  start-page: 961
  year: 2002
  ident: 2019101405384850800_B3
  article-title: Designing monitoring networks to represent outdoor human exposure
  publication-title: Chemosphere
  doi: 10.1016/S0045-6535(02)00239-4
– volume: 11
  start-page: 1
  year: 2012
  ident: 2019101405384850800_B11
  article-title: Using new satellite based exposure methods to study the association between pregnancy pm2. 5 exposure, premature birth and birth weight in Massachusetts
  publication-title: Environmental Health
  doi: 10.1186/1476-069X-11-40
– volume: 59
  start-page: 191
  year: 2010
  ident: 2019101405384850800_B4
  article-title: Geostatistical inference under preferential sampling
  publication-title: Journal of the Royal Statistical Society: Series C (Applied Statistics)
  doi: 10.1111/j.1467-9876.2009.00701.x
– volume: 295
  start-page: 1127
  year: 2006
  ident: 2019101405384850800_B6
  article-title: Fine particulate air pollution and hospital admission for cardiovascular and respiratory diseases
  publication-title: JAMA
  doi: 10.1001/jama.295.10.1127
– volume: 10
  start-page: 258
  year: 2009
  ident: 2019101405384850800_B9
  article-title: Measurement error caused by spatial misalignment in environmental epidemiology
  publication-title: Biostatistics
  doi: 10.1093/biostatistics/kxn033
– volume: 6
  start-page: 333
  issue: 2
  year: 2013
  ident: 2019101405384850800_B2
  article-title: US EPA particulate matter research centers: summary of research results for 2005–2011
  publication-title: Air Quality, Atmosphere & Health
  doi: 10.1007/s11869-012-0181-8
– volume: 24
  start-page: 501
  year: 2013
  ident: 2019101405384850800_B19
  article-title: Measurement error in two-stage analyses, with application to air pollution epidemiology
  publication-title: Environmetrics
  doi: 10.1002/env.2233
– volume: 23
  start-page: 565
  year: 2012
  ident: 2019101405384850800_B8
  article-title: On the effect of preferential sampling in spatial prediction
  publication-title: Environmetrics
  doi: 10.1002/env.2169
– volume: 23
  start-page: 223
  year: 2013
  ident: 2019101405384850800_B13
  article-title: Monitoring intraurban spatial patterns of multiple combustion air pollutants in New York City: design and implementation
  publication-title: Journal of Exposure Science and Environmental Epidemiology
  doi: 10.1038/jes.2012.126
– volume: 343
  start-page: 1742
  issue: 24
  year: 2000
  ident: 2019101405384850800_B18
  article-title: Fine particulate air pollution and mortality in 20 US cities, 1987–1994
  publication-title: New England Journal of Medicine
  doi: 10.1056/NEJM200012143432401
– volume: 22
  start-page: 165
  year: 2011
  ident: 2019101405384850800_B14
  article-title: Multiplicative factor analysis with a latent mixed model structure for air pollution exposure assessment
  publication-title: Environmetrics
  doi: 10.1002/env.1039
– volume: 12
  start-page: 610
  issue: 4
  year: 2011
  ident: 2019101405384850800_B20
  article-title: Efficient measurement error correction with spatially misaligned data
  publication-title: Biostatistics
  doi: 10.1093/biostatistics/kxq083
– volume: 57
  start-page: 357
  year: 2008
  ident: 2019101405384850800_B15
  article-title: Statistical methods to evaluate health effects associated with major sources of air pollution: a case-study of breathing patterns during exposure to concentrated Boston air particles
  publication-title: Journal of the Royal Statistical Society: Series C (Applied Statistics)
  doi: 10.1111/j.1467-9876.2008.00618.x
SSID ssj0022363
Score 2.139843
Snippet In environmental epidemiology, exposures are not always available at subject locations and must be predicted using monitoring data. The monitor locations are...
In environmental epidemiology, exposures are not always available at subject locations and must be predicted using monitoring data. The monitor locations are...
SourceID pubmedcentral
pubmed
crossref
SourceType Open Access Repository
Index Database
Enrichment Source
StartPage 764
SubjectTerms Environmental Exposure
Environmental Monitoring - statistics & numerical data
Epidemiologic Methods
Humans
Models, Theoretical
Research Design
Title The positive effects of population-based preferential sampling in environmental epidemiology
URI https://www.ncbi.nlm.nih.gov/pubmed/27324413
https://pubmed.ncbi.nlm.nih.gov/PMC6424414
Volume 17
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1bT9swFLY6ENJe0MZuHWzyw17d5uLYySNCQ2gbSEgg8TCpim9aRdWiKh3d_sP-M8dxYmyGxuAlap3UaXq-2ufY5zsfQp-ETnRVy5JIyZVduhGkVrkmXIF_oIqiZC2L__iEHZ3TLxfFxWDwJ8haWjViJH_fyyt5ilWhDexqWbKPsKzvFBrgNdgXjmBhOP63jV3W1c8oM-PKq3IRO0u1lQBaXl_T0kNqm0XuqCwBzw3O6Fu52Hi3d7qwxKOuprMtULruc-I7EZBoQcHW95452nW3wdBvdGhTz1aOItTKjPvFgMXS-dHfVpc6XIhImU9p68dOygpCmSvnONJ9W0nAI-PRgMsDYNFg9OSMBhMxd9o-f43xrv6VCJ8c3l-ur5PsnqLadyY7n4LoNt_zSdTPxPXyDG1mEHVYQYyvp35TChypVpjPP2fPxKzycdTL2PUSeTrevYlTbwNf5uwF2u6CELzvEPUSDfR8B205WdJfr9B3wBXucYU7XOGFwXdxhUNc4R5XeDrHEa5wiKvX6Pzw89nBEelUOIiknDYkU4zCnaiBQJeWCYMRWxmhMwG-ry4Sk6YSwmAhTG0SkUgNAVAqkkopWssqL8v8DdqYA-7eIVwayTMjcwZhA61oUVtBHiMLzWquTKaHKO9_sYnsStRbpZTZ5F_WGiLiP3XlSrQ8cP1bZwN_Nbjw4OWm-RDxyDr-AluDPT4zn_5oa7EzSxRN6ftHfodd9Pz2L7SHNprlSn8A77YRH1vM3QANArbY
linkProvider Colorado Alliance of Research Libraries
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=The+positive+effects+of+population-based+preferential+sampling+in+environmental+epidemiology&rft.jtitle=Biostatistics+%28Oxford%2C+England%29&rft.au=Antonelli%2C+Joseph&rft.au=Cefalu%2C+Matthew&rft.au=Bornn%2C+Luke&rft.date=2016-10-01&rft.issn=1465-4644&rft.eissn=1468-4357&rft.volume=17&rft.issue=4&rft.spage=764&rft.epage=778&rft_id=info:doi/10.1093%2Fbiostatistics%2Fkxw026&rft.externalDBID=n%2Fa&rft.externalDocID=10_1093_biostatistics_kxw026
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1465-4644&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1465-4644&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1465-4644&client=summon