The positive effects of population-based preferential sampling in environmental epidemiology
In environmental epidemiology, exposures are not always available at subject locations and must be predicted using monitoring data. The monitor locations are often outside the control of researchers, and previous studies have shown that “preferential sampling” of monitoring locations can adversely a...
Saved in:
Published in | Biostatistics (Oxford, England) Vol. 17; no. 4; pp. 764 - 778 |
---|---|
Main Authors | , , |
Format | Journal Article |
Language | English |
Published |
England
Oxford University Press
01.10.2016
|
Subjects | |
Online Access | Get full text |
ISSN | 1465-4644 1468-4357 |
DOI | 10.1093/biostatistics/kxw026 |
Cover
Loading…
Abstract | In environmental epidemiology, exposures are not always available at subject locations and must be predicted using monitoring data. The monitor locations are often outside the control of researchers, and previous studies have shown that “preferential sampling” of monitoring locations can adversely affect exposure prediction and subsequent health effect estimation. We adopt a slightly different definition of preferential sampling than is typically seen in the literature, which we call population-based preferential sampling. Population-based preferential sampling occurs when the location of the monitors is dependent on the subject locations. We show the impact that population-based preferential sampling has on exposure prediction and health effect estimation using analytic results and a simulation study. A simple, one-parameter model is proposed to measure the degree to which monitors are preferentially sampled with respect to population density. We then discuss these concepts in the context of PM2.5 and the EPA Air Quality System monitoring sites, which are generally placed in areas of higher population density to capture the population's exposure. |
---|---|
AbstractList | In environmental epidemiology, exposures are not always available at subject locations and must be predicted using monitoring data. The monitor locations are often outside the control of researchers, and previous studies have shown that “preferential sampling” of monitoring locations can adversely affect exposure prediction and subsequent health effect estimation. We adopt a slightly different definition of preferential sampling than is typically seen in the literature, which we call population-based preferential sampling. Population-based preferential sampling occurs when the location of the monitors is dependent on the subject locations. We show the impact that population-based preferential sampling has on exposure prediction and health effect estimation using analytic results and a simulation study. A simple, one-parameter model is proposed to measure the degree to which monitors are preferentially sampled with respect to population density. We then discuss these concepts in the context of PM2.5 and the EPA Air Quality System monitoring sites, which are generally placed in areas of higher population density to capture the population's exposure. In environmental epidemiology, exposures are not always available at subject locations and must be predicted using monitoring data. The monitor locations are often outside the control of researchers, and previous studies have shown that “preferential sampling” of monitoring locations can adversely affect exposure prediction and subsequent health effect estimation. We adopt a slightly different definition of preferential sampling than is typically seen in the literature, which we call population-based preferential sampling. Population-based preferential sampling occurs when the location of the monitors is dependent on the subject locations. We show the impact that population-based preferential sampling has on exposure prediction and health effect estimation using analytic results and a simulation study. A simple, one-parameter model is proposed to measure the degree to which monitors are preferentially sampled with respect to population density. We then discuss these concepts in the context of PM 2.5 and the EPA Air Quality System monitoring sites, which are generally placed in areas of higher population density to capture the population's exposure. |
Author | Antonelli, Joseph Cefalu, Matthew Bornn, Luke |
AuthorAffiliation | 3 Department of Statistics and Actuarial Science, Simon Fraser University , 8888 University Drive, Burnaby, BC, Canada 1 Department of Biostatistics, Harvard University , 655 Huntington Avenue, Boston, MA 02115, USA 2 RAND Corporation , 1776 Main Street, Santa Monica, CA 90401, USA |
AuthorAffiliation_xml | – name: 2 RAND Corporation , 1776 Main Street, Santa Monica, CA 90401, USA – name: 3 Department of Statistics and Actuarial Science, Simon Fraser University , 8888 University Drive, Burnaby, BC, Canada – name: 1 Department of Biostatistics, Harvard University , 655 Huntington Avenue, Boston, MA 02115, USA |
Author_xml | – sequence: 1 givenname: Joseph surname: Antonelli fullname: Antonelli, Joseph organization: Department of Biostatistics, Harvard University, 655 Huntington Avenue, Boston, MA 02115, USA – sequence: 2 givenname: Matthew surname: Cefalu fullname: Cefalu, Matthew organization: RAND Corporation, 1776 Main Street, Santa Monica, CA 90401, USA – sequence: 3 givenname: Luke surname: Bornn fullname: Bornn, Luke organization: Department of Statistics and Actuarial Science, Simon Fraser University, 8888 University Drive, Burnaby, BC, Canada |
BackLink | https://www.ncbi.nlm.nih.gov/pubmed/27324413$$D View this record in MEDLINE/PubMed |
BookMark | eNp9kc1OAyEUhYmp8f8NjJkXGIWB0hkXJqbxL2nipu5MCDCXis7AZKDVvr20VWNduIJw7vkuOecQDZx3gNApwecEV_RCWR-ijDZEq8PF28c7LvgOOiCMlzmjw9FgfR_mjDO2jw5DeMW4KCine2i_GNGCMUIP0PP0BbLOBxvtAjIwBnQMmTfprZs3Ce9drmSAOut6MNCDi1Y2WZBt11g3y6zLwC1s712bpKRAZ2torW_8bHmMdo1sApx8nUfo6fZmOr7PJ493D-PrSa7ZiMW8qDlLm5nhnLMS84rT2igoFCcjGGJDiMYVV8pIgxXWUJKaKFzVNZO6omVJj9DVhtvNVQu1Tj_pZSO63rayXwovrdhWnH0RM78QnK1iYAlw9hvw4_zOKQ1cbgZ070NISQht4zqdxLONIFisShFbpYhNKcnM_pi_-f_aPgHDGpzv |
CitedBy_id | crossref_primary_10_1007_s42081_022_00178_8 crossref_primary_10_1002_env_2573 crossref_primary_10_1016_j_jenvman_2022_117194 crossref_primary_10_1164_rccm_201706_1267OC |
Cites_doi | 10.1056/NEJM199312093292401 10.1097/EDE.0b013e31819e4331 10.1002/env.2334 10.1080/08958370701492961 10.1111/j.1751-5823.2003.tb00195.x 10.1080/02693799008941549 10.1016/S0045-6535(02)00239-4 10.1186/1476-069X-11-40 10.1111/j.1467-9876.2009.00701.x 10.1001/jama.295.10.1127 10.1093/biostatistics/kxn033 10.1007/s11869-012-0181-8 10.1002/env.2233 10.1002/env.2169 10.1038/jes.2012.126 10.1056/NEJM200012143432401 10.1002/env.1039 10.1093/biostatistics/kxq083 10.1111/j.1467-9876.2008.00618.x |
ContentType | Journal Article |
Copyright | The Author 2016. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com. The Author 2016. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com. 2016 |
Copyright_xml | – notice: The Author 2016. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com. – notice: The Author 2016. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com. 2016 |
DBID | AAYXX CITATION CGR CUY CVF ECM EIF NPM 5PM |
DOI | 10.1093/biostatistics/kxw026 |
DatabaseName | CrossRef Medline MEDLINE MEDLINE (Ovid) MEDLINE MEDLINE PubMed PubMed Central (Full Participant titles) |
DatabaseTitle | CrossRef MEDLINE Medline Complete MEDLINE with Full Text PubMed MEDLINE (Ovid) |
DatabaseTitleList | CrossRef MEDLINE |
Database_xml | – sequence: 1 dbid: NPM name: PubMed url: https://proxy.k.utb.cz/login?url=http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed sourceTypes: Index Database – sequence: 2 dbid: EIF name: MEDLINE url: https://proxy.k.utb.cz/login?url=https://www.webofscience.com/wos/medline/basic-search sourceTypes: Index Database |
DeliveryMethod | fulltext_linktorsrc |
Discipline | Biology |
EISSN | 1468-4357 |
EndPage | 778 |
ExternalDocumentID | PMC6424414 27324413 10_1093_biostatistics_kxw026 |
Genre | Research Support, U.S. Gov't, Non-P.H.S Journal Article Research Support, N.I.H., Extramural |
GrantInformation_xml | – fundername: NIEHS NIH HHS grantid: T32 ES007142 |
GroupedDBID | --- -E4 .2P .I3 0R~ 1TH 23N 2WC 4.4 48X 53G 5GY 5VS 5WA 6PF 70D AAIJN AAJKP AAJQQ AAMVS AAOGV AAPQZ AAPXW AARHZ AAUAY AAUQX AAVAP AAWTL AAYXX ABDFA ABDTM ABEJV ABEUO ABGNP ABIXL ABJNI ABLJU ABNKS ABPQP ABPTD ABQLI ABVGC ABWST ABXVV ABZBJ ACGFS ACIWK ACPRK ACUFI ACUXJ ACYTK ADBBV ADEYI ADEZT ADGZP ADHKW ADHZD ADIPN ADNBA ADOCK ADQBN ADRDM ADRTK ADVEK ADYJX ADYVW ADZXQ AECKG AEGPL AEJOX AEKKA AEKSI AEMDU AENEX AENZO AEPUE AETBJ AEWNT AFFZL AFIYH AFOFC AFRAH AGINJ AGKEF AGORE AGQXC AGSYK AHMBA AHXPO AIJHB AJBYB AJEEA AJEUX AJNCP ALMA_UNASSIGNED_HOLDINGS ALTZX ALUQC ALXQX ANAKG APIBT APWMN ATGXG AXUDD AZVOD BAWUL BAYMD BCRHZ BEYMZ BHONS BQUQU BTQHN C45 CDBKE CITATION CS3 CZ4 DAKXR DIK DILTD DU5 D~K E3Z EBD EBS EE~ EJD EMOBN F5P F9B FLIZI FLUFQ FOEOM FQBLK GAUVT GJXCC H13 H5~ HAR HW0 HZ~ IOX J21 JXSIZ KBUDW KOP KQ8 KSI KSN M-Z N9A NGC NMDNZ NOMLY NU- O9- ODMLO OJQWA OJZSN OK1 OVD P2P PAFKI PEELM PQQKQ Q1. Q5Y RD5 ROL ROX RUSNO RW1 RXO SV3 TEORI TJP TN5 TR2 W8F WOQ X7H YAYTL YKOAZ YXANX ZKX ~91 ABQTQ ACIPB ADRIX AFXEN C1A CAG CGR COF CUY CVF ECM EIF M49 NPM NTWIH O0~ RHF RIG RNI RZO 5PM |
ID | FETCH-LOGICAL-c474t-2d64eff4f6664806963dfbe2b617e50f11c096bbfaf0b0ce81d1b09dd4ac93883 |
ISSN | 1465-4644 |
IngestDate | Thu Aug 21 14:11:39 EDT 2025 Wed Feb 19 02:32:01 EST 2025 Tue Jul 01 03:45:54 EDT 2025 Thu Apr 24 23:04:56 EDT 2025 |
IsDoiOpenAccess | false |
IsOpenAccess | true |
IsPeerReviewed | true |
IsScholarly | true |
Issue | 4 |
Keywords | Air pollution epidemiology Preferential sampling exposure estimation |
Language | English |
License | The Author 2016. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com. |
LinkModel | OpenURL |
MergedId | FETCHMERGED-LOGICAL-c474t-2d64eff4f6664806963dfbe2b617e50f11c096bbfaf0b0ce81d1b09dd4ac93883 |
OpenAccessLink | https://academic.oup.com/biostatistics/article-pdf/17/4/764/30142190/kxw026.pdf |
PMID | 27324413 |
PageCount | 15 |
ParticipantIDs | pubmedcentral_primary_oai_pubmedcentral_nih_gov_6424414 pubmed_primary_27324413 crossref_citationtrail_10_1093_biostatistics_kxw026 crossref_primary_10_1093_biostatistics_kxw026 |
ProviderPackageCode | CITATION AAYXX |
PublicationCentury | 2000 |
PublicationDate | 2016-10-01 |
PublicationDateYYYYMMDD | 2016-10-01 |
PublicationDate_xml | – month: 10 year: 2016 text: 2016-10-01 day: 01 |
PublicationDecade | 2010 |
PublicationPlace | England |
PublicationPlace_xml | – name: England |
PublicationTitle | Biostatistics (Oxford, England) |
PublicationTitleAlternate | Biostatistics |
PublicationYear | 2016 |
Publisher | Oxford University Press |
Publisher_xml | – name: Oxford University Press |
References | Gelfand (2019101405384850800_B8) 2012; 23 Gryparis (2019101405384850800_B9) 2009; 10 Nikolov (2019101405384850800_B14) 2011; 22 Pope (2019101405384850800_B17) 2007; 19 Samet (2019101405384850800_B18) 2000; 343 Kim (2019101405384850800_B10) 2009; 20 Breysse (2019101405384850800_B2) 2013; 6 Dominici (2019101405384850800_B6) 2006; 295 Matte (2019101405384850800_B13) 2013; 23 Dominici (2019101405384850800_B7) 2003; 71 Szpiro (2019101405384850800_B20) 2011; 12 Szpiro (2019101405384850800_B19) 2013; 24 Nikolov (2019101405384850800_B15) 2008; 57 Oliver (2019101405384850800_B16) 1990; 4 Kloog (2019101405384850800_B11) 2012; 11 2019101405384850800_B1 Lee (2019101405384850800_B12) 2015; 26 Dockery (2019101405384850800_B5) 1993; 329 Chow (2019101405384850800_B3) 2002; 49 Diggle (2019101405384850800_B4) 2010; 59 |
References_xml | – volume: 329 start-page: 1753 year: 1993 ident: 2019101405384850800_B5 article-title: An association between air pollution and mortality in six US cities publication-title: New England Journal of Medicine doi: 10.1056/NEJM199312093292401 – volume: 20 start-page: 442 year: 2009 ident: 2019101405384850800_B10 article-title: Health effects of long-term air pollution: influence of exposure prediction methods publication-title: Epidemiology doi: 10.1097/EDE.0b013e31819e4331 – volume: 26 start-page: 255 issue: 4 year: 2015 ident: 2019101405384850800_B12 article-title: Impact of preferential sampling on exposure prediction and health effect inference in the context of air pollution epidemiology publication-title: Environmetrics doi: 10.1002/env.2334 – volume: 19 start-page: 33 year: 2007 ident: 2019101405384850800_B17 article-title: Mortality effects of longer term exposures to fine particulate air pollution: review of recent epidemiological evidence publication-title: Inhalation Toxicology doi: 10.1080/08958370701492961 – ident: 2019101405384850800_B1 – volume: 71 start-page: 243 year: 2003 ident: 2019101405384850800_B7 article-title: Health effects of air pollution: a statistical review publication-title: International Statistical Review doi: 10.1111/j.1751-5823.2003.tb00195.x – volume: 4 start-page: 313 year: 1990 ident: 2019101405384850800_B16 article-title: Kriging: a method of interpolation for geographical information systems publication-title: International Journal of Geographical Information System doi: 10.1080/02693799008941549 – volume: 49 start-page: 961 year: 2002 ident: 2019101405384850800_B3 article-title: Designing monitoring networks to represent outdoor human exposure publication-title: Chemosphere doi: 10.1016/S0045-6535(02)00239-4 – volume: 11 start-page: 1 year: 2012 ident: 2019101405384850800_B11 article-title: Using new satellite based exposure methods to study the association between pregnancy pm2. 5 exposure, premature birth and birth weight in Massachusetts publication-title: Environmental Health doi: 10.1186/1476-069X-11-40 – volume: 59 start-page: 191 year: 2010 ident: 2019101405384850800_B4 article-title: Geostatistical inference under preferential sampling publication-title: Journal of the Royal Statistical Society: Series C (Applied Statistics) doi: 10.1111/j.1467-9876.2009.00701.x – volume: 295 start-page: 1127 year: 2006 ident: 2019101405384850800_B6 article-title: Fine particulate air pollution and hospital admission for cardiovascular and respiratory diseases publication-title: JAMA doi: 10.1001/jama.295.10.1127 – volume: 10 start-page: 258 year: 2009 ident: 2019101405384850800_B9 article-title: Measurement error caused by spatial misalignment in environmental epidemiology publication-title: Biostatistics doi: 10.1093/biostatistics/kxn033 – volume: 6 start-page: 333 issue: 2 year: 2013 ident: 2019101405384850800_B2 article-title: US EPA particulate matter research centers: summary of research results for 2005–2011 publication-title: Air Quality, Atmosphere & Health doi: 10.1007/s11869-012-0181-8 – volume: 24 start-page: 501 year: 2013 ident: 2019101405384850800_B19 article-title: Measurement error in two-stage analyses, with application to air pollution epidemiology publication-title: Environmetrics doi: 10.1002/env.2233 – volume: 23 start-page: 565 year: 2012 ident: 2019101405384850800_B8 article-title: On the effect of preferential sampling in spatial prediction publication-title: Environmetrics doi: 10.1002/env.2169 – volume: 23 start-page: 223 year: 2013 ident: 2019101405384850800_B13 article-title: Monitoring intraurban spatial patterns of multiple combustion air pollutants in New York City: design and implementation publication-title: Journal of Exposure Science and Environmental Epidemiology doi: 10.1038/jes.2012.126 – volume: 343 start-page: 1742 issue: 24 year: 2000 ident: 2019101405384850800_B18 article-title: Fine particulate air pollution and mortality in 20 US cities, 1987–1994 publication-title: New England Journal of Medicine doi: 10.1056/NEJM200012143432401 – volume: 22 start-page: 165 year: 2011 ident: 2019101405384850800_B14 article-title: Multiplicative factor analysis with a latent mixed model structure for air pollution exposure assessment publication-title: Environmetrics doi: 10.1002/env.1039 – volume: 12 start-page: 610 issue: 4 year: 2011 ident: 2019101405384850800_B20 article-title: Efficient measurement error correction with spatially misaligned data publication-title: Biostatistics doi: 10.1093/biostatistics/kxq083 – volume: 57 start-page: 357 year: 2008 ident: 2019101405384850800_B15 article-title: Statistical methods to evaluate health effects associated with major sources of air pollution: a case-study of breathing patterns during exposure to concentrated Boston air particles publication-title: Journal of the Royal Statistical Society: Series C (Applied Statistics) doi: 10.1111/j.1467-9876.2008.00618.x |
SSID | ssj0022363 |
Score | 2.139843 |
Snippet | In environmental epidemiology, exposures are not always available at subject locations and must be predicted using monitoring data. The monitor locations are... In environmental epidemiology, exposures are not always available at subject locations and must be predicted using monitoring data. The monitor locations are... |
SourceID | pubmedcentral pubmed crossref |
SourceType | Open Access Repository Index Database Enrichment Source |
StartPage | 764 |
SubjectTerms | Environmental Exposure Environmental Monitoring - statistics & numerical data Epidemiologic Methods Humans Models, Theoretical Research Design |
Title | The positive effects of population-based preferential sampling in environmental epidemiology |
URI | https://www.ncbi.nlm.nih.gov/pubmed/27324413 https://pubmed.ncbi.nlm.nih.gov/PMC6424414 |
Volume | 17 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1bT9swFLY6ENJe0MZuHWzyw17d5uLYySNCQ2gbSEgg8TCpim9aRdWiKh3d_sP-M8dxYmyGxuAlap3UaXq-2ufY5zsfQp-ETnRVy5JIyZVduhGkVrkmXIF_oIqiZC2L__iEHZ3TLxfFxWDwJ8haWjViJH_fyyt5ilWhDexqWbKPsKzvFBrgNdgXjmBhOP63jV3W1c8oM-PKq3IRO0u1lQBaXl_T0kNqm0XuqCwBzw3O6Fu52Hi3d7qwxKOuprMtULruc-I7EZBoQcHW95452nW3wdBvdGhTz1aOItTKjPvFgMXS-dHfVpc6XIhImU9p68dOygpCmSvnONJ9W0nAI-PRgMsDYNFg9OSMBhMxd9o-f43xrv6VCJ8c3l-ur5PsnqLadyY7n4LoNt_zSdTPxPXyDG1mEHVYQYyvp35TChypVpjPP2fPxKzycdTL2PUSeTrevYlTbwNf5uwF2u6CELzvEPUSDfR8B205WdJfr9B3wBXucYU7XOGFwXdxhUNc4R5XeDrHEa5wiKvX6Pzw89nBEelUOIiknDYkU4zCnaiBQJeWCYMRWxmhMwG-ry4Sk6YSwmAhTG0SkUgNAVAqkkopWssqL8v8DdqYA-7eIVwayTMjcwZhA61oUVtBHiMLzWquTKaHKO9_sYnsStRbpZTZ5F_WGiLiP3XlSrQ8cP1bZwN_Nbjw4OWm-RDxyDr-AluDPT4zn_5oa7EzSxRN6ftHfodd9Pz2L7SHNprlSn8A77YRH1vM3QANArbY |
linkProvider | Colorado Alliance of Research Libraries |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=The+positive+effects+of+population-based+preferential+sampling+in+environmental+epidemiology&rft.jtitle=Biostatistics+%28Oxford%2C+England%29&rft.au=Antonelli%2C+Joseph&rft.au=Cefalu%2C+Matthew&rft.au=Bornn%2C+Luke&rft.date=2016-10-01&rft.issn=1465-4644&rft.eissn=1468-4357&rft.volume=17&rft.issue=4&rft.spage=764&rft.epage=778&rft_id=info:doi/10.1093%2Fbiostatistics%2Fkxw026&rft.externalDBID=n%2Fa&rft.externalDocID=10_1093_biostatistics_kxw026 |
thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1465-4644&client=summon |
thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1465-4644&client=summon |
thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1465-4644&client=summon |