Prevalence estimation by joint use of big data and health survey: a demonstration study using electronic health records in New York city

Electronic Health Records (EHR) has been increasingly used as a tool to monitor population health. However, subject-level errors in the records can yield biased estimates of health indicators. There is an urgent need for methods to estimate the prevalence of health indicators using large and real-ti...

Full description

Saved in:
Bibliographic Details
Published inBMC medical research methodology Vol. 20; no. 1; pp. 77 - 10
Main Authors Kim, Ryung S., Shankar, Viswanathan
Format Journal Article
LanguageEnglish
Published England BioMed Central Ltd 06.04.2020
BioMed Central
BMC
Subjects
Online AccessGet full text

Cover

Loading…
Abstract Electronic Health Records (EHR) has been increasingly used as a tool to monitor population health. However, subject-level errors in the records can yield biased estimates of health indicators. There is an urgent need for methods to estimate the prevalence of health indicators using large and real-time EHR while correcting the potential bias. We demonstrate joint analyses of EHR and a smaller gold-standard health survey. We first adopted Mosteller's method that pools two estimators, among which one is potentially biased. It only requires knowing the prevalence estimates from two data sources and their standard errors. Then, we adopted the method of Schenker et al., which uses multiple imputations of subject-level health outcomes that are missing for the subjects in EHR. This procedure requires information to link some subjects between two sources and modeling the mechanism of misclassification in EHR as well as modeling inclusion probabilities to both sources. In a simulation study, both estimators yielded negligible bias even when EHR was biased. They performed as well as health survey estimator when EHR bias was large and better than health survey estimator when EHR bias was moderate. It may be challenging to model the misclassification mechanism in real data for the subject-level imputation estimator. We illustrated the methods analyzing six health indicators from 2013 to 14 NYC HANES and the 2013 NYC Macroscope, and a study that linked some subjects in both data sources. When a small gold-standard health survey exists, it can serve as a safeguard against potential bias in EHR through the joint analysis of the two sources.
AbstractList Electronic Health Records (EHR) has been increasingly used as a tool to monitor population health. However, subject-level errors in the records can yield biased estimates of health indicators. There is an urgent need for methods to estimate the prevalence of health indicators using large and real-time EHR while correcting the potential bias. We demonstrate joint analyses of EHR and a smaller gold-standard health survey. We first adopted Mosteller's method that pools two estimators, among which one is potentially biased. It only requires knowing the prevalence estimates from two data sources and their standard errors. Then, we adopted the method of Schenker et al., which uses multiple imputations of subject-level health outcomes that are missing for the subjects in EHR. This procedure requires information to link some subjects between two sources and modeling the mechanism of misclassification in EHR as well as modeling inclusion probabilities to both sources. In a simulation study, both estimators yielded negligible bias even when EHR was biased. They performed as well as health survey estimator when EHR bias was large and better than health survey estimator when EHR bias was moderate. It may be challenging to model the misclassification mechanism in real data for the subject-level imputation estimator. We illustrated the methods analyzing six health indicators from 2013 to 14 NYC HANES and the 2013 NYC Macroscope, and a study that linked some subjects in both data sources. When a small gold-standard health survey exists, it can serve as a safeguard against potential bias in EHR through the joint analysis of the two sources.
Background Electronic Health Records (EHR) has been increasingly used as a tool to monitor population health. However, subject-level errors in the records can yield biased estimates of health indicators. There is an urgent need for methods to estimate the prevalence of health indicators using large and real-time EHR while correcting the potential bias. Methods We demonstrate joint analyses of EHR and a smaller gold-standard health survey. We first adopted Mosteller’s method that pools two estimators, among which one is potentially biased. It only requires knowing the prevalence estimates from two data sources and their standard errors. Then, we adopted the method of Schenker et al., which uses multiple imputations of subject-level health outcomes that are missing for the subjects in EHR. This procedure requires information to link some subjects between two sources and modeling the mechanism of misclassification in EHR as well as modeling inclusion probabilities to both sources. Results In a simulation study, both estimators yielded negligible bias even when EHR was biased. They performed as well as health survey estimator when EHR bias was large and better than health survey estimator when EHR bias was moderate. It may be challenging to model the misclassification mechanism in real data for the subject-level imputation estimator. We illustrated the methods analyzing six health indicators from 2013 to 14 NYC HANES and the 2013 NYC Macroscope, and a study that linked some subjects in both data sources. Conclusions When a small gold-standard health survey exists, it can serve as a safeguard against potential bias in EHR through the joint analysis of the two sources.
Electronic Health Records (EHR) has been increasingly used as a tool to monitor population health. However, subject-level errors in the records can yield biased estimates of health indicators. There is an urgent need for methods to estimate the prevalence of health indicators using large and real-time EHR while correcting the potential bias.BACKGROUNDElectronic Health Records (EHR) has been increasingly used as a tool to monitor population health. However, subject-level errors in the records can yield biased estimates of health indicators. There is an urgent need for methods to estimate the prevalence of health indicators using large and real-time EHR while correcting the potential bias.We demonstrate joint analyses of EHR and a smaller gold-standard health survey. We first adopted Mosteller's method that pools two estimators, among which one is potentially biased. It only requires knowing the prevalence estimates from two data sources and their standard errors. Then, we adopted the method of Schenker et al., which uses multiple imputations of subject-level health outcomes that are missing for the subjects in EHR. This procedure requires information to link some subjects between two sources and modeling the mechanism of misclassification in EHR as well as modeling inclusion probabilities to both sources.METHODSWe demonstrate joint analyses of EHR and a smaller gold-standard health survey. We first adopted Mosteller's method that pools two estimators, among which one is potentially biased. It only requires knowing the prevalence estimates from two data sources and their standard errors. Then, we adopted the method of Schenker et al., which uses multiple imputations of subject-level health outcomes that are missing for the subjects in EHR. This procedure requires information to link some subjects between two sources and modeling the mechanism of misclassification in EHR as well as modeling inclusion probabilities to both sources.In a simulation study, both estimators yielded negligible bias even when EHR was biased. They performed as well as health survey estimator when EHR bias was large and better than health survey estimator when EHR bias was moderate. It may be challenging to model the misclassification mechanism in real data for the subject-level imputation estimator. We illustrated the methods analyzing six health indicators from 2013 to 14 NYC HANES and the 2013 NYC Macroscope, and a study that linked some subjects in both data sources.RESULTSIn a simulation study, both estimators yielded negligible bias even when EHR was biased. They performed as well as health survey estimator when EHR bias was large and better than health survey estimator when EHR bias was moderate. It may be challenging to model the misclassification mechanism in real data for the subject-level imputation estimator. We illustrated the methods analyzing six health indicators from 2013 to 14 NYC HANES and the 2013 NYC Macroscope, and a study that linked some subjects in both data sources.When a small gold-standard health survey exists, it can serve as a safeguard against potential bias in EHR through the joint analysis of the two sources.CONCLUSIONSWhen a small gold-standard health survey exists, it can serve as a safeguard against potential bias in EHR through the joint analysis of the two sources.
Abstract Background Electronic Health Records (EHR) has been increasingly used as a tool to monitor population health. However, subject-level errors in the records can yield biased estimates of health indicators. There is an urgent need for methods to estimate the prevalence of health indicators using large and real-time EHR while correcting the potential bias. Methods We demonstrate joint analyses of EHR and a smaller gold-standard health survey. We first adopted Mosteller’s method that pools two estimators, among which one is potentially biased. It only requires knowing the prevalence estimates from two data sources and their standard errors. Then, we adopted the method of Schenker et al., which uses multiple imputations of subject-level health outcomes that are missing for the subjects in EHR. This procedure requires information to link some subjects between two sources and modeling the mechanism of misclassification in EHR as well as modeling inclusion probabilities to both sources. Results In a simulation study, both estimators yielded negligible bias even when EHR was biased. They performed as well as health survey estimator when EHR bias was large and better than health survey estimator when EHR bias was moderate. It may be challenging to model the misclassification mechanism in real data for the subject-level imputation estimator. We illustrated the methods analyzing six health indicators from 2013 to 14 NYC HANES and the 2013 NYC Macroscope, and a study that linked some subjects in both data sources. Conclusions When a small gold-standard health survey exists, it can serve as a safeguard against potential bias in EHR through the joint analysis of the two sources.
Background Electronic Health Records (EHR) has been increasingly used as a tool to monitor population health. However, subject-level errors in the records can yield biased estimates of health indicators. There is an urgent need for methods to estimate the prevalence of health indicators using large and real-time EHR while correcting the potential bias. Methods We demonstrate joint analyses of EHR and a smaller gold-standard health survey. We first adopted Mosteller's method that pools two estimators, among which one is potentially biased. It only requires knowing the prevalence estimates from two data sources and their standard errors. Then, we adopted the method of Schenker et al., which uses multiple imputations of subject-level health outcomes that are missing for the subjects in EHR. This procedure requires information to link some subjects between two sources and modeling the mechanism of misclassification in EHR as well as modeling inclusion probabilities to both sources. Results In a simulation study, both estimators yielded negligible bias even when EHR was biased. They performed as well as health survey estimator when EHR bias was large and better than health survey estimator when EHR bias was moderate. It may be challenging to model the misclassification mechanism in real data for the subject-level imputation estimator. We illustrated the methods analyzing six health indicators from 2013 to 14 NYC HANES and the 2013 NYC Macroscope, and a study that linked some subjects in both data sources. Conclusions When a small gold-standard health survey exists, it can serve as a safeguard against potential bias in EHR through the joint analysis of the two sources. Keywords: Big data, Electronic health records, Multiple imputations, Measurement error, Selection bias, Population health surveillance
Electronic Health Records (EHR) has been increasingly used as a tool to monitor population health. However, subject-level errors in the records can yield biased estimates of health indicators. There is an urgent need for methods to estimate the prevalence of health indicators using large and real-time EHR while correcting the potential bias. We demonstrate joint analyses of EHR and a smaller gold-standard health survey. We first adopted Mosteller's method that pools two estimators, among which one is potentially biased. It only requires knowing the prevalence estimates from two data sources and their standard errors. Then, we adopted the method of Schenker et al., which uses multiple imputations of subject-level health outcomes that are missing for the subjects in EHR. This procedure requires information to link some subjects between two sources and modeling the mechanism of misclassification in EHR as well as modeling inclusion probabilities to both sources. In a simulation study, both estimators yielded negligible bias even when EHR was biased. They performed as well as health survey estimator when EHR bias was large and better than health survey estimator when EHR bias was moderate. It may be challenging to model the misclassification mechanism in real data for the subject-level imputation estimator. We illustrated the methods analyzing six health indicators from 2013 to 14 NYC HANES and the 2013 NYC Macroscope, and a study that linked some subjects in both data sources. When a small gold-standard health survey exists, it can serve as a safeguard against potential bias in EHR through the joint analysis of the two sources.
ArticleNumber 77
Audience Academic
Author Kim, Ryung S.
Shankar, Viswanathan
Author_xml – sequence: 1
  givenname: Ryung S.
  surname: Kim
  fullname: Kim, Ryung S.
– sequence: 2
  givenname: Viswanathan
  surname: Shankar
  fullname: Shankar, Viswanathan
BackLink https://www.ncbi.nlm.nih.gov/pubmed/32252642$$D View this record in MEDLINE/PubMed
BookMark eNp9Uk1v1DAUjFARbRf-AAdkiQuXlPg5sR0OSFXFR6UKOMCBk-XYL7tesnaxk0X7D_jZeHfb0q0Q8sHW88zY896cFkc-eCyK57Q6o1Ty14mCFHVZQVVWVdvwkj8qTmgtaAkg5dG983FxmtKyqqiQjD8pjhlAA7yGk-L3l4hrPaA3SDCNbqVHFzzpNmQZnB_JlJCEnnRuTqweNdHekgXqYVyQNMU1bt4QTSyugk9j3HPTONlNJjo_JzigGWPwztyyIpoQbSLOk0_4i3wP8Qcxbtw8LR73ekj47GafFd_ev_t68bG8-vzh8uL8qjQNZ2OpO9sI2YMxomFtdkOZkYCy1RYktqJvuTUWaN3yXnPEnjWdtFWDYBrT8Y7Nisu9rg16qa5jdhw3KmindoUQ50rH0ZkBVaNrYfum4Tq3GSh2ALSDts9HDjWyrPV2r3U9dSu0Bn3uwXAgenjj3ULNw1oJygSjPAu8uhGI4eeU-69WLhkcBu0xTEkBkwJ4xfPYZsXLB9BlmKLPrcqoljIqKcBf1DzPVDnfh_yu2Yqqcw6CgWypyKizf6DyyoN0Joesd7l-QHhx3-idw9scZQDsASaGlCL2dxBaqW1Y1T6sKodV7cKqtu7lA1IOwi5D-Ttu-B_1D8FB7oQ
CitedBy_id crossref_primary_10_1093_ajh_hpad081
crossref_primary_10_1186_s12889_022_13809_2
Cites_doi 10.1093/biomet/asr063
10.1002/sim.3809
10.1093/biomet/asx073
10.1093/biomet/asn048
10.1111/j.1467-985X.2010.00648.x
10.18637/jss.v009.i08
10.1007/s40300-017-0124-0
10.1017/CBO9780511790942
10.1198/016214506000001293
10.1002/cjs.11153
10.5888/pcd15.180371
10.1080/01621459.1948.10483259
10.1093/biomet/86.4.948
10.18637/jss.v045.i03
10.1080/01621459.1993.10594298
10.1089/pop.2014.0093
10.1016/j.pmedr.2015.06.019
10.1002/sim.6173
ContentType Journal Article
Copyright COPYRIGHT 2020 BioMed Central Ltd.
2020. This work is licensed under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
The Author(s) 2020
Copyright_xml – notice: COPYRIGHT 2020 BioMed Central Ltd.
– notice: 2020. This work is licensed under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.
– notice: The Author(s) 2020
DBID AAYXX
CITATION
NPM
3V.
7X7
7XB
88E
8FI
8FJ
8FK
ABUWG
AFKRA
AZQEC
BENPR
CCPQU
DWQXO
FYUFA
GHDGH
K9.
M0S
M1P
PHGZM
PHGZT
PIMPY
PJZUB
PKEHL
PPXIY
PQEST
PQQKQ
PQUKI
PRINS
7X8
5PM
DOA
DOI 10.1186/s12874-020-00956-6
DatabaseName CrossRef
PubMed
ProQuest Central (Corporate)
Health & Medical Collection
ProQuest Central (purchase pre-March 2016)
Medical Database (Alumni Edition)
Hospital Premium Collection
Hospital Premium Collection (Alumni Edition)
ProQuest Central (Alumni) (purchase pre-March 2016)
ProQuest Central (Alumni)
ProQuest Central UK/Ireland
ProQuest Central Essentials
ProQuest Central
ProQuest One
ProQuest Central Korea
Health Research Premium Collection
Health Research Premium Collection (Alumni)
ProQuest Health & Medical Complete (Alumni)
ProQuest Health & Medical Collection
Medical Database
ProQuest Central Premium
ProQuest One Academic (New)
Publicly Available Content Database
ProQuest Health & Medical Research Collection
ProQuest One Academic Middle East (New)
ProQuest One Health & Nursing
ProQuest One Academic Eastern Edition (DO NOT USE)
ProQuest One Academic
ProQuest One Academic UKI Edition
ProQuest Central China
MEDLINE - Academic
PubMed Central (Full Participant titles)
DOAJ Directory of Open Access Journals
DatabaseTitle CrossRef
PubMed
Publicly Available Content Database
ProQuest One Academic Middle East (New)
ProQuest Central Essentials
ProQuest Health & Medical Complete (Alumni)
ProQuest Central (Alumni Edition)
ProQuest One Community College
ProQuest One Health & Nursing
ProQuest Central China
ProQuest Central
Health Research Premium Collection
Health and Medicine Complete (Alumni Edition)
ProQuest Central Korea
Health & Medical Research Collection
ProQuest Central (New)
ProQuest Medical Library (Alumni)
ProQuest One Academic Eastern Edition
ProQuest Hospital Collection
Health Research Premium Collection (Alumni)
ProQuest Hospital Collection (Alumni)
ProQuest Health & Medical Complete
ProQuest Medical Library
ProQuest One Academic UKI Edition
ProQuest One Academic
ProQuest One Academic (New)
ProQuest Central (Alumni)
MEDLINE - Academic
DatabaseTitleList
Publicly Available Content Database
MEDLINE - Academic


PubMed
Database_xml – sequence: 1
  dbid: DOA
  name: DOAJ Directory of Open Access Journals
  url: https://www.doaj.org/
  sourceTypes: Open Website
– sequence: 2
  dbid: NPM
  name: PubMed
  url: https://proxy.k.utb.cz/login?url=http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed
  sourceTypes: Index Database
– sequence: 3
  dbid: BENPR
  name: ProQuest Central
  url: https://www.proquest.com/central
  sourceTypes: Aggregation Database
DeliveryMethod fulltext_linktorsrc
Discipline Medicine
Public Health
EISSN 1471-2288
EndPage 10
ExternalDocumentID oai_doaj_org_article_5a47df556a87421eb221b29f21e624e3
PMC7137316
A627328917
32252642
10_1186_s12874_020_00956_6
Genre Journal Article
GeographicLocations United States
GeographicLocations_xml – name: United States
GrantInformation_xml – fundername: New York City Department of Health and Mental Hygiene (US)
  grantid: The study began as the authors worked as paid statistical consultants for NYC DOHMH to analyze NYC Macroscope jointly with NYC HANES. When the work ended, the authors continued further simulation studies to write the manuscript.
– fundername: New York City Department of Health and Mental Hygine
  grantid: The study began as the authors worked as paid statistical consultants for NYC DOHMH to analyze NYC Macroscope jointly with NYC HANES. When the work ended, the authors continued further simulation studies to write the manuscript.
– fundername: ;
  grantid: The study began as the authors worked as paid statistical consultants for NYC DOHMH to analyze NYC Macroscope jointly with NYC HANES. When the work ended, the authors continued further simulation studies to write the manuscript.
GroupedDBID ---
0R~
23N
2WC
53G
5VS
6J9
6PF
7X7
88E
8FI
8FJ
AAFWJ
AAJSJ
AASML
AAWTL
AAYXX
ABDBF
ABUWG
ACGFO
ACGFS
ACIHN
ACUHS
ADBBV
ADRAZ
ADUKV
AEAQA
AENEX
AFKRA
AFPKN
AHBYD
AHMBA
AHYZX
ALIPV
ALMA_UNASSIGNED_HOLDINGS
AMKLP
AMTXH
AOIJS
BAPOH
BAWUL
BCNDV
BENPR
BFQNJ
BMC
BPHCQ
BVXVI
C6C
CCPQU
CITATION
CS3
DIK
DU5
E3Z
EAD
EAP
EAS
EBD
EBLON
EBS
EMB
EMK
EMOBN
ESX
F5P
FYUFA
GROUPED_DOAJ
GX1
HMCUK
HYE
IAO
IHR
INH
INR
ITC
KQ8
M1P
M48
MK0
M~E
O5R
O5S
OK1
OVT
P2P
PGMZT
PHGZM
PHGZT
PIMPY
PQQKQ
PROAC
PSQYO
RBZ
RNS
ROL
RPM
RSV
SMD
SOJ
SV3
TR2
TUS
UKHRP
W2D
WOQ
WOW
XSB
-A0
3V.
ACRMQ
ADINQ
C24
NPM
PMFND
7XB
8FK
AZQEC
DWQXO
K9.
PJZUB
PKEHL
PPXIY
PQEST
PQUKI
PRINS
7X8
5PM
PUEGO
ID FETCH-LOGICAL-c563t-abd578f2cc753983613c82e89ad28e97f96dcd21496fa6eef35b8d05e2c5cb6b3
IEDL.DBID M48
ISSN 1471-2288
IngestDate Wed Aug 27 01:31:31 EDT 2025
Thu Aug 21 18:24:47 EDT 2025
Fri Jul 11 00:35:49 EDT 2025
Fri Jul 25 07:21:27 EDT 2025
Tue Jun 17 21:25:49 EDT 2025
Tue Jun 10 20:39:10 EDT 2025
Thu Jan 02 22:59:11 EST 2025
Tue Jul 01 04:30:56 EDT 2025
Thu Apr 24 22:58:54 EDT 2025
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Issue 1
Keywords Selection bias
Population health surveillance
Big data
Multiple imputations
Electronic health records
Measurement error
Language English
License Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c563t-abd578f2cc753983613c82e89ad28e97f96dcd21496fa6eef35b8d05e2c5cb6b3
Notes ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
content type line 23
OpenAccessLink http://journals.scholarsportal.info/openUrl.xqy?doi=10.1186/s12874-020-00956-6
PMID 32252642
PQID 2391318122
PQPubID 42579
PageCount 10
ParticipantIDs doaj_primary_oai_doaj_org_article_5a47df556a87421eb221b29f21e624e3
pubmedcentral_primary_oai_pubmedcentral_nih_gov_7137316
proquest_miscellaneous_2387260678
proquest_journals_2391318122
gale_infotracmisc_A627328917
gale_infotracacademiconefile_A627328917
pubmed_primary_32252642
crossref_primary_10_1186_s12874_020_00956_6
crossref_citationtrail_10_1186_s12874_020_00956_6
ProviderPackageCode CITATION
AAYXX
PublicationCentury 2000
PublicationDate 2020-04-06
PublicationDateYYYYMMDD 2020-04-06
PublicationDate_xml – month: 04
  year: 2020
  text: 2020-04-06
  day: 06
PublicationDecade 2020
PublicationPlace England
PublicationPlace_xml – name: England
– name: London
PublicationTitle BMC medical research methodology
PublicationTitleAlternate BMC Med Res Methodol
PublicationYear 2020
Publisher BioMed Central Ltd
BioMed Central
BMC
Publisher_xml – name: BioMed Central Ltd
– name: BioMed Central
– name: BMC
References LE Thorpe (956_CR24) 2015; 2
R Newton-Dame (956_CR2) 2016; 4
TE Raghunathan (956_CR10) 2007; 102
A Gelman (956_CR17) 2011
MM Paul (956_CR1) 2015; 18
LE Thorpe (956_CR3) 2016; 4
S Park (956_CR13) 2017; 75
ME Thompson (956_CR6) 2008; 34
Z Wang (956_CR19) 2017; 105
Y He (956_CR16) 2014; 20
R Valliant (956_CR26) 1993; 88
G Manzi (956_CR8) 2011; 174
A Gelman (956_CR20) 2006
T Lumley (956_CR25) 2004; 9
LMR Ybarra (956_CR11) 2008; 95
KH McVeigh (956_CR5) 2017; 5
N Schenker (956_CR14) 2010; 29
J Barnard (956_CR22) 1999; 86
PY Chan (956_CR27) 2018; 15
KH McVeigh (956_CR4) 2016; 4
TE Raghunathan (956_CR28) 2006; 90
F Mosteller (956_CR9) 1948; 43
J Kim (956_CR12) 2012; 99
R Core Team (956_CR18) 2016
A Gelman (956_CR15) 1998; 93
S van Buuren (956_CR23) 2011; 45
SL Lohr (956_CR7) 2012; 40
956_CR21
References_xml – volume: 99
  start-page: 85
  issue: 1
  year: 2012
  ident: 956_CR12
  publication-title: Biometrika
  doi: 10.1093/biomet/asr063
– volume: 4
  start-page: 1267
  issue: 1
  year: 2016
  ident: 956_CR4
  publication-title: EGEMS (Washington, DC)
– volume: 29
  start-page: 533
  issue: 5
  year: 2010
  ident: 956_CR14
  publication-title: Stat Med
  doi: 10.1002/sim.3809
– volume: 105
  start-page: 91
  issue: 1
  year: 2017
  ident: 956_CR19
  publication-title: Biometrika
  doi: 10.1093/biomet/asx073
– volume: 95
  start-page: 919
  issue: 4
  year: 2008
  ident: 956_CR11
  publication-title: Biometrika
  doi: 10.1093/biomet/asn048
– volume: 4
  start-page: 1265
  issue: 1
  year: 2016
  ident: 956_CR2
  publication-title: EGEMS (Washington, DC)
– volume: 174
  start-page: 31
  year: 2011
  ident: 956_CR8
  publication-title: J Royal Stat Soc Ser A
  doi: 10.1111/j.1467-985X.2010.00648.x
– volume: 93
  start-page: 869
  issue: 443
  year: 1998
  ident: 956_CR15
  publication-title: J Am Stat Assoc
– volume: 9
  start-page: 19
  issue: 8
  year: 2004
  ident: 956_CR25
  publication-title: J Stat Softw
  doi: 10.18637/jss.v009.i08
– volume: 75
  start-page: 345
  year: 2017
  ident: 956_CR13
  publication-title: Metron
  doi: 10.1007/s40300-017-0124-0
– volume-title: Arm : data analysis using regression and multilevel/hierarchical models
  year: 2011
  ident: 956_CR17
– volume-title: Data analysis using regression and multilevel/hierarchical models
  year: 2006
  ident: 956_CR20
  doi: 10.1017/CBO9780511790942
– volume: 102
  start-page: 474
  issue: 478
  year: 2007
  ident: 956_CR10
  publication-title: J Am Stat Assoc
  doi: 10.1198/016214506000001293
– volume: 40
  start-page: 679
  issue: 4
  year: 2012
  ident: 956_CR7
  publication-title: Can J Stat
  doi: 10.1002/cjs.11153
– volume: 15
  year: 2018
  ident: 956_CR27
  publication-title: Prev Chronic Dis
  doi: 10.5888/pcd15.180371
– volume: 43
  start-page: 231
  issue: 242
  year: 1948
  ident: 956_CR9
  publication-title: J Am Stat Assoc
  doi: 10.1080/01621459.1948.10483259
– volume: 5
  start-page: 25
  issue: 1
  year: 2017
  ident: 956_CR5
  publication-title: EGEMS (Washington, DC)
– ident: 956_CR21
– volume: 4
  start-page: 1266
  issue: 1
  year: 2016
  ident: 956_CR3
  publication-title: EGEMS (Washington, DC)
– volume: 86
  start-page: 948
  issue: 4
  year: 1999
  ident: 956_CR22
  publication-title: Biometrika
  doi: 10.1093/biomet/86.4.948
– volume: 45
  start-page: 67
  issue: 3
  year: 2011
  ident: 956_CR23
  publication-title: J Stat Softw
  doi: 10.18637/jss.v045.i03
– volume: 34
  start-page: 131
  issue: 2
  year: 2008
  ident: 956_CR6
  publication-title: Surv Methodol
– volume: 88
  start-page: 89
  issue: 421
  year: 1993
  ident: 956_CR26
  publication-title: J Am Stat Assoc
  doi: 10.1080/01621459.1993.10594298
– volume-title: R: a language and environment for statistical computing
  year: 2016
  ident: 956_CR18
– volume: 18
  start-page: 209
  issue: 3
  year: 2015
  ident: 956_CR1
  publication-title: Popul Health Manag
  doi: 10.1089/pop.2014.0093
– volume: 2
  start-page: 580
  year: 2015
  ident: 956_CR24
  publication-title: Prev Med Rep
  doi: 10.1016/j.pmedr.2015.06.019
– volume: 90
  start-page: 515
  year: 2006
  ident: 956_CR28
  publication-title: Allg Stat Arch
– volume: 20
  start-page: 3710
  issue: 33
  year: 2014
  ident: 956_CR16
  publication-title: Stat Med
  doi: 10.1002/sim.6173
SSID ssj0017836
Score 2.2815576
SecondaryResourceType review_article
Snippet Electronic Health Records (EHR) has been increasingly used as a tool to monitor population health. However, subject-level errors in the records can yield...
Background Electronic Health Records (EHR) has been increasingly used as a tool to monitor population health. However, subject-level errors in the records can...
Abstract Background Electronic Health Records (EHR) has been increasingly used as a tool to monitor population health. However, subject-level errors in the...
SourceID doaj
pubmedcentral
proquest
gale
pubmed
crossref
SourceType Open Website
Open Access Repository
Aggregation Database
Index Database
Enrichment Source
StartPage 77
SubjectTerms Bias
Big Data
Electronic health records
Electronic records
Estimates
Health surveillance
Health surveys
Immunization
Influenza
Measurement error
Medical records
Medical research
Multiple imputations
Population health surveillance
Primary care
Public health
Research methodology
Sample variance
Selection bias
Software
Victimization
SummonAdditionalLinks – databaseName: DOAJ Directory of Open Access Journals
  dbid: DOA
  link: http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrV3fi9QwEA5yD-KL-NvqKSMIPki5TdpMW99O8TiE88mDewtJmpwr2pXrrnD_gX-2M0l33SLoi2_LJoEmM5OZr535RoiXHh3HEchtUquy9oglOcG6XKCSVVOFWmuuHT77iKfn9YcLfbHX6otzwjI9cD64I23rpo9ao20JxUkCgko61UX6iaoOieeTfN4WTE3fD7g2YVsi0-LRKJnWvWSotEjMezhzQ4mt_887ec8pzRMm9zzQyR1xewod4Tg_8l1xIwz3xM2z6eP4ffGT6ZhsKiIC5s7IRYngruHLajmsYTMGWEVwy0vgvFCwQw-5DBLGzdWPcP0GLPThG0eMWS8gkc8C58Zfwu-GOdtV-QXPCMsB6K4EZn8CT1H9A3F-8v7Tu9NyarRQeo3VurSuJ8ONynsCLx0dnax8q0Lb2V61oWtih73vFYEpjBZDiJV2bb_QQXntHbrqoTgYVkN4LIBEZWUkITeWoKP0NnpnI6E61aJVviuE3J678RMLOTfD-GoSGmnRZFkZkpVJsjJYiNe7Nd8zB8dfZ79lce5mMn92-oO0ykxaZf6lVYV4xcpg2Mrp8bydihVok8yXZY5RMcsRYd1CHM5mknX6-fBWncx0O4xGVZ2sOLRShXixG-aVnPE2hNWG57QNYU2KJQrxKGvfbkt8CVMgS6ubmV7O9jwfGZafE3d4wxYo8cn_OKSn4pZKJsWmfSgO1leb8IxCtLV7nqzxF4JFN40
  priority: 102
  providerName: Directory of Open Access Journals
– databaseName: Health & Medical Collection
  dbid: 7X7
  link: http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV3di9QwEA96gggien5VTxlB8EHKbZI2bX2RUzwO4XzyYN9CkqbrHtqe213h_gP_bGfStHdFuLfSZCDpfGQmnfkNY2-dsuRHKGqTKtPMKZXiIZilCyW4LKTP8pxqh0-_qZOz7OsyX8YLtz6mVY42MRjqunN0R34oZMUlHUfi48XvlLpG0d_V2ELjNrtD0GWU0lUsp4CLU4XCWChTqsOeE7h7SgHTIuDvqdlhFDD7_7fM146medrktXPo-CF7EB1IOBo4_ojd8u0-u3saf5Hvs_vDRRwM9UWP2V8CaTKhtAgIUWMoVQR7Cefdut3CrvfQNWDXK6BsUTBtDUNxJPS7zR9_-QEM1P4X-ZGDtECApAXKmF_BVRudkWq49ulh3QJaUCBMKHDo6z9hZ8dfvn8-SWP7hdTlSm5TY2tU50Y4hyFNhZ-SS1cKX1amFqWviqZStasFhliqMcr7Rua2rBe5Fy53Vln5lO21XeufM6ib3PAGWV8YDCi5M42zpsFYT5TKCFcljI980C5ik1OLjJ86xCil0gPvNPJOB95plbD3E83FgMxx4-xPxN5pJqFqhxfdZqWjkurcZAUuNVcGqQX3VghuRdXgoxKZlwl7R8KhSfdxec7EEgbcJKFo6SMlCPsII-CEHcxmos66-fAoXjrajF5fSXjC3kzDREl5cK3vdjSnLDACRQ8jYc8GaZy2RKYZ3VukLmZyOtvzfKRd_wiI4gXpJVcvbl7WS3ZPBOUhVT5ge9vNzr9Cl2xrXwe9-wfVUzSj
  priority: 102
  providerName: ProQuest
Title Prevalence estimation by joint use of big data and health survey: a demonstration study using electronic health records in New York city
URI https://www.ncbi.nlm.nih.gov/pubmed/32252642
https://www.proquest.com/docview/2391318122
https://www.proquest.com/docview/2387260678
https://pubmed.ncbi.nlm.nih.gov/PMC7137316
https://doaj.org/article/5a47df556a87421eb221b29f21e624e3
Volume 20
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1bi9QwFD7sBZZ9Ee9W1yGC4INUJ2mbtoLIjuyyCLPI4sDgS0jSdBxZW52LOP_An-05aTu7xcUHX4YyyYEm-b7mnDbnOwDPrTTkR0gqkxqFsZUyxE0wDodS8CiNXJwklDs8Ppdnk_jDNJnuQFfuqJ3A5Y2hHdWTmiwuX_36sXmHhH_rCZ_J10tOou0hBUJDr6snd2Efd6aUiDqOr74qUMZClzhzo90hHBDA0UkQvX3Ky_n__dC-tmv1T1Re26JOb8Ot1rdkxw0Y7sCOq-7Cwbj9en4PfpNek_ZZRozENZqsRWY27Gs9r1ZsvXSsLpmZzxgdHGW6KliTJ8mW68VPt3nDNCvcN3IpG-Awr07L6PD8jF1V1OmsmjdASzavGD5MGclDMYtu_32YnJ58en8WtpUYQpvIaBVqUyCzS2EtRjc5ziKPbCZclutCZC5Py1wWthAYbclSS-fKKDFZMUycsIk10kQPYK-qK_cIWFEmmpeIglRjbMmtLq3RJYZ9IpNa2DwA3s27sq1MOVXLuFQ-XMmkapZN4bIpv2xKBvBya_O9Een4Z-8RLee2Jwls-z_qxUy1fFWJjlO81URqtBbcGSG4EXmJl1LELgrgBYFBETDx9qxusxlwkCSopY6lIBkkDIYDOOr1RPrafnMHJ9WhX4ko5xH5XiKAZ9tmsqQjcZWr19QnSzEYRWcjgIcN-rZD6kAcQNrDZW_M_ZZq_sWLi6dEUS4f_7flEzgUnlJE-CPYWy3W7ik6biszgN10mg5gf3Ry_vFi4F9_DDxD8fdi9PkPPHNEuw
linkProvider Scholars Portal
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV1ta9RAEB7qFVQQ0foWrTqC4gcJvWySTSKItNpytb1DpIV-2242m_NEk3ovyv0Df42_0Zm8XBuEfuu3I7tz7DKz87I78wzASyNT9iMkt0n13cBI6ZIRDNy-FJ4f-TYIQ64dHo7k4Dj4dBKerMHfthaG0ypbnVgp6qw0fEe-JfzE89kcifdnP13uGsWvq20LjVosDuzyN4Vss3f7H4m_r4TY2z36MHCbrgKuCaU_d3WakZTmwhjy1JPYJ3tmYmHjRGcitkmUJzIzmaDIQeZaWpv7YRpn_dAKE5pUpj797zVYD3wKZXqwvrM7-vxl9W7BNRFtaU4st2Yew8m7HKL1K8Q_2TF_VZeA_23BBWPYTdS8YPn27sDtxmXF7VrG7sKaLTbg-rB5lN-AW_XVH9YVTffgD8NC6aqYCRnDoy6OxHSJ38pJMcfFzGKZYzoZI-enoi4yrMsxcbaY_rLLt6gxsz_Yc63lEysQXOQc_TGeN-5pqeqLphlOCiSdjYxChYaii_twfCWseQC9oizsI8AsD7WXk7BFmkJYz-jcpDqn6FLEUguTOOC1fFCmQUPnphzfVRUVxVLVvFPEO1XxTkkH3qxozmoskEtn7zB7VzMZx7v6UE7HqlELKtRBREsNpSZq4dlUCC8VSU4_pQis78BrFg7F2oaWZ3RTNEGbZNwutS0Foy1RzO3AZmcmaQnTHW7FSzVaaqbOz5QDL1bDTMmZd4UtFzwnjijmJZ_GgYe1NK62xMaAHGqijjpy2tlzd6SYfK0wzCPWBJ58fPmynsONwdHwUB3ujw6ewE1RHSRWJJvQm08X9ik5hPP0WXMKEU6v-uD_A41ldBA
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Prevalence+estimation+by+joint+use+of+big+data+and+health+survey%3A+a+demonstration+study+using+electronic+health+records+in+New+York+city&rft.jtitle=BMC+medical+research+methodology&rft.au=Kim%2C+Ryung+S.&rft.au=Shankar%2C+Viswanathan&rft.date=2020-04-06&rft.pub=BioMed+Central&rft.eissn=1471-2288&rft.volume=20&rft_id=info:doi/10.1186%2Fs12874-020-00956-6&rft_id=info%3Apmid%2F32252642&rft.externalDocID=PMC7137316
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1471-2288&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1471-2288&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1471-2288&client=summon