Prevalence estimation by joint use of big data and health survey: a demonstration study using electronic health records in New York city

Electronic Health Records (EHR) has been increasingly used as a tool to monitor population health. However, subject-level errors in the records can yield biased estimates of health indicators. There is an urgent need for methods to estimate the prevalence of health indicators using large and real-ti...

Full description

Saved in:

Bibliographic Details
Published in	BMC medical research methodology Vol. 20; no. 1; pp. 77 - 10
Main Authors	Kim, Ryung S., Shankar, Viswanathan
Format	Journal Article
Language	English
Published	England BioMed Central Ltd 06.04.2020 BioMed Central BMC
Subjects	Bias Big Data Electronic health records Electronic records Estimates Health surveillance Health surveys Immunization Influenza Measurement error Medical records Medical research Multiple imputations Population health surveillance Primary care Public health Research methodology Sample variance Selection bias Software Victimization United States Selection bias Population health surveillance Big data Multiple imputations Electronic health records Measurement error
Online Access	Get full text

Cover

Loading…

Abstract	Electronic Health Records (EHR) has been increasingly used as a tool to monitor population health. However, subject-level errors in the records can yield biased estimates of health indicators. There is an urgent need for methods to estimate the prevalence of health indicators using large and real-time EHR while correcting the potential bias. We demonstrate joint analyses of EHR and a smaller gold-standard health survey. We first adopted Mosteller's method that pools two estimators, among which one is potentially biased. It only requires knowing the prevalence estimates from two data sources and their standard errors. Then, we adopted the method of Schenker et al., which uses multiple imputations of subject-level health outcomes that are missing for the subjects in EHR. This procedure requires information to link some subjects between two sources and modeling the mechanism of misclassification in EHR as well as modeling inclusion probabilities to both sources. In a simulation study, both estimators yielded negligible bias even when EHR was biased. They performed as well as health survey estimator when EHR bias was large and better than health survey estimator when EHR bias was moderate. It may be challenging to model the misclassification mechanism in real data for the subject-level imputation estimator. We illustrated the methods analyzing six health indicators from 2013 to 14 NYC HANES and the 2013 NYC Macroscope, and a study that linked some subjects in both data sources. When a small gold-standard health survey exists, it can serve as a safeguard against potential bias in EHR through the joint analysis of the two sources.
AbstractList	Electronic Health Records (EHR) has been increasingly used as a tool to monitor population health. However, subject-level errors in the records can yield biased estimates of health indicators. There is an urgent need for methods to estimate the prevalence of health indicators using large and real-time EHR while correcting the potential bias. We demonstrate joint analyses of EHR and a smaller gold-standard health survey. We first adopted Mosteller's method that pools two estimators, among which one is potentially biased. It only requires knowing the prevalence estimates from two data sources and their standard errors. Then, we adopted the method of Schenker et al., which uses multiple imputations of subject-level health outcomes that are missing for the subjects in EHR. This procedure requires information to link some subjects between two sources and modeling the mechanism of misclassification in EHR as well as modeling inclusion probabilities to both sources. In a simulation study, both estimators yielded negligible bias even when EHR was biased. They performed as well as health survey estimator when EHR bias was large and better than health survey estimator when EHR bias was moderate. It may be challenging to model the misclassification mechanism in real data for the subject-level imputation estimator. We illustrated the methods analyzing six health indicators from 2013 to 14 NYC HANES and the 2013 NYC Macroscope, and a study that linked some subjects in both data sources. When a small gold-standard health survey exists, it can serve as a safeguard against potential bias in EHR through the joint analysis of the two sources. Background Electronic Health Records (EHR) has been increasingly used as a tool to monitor population health. However, subject-level errors in the records can yield biased estimates of health indicators. There is an urgent need for methods to estimate the prevalence of health indicators using large and real-time EHR while correcting the potential bias. Methods We demonstrate joint analyses of EHR and a smaller gold-standard health survey. We first adopted Mosteller’s method that pools two estimators, among which one is potentially biased. It only requires knowing the prevalence estimates from two data sources and their standard errors. Then, we adopted the method of Schenker et al., which uses multiple imputations of subject-level health outcomes that are missing for the subjects in EHR. This procedure requires information to link some subjects between two sources and modeling the mechanism of misclassification in EHR as well as modeling inclusion probabilities to both sources. Results In a simulation study, both estimators yielded negligible bias even when EHR was biased. They performed as well as health survey estimator when EHR bias was large and better than health survey estimator when EHR bias was moderate. It may be challenging to model the misclassification mechanism in real data for the subject-level imputation estimator. We illustrated the methods analyzing six health indicators from 2013 to 14 NYC HANES and the 2013 NYC Macroscope, and a study that linked some subjects in both data sources. Conclusions When a small gold-standard health survey exists, it can serve as a safeguard against potential bias in EHR through the joint analysis of the two sources. Electronic Health Records (EHR) has been increasingly used as a tool to monitor population health. However, subject-level errors in the records can yield biased estimates of health indicators. There is an urgent need for methods to estimate the prevalence of health indicators using large and real-time EHR while correcting the potential bias.BACKGROUNDElectronic Health Records (EHR) has been increasingly used as a tool to monitor population health. However, subject-level errors in the records can yield biased estimates of health indicators. There is an urgent need for methods to estimate the prevalence of health indicators using large and real-time EHR while correcting the potential bias.We demonstrate joint analyses of EHR and a smaller gold-standard health survey. We first adopted Mosteller's method that pools two estimators, among which one is potentially biased. It only requires knowing the prevalence estimates from two data sources and their standard errors. Then, we adopted the method of Schenker et al., which uses multiple imputations of subject-level health outcomes that are missing for the subjects in EHR. This procedure requires information to link some subjects between two sources and modeling the mechanism of misclassification in EHR as well as modeling inclusion probabilities to both sources.METHODSWe demonstrate joint analyses of EHR and a smaller gold-standard health survey. We first adopted Mosteller's method that pools two estimators, among which one is potentially biased. It only requires knowing the prevalence estimates from two data sources and their standard errors. Then, we adopted the method of Schenker et al., which uses multiple imputations of subject-level health outcomes that are missing for the subjects in EHR. This procedure requires information to link some subjects between two sources and modeling the mechanism of misclassification in EHR as well as modeling inclusion probabilities to both sources.In a simulation study, both estimators yielded negligible bias even when EHR was biased. They performed as well as health survey estimator when EHR bias was large and better than health survey estimator when EHR bias was moderate. It may be challenging to model the misclassification mechanism in real data for the subject-level imputation estimator. We illustrated the methods analyzing six health indicators from 2013 to 14 NYC HANES and the 2013 NYC Macroscope, and a study that linked some subjects in both data sources.RESULTSIn a simulation study, both estimators yielded negligible bias even when EHR was biased. They performed as well as health survey estimator when EHR bias was large and better than health survey estimator when EHR bias was moderate. It may be challenging to model the misclassification mechanism in real data for the subject-level imputation estimator. We illustrated the methods analyzing six health indicators from 2013 to 14 NYC HANES and the 2013 NYC Macroscope, and a study that linked some subjects in both data sources.When a small gold-standard health survey exists, it can serve as a safeguard against potential bias in EHR through the joint analysis of the two sources.CONCLUSIONSWhen a small gold-standard health survey exists, it can serve as a safeguard against potential bias in EHR through the joint analysis of the two sources. Abstract Background Electronic Health Records (EHR) has been increasingly used as a tool to monitor population health. However, subject-level errors in the records can yield biased estimates of health indicators. There is an urgent need for methods to estimate the prevalence of health indicators using large and real-time EHR while correcting the potential bias. Methods We demonstrate joint analyses of EHR and a smaller gold-standard health survey. We first adopted Mosteller’s method that pools two estimators, among which one is potentially biased. It only requires knowing the prevalence estimates from two data sources and their standard errors. Then, we adopted the method of Schenker et al., which uses multiple imputations of subject-level health outcomes that are missing for the subjects in EHR. This procedure requires information to link some subjects between two sources and modeling the mechanism of misclassification in EHR as well as modeling inclusion probabilities to both sources. Results In a simulation study, both estimators yielded negligible bias even when EHR was biased. They performed as well as health survey estimator when EHR bias was large and better than health survey estimator when EHR bias was moderate. It may be challenging to model the misclassification mechanism in real data for the subject-level imputation estimator. We illustrated the methods analyzing six health indicators from 2013 to 14 NYC HANES and the 2013 NYC Macroscope, and a study that linked some subjects in both data sources. Conclusions When a small gold-standard health survey exists, it can serve as a safeguard against potential bias in EHR through the joint analysis of the two sources. Background Electronic Health Records (EHR) has been increasingly used as a tool to monitor population health. However, subject-level errors in the records can yield biased estimates of health indicators. There is an urgent need for methods to estimate the prevalence of health indicators using large and real-time EHR while correcting the potential bias. Methods We demonstrate joint analyses of EHR and a smaller gold-standard health survey. We first adopted Mosteller's method that pools two estimators, among which one is potentially biased. It only requires knowing the prevalence estimates from two data sources and their standard errors. Then, we adopted the method of Schenker et al., which uses multiple imputations of subject-level health outcomes that are missing for the subjects in EHR. This procedure requires information to link some subjects between two sources and modeling the mechanism of misclassification in EHR as well as modeling inclusion probabilities to both sources. Results In a simulation study, both estimators yielded negligible bias even when EHR was biased. They performed as well as health survey estimator when EHR bias was large and better than health survey estimator when EHR bias was moderate. It may be challenging to model the misclassification mechanism in real data for the subject-level imputation estimator. We illustrated the methods analyzing six health indicators from 2013 to 14 NYC HANES and the 2013 NYC Macroscope, and a study that linked some subjects in both data sources. Conclusions When a small gold-standard health survey exists, it can serve as a safeguard against potential bias in EHR through the joint analysis of the two sources. Keywords: Big data, Electronic health records, Multiple imputations, Measurement error, Selection bias, Population health surveillance Electronic Health Records (EHR) has been increasingly used as a tool to monitor population health. However, subject-level errors in the records can yield biased estimates of health indicators. There is an urgent need for methods to estimate the prevalence of health indicators using large and real-time EHR while correcting the potential bias. We demonstrate joint analyses of EHR and a smaller gold-standard health survey. We first adopted Mosteller's method that pools two estimators, among which one is potentially biased. It only requires knowing the prevalence estimates from two data sources and their standard errors. Then, we adopted the method of Schenker et al., which uses multiple imputations of subject-level health outcomes that are missing for the subjects in EHR. This procedure requires information to link some subjects between two sources and modeling the mechanism of misclassification in EHR as well as modeling inclusion probabilities to both sources. In a simulation study, both estimators yielded negligible bias even when EHR was biased. They performed as well as health survey estimator when EHR bias was large and better than health survey estimator when EHR bias was moderate. It may be challenging to model the misclassification mechanism in real data for the subject-level imputation estimator. We illustrated the methods analyzing six health indicators from 2013 to 14 NYC HANES and the 2013 NYC Macroscope, and a study that linked some subjects in both data sources. When a small gold-standard health survey exists, it can serve as a safeguard against potential bias in EHR through the joint analysis of the two sources.
ArticleNumber	77
Audience	Academic
Author	Kim, Ryung S. Shankar, Viswanathan
Author_xml	– sequence: 1 givenname: Ryung S. surname: Kim fullname: Kim, Ryung S. – sequence: 2 givenname: Viswanathan surname: Shankar fullname: Shankar, Viswanathan
BackLink	https://www.ncbi.nlm.nih.gov/pubmed/32252642$$D View this record in MEDLINE/PubMed
BookMark	eNp9Uk1v1DAUjFARbRf-AAdkiQuXlPg5sR0OSFXFR6UKOMCBk-XYL7tesnaxk0X7D_jZeHfb0q0Q8sHW88zY896cFkc-eCyK57Q6o1Ty14mCFHVZQVVWVdvwkj8qTmgtaAkg5dG983FxmtKyqqiQjD8pjhlAA7yGk-L3l4hrPaA3SDCNbqVHFzzpNmQZnB_JlJCEnnRuTqweNdHekgXqYVyQNMU1bt4QTSyugk9j3HPTONlNJjo_JzigGWPwztyyIpoQbSLOk0_4i3wP8Qcxbtw8LR73ekj47GafFd_ev_t68bG8-vzh8uL8qjQNZ2OpO9sI2YMxomFtdkOZkYCy1RYktqJvuTUWaN3yXnPEnjWdtFWDYBrT8Y7Nisu9rg16qa5jdhw3KmindoUQ50rH0ZkBVaNrYfum4Tq3GSh2ALSDts9HDjWyrPV2r3U9dSu0Bn3uwXAgenjj3ULNw1oJygSjPAu8uhGI4eeU-69WLhkcBu0xTEkBkwJ4xfPYZsXLB9BlmKLPrcqoljIqKcBf1DzPVDnfh_yu2Yqqcw6CgWypyKizf6DyyoN0Joesd7l-QHhx3-idw9scZQDsASaGlCL2dxBaqW1Y1T6sKodV7cKqtu7lA1IOwi5D-Ttu-B_1D8FB7oQ
CitedBy_id	crossref_primary_10_1093_ajh_hpad081 crossref_primary_10_1186_s12889_022_13809_2
Cites_doi	10.1093/biomet/asr063 10.1002/sim.3809 10.1093/biomet/asx073 10.1093/biomet/asn048 10.1111/j.1467-985X.2010.00648.x 10.18637/jss.v009.i08 10.1007/s40300-017-0124-0 10.1017/CBO9780511790942 10.1198/016214506000001293 10.1002/cjs.11153 10.5888/pcd15.180371 10.1080/01621459.1948.10483259 10.1093/biomet/86.4.948 10.18637/jss.v045.i03 10.1080/01621459.1993.10594298 10.1089/pop.2014.0093 10.1016/j.pmedr.2015.06.019 10.1002/sim.6173
ContentType	Journal Article
Copyright	COPYRIGHT 2020 BioMed Central Ltd. 2020. This work is licensed under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. The Author(s) 2020
Copyright_xml	– notice: COPYRIGHT 2020 BioMed Central Ltd. – notice: 2020. This work is licensed under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. – notice: The Author(s) 2020
DBID	AAYXX CITATION NPM 3V. 7X7 7XB 88E 8FI 8FJ 8FK ABUWG AFKRA AZQEC BENPR CCPQU DWQXO FYUFA GHDGH K9. M0S M1P PHGZM PHGZT PIMPY PJZUB PKEHL PPXIY PQEST PQQKQ PQUKI PRINS 7X8 5PM DOA
DOI	10.1186/s12874-020-00956-6
DatabaseName	CrossRef PubMed ProQuest Central (Corporate) Health & Medical Collection ProQuest Central (purchase pre-March 2016) Medical Database (Alumni Edition) Hospital Premium Collection Hospital Premium Collection (Alumni Edition) ProQuest Central (Alumni) (purchase pre-March 2016) ProQuest Central (Alumni) ProQuest Central UK/Ireland ProQuest Central Essentials ProQuest Central ProQuest One ProQuest Central Korea Health Research Premium Collection Health Research Premium Collection (Alumni) ProQuest Health & Medical Complete (Alumni) ProQuest Health & Medical Collection Medical Database ProQuest Central Premium ProQuest One Academic (New) Publicly Available Content Database ProQuest Health & Medical Research Collection ProQuest One Academic Middle East (New) ProQuest One Health & Nursing ProQuest One Academic Eastern Edition (DO NOT USE) ProQuest One Academic ProQuest One Academic UKI Edition ProQuest Central China MEDLINE - Academic PubMed Central (Full Participant titles) DOAJ Directory of Open Access Journals
DatabaseTitle	CrossRef PubMed Publicly Available Content Database ProQuest One Academic Middle East (New) ProQuest Central Essentials ProQuest Health & Medical Complete (Alumni) ProQuest Central (Alumni Edition) ProQuest One Community College ProQuest One Health & Nursing ProQuest Central China ProQuest Central Health Research Premium Collection Health and Medicine Complete (Alumni Edition) ProQuest Central Korea Health & Medical Research Collection ProQuest Central (New) ProQuest Medical Library (Alumni) ProQuest One Academic Eastern Edition ProQuest Hospital Collection Health Research Premium Collection (Alumni) ProQuest Hospital Collection (Alumni) ProQuest Health & Medical Complete ProQuest Medical Library ProQuest One Academic UKI Edition ProQuest One Academic ProQuest One Academic (New) ProQuest Central (Alumni) MEDLINE - Academic
DatabaseTitleList	Publicly Available Content Database MEDLINE - Academic PubMed
Database_xml	– sequence: 1 dbid: DOA name: DOAJ Directory of Open Access Journals url: https://www.doaj.org/ sourceTypes: Open Website – sequence: 2 dbid: NPM name: PubMed url: https://proxy.k.utb.cz/login?url=http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed sourceTypes: Index Database – sequence: 3 dbid: BENPR name: ProQuest Central url: https://www.proquest.com/central sourceTypes: Aggregation Database
DeliveryMethod	fulltext_linktorsrc
Discipline	Medicine Public Health
EISSN	1471-2288
EndPage	10
ExternalDocumentID	oai_doaj_org_article_5a47df556a87421eb221b29f21e624e3 PMC7137316 A627328917 32252642 10_1186_s12874_020_00956_6
Genre	Journal Article
GeographicLocations	United States
GeographicLocations_xml	– name: United States
GrantInformation_xml	– fundername: New York City Department of Health and Mental Hygiene (US) grantid: The study began as the authors worked as paid statistical consultants for NYC DOHMH to analyze NYC Macroscope jointly with NYC HANES. When the work ended, the authors continued further simulation studies to write the manuscript. – fundername: New York City Department of Health and Mental Hygine grantid: The study began as the authors worked as paid statistical consultants for NYC DOHMH to analyze NYC Macroscope jointly with NYC HANES. When the work ended, the authors continued further simulation studies to write the manuscript. – fundername: ; grantid: The study began as the authors worked as paid statistical consultants for NYC DOHMH to analyze NYC Macroscope jointly with NYC HANES. When the work ended, the authors continued further simulation studies to write the manuscript.
GroupedDBID	--- 0R~ 23N 2WC 53G 5VS 6J9 6PF 7X7 88E 8FI 8FJ AAFWJ AAJSJ AASML AAWTL AAYXX ABDBF ABUWG ACGFO ACGFS ACIHN ACUHS ADBBV ADRAZ ADUKV AEAQA AENEX AFKRA AFPKN AHBYD AHMBA AHYZX ALIPV ALMA_UNASSIGNED_HOLDINGS AMKLP AMTXH AOIJS BAPOH BAWUL BCNDV BENPR BFQNJ BMC BPHCQ BVXVI C6C CCPQU CITATION CS3 DIK DU5 E3Z EAD EAP EAS EBD EBLON EBS EMB EMK EMOBN ESX F5P FYUFA GROUPED_DOAJ GX1 HMCUK HYE IAO IHR INH INR ITC KQ8 M1P M48 MK0 M~E O5R O5S OK1 OVT P2P PGMZT PHGZM PHGZT PIMPY PQQKQ PROAC PSQYO RBZ RNS ROL RPM RSV SMD SOJ SV3 TR2 TUS UKHRP W2D WOQ WOW XSB -A0 3V. ACRMQ ADINQ C24 NPM PMFND 7XB 8FK AZQEC DWQXO K9. PJZUB PKEHL PPXIY PQEST PQUKI PRINS 7X8 5PM PUEGO
ID	FETCH-LOGICAL-c563t-abd578f2cc753983613c82e89ad28e97f96dcd21496fa6eef35b8d05e2c5cb6b3
IEDL.DBID	M48
ISSN	1471-2288
IngestDate	Wed Aug 27 01:31:31 EDT 2025 Thu Aug 21 18:24:47 EDT 2025 Fri Jul 11 00:35:49 EDT 2025 Fri Jul 25 07:21:27 EDT 2025 Tue Jun 17 21:25:49 EDT 2025 Tue Jun 10 20:39:10 EDT 2025 Thu Jan 02 22:59:11 EST 2025 Tue Jul 01 04:30:56 EDT 2025 Thu Apr 24 22:58:54 EDT 2025
IsDoiOpenAccess	true
IsOpenAccess	true
IsPeerReviewed	true
IsScholarly	true
Issue	1
Keywords	Selection bias Population health surveillance Big data Multiple imputations Electronic health records Measurement error
Language	English
License	Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
LinkModel	DirectLink
MergedId	FETCHMERGED-LOGICAL-c563t-abd578f2cc753983613c82e89ad28e97f96dcd21496fa6eef35b8d05e2c5cb6b3
Notes	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 content type line 23
OpenAccessLink	http://journals.scholarsportal.info/openUrl.xqy?doi=10.1186/s12874-020-00956-6
PMID	32252642
PQID	2391318122
PQPubID	42579
PageCount	10
ParticipantIDs	doaj_primary_oai_doaj_org_article_5a47df556a87421eb221b29f21e624e3 pubmedcentral_primary_oai_pubmedcentral_nih_gov_7137316 proquest_miscellaneous_2387260678 proquest_journals_2391318122 gale_infotracmisc_A627328917 gale_infotracacademiconefile_A627328917 pubmed_primary_32252642 crossref_primary_10_1186_s12874_020_00956_6 crossref_citationtrail_10_1186_s12874_020_00956_6
ProviderPackageCode	CITATION AAYXX
PublicationCentury	2000
PublicationDate	2020-04-06
PublicationDateYYYYMMDD	2020-04-06
PublicationDate_xml	– month: 04 year: 2020 text: 2020-04-06 day: 06
PublicationDecade	2020
PublicationPlace	England
PublicationPlace_xml	– name: England – name: London
PublicationTitle	BMC medical research methodology
PublicationTitleAlternate	BMC Med Res Methodol
PublicationYear	2020
Publisher	BioMed Central Ltd BioMed Central BMC
Publisher_xml	– name: BioMed Central Ltd – name: BioMed Central – name: BMC
References	LE Thorpe (956_CR24) 2015; 2 R Newton-Dame (956_CR2) 2016; 4 TE Raghunathan (956_CR10) 2007; 102 A Gelman (956_CR17) 2011 MM Paul (956_CR1) 2015; 18 LE Thorpe (956_CR3) 2016; 4 S Park (956_CR13) 2017; 75 ME Thompson (956_CR6) 2008; 34 Z Wang (956_CR19) 2017; 105 Y He (956_CR16) 2014; 20 R Valliant (956_CR26) 1993; 88 G Manzi (956_CR8) 2011; 174 A Gelman (956_CR20) 2006 T Lumley (956_CR25) 2004; 9 LMR Ybarra (956_CR11) 2008; 95 KH McVeigh (956_CR5) 2017; 5 N Schenker (956_CR14) 2010; 29 J Barnard (956_CR22) 1999; 86 PY Chan (956_CR27) 2018; 15 KH McVeigh (956_CR4) 2016; 4 TE Raghunathan (956_CR28) 2006; 90 F Mosteller (956_CR9) 1948; 43 J Kim (956_CR12) 2012; 99 R Core Team (956_CR18) 2016 A Gelman (956_CR15) 1998; 93 S van Buuren (956_CR23) 2011; 45 SL Lohr (956_CR7) 2012; 40 956_CR21
References_xml	– volume: 99 start-page: 85 issue: 1 year: 2012 ident: 956_CR12 publication-title: Biometrika doi: 10.1093/biomet/asr063 – volume: 4 start-page: 1267 issue: 1 year: 2016 ident: 956_CR4 publication-title: EGEMS (Washington, DC) – volume: 29 start-page: 533 issue: 5 year: 2010 ident: 956_CR14 publication-title: Stat Med doi: 10.1002/sim.3809 – volume: 105 start-page: 91 issue: 1 year: 2017 ident: 956_CR19 publication-title: Biometrika doi: 10.1093/biomet/asx073 – volume: 95 start-page: 919 issue: 4 year: 2008 ident: 956_CR11 publication-title: Biometrika doi: 10.1093/biomet/asn048 – volume: 4 start-page: 1265 issue: 1 year: 2016 ident: 956_CR2 publication-title: EGEMS (Washington, DC) – volume: 174 start-page: 31 year: 2011 ident: 956_CR8 publication-title: J Royal Stat Soc Ser A doi: 10.1111/j.1467-985X.2010.00648.x – volume: 93 start-page: 869 issue: 443 year: 1998 ident: 956_CR15 publication-title: J Am Stat Assoc – volume: 9 start-page: 19 issue: 8 year: 2004 ident: 956_CR25 publication-title: J Stat Softw doi: 10.18637/jss.v009.i08 – volume: 75 start-page: 345 year: 2017 ident: 956_CR13 publication-title: Metron doi: 10.1007/s40300-017-0124-0 – volume-title: Arm : data analysis using regression and multilevel/hierarchical models year: 2011 ident: 956_CR17 – volume-title: Data analysis using regression and multilevel/hierarchical models year: 2006 ident: 956_CR20 doi: 10.1017/CBO9780511790942 – volume: 102 start-page: 474 issue: 478 year: 2007 ident: 956_CR10 publication-title: J Am Stat Assoc doi: 10.1198/016214506000001293 – volume: 40 start-page: 679 issue: 4 year: 2012 ident: 956_CR7 publication-title: Can J Stat doi: 10.1002/cjs.11153 – volume: 15 year: 2018 ident: 956_CR27 publication-title: Prev Chronic Dis doi: 10.5888/pcd15.180371 – volume: 43 start-page: 231 issue: 242 year: 1948 ident: 956_CR9 publication-title: J Am Stat Assoc doi: 10.1080/01621459.1948.10483259 – volume: 5 start-page: 25 issue: 1 year: 2017 ident: 956_CR5 publication-title: EGEMS (Washington, DC) – ident: 956_CR21 – volume: 4 start-page: 1266 issue: 1 year: 2016 ident: 956_CR3 publication-title: EGEMS (Washington, DC) – volume: 86 start-page: 948 issue: 4 year: 1999 ident: 956_CR22 publication-title: Biometrika doi: 10.1093/biomet/86.4.948 – volume: 45 start-page: 67 issue: 3 year: 2011 ident: 956_CR23 publication-title: J Stat Softw doi: 10.18637/jss.v045.i03 – volume: 34 start-page: 131 issue: 2 year: 2008 ident: 956_CR6 publication-title: Surv Methodol – volume: 88 start-page: 89 issue: 421 year: 1993 ident: 956_CR26 publication-title: J Am Stat Assoc doi: 10.1080/01621459.1993.10594298 – volume-title: R: a language and environment for statistical computing year: 2016 ident: 956_CR18 – volume: 18 start-page: 209 issue: 3 year: 2015 ident: 956_CR1 publication-title: Popul Health Manag doi: 10.1089/pop.2014.0093 – volume: 2 start-page: 580 year: 2015 ident: 956_CR24 publication-title: Prev Med Rep doi: 10.1016/j.pmedr.2015.06.019 – volume: 90 start-page: 515 year: 2006 ident: 956_CR28 publication-title: Allg Stat Arch – volume: 20 start-page: 3710 issue: 33 year: 2014 ident: 956_CR16 publication-title: Stat Med doi: 10.1002/sim.6173
SSID	ssj0017836
Score	2.2815576
SecondaryResourceType	review_article
Snippet	Electronic Health Records (EHR) has been increasingly used as a tool to monitor population health. However, subject-level errors in the records can yield... Background Electronic Health Records (EHR) has been increasingly used as a tool to monitor population health. However, subject-level errors in the records can... Abstract Background Electronic Health Records (EHR) has been increasingly used as a tool to monitor population health. However, subject-level errors in the...
SourceID	doaj pubmedcentral proquest gale pubmed crossref
SourceType	Open Website Open Access Repository Aggregation Database Index Database Enrichment Source
StartPage	77
SubjectTerms	Bias Big Data Electronic health records Electronic records Estimates Health surveillance Health surveys Immunization Influenza Measurement error Medical records Medical research Multiple imputations Population health surveillance Primary care Public health Research methodology Sample variance Selection bias Software Victimization
SummonAdditionalLinks	– databaseName: DOAJ Directory of Open Access Journals dbid: DOA link: http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrV3fi9QwEA5yD-KL-NvqKSMIPki5TdpMW99O8TiE88mDewtJmpwr2pXrrnD_gX-2M0l33SLoi2_LJoEmM5OZr535RoiXHh3HEchtUquy9oglOcG6XKCSVVOFWmuuHT77iKfn9YcLfbHX6otzwjI9cD64I23rpo9ao20JxUkCgko61UX6iaoOieeTfN4WTE3fD7g2YVsi0-LRKJnWvWSotEjMezhzQ4mt_887ec8pzRMm9zzQyR1xewod4Tg_8l1xIwz3xM2z6eP4ffGT6ZhsKiIC5s7IRYngruHLajmsYTMGWEVwy0vgvFCwQw-5DBLGzdWPcP0GLPThG0eMWS8gkc8C58Zfwu-GOdtV-QXPCMsB6K4EZn8CT1H9A3F-8v7Tu9NyarRQeo3VurSuJ8ONynsCLx0dnax8q0Lb2V61oWtih73vFYEpjBZDiJV2bb_QQXntHbrqoTgYVkN4LIBEZWUkITeWoKP0NnpnI6E61aJVviuE3J678RMLOTfD-GoSGmnRZFkZkpVJsjJYiNe7Nd8zB8dfZ79lce5mMn92-oO0ykxaZf6lVYV4xcpg2Mrp8bydihVok8yXZY5RMcsRYd1CHM5mknX6-fBWncx0O4xGVZ2sOLRShXixG-aVnPE2hNWG57QNYU2KJQrxKGvfbkt8CVMgS6ubmV7O9jwfGZafE3d4wxYo8cn_OKSn4pZKJsWmfSgO1leb8IxCtLV7nqzxF4JFN40 priority: 102 providerName: Directory of Open Access Journals – databaseName: Health & Medical Collection dbid: 7X7 link: http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwfV3di9QwEA96gggien5VTxlB8EHKbZI2bX2RUzwO4XzyYN9CkqbrHtqe213h_gP_bGfStHdFuLfSZCDpfGQmnfkNY2-dsuRHKGqTKtPMKZXiIZilCyW4LKTP8pxqh0-_qZOz7OsyX8YLtz6mVY42MRjqunN0R34oZMUlHUfi48XvlLpG0d_V2ELjNrtD0GWU0lUsp4CLU4XCWChTqsOeE7h7SgHTIuDvqdlhFDD7_7fM146medrktXPo-CF7EB1IOBo4_ojd8u0-u3saf5Hvs_vDRRwM9UWP2V8CaTKhtAgIUWMoVQR7Cefdut3CrvfQNWDXK6BsUTBtDUNxJPS7zR9_-QEM1P4X-ZGDtECApAXKmF_BVRudkWq49ulh3QJaUCBMKHDo6z9hZ8dfvn8-SWP7hdTlSm5TY2tU50Y4hyFNhZ-SS1cKX1amFqWviqZStasFhliqMcr7Rua2rBe5Fy53Vln5lO21XeufM6ib3PAGWV8YDCi5M42zpsFYT5TKCFcljI980C5ik1OLjJ86xCil0gPvNPJOB95plbD3E83FgMxx4-xPxN5pJqFqhxfdZqWjkurcZAUuNVcGqQX3VghuRdXgoxKZlwl7R8KhSfdxec7EEgbcJKFo6SMlCPsII-CEHcxmos66-fAoXjrajF5fSXjC3kzDREl5cK3vdjSnLDACRQ8jYc8GaZy2RKYZ3VukLmZyOtvzfKRd_wiI4gXpJVcvbl7WS3ZPBOUhVT5ge9vNzr9Cl2xrXwe9-wfVUzSj priority: 102 providerName: ProQuest
Title	Prevalence estimation by joint use of big data and health survey: a demonstration study using electronic health records in New York city
URI	https://www.ncbi.nlm.nih.gov/pubmed/32252642 https://www.proquest.com/docview/2391318122 https://www.proquest.com/docview/2387260678 https://pubmed.ncbi.nlm.nih.gov/PMC7137316 https://doaj.org/article/5a47df556a87421eb221b29f21e624e3
Volume	20
hasFullText	1
inHoldings	1
isFullTextHit
isPrint
link	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV1bi9QwFD7sBZZ9Ee9W1yGC4INUJ2mbtoLIjuyyCLPI4sDgS0jSdBxZW52LOP_An-05aTu7xcUHX4YyyYEm-b7mnDbnOwDPrTTkR0gqkxqFsZUyxE0wDodS8CiNXJwklDs8Ppdnk_jDNJnuQFfuqJ3A5Y2hHdWTmiwuX_36sXmHhH_rCZ_J10tOou0hBUJDr6snd2Efd6aUiDqOr74qUMZClzhzo90hHBDA0UkQvX3Ky_n__dC-tmv1T1Re26JOb8Ot1rdkxw0Y7sCOq-7Cwbj9en4PfpNek_ZZRozENZqsRWY27Gs9r1ZsvXSsLpmZzxgdHGW6KliTJ8mW68VPt3nDNCvcN3IpG-Awr07L6PD8jF1V1OmsmjdASzavGD5MGclDMYtu_32YnJ58en8WtpUYQpvIaBVqUyCzS2EtRjc5ziKPbCZclutCZC5Py1wWthAYbclSS-fKKDFZMUycsIk10kQPYK-qK_cIWFEmmpeIglRjbMmtLq3RJYZ9IpNa2DwA3s27sq1MOVXLuFQ-XMmkapZN4bIpv2xKBvBya_O9Een4Z-8RLee2Jwls-z_qxUy1fFWJjlO81URqtBbcGSG4EXmJl1LELgrgBYFBETDx9qxusxlwkCSopY6lIBkkDIYDOOr1RPrafnMHJ9WhX4ko5xH5XiKAZ9tmsqQjcZWr19QnSzEYRWcjgIcN-rZD6kAcQNrDZW_M_ZZq_sWLi6dEUS4f_7flEzgUnlJE-CPYWy3W7ik6biszgN10mg5gf3Ry_vFi4F9_DDxD8fdi9PkPPHNEuw
linkProvider	Scholars Portal
linkToHtml	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwtV1ta9RAEB7qFVQQ0foWrTqC4gcJvWySTSKItNpytb1DpIV-2242m_NEk3ovyv0Df42_0Zm8XBuEfuu3I7tz7DKz87I78wzASyNT9iMkt0n13cBI6ZIRDNy-FJ4f-TYIQ64dHo7k4Dj4dBKerMHfthaG0ypbnVgp6qw0fEe-JfzE89kcifdnP13uGsWvq20LjVosDuzyN4Vss3f7H4m_r4TY2z36MHCbrgKuCaU_d3WakZTmwhjy1JPYJ3tmYmHjRGcitkmUJzIzmaDIQeZaWpv7YRpn_dAKE5pUpj797zVYD3wKZXqwvrM7-vxl9W7BNRFtaU4st2Yew8m7HKL1K8Q_2TF_VZeA_23BBWPYTdS8YPn27sDtxmXF7VrG7sKaLTbg-rB5lN-AW_XVH9YVTffgD8NC6aqYCRnDoy6OxHSJ38pJMcfFzGKZYzoZI-enoi4yrMsxcbaY_rLLt6gxsz_Yc63lEysQXOQc_TGeN-5pqeqLphlOCiSdjYxChYaii_twfCWseQC9oizsI8AsD7WXk7BFmkJYz-jcpDqn6FLEUguTOOC1fFCmQUPnphzfVRUVxVLVvFPEO1XxTkkH3qxozmoskEtn7zB7VzMZx7v6UE7HqlELKtRBREsNpSZq4dlUCC8VSU4_pQis78BrFg7F2oaWZ3RTNEGbZNwutS0Foy1RzO3AZmcmaQnTHW7FSzVaaqbOz5QDL1bDTMmZd4UtFzwnjijmJZ_GgYe1NK62xMaAHGqijjpy2tlzd6SYfK0wzCPWBJ58fPmynsONwdHwUB3ujw6ewE1RHSRWJJvQm08X9ik5hPP0WXMKEU6v-uD_A41ldBA
openUrl	ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Prevalence+estimation+by+joint+use+of+big+data+and+health+survey%3A+a+demonstration+study+using+electronic+health+records+in+New+York+city&rft.jtitle=BMC+medical+research+methodology&rft.au=Kim%2C+Ryung+S.&rft.au=Shankar%2C+Viswanathan&rft.date=2020-04-06&rft.pub=BioMed+Central&rft.eissn=1471-2288&rft.volume=20&rft_id=info:doi/10.1186%2Fs12874-020-00956-6&rft_id=info%3Apmid%2F32252642&rft.externalDocID=PMC7137316
thumbnail_l	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1471-2288&client=summon
thumbnail_m	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1471-2288&client=summon
thumbnail_s	http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1471-2288&client=summon