Improved generalized raking estimators to address dependent covariate and failure‐time outcome error

Biomedical studies that use electronic health records (EHR) data for inference are often subject to bias due to measurement error. The measurement error present in EHR data is typically complex, consisting of errors of unknown functional form in covariates and the outcome, which can be dependent. To...

Full description

Saved in:
Bibliographic Details
Published inBiometrical journal Vol. 63; no. 5; pp. 1006 - 1027
Main Authors Oh, Eric J., Shepherd, Bryan E., Lumley, Thomas, Shaw, Pamela A.
Format Journal Article
LanguageEnglish
Published Germany Wiley - VCH Verlag GmbH & Co. KGaA 01.06.2021
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Biomedical studies that use electronic health records (EHR) data for inference are often subject to bias due to measurement error. The measurement error present in EHR data is typically complex, consisting of errors of unknown functional form in covariates and the outcome, which can be dependent. To address the bias resulting from such errors, generalized raking has recently been proposed as a robust method that yields consistent estimates without the need to model the error structure. We provide rationale for why these previously proposed raking estimators can be expected to be inefficient in failure‐time outcome settings involving misclassification of the event indicator. We propose raking estimators that utilize multiple imputation, to impute either the target variables or auxiliary variables, to improve the efficiency. We also consider outcome‐dependent sampling designs and investigate their impact on the efficiency of the raking estimators, either with or without multiple imputation. We present an extensive numerical study to examine the performance of the proposed estimators across various measurement error settings. We then apply the proposed methods to our motivating setting, in which we seek to analyze HIV outcomes in an observational cohort with EHR data from the Vanderbilt Comprehensive Care Clinic.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
content type line 23
ISSN:0323-3847
1521-4036
1521-4036
DOI:10.1002/bimj.202000187