Three‐phase generalized raking and multiple imputation estimators to address error‐prone data

Validation studies are often used to obtain more reliable information in settings with error‐prone data. Validated data on a subsample of subjects can be used together with error‐prone data on all subjects to improve estimation. In practice, more than one round of data validation may be required, an...

Full description

Saved in:
Bibliographic Details
Published inStatistics in medicine Vol. 43; no. 2; pp. 379 - 394
Main Authors Amorim, Gustavo, Tao, Ran, Lotspeich, Sarah, Shaw, Pamela A., Lumley, Thomas, Patel, Rena C., Shepherd, Bryan E.
Format Journal Article
LanguageEnglish
Published Hoboken, USA John Wiley & Sons, Inc 30.01.2024
Wiley Subscription Services, Inc
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Validation studies are often used to obtain more reliable information in settings with error‐prone data. Validated data on a subsample of subjects can be used together with error‐prone data on all subjects to improve estimation. In practice, more than one round of data validation may be required, and direct application of standard approaches for combining validation data into analyses may lead to inefficient estimators since the information available from intermediate validation steps is only partially considered or even completely ignored. In this paper, we present two novel extensions of multiple imputation and generalized raking estimators that make full use of all available data. We show through simulations that incorporating information from intermediate steps can lead to substantial gains in efficiency. This work is motivated by and illustrated in a study of contraceptive effectiveness among 83 671 women living with HIV, whose data were originally extracted from electronic medical records, of whom 4732 had their charts reviewed, and a subsequent 1210 also had a telephone interview to validate key study variables.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
content type line 23
ISSN:0277-6715
1097-0258
1097-0258
DOI:10.1002/sim.9967