Two‐wave two‐phase outcome‐dependent sampling designs, with applications to longitudinal binary data
Two‐phase outcome‐dependent sampling (ODS) designs are useful when resource constraints prohibit expensive exposure ascertainment on all study subjects. One class of ODS designs for longitudinal binary data stratifies subjects into three strata according to those who experience the event at none, so...
Saved in:
Published in | Statistics in medicine Vol. 40; no. 8; pp. 1863 - 1876 |
---|---|
Main Authors | , , , , , , |
Format | Journal Article |
Language | English |
Published |
England
Wiley Subscription Services, Inc
15.04.2021
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | Two‐phase outcome‐dependent sampling (ODS) designs are useful when resource constraints prohibit expensive exposure ascertainment on all study subjects. One class of ODS designs for longitudinal binary data stratifies subjects into three strata according to those who experience the event at none, some, or all follow‐up times. For time‐varying covariate effects, exclusively selecting subjects with response variation can yield highly efficient estimates. However, if interest lies in the association of a time‐invariant covariate, or the joint associations of time‐varying and time‐invariant covariates with the outcome, then the optimal design is unknown. Therefore, we propose a class of two‐wave two‐phase ODS designs for longitudinal binary data. We split the second‐phase sample selection into two waves, between which an interim design evaluation analysis is conducted. The interim design evaluation analysis uses first‐wave data to conduct a simulation‐based search for the optimal second‐wave design that will improve the likelihood of study success. Although we focus on longitudinal binary response data, the proposed design is general and can be applied to other response distributions. We believe that the proposed designs can be useful in settings where (1) the expected second‐phase sample size is fixed and one must tailor stratum‐specific sampling probabilities to maximize estimation efficiency, or (2) relative sampling probabilities are fixed across sampling strata and one must tailor sample size to achieve a desired precision. We describe the class of designs, examine finite sampling operating characteristics, and apply the designs to an exemplar longitudinal cohort study, the Lung Health Study. |
---|---|
Bibliography: | Funding information National Heart, Lung, and Blood Institute, R01HL094786 ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 content type line 23 |
ISSN: | 0277-6715 1097-0258 1097-0258 |
DOI: | 10.1002/sim.8876 |