Assessing Covariate Balance With Small Sample Sizes

ABSTRACT Propensity score adjustment addresses confounding by balancing covariates in subject treatment groups through matching, stratification, or weighting. Diagnostics test the success of adjustment. For example, if the standardized mean difference (SMD) for a relevant covariate exceeds a thresho...

Full description

Saved in:
Bibliographic Details
Published inStatistics in medicine Vol. 44; no. 18-19; pp. e70212 - n/a
Main Authors Hripcsak, George, Zhang, Linying, Chen, Yong, Li, Kelly, Suchard, Marc A., Ryan, Patrick B., Schuemie, Martijn J.
Format Journal Article
LanguageEnglish
Published Hoboken, USA John Wiley & Sons, Inc 01.08.2025
Wiley Subscription Services, Inc
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:ABSTRACT Propensity score adjustment addresses confounding by balancing covariates in subject treatment groups through matching, stratification, or weighting. Diagnostics test the success of adjustment. For example, if the standardized mean difference (SMD) for a relevant covariate exceeds a threshold like 0.1, the covariate is considered imbalanced and the study may be biased. Unfortunately, for studies with small or moderate numbers of subjects, the probability of identifying a study as biased because of chance imbalance can be grossly larger than a given nominal level like 0.05, yet that chance imbalance may not cause significant bias. In this paper, we illustrate that chance imbalance is operative in real‐world settings even for moderate sample sizes of 2000. We identify a previously unrecognized challenge that as meta‐analyses increase the precision of an effect estimate, the diagnostics must also undergo meta‐analysis for a corresponding increase in precision. We propose an alternative diagnostic that checks whether the SMD statistically significantly exceeds the threshold. Through simulation and real‐world data, we find that this diagnostic achieves a better trade‐off of type 1 error rate and power than standard nominal threshold tests and not testing for sample sizes from 250 to 4000 and for 20 to 100 000 covariates. We confirm that in network studies, meta‐analysis of effect estimates must be accompanied by meta‐analysis of the diagnostics or else systematic confounding may overwhelm the estimated effect. Our procedure supports the review of large numbers of covariates, enabling more rigorous diagnostics.
Bibliography:Funding
This work was supported by National Institutes of Health (Grant Nos. T15 LM007079, R01 LM006910, R01 HL169954).
ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
content type line 23
ISSN:0277-6715
1097-0258
1097-0258
DOI:10.1002/sim.70212