Assessing Covariate Balance With Small Sample Sizes

ABSTRACT Propensity score adjustment addresses confounding by balancing covariates in subject treatment groups through matching, stratification, or weighting. Diagnostics test the success of adjustment. For example, if the standardized mean difference (SMD) for a relevant covariate exceeds a thresho...

Full description

Saved in:

Bibliographic Details
Published in	Statistics in medicine Vol. 44; no. 18-19; pp. e70212 - n/a
Main Authors	Hripcsak, George, Zhang, Linying, Chen, Yong, Li, Kelly, Suchard, Marc A., Ryan, Patrick B., Schuemie, Martijn J.
Format	Journal Article
Language	English
Published	Hoboken, USA John Wiley & Sons, Inc 01.08.2025 Wiley Subscription Services, Inc
Subjects	Bias Computer Simulation confounding covariate balance Data Interpretation, Statistical Humans Meta-Analysis as Topic meta‐analysis Models, Statistical observational research Propensity Score Sample Size observational research confounding meta‐analysis covariate balance propensity score
Online Access	Get full text

Cover

Loading…

More Information
Summary:	ABSTRACT Propensity score adjustment addresses confounding by balancing covariates in subject treatment groups through matching, stratification, or weighting. Diagnostics test the success of adjustment. For example, if the standardized mean difference (SMD) for a relevant covariate exceeds a threshold like 0.1, the covariate is considered imbalanced and the study may be biased. Unfortunately, for studies with small or moderate numbers of subjects, the probability of identifying a study as biased because of chance imbalance can be grossly larger than a given nominal level like 0.05, yet that chance imbalance may not cause significant bias. In this paper, we illustrate that chance imbalance is operative in real‐world settings even for moderate sample sizes of 2000. We identify a previously unrecognized challenge that as meta‐analyses increase the precision of an effect estimate, the diagnostics must also undergo meta‐analysis for a corresponding increase in precision. We propose an alternative diagnostic that checks whether the SMD statistically significantly exceeds the threshold. Through simulation and real‐world data, we find that this diagnostic achieves a better trade‐off of type 1 error rate and power than standard nominal threshold tests and not testing for sample sizes from 250 to 4000 and for 20 to 100 000 covariates. We confirm that in network studies, meta‐analysis of effect estimates must be accompanied by meta‐analysis of the diagnostics or else systematic confounding may overwhelm the estimated effect. Our procedure supports the review of large numbers of covariates, enabling more rigorous diagnostics.
Bibliography:	Funding This work was supported by National Institutes of Health (Grant Nos. T15 LM007079, R01 LM006910, R01 HL169954). ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 content type line 23
ISSN:	0277-6715 1097-0258 1097-0258
DOI:	10.1002/sim.70212