The performance of different propensity-score methods for estimating differences in proportions (risk differences or absolute risk reductions) in observational studies

Propensity score methods are increasingly being used to estimate the effects of treatments on health outcomes using observational data. There are four methods for using the propensity score to estimate treatment effects: covariate adjustment using the propensity score, stratification on the propensi...

Full description

Saved in:
Bibliographic Details
Published inStatistics in medicine Vol. 29; no. 20; pp. 2137 - 2148
Main Author Austin, Peter C.
Format Journal Article
LanguageEnglish
Published Chichester, UK John Wiley & Sons, Ltd 10.09.2010
Wiley Subscription Services, Inc
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Propensity score methods are increasingly being used to estimate the effects of treatments on health outcomes using observational data. There are four methods for using the propensity score to estimate treatment effects: covariate adjustment using the propensity score, stratification on the propensity score, propensity‐score matching, and inverse probability of treatment weighting (IPTW) using the propensity score. When outcomes are binary, the effect of treatment on the outcome can be described using odds ratios, relative risks, risk differences, or the number needed to treat. Several clinical commentators suggested that risk differences and numbers needed to treat are more meaningful for clinical decision making than are odds ratios or relative risks. However, there is a paucity of information about the relative performance of the different propensity‐score methods for estimating risk differences. We conducted a series of Monte Carlo simulations to examine this issue. We examined bias, variance estimation, coverage of confidence intervals, mean‐squared error (MSE), and type I error rates. A doubly robust version of IPTW had superior performance compared with the other propensity‐score methods. It resulted in unbiased estimation of risk differences, treatment effects with the lowest standard errors, confidence intervals with the correct coverage rates, and correct type I error rates. Stratification, matching on the propensity score, and covariate adjustment using the propensity score resulted in minor to modest bias in estimating risk differences. Estimators based on IPTW had lower MSE compared with other propensity‐score methods. Differences between IPTW and propensity‐score matching may reflect that these two methods estimate the average treatment effect and the average treatment effect for the treated, respectively. Copyright © 2010 John Wiley & Sons, Ltd.
Bibliography:Ontario Ministry of Health and Long-Term Care (MOHLTC)
istex:F35DF484E290CC7DDDDDFB338941DEA457ABEA02
ArticleID:SIM3854
Heart and Stroke Foundation of Ontario
Re-use of this article is permitted in accordance with the Terms and Conditions set out at [http://wileyonlinelibrary.com/author resources/onlineopen.html]
ark:/67375/WNG-FQ398FMJ-1
Canadian Institutes of Health Research (CIHR) - No. MOP 86508
SourceType-Scholarly Journals-1
ObjectType-Feature-1
content type line 14
ObjectType-Article-2
content type line 23
E-mail: peter.austin@ices.on.ca
Contract/grant sponsor: Canadian Institutes of Health Research (CIHR); contract/grant number: MOP 86508
Contract/grant sponsor: Ontario Ministry of Health and Long-Term Care (MOHLTC)
Contract/grant sponsor: Heart and Stroke Foundation of Ontario
ISSN:0277-6715
1097-0258
1097-0258
DOI:10.1002/sim.3854