Understanding and diagnosing the potential for bias when using machine learning methods with doubly robust causal estimators

Data-adaptive methods have been proposed to estimate nuisance parameters when using doubly robust semiparametric methods for estimating marginal causal effects. However, in the presence of near practical positivity violations, these methods can produce a separation of the two exposure groups in term...

Full description

Saved in:

Bibliographic Details
Published in	Statistical methods in medical research Vol. 28; no. 6; p. 1637
Main Authors	Bahamyirou, Asma, Blais, Lucie, Forget, Amélie, Schnitzer, Mireille E
Format	Journal Article
Language	English
Published	England 01.06.2019
Subjects	Causal inference IPTW doubly robust TMLE positivity super learner
Online Access	Get more information

Cover

Loading…

More Information
Summary:	Data-adaptive methods have been proposed to estimate nuisance parameters when using doubly robust semiparametric methods for estimating marginal causal effects. However, in the presence of near practical positivity violations, these methods can produce a separation of the two exposure groups in terms of propensity score densities which can lead to biased estimates of the treatment effect. To motivate the problem, we evaluated the Targeted Minimum Loss-based Estimation procedure using a simulation scenario to estimate the average treatment effect. We highlight the divergence in estimates obtained when using parametric and data-adaptive methods to estimate the propensity score. We then adapted an existing diagnostic tool based on a bootstrap resampling of the subjects and simulation of the outcome data in order to show that the estimation using data-adaptive methods for the propensity score in this study may lead to large bias and poor coverage. The adapted bootstrap procedure is able to identify this instability and can be used as a diagnostic tool.
ISSN:	1477-0334
DOI:	10.1177/0962280218772065