Introduction to computational causal inference using reproducible Stata, R, and Python code: A tutorial

The main purpose of many medical studies is to estimate the effects of a treatment or exposure on an outcome. However, it is not always possible to randomize the study participants to a particular treatment, therefore observational study designs may be used. There are major challenges with observati...

Full description

Saved in:

Bibliographic Details
Published in	Statistics in medicine Vol. 41; no. 2; pp. 407 - 432
Main Authors	Smith, Matthew J., Mansournia, Mohammad A., Maringe, Camille, Zivich, Paul N., Cole, Stephen R., Leyrat, Clémence, Belot, Aurélien, Rachet, Bernard, Luque‐Fernandez, Miguel A.
Format	Journal Article
Language	English
Published	England Wiley Subscription Services, Inc 30.01.2022
Subjects	causal inference Causality Clinical outcomes Clinical trials Computer Simulation double‐robust methods g‐formula G‐methods Humans inverse probability weighting machine learning Medical research Medical statistics Models, Statistical Probability Propensity Score regression adjustment Research Design Statistical analysis Statistical inference targeted maximum likelihood estimation double-robust methods targeted maximum likelihood estimation inverse probability weighting causal inference g-formula machine learning regression adjustment G-methods propensity score
Online Access	Get full text

Cover

Loading…

More Information
Summary:	The main purpose of many medical studies is to estimate the effects of a treatment or exposure on an outcome. However, it is not always possible to randomize the study participants to a particular treatment, therefore observational study designs may be used. There are major challenges with observational studies; one of which is confounding. Controlling for confounding is commonly performed by direct adjustment of measured confounders; although, sometimes this approach is suboptimal due to modeling assumptions and misspecification. Recent advances in the field of causal inference have dealt with confounding by building on classical standardization methods. However, these recent advances have progressed quickly with a relative paucity of computational‐oriented applied tutorials contributing to some confusion in the use of these methods among applied researchers. In this tutorial, we show the computational implementation of different causal inference estimators from a historical perspective where new estimators were developed to overcome the limitations of the previous estimators (ie, nonparametric and parametric g‐formula, inverse probability weighting, double‐robust, and data‐adaptive estimators). We illustrate the implementation of different methods using an empirical example from the Connors study based on intensive care medicine, and most importantly, we provide reproducible and commented code in Stata, R, and Python for researchers to adapt in their own observational study. The code can be accessed at https://github.com/migariane/Tutorial_Computational_Causal_Inference_Estimators.
Bibliography:	Funding information Cancer Research UK, Instituto de Salud Carlos III ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23 ObjectType-Undefined-3
ISSN:	0277-6715 1097-0258 1097-0258
DOI:	10.1002/sim.9234