Causal Inference Is Not Just a Statistics Problem

This article introduces a collection of four datasets, similar to Anscombe’s quartet, that aim to highlight the challenges involved when estimating causal effects. Each of the four datasets is generated based on a distinct causal mechanism: the first involves a collider, the second involves a confou...

Full description

Saved in:
Bibliographic Details
Published inJournal of statistics and data science education Vol. 32; no. 2; pp. 150 - 155
Main Authors D’Agostino McGowan, Lucy, Gerke, Travis, Barrett, Malcolm
Format Journal Article
LanguageEnglish
Published Alexandria Taylor & Francis Ltd 03.05.2024
Taylor & Francis Group
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:This article introduces a collection of four datasets, similar to Anscombe’s quartet, that aim to highlight the challenges involved when estimating causal effects. Each of the four datasets is generated based on a distinct causal mechanism: the first involves a collider, the second involves a confounder, the third involves a mediator, and the fourth involves the induction of M-Bias by an included factor. The article includes a mathematical summary of each dataset, as well as directed acyclic graphs that depict the relationships between the variables. Despite the fact that the statistical summaries and visualizations for each dataset are identical, the true causal effect differs, and estimating it correctly requires knowledge of the data-generating mechanism. These example datasets can help practitioners gain a better understanding of the assumptions underlying causal inference methods and emphasize the importance of gathering more information beyond what can be obtained from statistical tools alone. The article also includes R code for reproducing all figures and provides access to the datasets themselves through an R package named “quartets.” Supplementary materials for this article are available online.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ISSN:2693-9169
2693-9169
DOI:10.1080/26939169.2023.2276446