Data integration in causal inference

Integrating data from multiple heterogeneous sources has become increasingly popular to achieve a large sample size and diverse study population. This article reviews development in causal inference methods that combines multiple datasets collected by potentially different designs from potentially h...

Full description

Saved in:
Bibliographic Details
Published inWiley interdisciplinary reviews. Computational statistics Vol. 15; no. 1
Main Authors Shi, Xu, Pan, Ziyang, Miao, Wang
Format Journal Article
LanguageEnglish
Published Hoboken, USA John Wiley & Sons, Inc 01.01.2023
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Integrating data from multiple heterogeneous sources has become increasingly popular to achieve a large sample size and diverse study population. This article reviews development in causal inference methods that combines multiple datasets collected by potentially different designs from potentially heterogeneous populations. We summarize recent advances on combining randomized clinical trials with external information from observational studies or historical controls, combining samples when no single sample has all relevant variables with application to two‐sample Mendelian randomization, distributed data setting under privacy concerns for comparative effectiveness and safety research using real‐world data, Bayesian causal inference, and causal discovery methods. This article is categorized under: Statistical Models > Semiparametric Models Applications of Computational Statistics > Clinical Trials Data missing patterns in the major settings discussed in Sections 3 and 4. For each variable in each sample, ✓ stands for observed, empty stands for unobserved, and ✓/✗ indicates different settings considered by different papers.
Bibliography:Funding Information
Edited by
James E. Gentle, Commissioning Editor and Editor‐in‐Chief and David W. Scott, Review Editor and Editor‐in‐Chief
Xu Shi is support by the NIH/NIGMS grant R01GM139926.
ISSN:1939-5108
1939-0068
DOI:10.1002/wics.1581