Data integration in causal inference

Integrating data from multiple heterogeneous sources has become increasingly popular to achieve a large sample size and diverse study population. This article reviews development in causal inference methods that combines multiple datasets collected by potentially different designs from potentially h...

Full description

Saved in:
Bibliographic Details
Published inWiley interdisciplinary reviews. Computational statistics Vol. 15; no. 1
Main Authors Shi, Xu, Pan, Ziyang, Miao, Wang
Format Journal Article
LanguageEnglish
Published Hoboken, USA John Wiley & Sons, Inc 01.01.2023
Subjects
Online AccessGet full text
ISSN1939-5108
1939-0068
DOI10.1002/wics.1581

Cover

Loading…
Abstract Integrating data from multiple heterogeneous sources has become increasingly popular to achieve a large sample size and diverse study population. This article reviews development in causal inference methods that combines multiple datasets collected by potentially different designs from potentially heterogeneous populations. We summarize recent advances on combining randomized clinical trials with external information from observational studies or historical controls, combining samples when no single sample has all relevant variables with application to two‐sample Mendelian randomization, distributed data setting under privacy concerns for comparative effectiveness and safety research using real‐world data, Bayesian causal inference, and causal discovery methods. This article is categorized under: Statistical Models > Semiparametric Models Applications of Computational Statistics > Clinical Trials Data missing patterns in the major settings discussed in Sections 3 and 4. For each variable in each sample, ✓ stands for observed, empty stands for unobserved, and ✓/✗ indicates different settings considered by different papers.
AbstractList Integrating data from multiple heterogeneous sources has become increasingly popular to achieve a large sample size and diverse study population. This article reviews development in causal inference methods that combines multiple datasets collected by potentially different designs from potentially heterogeneous populations. We summarize recent advances on combining randomized clinical trials with external information from observational studies or historical controls, combining samples when no single sample has all relevant variables with application to two‐sample Mendelian randomization, distributed data setting under privacy concerns for comparative effectiveness and safety research using real‐world data, Bayesian causal inference, and causal discovery methods. This article is categorized under: Statistical Models > Semiparametric Models Applications of Computational Statistics > Clinical Trials Data missing patterns in the major settings discussed in Sections 3 and 4. For each variable in each sample, ✓ stands for observed, empty stands for unobserved, and ✓/✗ indicates different settings considered by different papers.
Integrating data from multiple heterogeneous sources has become increasingly popular to achieve a large sample size and diverse study population. This paper reviews development in causal inference methods that combines multiple datasets collected by potentially different designs from potentially heterogeneous populations. We summarize recent advances on combining randomized clinical trial with external information from observational studies or historical controls, combining samples when no single sample has all relevant variables with application to two-sample Mendelian randomization, distributed data setting under privacy concerns for comparative effectiveness and safety research using real-world data, Bayesian causal inference, and causal discovery methods.
Author Shi, Xu
Miao, Wang
Pan, Ziyang
Author_xml – sequence: 1
  givenname: Xu
  orcidid: 0000-0001-8566-9552
  surname: Shi
  fullname: Shi, Xu
  email: shixu@umich.edu
  organization: University of Michigan
– sequence: 2
  givenname: Ziyang
  orcidid: 0000-0003-1600-5597
  surname: Pan
  fullname: Pan, Ziyang
  organization: University of Michigan
– sequence: 3
  givenname: Wang
  surname: Miao
  fullname: Miao, Wang
  organization: Peking University
BackLink https://www.ncbi.nlm.nih.gov/pubmed/36713955$$D View this record in MEDLINE/PubMed
BookMark eNo9kEtrwzAQhEVJaR7toX-g5NCrk13LkqVjcV-BQA8N9Cj0WBcXxwm2Q8i_r0zSMof92B2WYaZs1OwaYuweYYEA6fJY-W6BQuEVm6DmOgGQanRhgaDGbNp1P3GbR92wMZc5ci3EhD0-297Oq6an79b21a6JPPf20Nk6UkktNZ5u2XVp647uLnPGNq8vm-I9WX-8rYqndeIFaEykCzwjhOAzGXNpJMq5k1SqlGdOR8nSOh0CB45KkbBpkMqSQ_QSPJ-xh_Pb_cFtKZh9W21tezJ_aaNheTYcq5pO_3cEM9RghhrMUIP5WhWfA_BfLv5QjQ
CitedBy_id crossref_primary_10_1002_sim_10076
crossref_primary_10_1093_biomtc_ujae095
crossref_primary_10_1002_sim_10068
crossref_primary_10_1002_wps_21224
crossref_primary_10_1007_s10489_023_04667_5
crossref_primary_10_5194_acp_24_6197_2024
crossref_primary_10_1007_s42081_024_00285_8
crossref_primary_10_1214_23_AOAS1860
crossref_primary_10_14778_3705829_3705836
crossref_primary_10_3390_math12182801
crossref_primary_10_1515_ijb_2024_0018
ContentType Journal Article
Copyright 2022 The Authors. published by Wiley Periodicals LLC.
Copyright_xml – notice: 2022 The Authors. published by Wiley Periodicals LLC.
DBID 24P
NPM
DOI 10.1002/wics.1581
DatabaseName Wiley Online Library Open Access
PubMed
DatabaseTitle PubMed
DatabaseTitleList
PubMed
Database_xml – sequence: 1
  dbid: 24P
  name: Wiley Online Library Open Access
  url: https://authorservices.wiley.com/open-science/open-access/browse-journals.html
  sourceTypes: Publisher
– sequence: 2
  dbid: NPM
  name: PubMed
  url: https://proxy.k.utb.cz/login?url=http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=PubMed
  sourceTypes: Index Database
DeliveryMethod fulltext_linktorsrc
Discipline Mathematics
EISSN 1939-0068
EndPage n/a
ExternalDocumentID 36713955
WICS1581
Genre reviewArticle
Journal Article
GrantInformation_xml – fundername: National Institute of General Medical Sciences
  funderid: R01GM139926
– fundername: NIGMS NIH HHS
  grantid: R01 GM139926
GroupedDBID 05W
0R~
1OC
1VH
24P
33P
4.4
53G
5DZ
8-1
A00
AAESR
AAHHS
AAHQN
AAMNL
AANHP
AANLZ
AAONW
AASGY
AAXRX
AAYCA
AAZKR
ABCUV
ACAHQ
ACBWZ
ACCFJ
ACCZN
ACGFS
ACPOU
ACRPL
ACXBN
ACXQS
ACYXJ
ADBBV
ADEOM
ADKYN
ADMGS
ADNMO
ADZMN
AEEZP
AEIGN
AEQDE
AEUYR
AFBPY
AFFPM
AFGKR
AFPWT
AFRAH
AFWVQ
AHBTC
AITYG
AIURR
AIWBW
AJBDE
AJXKR
ALMA_UNASSIGNED_HOLDINGS
ALUQN
AMBMR
AMYDB
ASPBG
AUFTA
AVWKF
AZBYB
AZFZN
AZVAB
BDRZF
BHBCM
BMNLL
BRXPI
DCZOG
DRFUL
DRSTM
EBS
EJD
F5P
FEDTE
G-S
GODZA
HGLYW
HVGLF
HZ~
LATKE
LEEKS
LITHE
LOXES
LUTES
LYRES
MEWTI
MRFUL
MRSTM
MSFUL
MSSTM
MXFUL
MXSTM
MY.
MY~
O66
O9-
P2W
RNS
ROL
SUPJJ
WBKPD
WIH
WIK
WOHZO
WXSBR
WYISQ
WYJ
XBAML
XV2
ZZTAW
NPM
ID FETCH-LOGICAL-c5091-6bd34e10dc4610091ee73b6ef8234b9b9b6fab9dd303188e5a2d68aeb11c60c3
IEDL.DBID 24P
ISSN 1939-5108
IngestDate Wed Feb 19 02:24:35 EST 2025
Wed Jan 22 16:24:32 EST 2025
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Issue 1
Keywords Causal inference
data fusion
generalizability
transportability
data integration
Language English
License Attribution
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c5091-6bd34e10dc4610091ee73b6ef8234b9b9b6fab9dd303188e5a2d68aeb11c60c3
Notes Funding Information
Edited by
James E. Gentle, Commissioning Editor and Editor‐in‐Chief and David W. Scott, Review Editor and Editor‐in‐Chief
Xu Shi is support by the NIH/NIGMS grant R01GM139926.
ORCID 0000-0003-1600-5597
0000-0001-8566-9552
OpenAccessLink https://onlinelibrary.wiley.com/doi/abs/10.1002%2Fwics.1581
PMID 36713955
PageCount 17
ParticipantIDs pubmed_primary_36713955
wiley_primary_10_1002_wics_1581_WICS1581
PublicationCentury 2000
PublicationDate January/February 2023
PublicationDateYYYYMMDD 2023-01-01
PublicationDate_xml – month: 01
  year: 2023
  text: January/February 2023
PublicationDecade 2020
PublicationPlace Hoboken, USA
PublicationPlace_xml – name: Hoboken, USA
– name: United States
PublicationTitle Wiley interdisciplinary reviews. Computational statistics
PublicationTitleAlternate Wiley Interdiscip Rev Comput Stat
PublicationYear 2023
Publisher John Wiley & Sons, Inc
Publisher_xml – name: John Wiley & Sons, Inc
References 1989; 42
2004; 23
2008; 36
2000; 95
2020; 12
1992; 59
2014; 29
2014; 23
2016; 34
2018; 9
2009; 10
1986; 7
2000; 15
2010; 28
2013; 51
2017; 79
2019; 28
2007; 6
2016; 40
2014; 13
2011; 67
1982
1998; 93
1980
2010; 2
2012; 21
2016; 45
2018; 181
2022; 191
2019; 34
1986; 15
1994; 89
2020; 39
2019; 39
2019; 38
1983; 70
2012; 107
2018; 27
2003; 32
1998; 66
1999
2018; 19
2018; 17
2010; 48
1974; 66
2020; 31
2015; 63
2019; 48
2013; 178
2010; 172
2002; 70
2020; 21
2018; 10
2020; 29
2014; 70
1997; 87
2000; 3
2017; 46
2003; 14
2014; 63
2017; 9
2016; 78
2015; 45
1976; 36
2020; 3
1980; 75
2001
2015; 178
2015; 44
2016; 113
2019; 115
2016; 111
2020; 48
1992; 87
2021; 40
2021; 109
2015; 16
2019; 75
2017; 2017
2011
2010
2017; 28
1995; 13
2020; 82
2011; 30
2007
1952
2006
2020; 107
2006; 1
2011; 174
2008; 95
2011; 173
2014; 82
2020; 190
2005; 162
2013; 38
2021
2020
1969; 64
2019
2018
2017
2017; 18
2017; 186
2013
2010; 92
References_xml – start-page: 592
  year: 2013
  end-page: 617
– volume: 78
  start-page: 947
  year: 2016
  end-page: 1012
  article-title: Causal inference by using invariant prediction: Identification and confidence intervals
  publication-title: Journal of the Royal Statistical Society, Series B: Statistical Methodology
– volume: 48
  start-page: 684
  year: 2019
  end-page: 690
  article-title: Software application profile: Mrrobust — A tool for performing two‐sample summary Mendelian randomization analyses
  publication-title: International Journal of Epidemiology
– volume: 82
  start-page: 811
  year: 2014
  end-page: 822
  article-title: Identifying treatment effects under data combination
  publication-title: Econometrica
– volume: 82
  start-page: 445
  year: 2020
  end-page: 465
  article-title: Doubly robust inference when combining probability and non‐probability samples with high dimensional data
  publication-title: Journal of the Royal Statistical Society, Series B: Statistical Methodology
– volume: 23
  start-page: R89
  year: 2014
  end-page: R98
  article-title: Mendelian randomization: Genetic anchors for causal inference in epidemiological studies
  publication-title: Human Molecular Genetics
– volume: 178
  start-page: 757
  year: 2015
  end-page: 778
  article-title: From SATE to PATT: Combining experimental with observational studies to estimate population treatment effects
  publication-title: Journal of the Royal Statistical Society: Series A (Statistics in Society)
– volume: 48
  start-page: S83
  year: 2010
  end-page: S39
  article-title: Privacy‐maintaining propensity score‐based pooling of multiple databases applied to a study of biologics
  publication-title: Medical Care
– volume: 87
  start-page: 328
  year: 1992
  end-page: 336
  article-title: The effect of age at school entry on educational attainment: An application of instrumental variables with moments from two samples
  publication-title: Journal of the American Statistical Association
– volume: 7
  start-page: 177
  year: 1986
  end-page: 188
  article-title: Meta‐analysis in clinical trials
  publication-title: Controlled Clinical Trials
– volume: 64
  start-page: 1183
  year: 1969
  end-page: 1210
  article-title: A theory for record linkage
  publication-title: Journal of the American Statistical Association
– volume: 34
  start-page: 288
  year: 2016
  end-page: 301
  article-title: Efficient estimation of data combination models by the method of auxiliary‐to‐study tilting (AST)
  publication-title: Journal of Business & Economic Statistics
– volume: 63
  start-page: 195
  year: 2014
  end-page: 210
  article-title: Generalizing from unrepresentative experiments: A stratified propensity score approach
  publication-title: Journal of the Royal Statistical Society: Series C: Applied Statistics
– year: 1952
– volume: 44
  start-page: 512
  year: 2015
  end-page: 525
  article-title: Mendelian randomization with invalid instruments: Effect estimation and bias detection through egger regression
  publication-title: International Journal of Epidemiology
– volume: 173
  start-page: 739
  year: 2011
  end-page: 742
  article-title: Invited commentary: G‐computation—Lost in translation?
  publication-title: American Journal of Epidemiology
– volume: 31
  start-page: 334
  year: 2020
  end-page: 344
  article-title: Toward causally interpretable meta‐analysis: Transporting inferences from multiple randomized trials to a new target population
  publication-title: Epidemiology
– volume: 40
  start-page: 597
  year: 2016
  end-page: 608
  article-title: Bias due to participant overlap in two‐sample Mendelian randomization
  publication-title: Genetic Epidemiology
– volume: 75
  start-page: 591
  year: 1980
  end-page: 593
  article-title: Randomization analysis of experimental data: The fisher randomization test comment
  publication-title: Journal of the American Statistical Association
– volume: 13
  start-page: 225
  year: 1995
  end-page: 235
  article-title: Split‐sample instrumental variables estimates of the return to schooling
  publication-title: Journal of Business & Economic Statistics
– volume: 178
  start-page: 1177
  year: 2013
  end-page: 1184
  article-title: Efficient design for Mendelian randomization studies: Subsample and 2‐sample instrumental variable estimators
  publication-title: American Journal of Epidemiology
– volume: 172
  start-page: 107
  year: 2010
  end-page: 115
  article-title: Generalizing evidence from randomized clinical trials to target populations: The ACTG 320 trial
  publication-title: American Journal of Epidemiology
– volume: 6
  start-page: 5469
  year: 2007
  end-page: 5547
  article-title: The econometrics of data combination
  publication-title: Handbook of Econometrics
– volume: 2017
  start-page: 1347
  year: 2017
  end-page: 1353
– volume: 67
  start-page: 1047
  year: 2011
  end-page: 1056
  article-title: Hierarchical commensurate and power prior models for adaptive incorporation of historical information in clinical trials
  publication-title: Biometrics
– start-page: 116
  year: 1999
  end-page: 125
– volume: 9
  start-page: 395
  year: 2018
  end-page: 440
  article-title: Identification, data combination, and the risk of disclosure
  publication-title: Quantitative Economics
– volume: 109
  start-page: 452
  year: 2021
  end-page: 461
  article-title: Use of real‐world data to emulate a clinical trial and support regulatory decision making: Assessing the impact of temporality, comparator choice, and method of adjustment
  publication-title: Clinical Pharmacology & Therapeutics
– volume: 45
  start-page: 908
  year: 2016
  end-page: 915
  article-title: Commentary: Two‐sample Mendelian randomization: Opportunities and challenges
  publication-title: International Journal of Epidemiology
– volume: 28
  start-page: 1293
  year: 2019
  end-page: 1310
  article-title: Bayesian hierarchical methods for meta‐analysis combining randomized‐controlled and single‐arm studies
  publication-title: Statistical Methods in Medical Research
– volume: 63
  start-page: 57
  year: 2015
  end-page: 68
  article-title: Integration and imputation of survey data in R: The StatMatch package
  publication-title: Romanian Statistical Review
– volume: 186
  start-page: 1010
  year: 2017
  end-page: 1014
  article-title: Transportability of trial results using inverse odds of sampling weights
  publication-title: American Journal of Epidemiology
– year: 2019
– volume: 18
  start-page: 553
  year: 2017
  end-page: 568
  article-title: Guided Bayesian imputation to adjust for confounding when combining heterogeneous data sources in comparative effectiveness research
  publication-title: Biostatistics
– volume: 36
  start-page: 808
  year: 2008
  end-page: 843
  article-title: Semiparametric efficiency in GMM models with auxiliary data
  publication-title: The Annals of Statistics
– volume: 79
  start-page: 1509
  year: 2017
  end-page: 1525
  article-title: Robust estimation of encouragement‐design intervention effects transported across sites
  publication-title: Journal of the Royal Statistical Society, Series B: Statistical Methodology
– volume: 48
  start-page: 259
  year: 2020
  end-page: 284
  article-title: Improved methods for moment restriction models with data combination and an application to two‐sample instrumental variable estimation
  publication-title: The Canadian Journal of Statistics
– start-page: 389
  year: 2020
  end-page: 398
– year: 2007
– volume: 32
  start-page: 1
  year: 2003
  end-page: 22
  article-title: ‘Mendelian randomization’: Can genetic epidemiology contribute to understanding environmental determinants of disease?
  publication-title: International Journal of Epidemiology
– volume: 66
  start-page: 315
  year: 1998
  end-page: 331
  article-title: On the role of the propensity score in efficient semiparametric estimation of average treatment effects
  publication-title: Econometrica
– volume: 48
  start-page: 1742
  year: 2020
  end-page: 1769
  article-title: Statistical inference in two‐sample summary‐data Mendelian randomization using robust adjusted profile score
  publication-title: The Annals of Statistics
– volume: 113
  start-page: 7345
  year: 2016
  end-page: 7352
  article-title: Causal inference and the data‐fusion problem
  publication-title: Proceedings of the National Academy of Sciences of the United States of America
– volume: 36
  start-page: 285
  year: 1976
  end-page: 294
  article-title: Inequalities for E ( , ) when the marginals are fixed
  publication-title: Zeitschrift für Wahrscheinlichkeitstheorie und verwandte Gebiete
– volume: 38
  start-page: 95
  year: 2019
  end-page: 123
  article-title: Two‐sample least squares projection
  publication-title: Econometric Reviews
– volume: 45
  start-page: 139
  year: 2015
  end-page: 145
  article-title: Meta‐analysis in clinical trials revisited
  publication-title: Contemporary Clinical Trials
– volume: 89
  start-page: 846
  year: 1994
  end-page: 866
  article-title: Estimation of regression coefficients when some regressors are not always observed
  publication-title: Journal of the American Statistical Association
– year: 2010
– volume: 29
  start-page: 1668
  year: 2020
  end-page: 1681
  article-title: Inverse probability weighted cox model in multi‐site studies without sharing individual‐level data
  publication-title: Statistical Methods in Medical Research
– volume: 115
  start-page: 1540
  year: 2019
  end-page: 1554
  article-title: Combining multiple observational data sources to estimate causal effects
  publication-title: Journal of the American Statistical Association
– volume: 107
  start-page: 806
  year: 2020
  end-page: 816
  article-title: Beyond randomized clinical trials: Use of external controls
  publication-title: Clinical Pharmacology & Therapeutics
– volume: 87
  start-page: 1009
  year: 1997
  end-page: 1018
  article-title: Intergenerational income mobility in Sweden compared to the United States
  publication-title: The American Economic Review
– volume: 30
  start-page: 1329
  year: 2011
  end-page: 1338
  article-title: The inclusion of historical control data may reduce the power of a confirmatory study
  publication-title: Statistics in Medicine
– volume: 181
  start-page: 1193
  year: 2018
  end-page: 1209
  article-title: Generalizing evidence from randomized trials using inverse probability of sampling weights
  publication-title: Journal of the Royal Statistical Society: Series A (Statistics in Society)
– volume: 107
  start-page: 40
  year: 2012
  end-page: 51
  article-title: Adjustment for missing confounders using external validation data and propensity scores
  publication-title: Journal of the American Statistical Association
– volume: 13
  start-page: 41
  year: 2014
  end-page: 54
  article-title: Use of historical control data for assessing treatment effects in clinical trials
  publication-title: Pharmaceutical Statistics
– volume: 75
  start-page: 685
  year: 2019
  end-page: 694
  article-title: Generalizing causal inferences from individuals in randomized trials to all trial‐eligible individuals
  publication-title: Biometrics
– volume: 42
  start-page: 317
  year: 1989
  end-page: 324
  article-title: Performance of tests of significance based on stratification by a multivariate confounder score or by a propensity score
  publication-title: Journal of Clinical Epidemiology
– volume: 66
  start-page: 688
  year: 1974
  end-page: 701
  article-title: Estimating causal effects of treatments in randomized and nonrandomized studies
  publication-title: Journal of Educational Psychology
– volume: 1
  start-page: 515
  year: 2006
  end-page: 534
  article-title: Prior distributions for variance parameters in hierarchical models (comment on article by Browne and Draper)
  publication-title: Bayesian Analysis
– volume: 10
  start-page: 1773
  year: 2018
  end-page: 1768
  article-title: Combining distributed regression and propensity scores: A doubly privacy‐protecting analytic method for multicenter research
  publication-title: Clinical Epidemiology
– volume: 39
  start-page: 369
  year: 2019
  end-page: 386
  article-title: Safety surveillance and the estimation of risk in select populations: Flexible methods to control for confounding while targeting marginal comparisons via standardization
  publication-title: Statistics in Medicine
– volume: 12
  start-page: 322
  year: 2020
  end-page: 333
  article-title: Target population statistical inference with data integration across multiple sources — An approach to mitigate information shortage in rare disease clinical trials
  publication-title: Statistics in Biopharmaceutical Research
– volume: 16
  start-page: 2147
  year: 2015
  end-page: 2205
  article-title: Constraint‐based causal discovery from multiple interventions over overlapping variable sets
  publication-title: Journal of Machine Learning Research
– volume: 3
  year: 2000
– volume: 14
  start-page: 300
  year: 2003
  end-page: 306
  article-title: Quantifying biases in causal models: Classical confounding vs collider‐stratification bias
  publication-title: Epidemiology
– year: 2021
– volume: 19
  start-page: 169
  year: 2018
  end-page: 184
  article-title: Bayesian hierarchical modeling based on multisource exchangeability
  publication-title: Biostatistics
– year: 2018
– volume: 174
  start-page: 369
  year: 2011
  end-page: 386
  article-title: The use of propensity scores to assess the generalizability of results from randomized trials
  publication-title: Journal of the Royal Statistical Society: Series A (Statistics in Society)
– volume: 40
  start-page: 5434
  year: 2021
  end-page: 5452
  article-title: Testing and correcting for weak and pleiotropic instruments in two‐sample multivariable Mendelian randomisation
  publication-title: Statistics in Medicine
– volume: 27
  start-page: 1034
  year: 2018
  end-page: 1041
  article-title: Comparison of privacy‐protecting analytic and data‐sharing methods: A simulation study
  publication-title: Pharmacoepidemiology and Drug Safety
– year: 1982
– volume: 95
  start-page: 415
  year: 2000
  end-page: 442
  article-title: Identification problems and decisions under ambiguity: Empirical analysis of treatment response and normative analysis of treatment choice
  publication-title: Journal of Econometrics
– volume: 29
  start-page: 579
  year: 2014
  end-page: 595
  article-title: External validity: From do‐calculus to transportability across populations
  publication-title: Statistical Science
– volume: 107
  start-page: 834
  year: 2020
  end-page: 842
  article-title: Analytic and data sharing options in real‐world multidatabase studies of comparative effectiveness and safety of medical products
  publication-title: Clinical Pharmacology & Therapeutics
– volume: 95
  start-page: 481
  year: 2008
  end-page: 488
  article-title: The prognostic analogue of the propensity score
  publication-title: Biometrika
– volume: 3
  start-page: 625
  year: 2020
  end-page: 650
  article-title: Statistical data integration in survey sampling: A review
  publication-title: Japanese Journal of Statistics and Data Science
– volume: 9
  start-page: 224
  year: 2018
  article-title: Causal associations between risk factors and common diseases inferred from GWAS summary data
  publication-title: Nature Communications
– volume: 2
  start-page: 535
  year: 2010
  end-page: 543
  article-title: Record linkage
  publication-title: WIREs Computational Statistics
– volume: 10
  start-page: 335
  year: 2009
  end-page: 351
  article-title: Bayesian graphical models for regression on multiple data sets with different variables
  publication-title: Biostatistics
– volume: 162
  start-page: 279
  year: 2005
  end-page: 289
  article-title: Adjusting effect estimates for unmeasured confounding with validation data using propensity score calibration
  publication-title: American Journal of Epidemiology
– volume: 40
  start-page: 304
  year: 2016
  end-page: 314
  article-title: Consistent estimation in Mendelian randomization with some invalid instruments using a weighted median estimator
  publication-title: Genetic Epidemiology
– volume: 21
  start-page: 1
  year: 2020
  end-page: 108
  article-title: Joint causal inference from multiple contexts
  publication-title: Journal of Machine Learning Research
– start-page: 512
  year: 2001
  end-page: 521
– volume: 190
  start-page: 1142
  issue: 6
  year: 2020
  end-page: 1147
  article-title: A warning about using predicted values from regression models for epidemiologic inquiry
  publication-title: American Journal of Epidemiology
– volume: 39
  start-page: 1999
  year: 2020
  end-page: 2014
  article-title: Extending inferences from a randomized trial to a new target population
  publication-title: Statistics in Medicine
– volume: 59
  start-page: 537
  year: 1992
  end-page: 559
  article-title: Female labour supply and on‐the‐job search: An empirical model estimated using complementary data sets
  publication-title: The Review of Economic Studies
– volume: 17
  start-page: 329
  year: 2018
  end-page: 341
  article-title: How to use prior knowledge and still give new data a chance?
  publication-title: Pharmaceutical Statistics
– volume: 70
  start-page: 1023
  year: 2014
  end-page: 1032
  article-title: Robust meta‐analytic‐predictive priors in clinical trials with historical control information
  publication-title: Biometrics
– volume: 92
  start-page: 557
  year: 2010
  end-page: 561
  article-title: Two‐sample instrumental variables estimators
  publication-title: The Review of Economics and Statistics
– volume: 191
  start-page: 674
  year: 2022
  end-page: 678
  article-title: Invited commentary: Estimation and bounds under Data fusion
  publication-title: American Journal of Epidemiology
– volume: 70
  start-page: 41
  year: 1983
  end-page: 55
  article-title: The central role of the propensity score in observational studies for causal effects
  publication-title: Biometrika
– volume: 89
  start-page: 81
  year: 1994
  end-page: 87
  article-title: Nonparametric estimation of mean functionals with data missing at random
  publication-title: Journal of the American Statistical Association
– volume: 31
  start-page: 353
  year: 2020
  end-page: 355
  article-title: Importance of homogeneous effect modification for causal interpretation of meta‐analyses
  publication-title: Epidemiology
– volume: 70
  start-page: 357
  year: 2002
  end-page: 368
  article-title: Regressions, short and long
  publication-title: Econometrica
– volume: 39
  start-page: 1054
  year: 2020
  end-page: 1067
  article-title: Causal data fusion methods using summary‐level statistics for a continuous outcome
  publication-title: Statistics in Medicine
– volume: 28
  start-page: 553
  year: 2017
  end-page: 561
  article-title: Generalizing study results: A potential outcomes perspective
  publication-title: Epidemiology
– start-page: 107
  year: 2007
  end-page: 114
– volume: 45
  start-page: 954
  year: 2016
  end-page: 964
  article-title: Probabilistic record linkage
  publication-title: International Journal of Epidemiology
– volume: 111
  start-page: 1466
  year: 2016
  end-page: 1479
  article-title: Multiple imputation of missing categorical and continuous values via Bayesian mixture models with local dependence
  publication-title: Journal of the American Statistical Association
– year: 1980
– volume: 51
  start-page: S4
  year: 2013
  end-page: S10
  article-title: Confounding adjustment in comparative effectiveness research conducted within distributed research networks
  publication-title: Medical Care
– volume: 7
  start-page: 1393
  year: 1986
  end-page: 1512
  article-title: A new approach to causal inference in mortality studies with a sustained exposure period—Application to control of the healthy worker survivor effect
  publication-title: Mathematical Modelling
– volume: 46
  start-page: 1985
  year: 2017
  end-page: 1998
  article-title: Robust inference in summary data Mendelian randomization via the zero modal pleiotropy assumption
  publication-title: International Journal of Epidemiology
– volume: 15
  start-page: 46
  year: 2000
  end-page: 60
  article-title: Power prior distributions for regression models
  publication-title: Statistical Science
– year: 2006
– year: 2020
– volume: 28
  start-page: 935
  year: 2010
  end-page: 945
  article-title: Comparative effectiveness without head‐to‐head trials
  publication-title: PharmacoEconomics
– start-page: 3
  year: 2011
  end-page: 15
– volume: 38
  start-page: 239
  year: 2013
  end-page: 266
  article-title: Improving generalizations from experiments using propensity score subclassification: Assumptions, properties, and contexts
  publication-title: Journal of Educational and Behavioral Statistics
– volume: 9
  start-page: 284
  year: 2017
  end-page: 297
  article-title: The myth of making inferences for an overall treatment efficacy with data from multiple comparative studies via meta‐analysis
  publication-title: Statistics in Biosciences
– volume: 34
  start-page: 317
  year: 2019
  end-page: 333
  article-title: Two‐sample instrumental variable analyses using heterogeneous samples
  publication-title: Statistical Science
– volume: 34
  start-page: 719
  year: 2019
  end-page: 722
  article-title: Extending inferences from a randomized trial to a target population
  publication-title: European Journal of Epidemiology
– volume: 93
  start-page: 846
  year: 1998
  end-page: 857
  article-title: Not asked and not answered: Multiple imputation for multiple surveys
  publication-title: Journal of the American Statistical Association
– volume: 21
  start-page: 41
  year: 2012
  end-page: 49
  article-title: Using high‐dimensional propensity scores to automate confounding control in a distributed medical product safety surveillance system
  publication-title: Pharmacoepidemiology and Drug Safety
– volume: 15
  start-page: 413
  year: 1986
  end-page: 419
  article-title: Identifiability, exchangeability, and epidemiological confounding
  publication-title: International Journal of Epidemiology
– year: 2017
– volume: 23
  start-page: 2937
  year: 2004
  end-page: 2960
  article-title: Stratification and weighting via the propensity score in estimation of causal treatment effects: A comparative study
  publication-title: Statistics in Medicine
– year: 1999
SSID ssj0067676
Score 2.450423
SecondaryResourceType review_article
Snippet Integrating data from multiple heterogeneous sources has become increasingly popular to achieve a large sample size and diverse study population. This article...
Integrating data from multiple heterogeneous sources has become increasingly popular to achieve a large sample size and diverse study population. This paper...
SourceID pubmed
wiley
SourceType Index Database
Publisher
SubjectTerms causal inference
data fusion
data integration
generalizability
transportability
Title Data integration in causal inference
URI https://onlinelibrary.wiley.com/doi/abs/10.1002%2Fwics.1581
https://www.ncbi.nlm.nih.gov/pubmed/36713955
Volume 15
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1LS8NAEF5KvehBfL8lhx68rE321SyepFqq0FKwYm9hX4FeqpgU_74zSRN7lEBYstnDzuabb3bCfEtITzqf5LFzVNg0UCENQMrEnJrgge6sT3nAhP5kqsbv4nUhFx3y0NTC1PoQbcINkVH5awS4sUX_TzT0Z-mK-0Ri2fUOltaicD4Ts8YNoxCZqn8pawofXtrICsWs3w7dop3t0LTiltEB2d8EhdFjvYqHpBNWR2Rv0iqqFsek92RKEzXiDmBMaEfOrAsYt2yK9k7IfPQ8H47p5oQD6pCoqbKei5DE3qHsOTwJYcCtCnnKuLAaLpUbq73niL00SMO8Sg3418Sp2PFT0l19rsI5iZjwuRWWaUCgGEhrtTNcQ7hinU5cLC_IWT3T7KtWsci4gv2pltBzV0297aiFjFmGVsrQStnHy_ANG5f_f_WK7OLB7HWy4pp0y-91uAH6Lu1ttUxwn84mv1LEll8
linkProvider Wiley-Blackwell
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV07T8MwED5VMAAD4v2GDB1YTBO_GkssqFC10FZIFNEt8itSl4JoK_4-56QJHVEWK7YHn_Pdd77oPgM0hXVJHltLuEk94UIjpHTMiPYO6c64lPmQ0B-OZO-dP0_EpAH3VS1MqQ9RJ9wCMgp_HQAeEtKtP9XQn6md3yUi1F1vcknbAZaUv1Z-OCiRyfKfsiL45aWVrlBMW_XUNd5Zj00Lcunuwe4qKoweym3ch4afHcDOsJZUnR9C81EvdFSpO6A1sR1ZvZzjvGlVtXcE4-7TuNMjqysOiA1MTaRxjPskdjbonuMb79vMSJ-nlHGj8JG5Nso5FsCXeqGpk6lGB5tYGVt2DBuzz5k_hYhylxtuqEII8rYwRlnNFMYrxqrExuIMTsqVZl-ljEXGJB5QlcCe22LpdUepZEyzYKUsWCn76HfeQuP8_0NvYKs3Hg6yQX_0cgHb4Zb2MnNxCRuL76W_Qi5fmOtiy34B_g2Yvw
linkToPdf http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1LT8JAEJ4QTIwejG9898DBy0rbfdCNJwMSUCEkYuTW7KsJFyQC8e8729LK0fSy6XYPO7vffNPZzLcATW5slIXGEKYTRxhXCCkVUqKcRbrTNqHOJ_SHI9H_YC9TPq3BY1kLU-hDVAk3j4zcX3uAL2zW-hMN_ZmZ5UPEfdn1Tn7Y52Wd2bh0w16ITBRHypLgxktKWaEwblVDt2hnOzTNuaV3CAeboDB4KlbxCGpufgz7w0pRdXkCza5aqaAUd0BjYjswar3EcbOyaO8UJr3nSadPNjccEOOJmghtKXNRaI2XPcc3zrWpFi5LYsq0xEdkSktrqcde4riKrUgU-tfIiNDQM6jPv-auAUHMbKaZjiUikLW51tIoKjFc0UZGJuQXcF7MNF0UKhYpFfh_Kjn23OdTrzoKIeM49VZKvZXSz0Hn3Tcu___pHeyOu730bTB6vYI9f0d7kbe4hvrqe-1ukMlX-jZfsV9xk5fx
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Data+integration+in+causal+inference&rft.jtitle=Wiley+interdisciplinary+reviews.+Computational+statistics&rft.au=Shi%2C+Xu&rft.au=Pan%2C+Ziyang&rft.au=Miao%2C+Wang&rft.date=2023-01-01&rft.pub=John+Wiley+%26+Sons%2C+Inc&rft.issn=1939-5108&rft.eissn=1939-0068&rft.volume=15&rft.issue=1&rft.epage=n%2Fa&rft_id=info:doi/10.1002%2Fwics.1581&rft.externalDBID=10.1002%252Fwics.1581&rft.externalDocID=WICS1581
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1939-5108&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1939-5108&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1939-5108&client=summon