Combining information on multiple instrumental variables in Mendelian randomization: comparison of allele score and summarized data methods

Mendelian randomization is the use of genetic instrumental variables to obtain causal inferences from observational data. Two recent developments for combining information on multiple uncorrelated instrumental variables (IVs) into a single causal estimate are as follows: (i) allele scores, in which...

Full description

Saved in:

Bibliographic Details
Published in	Statistics in medicine Vol. 35; no. 11; pp. 1880 - 1906
Main Authors	Burgess, Stephen, Dudbridge, Frank, Thompson, Simon G.
Format	Journal Article
Language	English
Published	England Blackwell Publishing Ltd 20.05.2016 Wiley Subscription Services, Inc John Wiley and Sons Inc
Subjects	aggregated data allele score Alleles Cardiovascular disease causal inference Causality Cholesterol, LDL - blood Cholesterol, LDL - genetics Coronary Disease - blood Coronary Disease - genetics Estimating techniques Estimation bias genetic risk score genetic variants Humans instrumental variables Low density lipoprotein Mendelian randomization Mendelian Randomization Analysis - methods Models, Genetic Models, Statistical Risk Factors Statistical inference summarized data weak instruments genetic variants instrumental variables summarized data allele score genetic risk score Mendelian randomization weak instruments causal inference aggregated data
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Mendelian randomization is the use of genetic instrumental variables to obtain causal inferences from observational data. Two recent developments for combining information on multiple uncorrelated instrumental variables (IVs) into a single causal estimate are as follows: (i) allele scores, in which individual‐level data on the IVs are aggregated into a univariate score, which is used as a single IV, and (ii) a summary statistic method, in which causal estimates calculated from each IV using summarized data are combined in an inverse‐variance weighted meta‐analysis. To avoid bias from weak instruments, unweighted and externally weighted allele scores have been recommended. Here, we propose equivalent approaches using summarized data and also provide extensions of the methods for use with correlated IVs. We investigate the impact of different choices of weights on the bias and precision of estimates in simulation studies. We show that allele score estimates can be reproduced using summarized data on genetic associations with the risk factor and the outcome. Estimates from the summary statistic method using external weights are biased towards the null when the weights are imprecisely estimated; in contrast, allele score estimates are unbiased. With equal or external weights, both methods provide appropriate tests of the null hypothesis of no causal effect even with large numbers of potentially weak instruments. We illustrate these methods using summarized data on the causal effect of low‐density lipoprotein cholesterol on coronary heart disease risk. It is shown that a more precise causal estimate can be obtained using multiple genetic variants from a single gene region, even if the variants are correlated. © 2015 The Authors. Statistics in Medicine published by John Wiley & Sons Ltd.
Bibliography:	ArticleID:SIM6835 ark:/67375/WNG-9ZN85Z8P-B Wellcome Trust - No. 100114 Medical Research Council - No. K006215 British Heart Foundation - No. CH/12/2/29428 istex:9BAF9E488870C6F9349B6286BDB84EF74F5B94FE ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23
ISSN:	0277-6715 1097-0258
DOI:	10.1002/sim.6835