Fine‐mapping and QTL tissue‐sharing information improves the reliability of causal gene identification

The integration of transcriptomic studies and genome‐wide association studies (GWAS) via imputed expression has seen extensive application in recent years, enabling the functional characterization and causal gene prioritization of GWAS loci. However, the techniques for imputing transcriptomic traits...

Full description

Saved in:
Bibliographic Details
Published inGenetic epidemiology Vol. 44; no. 8; pp. 854 - 867
Main Authors Barbeira, Alvaro N., Melia, Owen J., Liang, Yanyu, Bonazzola, Rodrigo, Wang, Gao, Wheeler, Heather E., Aguet, François, Ardlie, Kristin G., Wen, Xiaoquan, Im, Hae K.
Format Journal Article
LanguageEnglish
Published United States Wiley Subscription Services, Inc 01.11.2020
John Wiley and Sons Inc
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:The integration of transcriptomic studies and genome‐wide association studies (GWAS) via imputed expression has seen extensive application in recent years, enabling the functional characterization and causal gene prioritization of GWAS loci. However, the techniques for imputing transcriptomic traits from DNA variation remain underdeveloped. Furthermore, associations found when linking eQTL studies to complex traits through methods like PrediXcan can lead to false positives due to linkage disequilibrium between distinct causal variants. Therefore, the best prediction performance models may not necessarily lead to more reliable causal gene discovery. With the goal of improving discoveries without increasing false positives, we develop and compare multiple transcriptomic imputation approaches using the most recent GTEx release of expression and splicing data on 17,382 RNA‐sequencing samples from 948 post‐mortem donors in 54 tissues. We find that informing prediction models with posterior causal probability from fine‐mapping (dap‐g) and borrowing information across tissues (mashr) can lead to better performance in terms of number and proportion of significant associations that are colocalized and the proportion of silver standard genes identified as indicated by precision‐recall and receiver operating characteristic curves. All prediction models are made publicly available at predictdb.org.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:0741-0395
1098-2272
1098-2272
DOI:10.1002/gepi.22346