MT-HESS: an efficient Bayesian approach for simultaneous association detection in OMICS datasets, with application to eQTL mapping in multiple tissues

Analysing the joint association between a large set of responses and predictors is a fundamental statistical task in integrative genomics, exemplified by numerous expression Quantitative Trait Loci (eQTL) studies. Of particular interest are the so-called ': hotspots ': , important genetic...

Full description

Saved in:
Bibliographic Details
Published inBioinformatics Vol. 32; no. 4; pp. 523 - 532
Main Authors Lewin, Alex, Saadi, Habib, Peters, James E, Moreno-Moral, Aida, Lee, James C, Smith, Kenneth G C, Petretto, Enrico, Bottolo, Leonardo, Richardson, Sylvia
Format Journal Article
LanguageEnglish
Published England Oxford University Press 15.02.2016
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Analysing the joint association between a large set of responses and predictors is a fundamental statistical task in integrative genomics, exemplified by numerous expression Quantitative Trait Loci (eQTL) studies. Of particular interest are the so-called ': hotspots ': , important genetic variants that regulate the expression of many genes. Recently, attention has focussed on whether eQTLs are common to several tissues, cell-types or, more generally, conditions or whether they are specific to a particular condition. We have implemented MT-HESS, a Bayesian hierarchical model that analyses the association between a large set of predictors, e.g. SNPs, and many responses, e.g. gene expression, in multiple tissues, cells or conditions. Our Bayesian sparse regression algorithm goes beyond ': one-at-a-time ': association tests between SNPs and responses and uses a fully multivariate model search across all linear combinations of SNPs, coupled with a model of the correlation between condition/tissue-specific responses. In addition, we use a hierarchical structure to leverage shared information across different genes, thus improving the detection of hotspots. We show the increase of power resulting from our new approach in an extensive simulation study. Our analysis of two case studies highlights new hotspots that would remain undetected by standard approaches and shows how greater prediction power can be achieved when several tissues are jointly considered. C[Formula: see text] source code and documentation including compilation instructions are available under GNU licence at http://www.mrc-bsu.cam.ac.uk/software/.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
The authors wish it to be known that, in their opinion, these authors should be regarded as Joint First Authors.
The authors wish it to be known that, in their opinion, these authors should be regarded as Joint Second Authors.
Associate Editor: Janet Kelso
ISSN:1367-4803
1367-4811
1460-2059
DOI:10.1093/bioinformatics/btv568