Bayesian method to predict individual SNP genotypes from gene expression data

Eric Schadt and colleagues report a Bayesian method to predict individual SNP genotypes based on RNA expression data. Using simulations and empirical data sets, they show that it is possible to infer a genotypic barcode specific to an individual, although the identification of an individual as a par...

Full description

Saved in:
Bibliographic Details
Published inNature genetics Vol. 44; no. 5; pp. 603 - 608
Main Authors Schadt, Eric E, Woo, Sangsoon, Hao, Ke
Format Journal Article
LanguageEnglish
Published New York Nature Publishing Group US 01.05.2012
Nature Publishing Group
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Eric Schadt and colleagues report a Bayesian method to predict individual SNP genotypes based on RNA expression data. Using simulations and empirical data sets, they show that it is possible to infer a genotypic barcode specific to an individual, although the identification of an individual as a participant in a study is limited by factors such as the availability of large-scale expression quantitative trait loci (eQTLs) and expression data sets. RNA profiling can be used to capture the expression patterns of many genes that are associated with expression quantitative trait loci (eQTLs). Employing published putative cis eQTLs, we developed a Bayesian approach to predict SNP genotypes that is based only on RNA expression data. We show that predicted genotypes can accurately and uniquely identify individuals in large populations. When inferring genotypes from an expression data set using eQTLs of the same tissue type (but from an independent cohort), we were able to resolve 99% of the identities of individuals in the cohort at P adjusted ≤ 1 × 10 −5 . When eQTLs derived from one tissue were used to predict genotypes using expression data from a different tissue, the identities of 90% of the study subjects could be resolved at P adjusted ≤ 1 × 10 −5 . We discuss the implications of deriving genotypic information from RNA data deposited in the public domain.
Bibliography:ObjectType-Article-2
SourceType-Scholarly Journals-1
ObjectType-Feature-1
content type line 14
ObjectType-Article-1
ObjectType-Feature-2
content type line 23
ISSN:1061-4036
1546-1718
1546-1718
DOI:10.1038/ng.2248