Predicting the Functional, Molecular, and Phenotypic Consequences of Amino Acid Substitutions using Hidden Markov Models

ABSTRACT The rate at which nonsynonymous single nucleotide polymorphisms (nsSNPs) are being identified in the human genome is increasing dramatically owing to advances in whole‐genome/whole‐exome sequencing technologies. Automated methods capable of accurately and reliably distinguishing between pat...

Full description

Saved in:
Bibliographic Details
Published inHuman mutation Vol. 34; no. 1; pp. 57 - 65
Main Authors Shihab, Hashem A., Gough, Julian, Cooper, David N., Stenson, Peter D., Barker, Gary L. A., Edwards, Keith J., Day, Ian N. M., Gaunt, Tom R.
Format Journal Article
LanguageEnglish
Published United States Blackwell Publishing Ltd 01.01.2013
John Wiley & Sons, Inc
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:ABSTRACT The rate at which nonsynonymous single nucleotide polymorphisms (nsSNPs) are being identified in the human genome is increasing dramatically owing to advances in whole‐genome/whole‐exome sequencing technologies. Automated methods capable of accurately and reliably distinguishing between pathogenic and functionally neutral nsSNPs are therefore assuming ever‐increasing importance. Here, we describe the Functional Analysis Through Hidden Markov Models (FATHMM) software and server: a species‐independent method with optional species‐specific weightings for the prediction of the functional effects of protein missense variants. Using a model weighted for human mutations, we obtained performance accuracies that outperformed traditional prediction methods (i.e., SIFT, PolyPhen, and PANTHER) on two separate benchmarks. Furthermore, in one benchmark, we achieve performance accuracies that outperform current state‐of‐the‐art prediction methods (i.e., SNPs&GO and MutPred). We demonstrate that FATHMM can be efficiently applied to high‐throughput/large‐scale human and nonhuman genome sequencing projects with the added benefit of phenotypic outcome associations. To illustrate this, we evaluated nsSNPs in wheat (Triticum spp.) to identify some of the important genetic variants responsible for the phenotypic differences introduced by intense selection during domestication. A Web‐based implementation of FATHMM, including a high‐throughput batch facility and a downloadable standalone package, is available at http://fathmm.biocompute.org.uk.
Bibliography:UK Medical Research Council - No. G1000427
ArticleID:HUMU22225
UK Biotechnology and Biological Sciences Research Council - No. BB/G022771
ark:/67375/WNG-6TDJL37K-7
BIOBASE GmbH
istex:926F9D7D8720820F32952A8DE3BACD96716CDA30
Joint last authorship.
Contract grant sponsors: UK Medical Research Council (G1000427 to T.R.G. and I.N.M.D.); UK Biotechnology and Biological Sciences Research Council (BB/G022771 to J.G.); BIOBASE GmbH (to D.N.C. and P.D.S.).
Joint first authorship.
Communicated by Christophe Béroud
ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
content type line 23
ObjectType-Article-2
ObjectType-Feature-1
ISSN:1059-7794
1098-1004
1098-1004
DOI:10.1002/humu.22225