The Evaluation of Tools Used to Predict the Impact of Missense Variants Is Hindered by Two Types of Circularity

ABSTRACT Prioritizing missense variants for further experimental investigation is a key challenge in current sequencing studies for exploring complex and Mendelian diseases. A large number of in silico tools have been employed for the task of pathogenicity prediction, including PolyPhen‐2, SIFT, Fat...

Full description

Saved in:

Bibliographic Details
Published in	Human mutation Vol. 36; no. 5; pp. 513 - 523
Main Authors	Grimm, Dominik G., Azencott, Chloé-Agathe, Aicheler, Fabian, Gieraths, Udo, MacArthur, Daniel G., Samocha, Kaitlin E., Cooper, David N., Stenson, Peter D., Daly, Mark J., Smoller, Jordan W., Duncan, Laramie E., Borgwardt, Karsten M.
Format	Journal Article
Language	English
Published	United States Blackwell Publishing Ltd 01.05.2015 John Wiley & Sons, Inc Wiley John Wiley and Sons Inc
Subjects	Computational Biology - methods Datasets as Topic exome sequencing Genetic disorders Humans Internet Life Sciences Mutation Mutation, Missense pathogenicity prediction tools Quantitative Methods Reproducibility of Results Software Web Browser pathogenicity prediction tools exome sequencing
Online Access	Get full text

Cover

Loading…

More Information
Summary:	ABSTRACT Prioritizing missense variants for further experimental investigation is a key challenge in current sequencing studies for exploring complex and Mendelian diseases. A large number of in silico tools have been employed for the task of pathogenicity prediction, including PolyPhen‐2, SIFT, FatHMM, MutationTaster‐2, MutationAssessor, Combined Annotation Dependent Depletion, LRT, phyloP, and GERP++, as well as optimized methods of combining tool scores, such as Condel and Logit. Due to the wealth of these methods, an important practical question to answer is which of these tools generalize best, that is, correctly predict the pathogenic character of new variants. We here demonstrate in a study of 10 tools on five datasets that such a comparative evaluation of these tools is hindered by two types of circularity: they arise due to (1) the same variants or (2) different variants from the same protein occurring both in the datasets used for training and for evaluation of these tools, which may lead to overly optimistic results. We show that comparative evaluations of predictors that do not address these types of circularity may erroneously conclude that circularity confounded tools are most accurate among all tools, and may even outperform optimized combinations of tools. In a study of ten in silico pathogenicity prediction tools on five datasets we demonstrate that two types of circularity hinder a comparative evaluation of these prediction tools. We further show that comparative evaluations of predictors that do not address these types of circularity may erroneously conclude that circularity confounded tools are most accurate among all tools, and may even outperform optimized combinations of tools.
Bibliography:	istex:D573BC28BB908F766F7473D3AE06DE23CB9B0F06 NIH/NIMH - No. K24MH094614 ArticleID:HUMU22768 ark:/67375/WNG-LMKPD09L-6 Communicated by Mauno Vihinen These authors made equal contributions. Contract grant sponsors: The research of Professor Dr. Karsten Borgwardt was supported by the Alfried Krupp Prize for Young University Teachers of the Alfried Krupp von Bohlen und Halbach‐Stiftung. Dr. Smoller is supported in part by NIH/NIMH grant K24MH094614. ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 content type line 23 PMCID: PMC4409520
ISSN:	1059-7794 1098-1004 1098-1004
DOI:	10.1002/humu.22768