The Importance of Being Earnest: Validation is the Absolute Essential for Successful Application and Interpretation of QSPR Models
This paper emphasizes the importance of rigorous validation as a crucial, integral component of Quantitative Structure Property Relationship (QSPR) model development. We consider some examples of published QSPR models, which in spite of their high fitted accuracy for the training sets and apparent m...
Saved in:
Published in | QSAR & combinatorial science Vol. 22; no. 1; pp. 69 - 77 |
---|---|
Main Authors | , , |
Format | Journal Article |
Language | English |
Published |
Weinheim
WILEY-VCH Verlag
01.04.2003
WILEY‐VCH Verlag |
Subjects | |
Online Access | Get full text |
ISSN | 1611-020X 1611-0218 |
DOI | 10.1002/qsar.200390007 |
Cover
Summary: | This paper emphasizes the importance of rigorous validation as a crucial, integral component of Quantitative Structure Property Relationship (QSPR) model development. We consider some examples of published QSPR models, which in spite of their high fitted accuracy for the training sets and apparent mechanistic appeal, fail rigorous validation tests, and, thus, may lack practical utility as reliable screening tools. We present a set of simple guidelines for developing validated and predictive QSPR models. To this end, we discuss several validation strategies including (1) randomization of the modelled property, also called Y‐scrambling, (2) multiple leave‐many‐out cross‐validations, and (3) external validation using rational division of a dataset into training and test sets. We also highlight the need to establish the domain of model applicability in the chemical space to flag molecules for which predictions may be unreliable, and discuss some algorithms that can be used for this purpose. We advocate the broad use of these guidelines in the development of predictive QSPR models. |
---|---|
Bibliography: | istex:C1D1227F8FE7F38B4E1F71698807C44037AAC145 ark:/67375/WNG-LTQF3TF2-2 ArticleID:QSAR200390007 to receive correspondence (All authors equally contributed to this paper) |
ISSN: | 1611-020X 1611-0218 |
DOI: | 10.1002/qsar.200390007 |