Assessment of applicability domain for multivariate counter-propagation artificial neural network predictive models by minimum Euclidean distance space analysis: A case study

[Display omitted] ► The concept of applicability domain (AD) in QSAR modeling is discussed. ► The AD assessment method for nonlinear neural network predictive models is proposed. ► The counter-propagation artificial neural network (CP-ANN) was applied for modeling. ► Minimal Euclidean distance space...

Full description

Saved in:
Bibliographic Details
Published inAnalytica chimica acta Vol. 759; pp. 28 - 42
Main Authors Minovski, Nikola, Župerl, Špela, Drgan, Viktor, Novič, Marjana
Format Journal Article
LanguageEnglish
Published Amsterdam Elsevier B.V 08.01.2013
Elsevier
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:[Display omitted] ► The concept of applicability domain (AD) in QSAR modeling is discussed. ► The AD assessment method for nonlinear neural network predictive models is proposed. ► The counter-propagation artificial neural network (CP-ANN) was applied for modeling. ► Minimal Euclidean distance space (MEDS) of CP-ANN model was defined and analyzed. ► The resulting outliers coincide with those from linear models (leverage based AD). Alongside the validation, the concept of applicability domain (AD) is probably one of the most important aspects which determine the quality as well as reliability of the established quantitative structure–activity relationship (QSAR) models. To date, a variety of approaches for AD estimation have been devised which can be applied to particular type of QSAR models and their practical utilization is extensively elaborated in the literature. The present study introduces a novel, simple, and effective distance-based method for estimation of the AD in case of developed and validated predictive counter-propagation artificial neural network (CP ANN) models through a proficient exploitation of the Euclidean distance (ED) metric in the structure-representation vector space. The performance of the method was evaluated and explained in a case study by using a pre-built and validated CP ANN model for prediction of the transport activity of the transmembrane protein bilitranslocase for a diverse set of compounds. The method was tested on two more datasets in order to confirm its performance for evaluation of the applicability domain in CP ANN models. The chemical compounds determined as potential outliers, i.e., outside of the CP ANN model AD, were confirmed in a comparative AD assessment by using the leverage approach. Moreover, the method offers a graphical depiction of the AD for fast and simple determination of the extreme points.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ObjectType-Article-2
ObjectType-Feature-1
ISSN:0003-2670
1873-4324
DOI:10.1016/j.aca.2012.11.002