Variable selection and specification of robust QSAR models from multicollinear data: arylpiperazinyl derivatives with affinity and selectivity for alpha2-adrenoceptors

Two QSAR models have been identified that predict the affinity and selectivity of arylpiperazinyl derivatives for alpha1 and alpha2 adrenoceptors (ARs). The models have been specified and validated using 108 compounds whose structures and inhibition constants (Ki) are available in the literature [Ba...

Full description

Saved in:
Bibliographic Details
Published inJournal of computer-aided molecular design Vol. 18; no. 7-9; pp. 495 - 509
Main Authors Salt, D W, Maccari, L, Botta, M, Ford, M G
Format Journal Article
LanguageEnglish
Published Netherlands 01.07.2004
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Two QSAR models have been identified that predict the affinity and selectivity of arylpiperazinyl derivatives for alpha1 and alpha2 adrenoceptors (ARs). The models have been specified and validated using 108 compounds whose structures and inhibition constants (Ki) are available in the literature [Barbaro et al., J. Med. Chem., 44 (2001) 2118; Betti et al., J. Med. Chem., 45 (2002) 3603; Barbaro et al., Bioorg. Med. Chem., 10 (2002) 361; Betti et al., J. Med. Chem., 46 (2003) 3555]. One hundred and forty-seven predictors have been calculated using the Cerius 2 software available from Accelrys. This set of variables exhibited redundancy and severe multicollinearity, which had to be identified and removed as appropriate in order to obtain robust regression models free of inflated errors for the beta estimates - so-called bouncing betas. Those predictors that contained information relevant to the alpha2 response were identified on the basis of their pairwise linear correlations with affinity (-log Ki) for alpha2 adrenoceptors; the remaining variables were discarded. Subsequent variable selection made use of Factor Analysis (FA) and Unsupervised Variable Selection (UzFS). The data was divided into test and training sets using cluster analysis. These two sets were characterised by similar and consistent distributions of compounds in a high dimensional, but relevant predictor space. Multiple regression was then used to determine a subset of predictors from which to determine QSAR models for affinity to alpha2-ARs. Two multivariate procedures, Continuum Regression (the Portsmouth formulation) and Canonical Correlation Analysis (CCA), have been used to specify models for affinity and selectivity, respectively. Reasonable predictions were obtained using these in silico screening tools.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:0920-654X