Predicting activities without computing descriptors: graph machines for QSAR

We describe graph machines, an alternative approach to traditional machine-learning-based QSAR, which circumvents the problem of designing, computing and selecting molecular descriptors. In that approach, which is similar in spirit to recursive networks, molecules are considered as structured data,...

Full description

Saved in:
Bibliographic Details
Published inSAR and QSAR in environmental research Vol. 18; no. 1-2; pp. 141 - 153
Main Authors Goulon, A., Picot, T., Duprat, A., Dreyfus, G.
Format Journal Article
LanguageEnglish
Published England Taylor & Francis Group 01.01.2007
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:We describe graph machines, an alternative approach to traditional machine-learning-based QSAR, which circumvents the problem of designing, computing and selecting molecular descriptors. In that approach, which is similar in spirit to recursive networks, molecules are considered as structured data, represented as graphs. For each example of the data set, a mathematical function (graph machine) is built, whose structure reflects the structure of the molecule under consideration; it is the combination of identical parameterised functions, called "node functions" (e.g. a feedforward neural network). The parameters of the node functions, shared both within and across the graph machines, are adjusted during training with the "shared weights" technique. Model selection is then performed by traditional cross-validation. Therefore, the designer's main task consists in finding the optimal complexity for the node function. The efficiency of this new approach has been demonstrated in many QSAR or QSPR tasks, as well as in modelling the activities of complex chemicals (e.g. the toxicity of a family of phenols or the anti-HIV activities of HEPT derivatives). It generally outperforms traditional techniques without requiring the selection and computation of descriptors. §Presented at the 12th International Workshop on Quantitative Structure-Activity Relationships in Environmental Toxicology (QSAR2006), 8-12 May 2006, Lyon, France.
ISSN:1062-936X
1029-046X
DOI:10.1080/10629360601054313