Optimal compounds discovery by design of experiments and algorithmic evolution of linear models

Based on the premise that, for a given class of related chemical compounds, there exists a relationship between their structure and their properties (i.e. activity), it is demonstrated herein that an elementary algorithm can readily identify, with simplistic models and without recourse to molecular...

Full description

Saved in:
Bibliographic Details
Published inChimia Vol. 67; no. 1-2; p. 71
Main Authors Dumeunier, Raphaël, Hascher, Florent A
Format Journal Article
LanguageEnglish
German
Published Switzerland Swiss Chemical Society 01.01.2013
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Based on the premise that, for a given class of related chemical compounds, there exists a relationship between their structure and their properties (i.e. activity), it is demonstrated herein that an elementary algorithm can readily identify, with simplistic models and without recourse to molecular descriptors, the most active compounds of a categorical, pre-defined space of molecules. In an actual case study using public experimental data on two thousand related molecules, D-optimal design of experiments initially identified the best subset of compounds considered for the construction of simple models. Subsequently, predictions of a first generation of best candidates, their preparation and inclusion into a new data set, allowed the exploration of the most active region within the space of interest. Survival of the algorithm by iterative generations ensured that most of the best (active) compounds had been prepared. A certain partial survival condition, followed by a complete termination criterion, helped to minimize the total amount of compounds to prepare while identifying the n best individuals of the matrix.
ISSN:0009-4293
2673-2424
DOI:10.2533/chimia.2013.71