Machine learning methods in chemoinformatics

Machine learning algorithms are generally developed in computer science or adjacent disciplines and find their way into chemical modeling by a process of diffusion. Though particular machine learning methods are popular in chemoinformatics and quantitative structure–activity relationships (QSAR), ma...

Full description

Saved in:
Bibliographic Details
Published inWiley interdisciplinary reviews. Computational molecular science Vol. 4; no. 5; pp. 468 - 481
Main Author Mitchell, John B. O.
Format Journal Article
LanguageEnglish
Published Hoboken, USA Wiley Periodicals, Inc 01.09.2014
Online AccessGet full text

Cover

Loading…
More Information
Summary:Machine learning algorithms are generally developed in computer science or adjacent disciplines and find their way into chemical modeling by a process of diffusion. Though particular machine learning methods are popular in chemoinformatics and quantitative structure–activity relationships (QSAR), many others exist in the technical literature. This discussion is methods‐based and focused on some algorithms that chemoinformatics researchers frequently use. It makes no claim to be exhaustive. We concentrate on methods for supervised learning, predicting the unknown property values of a test set of instances, usually molecules, based on the known values for a training set. Particularly relevant approaches include Artificial Neural Networks, Random Forest, Support Vector Machine, k‐Nearest Neighbors and naïve Bayes classifiers. WIREs Comput Mol Sci 2014, 4:468–481. This article is categorized under: Computer and Information Science > Chemoinformatics
Bibliography:Conflict of interest: The author has declared no conflicts of interest for this article.
ISSN:1759-0876
1759-0884
DOI:10.1002/wcms.1183