Interpretation of Compound Activity Predictions from Complex Machine Learning Models Using Local Approximations and Shapley Values

In qualitative or quantitative studies of structure–activity relationships (SARs), machine learning (ML) models are trained to recognize structural patterns that differentiate between active and inactive compounds. Understanding model decisions is challenging but of critical importance to guide comp...

Full description

Saved in:

Bibliographic Details
Published in	Journal of medicinal chemistry Vol. 63; no. 16; pp. 8761 - 8777
Main Authors	Rodríguez-Pérez, Raquel, Bajorath, Jürgen
Format	Journal Article
Language	English
Published	United States American Chemical Society 27.08.2020
Subjects	Deep Learning - statistics & numerical data Organic Chemicals - chemistry Support Vector Machine - statistics & numerical data
Online Access	Get full text

Cover

Loading…

More Information
Summary:	In qualitative or quantitative studies of structure–activity relationships (SARs), machine learning (ML) models are trained to recognize structural patterns that differentiate between active and inactive compounds. Understanding model decisions is challenging but of critical importance to guide compound design. Moreover, the interpretation of ML results provides an additional level of model validation based on expert knowledge. A number of complex ML approaches, especially deep learning (DL) architectures, have distinctive black-box character. Herein, a locally interpretable explanatory method termed Shapley additive explanations (SHAP) is introduced for rationalizing activity predictions of any ML algorithm, regardless of its complexity. Models resulting from random forest (RF), nonlinear support vector machine (SVM), and deep neural network (DNN) learning are interpreted, and structural patterns determining the predicted probability of activity are identified and mapped onto test compounds. The results indicate that SHAP has high potential for rationalizing predictions of complex ML models.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23
ISSN:	0022-2623 1520-4804
DOI:	10.1021/acs.jmedchem.9b01101