Application of machine learning models for property prediction to targeted protein degraders

Machine learning (ML) systems can model quantitative structure-property relationships (QSPR) using existing experimental data and make property predictions for new molecules. With the advent of modalities such as targeted protein degraders (TPD), the applicability of QSPR models is questioned and ML...

Full description

Saved in:

Bibliographic Details
Published in	Nature communications Vol. 15; no. 1; pp. 5764 - 15
Main Authors	Peteani, Giulia, Huynh, Minh Tam Davide, Gerebtzoff, Grégori, Rodríguez-Pérez, Raquel
Format	Journal Article
Language	English
Published	London Nature Publishing Group UK 09.07.2024 Nature Publishing Group Nature Portfolio
Subjects	631/114 631/154 Adhesives Animals Biodegradation Clearances Cytochrome P-450 CYP3A - chemistry Cytochrome P-450 CYP3A - metabolism Cytochrome P450 Cytochromes P450 Drug development Drug discovery Errors Glues Humanities and Social Sciences Humans Learning algorithms Machine Learning Molecular structure multidisciplinary Permeability Physicochemical properties Predictions Protein Binding Protein structure Proteins Proteolysis Quantitative Structure-Activity Relationship Rats Science Science (multidisciplinary) Structure-activity relationships Transfer learning
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Machine learning (ML) systems can model quantitative structure-property relationships (QSPR) using existing experimental data and make property predictions for new molecules. With the advent of modalities such as targeted protein degraders (TPD), the applicability of QSPR models is questioned and ML usage in TPD-centric projects remains limited. Herein, ML models are developed and evaluated for TPDs’ property predictions, including passive permeability, metabolic clearance, cytochrome P450 inhibition, plasma protein binding, and lipophilicity. Interestingly, performance on TPDs is comparable to that of other modalities. Predictions for glues and heterobifunctionals often yield lower and higher errors, respectively. For permeability, CYP3A4 inhibition, and human and rat microsomal clearance, misclassification errors into high and low risk categories are lower than 4% for glues and 15% for heterobifunctionals. For all modalities, misclassification errors range from 0.8% to 8.1%. Investigated transfer learning strategies improve predictions for heterobifunctionals. This is the first comprehensive evaluation of ML for the prediction of absorption, distribution, metabolism, and excretion (ADME) and physicochemical properties of TPD molecules, including heterobifunctional and molecular glue sub-modalities. Taken together, our investigations show that ML-based QSPR models are applicable to TPDs and support ML usage for TPDs’ design, to potentially accelerate drug discovery. Targeted protein degraders are a recently developed drug modality for which it is unclear whether traditional QSAR can be applied. Here the authors show that classical ML can be used to predict properties of these drugs.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23
ISSN:	2041-1723 2041-1723
DOI:	10.1038/s41467-024-49979-3