From intuition to AI: evolution of small molecule representations in drug discovery

Within drug discovery, the goal of AI scientists and cheminformaticians is to help identify molecular starting points that will develop into safe and efficacious drugs while reducing costs, time and failure rates. To achieve this goal, it is crucial to represent molecules in a digital format that ma...

Full description

Saved in:
Bibliographic Details
Published inBriefings in bioinformatics Vol. 25; no. 1
Main Authors McGibbon, Miles, Shave, Steven, Dong, Jie, Gao, Yumiao, Houston, Douglas R, Xie, Jiancong, Yang, Yuedong, Schwaller, Philippe, Blay, Vincent
Format Journal Article
LanguageEnglish
Published England Oxford University Press 22.11.2023
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Within drug discovery, the goal of AI scientists and cheminformaticians is to help identify molecular starting points that will develop into safe and efficacious drugs while reducing costs, time and failure rates. To achieve this goal, it is crucial to represent molecules in a digital format that makes them machine-readable and facilitates the accurate prediction of properties that drive decision-making. Over the years, molecular representations have evolved from intuitive and human-readable formats to bespoke numerical descriptors and fingerprints, and now to learned representations that capture patterns and salient features across vast chemical spaces. Among these, sequence-based and graph-based representations of small molecules have become highly popular. However, each approach has strengths and weaknesses across dimensions such as generality, computational cost, inversibility for generative applications and interpretability, which can be critical in informing practitioners' decisions. As the drug discovery landscape evolves, opportunities for innovation continue to emerge. These include the creation of molecular representations for high-value, low-data regimes, the distillation of broader biological and chemical knowledge into novel learned representations and the modeling of up-and-coming therapeutic modalities.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
Miles McGibbon, Steven Shave and Jie Dong contributed equally to this work.
ISSN:1467-5463
1477-4054
DOI:10.1093/bib/bbad422