Cross‐Modal Graph Contrastive Learning with Cellular Images

Constructing discriminative representations of molecules lies at the core of a number of domains such as drug discovery, chemistry, and medicine. State‐of‐the‐art methods employ graph neural networks and self‐supervised learning (SSL) to learn unlabeled data for structural representations, which can...

Full description

Saved in:
Bibliographic Details
Published inAdvanced science Vol. 11; no. 32; pp. e2404845 - n/a
Main Authors Zheng, Shuangjia, Rao, Jiahua, Zhang, Jixian, Zhou, Lianyu, Xie, Jiancong, Cohen, Ethan, Lu, Wei, Li, Chengtao, Yang, Yuedong
Format Journal Article
LanguageEnglish
Published Weinheim John Wiley & Sons, Inc 01.08.2024
John Wiley and Sons Inc
Wiley
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Constructing discriminative representations of molecules lies at the core of a number of domains such as drug discovery, chemistry, and medicine. State‐of‐the‐art methods employ graph neural networks and self‐supervised learning (SSL) to learn unlabeled data for structural representations, which can then be fine‐tuned for downstream tasks. Albeit powerful, these methods are pre‐trained solely on molecular structures and thus often struggle with tasks involved in intricate biological processes. Here, it is proposed to assist the learning of molecular representation by using the perturbed high‐content cell microscopy images at the phenotypic level. To incorporate the cross‐modal pre‐training, a unified framework is constructed to align them through multiple types of contrastive loss functions, which is proven effective in the formulated novel tasks to retrieve the molecules and corresponding images mutually. More importantly, the model can infer functional molecules according to cellular images generated by genetic perturbations. In parallel, the proposed model can transfer non‐trivially to molecular property predictions, and has shown great improvement over clinical outcome predictions. These results suggest that such cross‐modality learning can bridge molecules and phenotype to play important roles in drug discovery. This study introduces a novel approach to enhance molecular representation learning by integrating high‐content cell microscopy images at the phenotypic level. The proposed unified framework employs contrastive loss functions for cross‐modal pre‐training, enabling mutual retrieval of molecules and images. The model improves not only molecular properties predictions but also clinical outcome predictions, highlighting the potential of cross‐modality learning in bridging molecular structures to phenotypes for drug discovery.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:2198-3844
2198-3844
DOI:10.1002/advs.202404845