SORBET: Automated cell-neighborhood analysis of spatial transcriptomics or proteomics for interpretable sample classification via GNN

Spatially resolved transcriptomics or proteomics data have the potential to contribute fundamental insights into the mechanisms underlying physiologic and pathological processes. However, analysis of these data capable of relating spatial information, multiplexed markers, and their observed phenotyp...

Full description

Saved in:
Bibliographic Details
Published inbioRxiv
Main Authors Shimonov, Shay, Cunningham, Joseph M, Talmon, Ronen, Aizenbud, Lilach, Desai, Shruti J, Rimm, David, Schalper, Kurt, Kluger, Harriet, Kluger, Yuval
Format Journal Article Paper
LanguageEnglish
Published United States Cold Spring Harbor Laboratory Press 01.01.2024
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Spatially resolved transcriptomics or proteomics data have the potential to contribute fundamental insights into the mechanisms underlying physiologic and pathological processes. However, analysis of these data capable of relating spatial information, multiplexed markers, and their observed phenotypes remains technically challenging. To analyze these relationships, we developed SORBET, a deep learning framework that leverages recent advances in graph neural networks (GNN). We apply SORBET to predict tissue phenotypes, such as response to immunotherapy, across different disease processes and different technologies including both spatial proteomics and transcriptomics methods. Our results show that SORBET accurately learns biologically meaningful relationships across distinct tissue structures and data acquisition methods. Furthermore, we demonstrate that SORBET facilitates understanding of the spatially-resolved biological mechanisms underlying the inferred phenotypes. In sum, our method facilitates mapping between the rich spatial and marker information acquired from spatial 'omics technologies to emergent biological phenotypes. Moreover, we provide novel techniques for identifying the biological processes that comprise the predicted phenotypes.
DOI:10.1101/2023.12.30.573739