A pathology foundation model for cancer diagnosis and prognosis prediction

Histopathology image evaluation is indispensable for cancer diagnoses and subtype classification. Standard artificial intelligence methods for histopathology image analyses have focused on optimizing specialized models for each diagnostic task 1 , 2 . Although such methods have achieved some success...

Full description

Saved in:
Bibliographic Details
Published inNature (London) Vol. 634; no. 8035; pp. 970 - 978
Main Authors Wang, Xiyue, Zhao, Junhan, Marostica, Eliana, Yuan, Wei, Jin, Jietian, Zhang, Jiayu, Li, Ruijiang, Tang, Hongping, Wang, Kanran, Li, Yu, Wang, Fang, Peng, Yulong, Zhu, Junyou, Zhang, Jing, Jackson, Christopher R., Zhang, Jun, Dillon, Deborah, Lin, Nancy U., Sholl, Lynette, Denize, Thomas, Meredith, David, Ligon, Keith L., Signoretti, Sabina, Ogino, Shuji, Golden, Jeffrey A., Nasrallah, MacLean P., Han, Xiao, Yang, Sen, Yu, Kun-Hsing
Format Journal Article
LanguageEnglish
Published London Nature Publishing Group UK 24.10.2024
Nature Publishing Group
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Histopathology image evaluation is indispensable for cancer diagnoses and subtype classification. Standard artificial intelligence methods for histopathology image analyses have focused on optimizing specialized models for each diagnostic task 1 , 2 . Although such methods have achieved some success, they often have limited generalizability to images generated by different digitization protocols or samples collected from different populations 3 . Here, to address this challenge, we devised the Clinical Histopathology Imaging Evaluation Foundation (CHIEF) model, a general-purpose weakly supervised machine learning framework to extract pathology imaging features for systematic cancer evaluation. CHIEF leverages two complementary pretraining methods to extract diverse pathology representations: unsupervised pretraining for tile-level feature identification and weakly supervised pretraining for whole-slide pattern recognition. We developed CHIEF using 60,530 whole-slide images spanning 19 anatomical sites. Through pretraining on 44 terabytes of high-resolution pathology imaging datasets, CHIEF extracted microscopic representations useful for cancer cell detection, tumour origin identification, molecular profile characterization and prognostic prediction. We successfully validated CHIEF using 19,491 whole-slide images from 32 independent slide sets collected from 24 hospitals and cohorts internationally. Overall, CHIEF outperformed the state-of-the-art deep learning methods by up to 36.1%, showing its ability to address domain shifts observed in samples from diverse populations and processed by different slide preparation methods. CHIEF provides a generalizable foundation for efficient digital pathology evaluation for patients with cancer. A study describes the development of a generalizable foundation machine learning framework to extract pathology imaging features for cancer diagnosis and prognosis prediction.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
content type line 23
ObjectType-Undefined-3
Lead contact: Kun-Hsing Yu, MD, PhD, 10 Shattuck Street, Boston, MA 02115
X.W., J.Zhao, Y.S., and K.-H.Y. conceived and designed the study. J.Zhao, E.M., D.D., N.U.L., L.S., T.D., D.M, K.L.L., S.S., S.O., J.A.G., M.P.N, K.-H.Y., F.W., H.T., Jing Z., K.W, and Y.L, curated the data from their respective institutes. X.W., J.Zhao, Y.S., W.Y, Jiayu Z., and K.-H.Y. developed, validated, and evaluated the models. J.J., F.W., K.W., Y.L, Y.P., J.Zhu, C.R.J, J.A.G., M.P.N, and K.-H.Y. interpreted the pathological images. Jun Z., Jing Z., X.H., and R.L. contributed to the technical discussion. X.W., J.Zhao, E.M., C.R.J., J.A.G, J.J., F.W., Y.S., and K.-H.Y interpreted the analytical results. X.W., J.Zhao, Y.S., and K.H.Y wrote the manuscript. All authors contributed to the edits of the manuscript. K.-H.Y. supervised the project.
These authors contributed equally to this manuscript.
Author Contributions
ISSN:0028-0836
1476-4687
1476-4687
DOI:10.1038/s41586-024-07894-z