Development and validation of machine learning models for predicting prognosis and guiding individualized postoperative chemotherapy: A real-world study of distal cholangiocarcinoma

Distal cholangiocarcinoma (dCCA), originating from the common bile duct, is greatly associated with a dismal prognosis. A series of different studies based on cancer classification have been developed, aimed to optimize therapy and predict and improve prognosis. In this study, we explored and compar...

Full description

Saved in:
Bibliographic Details
Published inFrontiers in oncology Vol. 13; p. 1106029
Main Authors Wang, Di, Pan, Bing, Huang, Jin-Can, Chen, Qing, Cui, Song-Ping, Lang, Ren, Lyu, Shao-Cheng
Format Journal Article
LanguageEnglish
Published Switzerland Frontiers Media S.A 15.03.2023
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Distal cholangiocarcinoma (dCCA), originating from the common bile duct, is greatly associated with a dismal prognosis. A series of different studies based on cancer classification have been developed, aimed to optimize therapy and predict and improve prognosis. In this study, we explored and compared several novel machine learning models that might lead to an improvement in prediction accuracy and treatment options for patients with dCCA. In this study, 169 patients with dCCA were recruited and randomly divided into the training cohort (n = 118) and the validation cohort (n = 51), and their medical records were reviewed, including survival outcomes, laboratory values, treatment strategies, pathological results, and demographic information. Variables identified as independently associated with the primary outcome by least absolute shrinkage and selection operator (LASSO) regression, the random survival forest (RSF) algorithm, and univariate and multivariate Cox regression analyses were introduced to establish the following different machine learning models and canonical regression model: support vector machine (SVM), SurvivalTree, Coxboost, RSF, DeepSurv, and Cox proportional hazards (CoxPH). We measured and compared the performance of models using the receiver operating characteristic (ROC) curve, integrated Brier score (IBS), and concordance index (C-index) following cross-validation. The machine learning model with the best performance was screened out and compared with the TNM Classification using ROC, IBS, and C-index. Finally, patients were stratified based on the model with the best performance to assess whether they benefited from postoperative chemotherapy through the log-rank test. Among medical features, five variables, including tumor differentiation, T-stage, lymph node metastasis (LNM), albumin-to-fibrinogen ratio (AFR), and carbohydrate antigen 19-9 (CA19-9), were used to develop machine learning models. In the training cohort and the validation cohort, C-index achieved 0.763 0.686 (SVM), 0.749 0.692 (SurvivalTree), 0.747 0.690 (Coxboost), 0.745 0.690 (RSF), 0.746 0.711 (DeepSurv), and 0.724 0.701 (CoxPH), respectively. The DeepSurv model (0.823 0.754) had the highest mean area under the ROC curve (AUC) than other models, including SVM (0.819 0.736), SurvivalTree (0.814 0.737), Coxboost (0.816 0.734), RSF (0.813 0.730), and CoxPH (0.788 0.753). The IBS of the DeepSurv model (0.132 0.147) was lower than that of SurvivalTree (0.135 0.236), Coxboost (0.141 0.207), RSF (0.140 0.225), and CoxPH (0.145 0.196). Results of the calibration chart and decision curve analysis (DCA) also demonstrated that DeepSurv had a satisfactory predictive performance. In addition, the performance of the DeepSurv model was better than that of the TNM Classification in C-index, mean AUC, and IBS (0.746 0.598, 0.823 0.613, and 0.132 0.186, respectively) in the training cohort. Patients were stratified and divided into high- and low-risk groups based on the DeepSurv model. In the training cohort, patients in the high-risk group would not benefit from postoperative chemotherapy (p = 0.519). In the low-risk group, patients receiving postoperative chemotherapy might have a better prognosis (p = 0.035). In this study, the DeepSurv model was good at predicting prognosis and risk stratification to guide treatment options. AFR level might be a potential prognostic factor for dCCA. For the low-risk group in the DeepSurv model, patients might benefit from postoperative chemotherapy.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
This article was submitted to Gastrointestinal Cancers: Hepato Pancreatic Biliary Cancers, a section of the journal Frontiers in Oncology
Reviewed by: Junyu Long, Peking Union Medical College Hospital (CAMS), China; Xing Niu, China Medical University, China
Edited by: Wenqing Cao, New York University, United States
These authors have contributed equally to this work and share first authorship
ISSN:2234-943X
2234-943X
DOI:10.3389/fonc.2023.1106029