Which model is better in predicting the survival of laryngeal squamous cell carcinoma?: Comparison of the random survival forest based on machine learning algorithms to Cox regression: analyses based on SEER database

Prediction of postoperative survival for laryngeal carcinoma patients is very important. This study attempts to demonstrate the utilization of the random survival forest (RSF) and Cox regression model to predict overall survival of laryngeal squamous cell carcinoma (LSCC) and compare their performan...

Full description

Saved in:
Bibliographic Details
Published inMedicine (Baltimore) Vol. 102; no. 10; p. e33144
Main Authors Sun, Haili, Wu, Shuangshuang, Li, Shaoxiao, Jiang, Xiaohua
Format Journal Article
LanguageEnglish
Published United States Lippincott Williams & Wilkins 10.03.2023
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Prediction of postoperative survival for laryngeal carcinoma patients is very important. This study attempts to demonstrate the utilization of the random survival forest (RSF) and Cox regression model to predict overall survival of laryngeal squamous cell carcinoma (LSCC) and compare their performance. A total of 8677 patients diagnosed with LSCC from 2004 to 2015 were obtained from surveillance, epidemiology, and end results database. Multivariate imputation by chained equations was applied to filling the missing data. Lasso regression algorithm was conducted to find potential predictors. RSF and Cox regression were used to develop the survival prediction models. Harrell's concordance index (C-index), area under the curve (AUC), Brier score, and calibration plot were used to evaluate the predictive performance of the 2 models. For 3-year survival prediction, the C-index in training set were 0.74 (0.011) and 0.84 (0.013) for Cox and RSF respectively. For 5-year survival prediction, the C-index in training set were 0.75 (0.022) and 0.80 (0.011) for Cox and RSF respectively. Similar results were found in validation set. The AUC were 0.795 for RSF and 0.715 for Cox in the training set while the AUC were 0.765 for RSF and 0.705 for Cox in the validation set. The prediction error curves for each model based on Brier score showed the RSF model had lower prediction errors both in training group and validation group. What's more, the calibration curve displayed similar results of 2 models both in training set and validation set. The performance of RSF model were better than Cox regression model. The RSF algorithms provide a relatively better alternatives to be of clinical use for estimating the survival probability of LSCC patients.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:0025-7974
1536-5964
DOI:10.1097/MD.0000000000033144