Development and Validation of Models to Predict Lymph Node Metastasis in Early Gastric Cancer Using Logistic Regression and Gradient Boosting Machine Methods

PurposeTo identify important features of lymph node metastasis (LNM) and develop a prediction model for early gastric cancer (EGC) using a gradient boosting machine (GBM) method. Materials and MethodsThe clinicopathologic data of 2556 patients with EGC who underwent gastrectomy were used as training...

Full description

Saved in:
Bibliographic Details
Published inCancer research and treatment Vol. 55; no. 4; pp. 1240 - 1249
Main Authors Lee, Hae Dong, Nam, Kyung Han, Shin, Cheol Min, Lee, Hye Seung, Chang, Young Hoon, Yoon, Hyuk, Park, Young Soo, Kim, Nayoung, Lee, Dong Ho, Ahn, Sang-Hoon, Kim, Hyung-Ho
Format Journal Article
LanguageEnglish
Published Korea (South) Korean Cancer Association 01.10.2023
대한암학회
Subjects
Online AccessGet full text
ISSN1598-2998
2005-9256
2005-9256
DOI10.4143/crt.2022.1330

Cover

Loading…
More Information
Summary:PurposeTo identify important features of lymph node metastasis (LNM) and develop a prediction model for early gastric cancer (EGC) using a gradient boosting machine (GBM) method. Materials and MethodsThe clinicopathologic data of 2556 patients with EGC who underwent gastrectomy were used as training set and the internal validation set (set 1) at a ratio of 8:2. Additionally, 548 patients with EGC who underwent endoscopic submucosal dissection (ESD) as the initial treatment were included in the external validation set (set 2). The GBM model was constructed, and its performance was compared with that of the Japanese guidelines.ResultsLNM was identified in 12.6% (321/2556) of the gastrectomy group (training set & set 1) and 4.3% (24/548) of the ESD group (set 2). In the GBM analysis, the top five features that most affected LNM were lymphovascular invasion, depth, differentiation, size, and location. The accuracy, sensitivity, specificity, and the area under the receiver operating characteristics of set 1 were 0.566, 0.922, 0.516, and 0.867, while those of set 2 were 0.810, 0.958, 0.803, and 0.944, respectively. When the sensitivity of GBM was adjusted to that of Japanese guidelines (beyond the expanded criteria in set 1 [0.922] and eCuraC-2 in set 2 [0.958]), the specificities of GBM in sets 1 and 2 were 0.516 (95% confidence interval, 0.502-0.523) and 0.803 (0.795-0.805), while those of the Japanese guidelines were 0.502 (0.488-0.509) and 0.788 (0.780-0.790), respectively.ConclusionThe GBM model showed good performance comparable with the eCura system in predicting LNM risk in EGCs.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:1598-2998
2005-9256
2005-9256
DOI:10.4143/crt.2022.1330