Artificial intelligence aided precise detection of local recurrence on MRI for nasopharyngeal carcinoma: a multicenter cohort studyResearch in context

Background: MRI is the routine examination to surveil the recurrence of nasopharyngeal carcinoma, but it has relatively lower sensitivity than PET/CT. We aimed to find if artificial intelligence (AI) could be competent pre-inspector for MRI radiologists and whether AI-aided MRI could perform better...

Full description

Saved in:
Bibliographic Details
Published inEClinicalMedicine Vol. 63; p. 102202
Main Authors Pu-Yun OuYang, Yun He, Jian-Gui Guo, Jia-Ni Liu, Zhi-Long Wang, Anwei Li, Jiajian Li, Shan-Shan Yang, Xu Zhang, Wei Fan, Yi-Shan Wu, Zhi-Qiao Liu, Bao-Yu Zhang, Ya-Nan Zhao, Ming-Yong Gao, Wei-Jun Zhang, Chuan-Miao Xie, Fang-Yun Xie
Format Journal Article
LanguageEnglish
Published Elsevier 01.09.2023
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Background: MRI is the routine examination to surveil the recurrence of nasopharyngeal carcinoma, but it has relatively lower sensitivity than PET/CT. We aimed to find if artificial intelligence (AI) could be competent pre-inspector for MRI radiologists and whether AI-aided MRI could perform better or even equal to PET/CT. Methods: This multicenter study enrolled 6916 patients from five hospitals between September 2009 and October 2020. A 2.5D convolutional neural network diagnostic model and a nnU-Net contouring model were developed in the training and test cohorts and used to independently predict and visualize the recurrence of patients in the internal and external validation cohorts. We evaluated the area under the ROC curve (AUC) of AI and compared AI with MRI and PET/CT in sensitivity and specificity using the McNemar test. The prospective cohort was randomized into the AI and non-AI groups, and their sensitivity and specificity were compared using the Chi-square test. Findings: The AI model achieved AUCs of 0.92 and 0.88 in the internal and external validation cohorts, corresponding to the sensitivity of 79.5% and 74.3% and specificity of 91.0% and 92.8%. It had comparable sensitivity to MRI (e.g., 74.3% vs. 74.7%, P = 0.89) but lower sensitivity than PET/CT (77.9% vs. 92.0%, P < 0.0001) at the same individual-specificities. The AI model achieved moderate precision with a median dice similarity coefficient of 0.67. AI-aided MRI improved specificity (92.5% vs. 85.0%, P = 0.034), equaled PET/CT in the internal validation subcohort, and increased sensitivity (81.9% vs. 70.8%, P = 0.021) in the external validation subcohort. In the prospective cohort of 1248 patients, the AI group had higher sensitivity than the non-AI group (78.6% vs. 67.3%, P = 0.23), albeit nonsignificant. In future randomized controlled trials, a sample size of 3943 patients in each arm would be required to demonstrate the statistically significant difference. Interpretation: The AI model equaled MRI by expert radiologists, and AI-aided MRI by expert radiologists equaled PET/CT. A larger randomized controlled trial is warranted to demonstrate the AI's benefit sufficiently. Funding: The Sun Yat-sen University Clinical Research 5010 Program (2015020), Guangdong Basic and Applied Basic Research Foundation (2022A1515110356), and Guangzhou Science and Technology Program (2023A04J1788).
ISSN:2589-5370
2589-5370