Diagnostic assessment of deep learning for melanocytic lesions using whole-slide pathological images

•The performance of the deep learning algorithm was on par with that of 7 expert pathologists in discriminating melanoma from nevus using whole-slide pathological images (WSIs).•Deep learning algorithm might function as a supplemental tool to assist pathologist by automatically pre-screening and hig...

Full description

Saved in:

Bibliographic Details
Published in	Translational oncology Vol. 14; no. 9; p. 101161
Main Authors	Ba, Wei, Wang, Rui, Yin, Guang, Song, Zhigang, Zou, Jinyi, Zhong, Cheng, Yang, Jingrun, Yu, Guanzhen, Yang, Hongyu, Zhang, Litao, Li, Chengxin
Format	Journal Article
Language	English
Published	Elsevier Inc 01.09.2021 Neoplasia Press Elsevier
Subjects	Artificial intelligence Deep learning algorithm Melanoma Nevus Original Research Whole-slide pathological images Whole-slide pathological images Artificial intelligence Melanoma Nevus Deep learning algorithm
Online Access	Get full text

Cover

Loading…

More Information
Summary:	•The performance of the deep learning algorithm was on par with that of 7 expert pathologists in discriminating melanoma from nevus using whole-slide pathological images (WSIs).•Deep learning algorithm might function as a supplemental tool to assist pathologist by automatically pre-screening and highlighting interest regions prior to review. Deep learning has the potential to improve diagnostic accuracy and efficiency in medical image recognition. In the current study, we developed a deep learning algorithm and assessed its performance in discriminating melanoma from nevus using whole-slide pathological images (WSIs). The deep learning algorithm was trained and validated using a set of 781 WSIs (86 melanomas, 695 nevi) from PLA General Hospital. The diagnostic performance of the algorithm was tested on an independent test set of 104 WSIs (29 melanomas, 75 nevi) from Tianjin Chang Zheng Hospital. The same test set was also diagnostically classified by 7 expert dermatopathologists. The deep learning algorithm receiver operating characteristic (ROC) curve achieved a sensitivity 100% at the specificity of 94.7% in the classification of melanoma and nevus on the test set. The area under ROC curve was 0.99. Dermatopathologists achieved a mean sensitivity and specificity of 95.1% (95% confidence interval [CI]: 92.0%-98.2%) and 96.0% (95% CI: 94.2%-97.8%), respectively. At the operating point of sensitivity of 95.1%, the algorithm revealed a comparable specificity with 7 dermatopathologists (97.3% vs. 96.0%, P = 0.11). At the operating point of specificity of 96.0%, the algorithm also achieved a comparable sensitivity with 7 dermatopathologists (96.5% vs. 95.1%, P = 0.30). A more transparent and interpretable diagnosis could be generated by highlighting the regions of interest recognized by the algorithm in WSIs. The performance of the deep learning algorithm was on par with that of 7 expert dermatopathologists in interpreting WSIs with melanocytic lesions. By pre-screening the suspicious melanoma regions, it might serve as a supplemental diagnostic tool to improve working efficiency of pathologists.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23
ISSN:	1936-5233 1936-5233
DOI:	10.1016/j.tranon.2021.101161