Comprehensive analysis of clinical images contributions for melanoma classification using convolutional neural networks
Background Timely diagnosis plays a critical role in determining melanoma prognosis, prompting the development of deep learning models to aid clinicians. Questions persist regarding the efficacy of clinical images alone or in conjunction with dermoscopy images for model training. This study aims to...
Saved in:
Published in | Skin research and technology Vol. 30; no. 5; pp. e13607 - n/a |
---|---|
Main Authors | , , , , |
Format | Journal Article |
Language | English |
Published |
England
John Wiley & Sons, Inc
01.05.2024
John Wiley and Sons Inc |
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | Background
Timely diagnosis plays a critical role in determining melanoma prognosis, prompting the development of deep learning models to aid clinicians. Questions persist regarding the efficacy of clinical images alone or in conjunction with dermoscopy images for model training. This study aims to compare the classification performance for melanoma of three types of CNN models: those trained on clinical images, dermoscopy images, and a combination of paired clinical and dermoscopy images from the same lesion.
Materials and Methods
We divided 914 image pairs into training, validation, and test sets. Models were built using pre‐trained Inception‐ResNetV2 convolutional layers for feature extraction, followed by binary classification. Training comprised 20 models per CNN type using sets of random hyperparameters. Best models were chosen based on validation AUC‐ROC.
Results
Significant AUC‐ROC differences were found between clinical versus dermoscopy models (0.661 vs. 0.869, p < 0.001) and clinical versus clinical + dermoscopy models (0.661 vs. 0.822, p = 0.001). Significant sensitivity differences were found between clinical and dermoscopy models (0.513 vs. 0.799, p = 0.01), dermoscopy versus clinical + dermoscopy models (0.799 vs. 1.000, p = 0.02), and clinical versus clinical + dermoscopy models (0.513 vs. 1.000, p < 0.001). Significant specificity differences were found between dermoscopy versus clinical + dermoscopy models (0.800 vs. 0.288, p < 0.001) and clinical versus clinical + dermoscopy models (0.650 vs. 0.288, p < 0.001).
Conclusion
CNN models trained on dermoscopy images outperformed those relying solely on clinical images under our study conditions. The potential advantages of incorporating paired clinical and dermoscopy images for CNN‐based melanoma classification appear less clear based on our findings. |
---|---|
Bibliography: | During the preparation of this work the authors used ChatGPT to enhance readability. After using this tool, the authors reviewed and edited the content as needed and take full responsibility for the content of this publication. Declaration of Generative AI and AI‐assisted technologies in the writing process : ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23 |
ISSN: | 0909-752X 1600-0846 |
DOI: | 10.1111/srt.13607 |