Structural Visual Guidance Attention Networks In Retinopathy Of Prematurity

Convolutional neural networks (CNNs) have shown great performance in medical diagnostic applications. However, because their black-box nature, clinicians are reluctant to trust CNN diagnostic outcomes. Incorporating visual attention capabilities in CNNs enhances interpretability by highlighting regi...

Full description

Saved in:
Bibliographic Details
Published in2021 IEEE 18th International Symposium on Biomedical Imaging (ISBI) pp. 353 - 357
Main Authors Yildiz, V., Ioannidis, S., Yildiz, I., Tian, P., Campbell, J. P., Ostmo, S., Kalpathy-Cramer, J., Chiang, M. F., Erdogmus, D., Dy, J.
Format Conference Proceeding
LanguageEnglish
Published IEEE 13.04.2021
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Convolutional neural networks (CNNs) have shown great performance in medical diagnostic applications. However, because their black-box nature, clinicians are reluctant to trust CNN diagnostic outcomes. Incorporating visual attention capabilities in CNNs enhances interpretability by highlighting regions in the images that CNNs utilize for prediction. Clinicians can often provide domain knowledge on relevant features: e.g., to diagnose retinopathy of prematurity (ROP), structural information such as tortuosity of vessels aid clinicians in diagnosing ROP. We propose a Structural Visual Guidance Attention Networks (SVGA-Net) method, that leverages structural domain knowledge to guide visual attention in CNNs. Experiments on a dataset of 5512 posterior retinal images, taken using a wide-angle fundus camera, show that SVGA-Net achieves 0.987 and 0.979 AUC to predict plus and normal categories, respectively. SVGA-Net consistently results in higher AUC compared to visual attention CNNs without guidance, baseline CNNs, and CNNs with structured masks.
ISSN:1945-8452
DOI:10.1109/ISBI48211.2021.9433881