Histology segmentation using active learning on regions of interest in oral cavity squamous cell carcinoma

In digital pathology, deep learning has been shown to have a wide range of applications, from cancer grading to segmenting structures like glomeruli. One of the main hurdles for digital pathology to be truly effective is the size of the dataset needed for generalization to address the spectrum of po...

Full description

Saved in:
Bibliographic Details
Published inJournal of pathology informatics Vol. 13; p. 100146
Main Authors Folmsbee, Jonathan, Zhang, Lei, Lu, Xulei, Rahman, Jawaria, Gentry, John, Conn, Brendan, Vered, Marilena, Roy, Paromita, Gupta, Ruta, Lin, Diana, Samankan, Shabnam, Dhorajiva, Pooja, Peter, Anu, Wang, Minhua, Israel, Anna, Brandwein-Weber, Margaret, Doyle, Scott
Format Journal Article
LanguageEnglish
Published United States Elsevier Inc 01.01.2022
Elsevier
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:In digital pathology, deep learning has been shown to have a wide range of applications, from cancer grading to segmenting structures like glomeruli. One of the main hurdles for digital pathology to be truly effective is the size of the dataset needed for generalization to address the spectrum of possible morphologies. Small datasets limit classifiers’ ability to generalize. Yet, when we move to larger datasets of whole slide images (WSIs) of tissue, these datasets may cause network bottlenecks as each WSI at its original magnification can be upwards of 100 000 by 100 000 pixels, and over a gigabyte in file size. Compounding this problem, high quality pathologist annotations are difficult to obtain, as the volume of necessary annotations to create a classifier that can generalize would be extremely costly in terms of pathologist-hours. In this work, we use Active Learning (AL), a process for iterative interactive training, to create a modified U-net classifier on the region of interest (ROI) scale. We then compare this to Random Learning (RL), where images for addition to the dataset for retraining are randomly selected. Our hypothesis is that AL shows benefits for generating segmentation results versus randomly selecting images to annotate. We show that after 3 iterations, that AL, with an average Dice coefficient of 0.461, outperforms RL, with an average Dice Coefficient of 0.375, by 0.086.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:2153-3539
2229-5089
2153-3539
DOI:10.1016/j.jpi.2022.100146