Structured crowdsourcing enables convolutional segmentation of histology images

Abstract Motivation While deep-learning algorithms have demonstrated outstanding performance in semantic image segmentation tasks, large annotation datasets are needed to create accurate models. Annotation of histology images is challenging due to the effort and experience required to carefully deli...

Full description

Saved in:
Bibliographic Details
Published inBioinformatics Vol. 35; no. 18; pp. 3461 - 3467
Main Authors Amgad, Mohamed, Elfandy, Habiba, Hussein, Hagar, Atteya, Lamees A, Elsebaie, Mai A T, Abo Elnasr, Lamia S, Sakr, Rokia A, Salem, Hazem S E, Ismail, Ahmed F, Saad, Anas M, Ahmed, Joumana, Elsebaie, Maha A T, Rahman, Mustafijur, Ruhban, Inas A, Elgazar, Nada M, Alagha, Yahya, Osman, Mohamed H, Alhusseiny, Ahmed M, Khalaf, Mariam M, Younes, Abo-Alela F, Abdulkarim, Ali, Younes, Duaa M, Gadallah, Ahmed M, Elkashash, Ahmad M, Fala, Salma Y, Zaki, Basma M, Beezley, Jonathan, Chittajallu, Deepak R, Manthey, David, Gutman, David A, Cooper, Lee A D
Format Journal Article
LanguageEnglish
Published England Oxford University Press 15.09.2019
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Abstract Motivation While deep-learning algorithms have demonstrated outstanding performance in semantic image segmentation tasks, large annotation datasets are needed to create accurate models. Annotation of histology images is challenging due to the effort and experience required to carefully delineate tissue structures, and difficulties related to sharing and markup of whole-slide images. Results We recruited 25 participants, ranging in experience from senior pathologists to medical students, to delineate tissue regions in 151 breast cancer slides using the Digital Slide Archive. Inter-participant discordance was systematically evaluated, revealing low discordance for tumor and stroma, and higher discordance for more subjectively defined or rare tissue classes. Feedback provided by senior participants enabled the generation and curation of 20 000+ annotated tissue regions. Fully convolutional networks trained using these annotations were highly accurate (mean AUC=0.945), and the scale of annotation data provided notable improvements in image classification accuracy. Availability and Implementation Dataset is freely available at: https://goo.gl/cNM4EL. Supplementary information Supplementary data are available at Bioinformatics online.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ISSN:1367-4803
1367-4811
1460-2059
1367-4811
DOI:10.1093/bioinformatics/btz083