Semi-weakly Supervised Contrastive Representation Learning for Retinal Fundus Images

We explore the value of weak labels in learning transferable representations for medical images. Compared to hand-labeled datasets, weak or inexact labels can be acquired in large quantities at significantly lower cost and can provide useful training signals for data-hungry models such as deep neura...

Full description

Saved in:

Bibliographic Details
Published in	arXiv.org
Main Authors	Yap, Boon Peng, Ng, Beng Koon
Format	Paper Journal Article
Language	English
Published	Ithaca Cornell University Library, arXiv.org 04.08.2021
Subjects	Annotations Artificial neural networks Computer Science - Computer Vision and Pattern Recognition Computer Science - Learning Datasets Image acquisition Image segmentation Labels Machine learning Medical imaging Representations Training
Online Access	Get full text

Cover

Loading…

More Information
Summary:	We explore the value of weak labels in learning transferable representations for medical images. Compared to hand-labeled datasets, weak or inexact labels can be acquired in large quantities at significantly lower cost and can provide useful training signals for data-hungry models such as deep neural networks. We consider weak labels in the form of pseudo-labels and propose a semi-weakly supervised contrastive learning (SWCL) framework for representation learning using semi-weakly annotated images. Specifically, we train a semi-supervised model to propagate labels from a small dataset consisting of diverse image-level annotations to a large unlabeled dataset. Using the propagated labels, we generate a patch-level dataset for pretraining and formulate a multi-label contrastive learning objective to capture position-specific features encoded in each patch. We empirically validate the transfer learning performance of SWCL on seven public retinal fundus datasets, covering three disease classification tasks and two anatomical structure segmentation tasks. Our experiment results suggest that, under very low data regime, large-scale ImageNet pretraining on improved architecture remains a very strong baseline, and recently proposed self-supervised methods falter in segmentation tasks, possibly due to the strong invariant constraint imposed. Our method surpasses all prior self-supervised methods and standard cross-entropy training, while closing the gaps with ImageNet pretraining.
ISSN:	2331-8422
DOI:	10.48550/arxiv.2108.02122