Diversity matters: Cross-head mutual mean-teaching for semi-supervised medical image segmentation

Semi-supervised medical image segmentation (SSMIS) has witnessed substantial advancements by leveraging limited labeled data and abundant unlabeled data. Nevertheless, existing state-of-the-art (SOTA) methods encounter challenges in accurately predicting labels for the unlabeled data, giving rise to...

Full description

Saved in:

Bibliographic Details
Published in	Medical image analysis Vol. 97; p. 103302
Main Authors	Li, Wei, Bian, Ruifeng, Zhao, Wenyi, Xu, Weijin, Yang, Huihua
Format	Journal Article
Language	English
Published	Netherlands Elsevier B.V 01.10.2024
Subjects	Medical image segmentation Mutual learning Semi-supervised learning Virtual adversarial learning Virtual adversarial learning Mutual learning Semi-supervised learning Medical image segmentation
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Semi-supervised medical image segmentation (SSMIS) has witnessed substantial advancements by leveraging limited labeled data and abundant unlabeled data. Nevertheless, existing state-of-the-art (SOTA) methods encounter challenges in accurately predicting labels for the unlabeled data, giving rise to disruptive noise during training and susceptibility to erroneous information overfitting. Moreover, applying perturbations to inaccurate predictions further impedes consistent learning. To address these concerns, we propose a novel cross-head mutual mean-teaching network (CMMT-Net) incorporated weak-strong data augmentations, thereby benefiting both co-training and consistency learning. More concretely, our CMMT-Net extends the cross-head co-training paradigm by introducing two auxiliary mean teacher models, which yield more accurate predictions and provide supplementary supervision. The predictions derived from weakly augmented samples generated by one mean teacher are leveraged to guide the training of another student with strongly augmented samples. Furthermore, two distinct yet synergistic data perturbations at the pixel and region levels are introduced. We propose mutual virtual adversarial training (MVAT) to smooth the decision boundary and enhance feature representations, and a cross-set CutMix strategy to generate more diverse training samples for capturing inherent structural data information. Notably, CMMT-Net simultaneously implements data, feature, and network perturbations, amplifying model diversity and generalization performance. Experimental results on three publicly available datasets indicate that our approach yields remarkable improvements over previous SOTA methods across various semi-supervised scenarios. The code is available at https://github.com/Leesoon1984/CMMT-Net. •Proposed CMMT-Net for semi-supervised medical image segmentation (SSMIS).•CMMT-Net integrates data, feature, and network perturbations for SSMIS tasks.•Our mutual virtual adversarial training smoothes the model’s decision boundary.•CMMT-Net adopts cross-set CutMix to facilitate cross-view consistency learning.•Extensive experiments proved that CMMT-Net has excellent SSMIS performance.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23
ISSN:	1361-8415 1361-8423 1361-8423
DOI:	10.1016/j.media.2024.103302