Boundary-refined prototype generation: A general end-to-end paradigm for semi-supervised semantic segmentation

Semi-supervised semantic segmentation has attracted increasing attention in computer vision, aiming to leverage unlabeled data through latent supervision. To achieve this goal, prototype-based classification has been introduced and achieved lots of success. However, the current approaches isolate pr...

Full description

Saved in:

Bibliographic Details
Published in	Engineering applications of artificial intelligence Vol. 137; p. 109021
Main Authors	Dong, Junhao, Meng, Zhu, Liu, Delong, Liu, Jiaxuan, Zhao, Zhicheng, Su, Fei
Format	Journal Article
Language	English
Published	Elsevier Ltd 01.11.2024
Subjects	Mean teacher Prototype-based contrastive learning Semantic segmentation Semi-supervised learning Mean teacher Semantic segmentation Semi-supervised learning Prototype-based contrastive learning
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Semi-supervised semantic segmentation has attracted increasing attention in computer vision, aiming to leverage unlabeled data through latent supervision. To achieve this goal, prototype-based classification has been introduced and achieved lots of success. However, the current approaches isolate prototype generation from the main training framework, presenting a non-end-to-end workflow. Furthermore, most methods directly perform the K-Means clustering on features to generate prototypes, resulting in their proximity to category semantic centers, while overlooking the clear delineation of class boundaries. To address the above problems, we propose a novel end-to-end boundary-refined prototype generation (BRPG) method. Specifically, we perform online clustering on sampled features to incorporate the prototype generation into the whole training framework. In addition, to enhance the classification boundaries, we sample and cluster high- and low-confidence features separately based on confidence estimation, facilitating the generation of prototypes closer to the class boundaries. Moreover, an adaptive prototype optimization strategy is proposed to increase the number of prototypes for categories with scattered feature distributions, which further refines the class boundaries. Extensive experiments demonstrate the remarkable robustness and scalability of our method across diverse datasets, segmentation networks, and semi-supervised frameworks, outperforming the state-of-the-art approaches on three benchmark datasets: PASCAL VOC 2012, Cityscapes and MS COCO. The code is available at https://github.com/djh-dzxw/BRPG.
ISSN:	0952-1976
DOI:	10.1016/j.engappai.2024.109021