Reviving Iterative Training with Mask Guidance for Interactive Segmentation

Recent works on click-based interactive segmentation have demonstrated state-of-the-art results by using various inference-time optimization schemes. These methods are significantly more computationally expensive than feedforward approaches, as they run backward gradient passes during inference. Mor...

Full description

Saved in:

Bibliographic Details
Published in	Proceedings - International Conference on Image Processing pp. 3141 - 3145
Main Authors	Sofiiuk, Konstantin, Petrov, Ilya A., Konushin, Anton
Format	Conference Proceeding
Language	English
Published	IEEE 16.10.2022
Subjects	Analytical models Annotations Benchmark testing Codes Computer architecture Image segmentation interactive segmentation mask refinement segmentation Training
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Recent works on click-based interactive segmentation have demonstrated state-of-the-art results by using various inference-time optimization schemes. These methods are significantly more computationally expensive than feedforward approaches, as they run backward gradient passes during inference. Moreover, backward passes are not supported in popular mobile frameworks, which complicates the deployment of such methods on embedded devices. In this paper, we study design choices for interactive segmentation and discover that state-of-the-art results can be obtained without any additional optimization schemes. We propose a simple feedforward model for click-based interactive segmentation that employs the segmentation masks from previous steps. It allows not only segmenting an entirely new object but also correcting an existing mask. We analyze the performance of models trained on different datasets and observe that the choice of a training dataset has a large impact on the quality of interactive segmentation. We find that the models trained on a combination of COCO and LVIS with diverse and high-quality annotations outperform all existing models. The code and trained models are available at https://github.com/saic-vul/ritm_interactive_segmentation.
ISSN:	2381-8549
DOI:	10.1109/ICIP46576.2022.9897365