Expandable-RCNN: toward high-efficiency incremental few-shot object detection

This study aims at addressing the challenging incremental few-shot object detection (iFSOD) problem toward online adaptive detection. iFSOD targets to learn novel categories in a sequential manner, and eventually, the detection is performed on all learned categories. Moreover, only a few training sa...

Full description

Saved in:

Bibliographic Details
Published in	Frontiers in artificial intelligence Vol. 7; p. 1377337
Main Authors	Li, Yiting, Tian, Sichao, Zhu, Haiyue, Jin, Yeying, Wang, Keqing, Ma, Jun, Xiang, Cheng, Vadakkepat, Prahlad
Format	Journal Article
Language	English
Published	Switzerland Frontiers Media S.A 23.04.2024
Subjects	Artificial Intelligence few-shot learning incremental learning long-tailed recognition object detection zero-shot learning object detection incremental learning long-tailed recognition zero-shot learning few-shot learning
Online Access	Get full text

Cover

Loading…

More Information
Summary:	This study aims at addressing the challenging incremental few-shot object detection (iFSOD) problem toward online adaptive detection. iFSOD targets to learn novel categories in a sequential manner, and eventually, the detection is performed on all learned categories. Moreover, only a few training samples are available for all sequential novel classes in these situations. In this study, we propose an efficient yet suitably simple framework, Expandable-RCNN, as a solution for the iFSOD problem, which allows online sequentially adding new classes with zero retraining of the base network. We achieve this by adapting the Faster R-CNN to the few-shot learning scenario with two elegant components to effectively address the overfitting and category bias. First, an IOU-aware weight imprinting strategy is proposed to directly determine the classifier weights for incremental novel classes and the background class, which is with zero training to avoid the notorious overfitting issue in few-shot learning. Second, since the above zero-retraining imprinting approach may lead to undesired category bias in the classifier, we develop a bias correction module for iFSOD, named the group soft-max layer (GSL), that efficiently calibrates the biased prediction of the imprinted classifier to organically improve classification performance for the few-shot classes, preventing catastrophic forgetting. Extensive experiments on MS-COCO show that our method can significantly outperform the state-of-the-art method ONCE by 5.9 points in commonly encountered few-shot classes.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23 Nian Liu, Mohamed bin Zayed University of Artificial Intelligence (MBZUAI), United Arab Emirates These authors have contributed equally to this work Reviewed by: Sang Jun Lee, Jeonbuk National University, Republic of Korea Edited by: Dongpo Xu, Northeast Normal University, China
ISSN:	2624-8212 2624-8212
DOI:	10.3389/frai.2024.1377337