Expandable-RCNN: toward high-efficiency incremental few-shot object detection
This study aims at addressing the challenging incremental few-shot object detection (iFSOD) problem toward online adaptive detection. iFSOD targets to learn novel categories in a sequential manner, and eventually, the detection is performed on all learned categories. Moreover, only a few training sa...
Saved in:
Published in | Frontiers in artificial intelligence Vol. 7; p. 1377337 |
---|---|
Main Authors | , , , , , , , |
Format | Journal Article |
Language | English |
Published |
Switzerland
Frontiers Media S.A
23.04.2024
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | This study aims at addressing the challenging incremental few-shot object detection (iFSOD) problem toward online adaptive detection. iFSOD targets to learn novel categories in a sequential manner, and eventually, the detection is performed on all learned categories. Moreover, only a few training samples are available for all sequential novel classes in these situations. In this study, we propose an efficient yet suitably simple framework, Expandable-RCNN, as a solution for the iFSOD problem, which allows online sequentially adding new classes with zero retraining of the base network. We achieve this by adapting the Faster R-CNN to the few-shot learning scenario with two elegant components to effectively address the overfitting and category bias. First, an IOU-aware weight imprinting strategy is proposed to directly determine the classifier weights for incremental novel classes and the background class, which is with zero training to avoid the notorious overfitting issue in few-shot learning. Second, since the above zero-retraining imprinting approach may lead to undesired category bias in the classifier, we develop a bias correction module for iFSOD, named the group soft-max layer (GSL), that efficiently calibrates the biased prediction of the imprinted classifier to organically improve classification performance for the few-shot classes, preventing catastrophic forgetting. Extensive experiments on MS-COCO show that our method can significantly outperform the state-of-the-art method ONCE by 5.9 points in commonly encountered few-shot classes. |
---|---|
Bibliography: | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23 Nian Liu, Mohamed bin Zayed University of Artificial Intelligence (MBZUAI), United Arab Emirates These authors have contributed equally to this work Reviewed by: Sang Jun Lee, Jeonbuk National University, Republic of Korea Edited by: Dongpo Xu, Northeast Normal University, China |
ISSN: | 2624-8212 2624-8212 |
DOI: | 10.3389/frai.2024.1377337 |