Expandable-RCNN: toward high-efficiency incremental few-shot object detection

This study aims at addressing the challenging incremental few-shot object detection (iFSOD) problem toward online adaptive detection. iFSOD targets to learn novel categories in a sequential manner, and eventually, the detection is performed on all learned categories. Moreover, only a few training sa...

Full description

Saved in:
Bibliographic Details
Published inFrontiers in artificial intelligence Vol. 7; p. 1377337
Main Authors Li, Yiting, Tian, Sichao, Zhu, Haiyue, Jin, Yeying, Wang, Keqing, Ma, Jun, Xiang, Cheng, Vadakkepat, Prahlad
Format Journal Article
LanguageEnglish
Published Switzerland Frontiers Media S.A 23.04.2024
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:This study aims at addressing the challenging incremental few-shot object detection (iFSOD) problem toward online adaptive detection. iFSOD targets to learn novel categories in a sequential manner, and eventually, the detection is performed on all learned categories. Moreover, only a few training samples are available for all sequential novel classes in these situations. In this study, we propose an efficient yet suitably simple framework, Expandable-RCNN, as a solution for the iFSOD problem, which allows online sequentially adding new classes with zero retraining of the base network. We achieve this by adapting the Faster R-CNN to the few-shot learning scenario with two elegant components to effectively address the overfitting and category bias. First, an IOU-aware weight imprinting strategy is proposed to directly determine the classifier weights for incremental novel classes and the background class, which is with zero training to avoid the notorious overfitting issue in few-shot learning. Second, since the above zero-retraining imprinting approach may lead to undesired category bias in the classifier, we develop a bias correction module for iFSOD, named the group soft-max layer (GSL), that efficiently calibrates the biased prediction of the imprinted classifier to organically improve classification performance for the few-shot classes, preventing catastrophic forgetting. Extensive experiments on MS-COCO show that our method can significantly outperform the state-of-the-art method ONCE by 5.9 points in commonly encountered few-shot classes.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
Nian Liu, Mohamed bin Zayed University of Artificial Intelligence (MBZUAI), United Arab Emirates
These authors have contributed equally to this work
Reviewed by: Sang Jun Lee, Jeonbuk National University, Republic of Korea
Edited by: Dongpo Xu, Northeast Normal University, China
ISSN:2624-8212
2624-8212
DOI:10.3389/frai.2024.1377337