Learn Discriminative Features for Small Object Detection through Multi-Scale Image Degradation with Contrastive Learning
To address challenges such as small target sizes, blurred target features, and difficulty in distinguishing between targets and backgrounds in small object detection, we propose a method based on Multi-Scale Image Degradation combined with the Contrastive Learning model. By leveraging contrastive le...
Saved in:
Published in | IEICE Transactions on Information and Systems Vol. E108.D; no. 4; pp. 371 - 383 |
---|---|
Main Authors | , , , , , , , , , |
Format | Journal Article |
Language | English |
Published |
Tokyo
The Institute of Electronics, Information and Communication Engineers
01.04.2025
Japan Science and Technology Agency |
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | To address challenges such as small target sizes, blurred target features, and difficulty in distinguishing between targets and backgrounds in small object detection, we propose a method based on Multi-Scale Image Degradation combined with the Contrastive Learning model. By leveraging contrastive learning techniques, our approach aims to enhance the discriminative features necessary for accurately distinguishing objects from backgrounds. To specifically target small objects, we subject target samples to various multi-scale image degradation modes before inputting them into the contrastive learning model. Augmentation techniques are then applied to these degraded samples to facilitate effective contrastive feature learning. Consequently, the model is better equipped to uncover the differences between small targets and backgrounds, thereby improving small object detection performance. Furthermore, considering that spatial domain features are sensitive to local changes in the image, while frequency domain features are sensitive to global structural changes, our approach applies the contrastive learning model in both spatial and frequency domains, aiming to acquire more robust features for small object detection. Extensive experiments conducted on the MS COCO dataset and the VisDrone2019 dataset validate the effectiveness of our proposed method in significantly enhancing small object detection accuracy. |
---|---|
Bibliography: | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 |
ISSN: | 0916-8532 1745-1361 |
DOI: | 10.1587/transinf.2024EDP7204 |