Object detection in crowded scenes via joint prediction

Detecting highly-overlapped objects in crowded scenes remains a challenging problem, especially for one-stage detector. In this paper, we extricate YOLOv4 from the dilemma in a crowd by fine-tuning its detection scheme, named YOLO-CS. Specifically, we give YOLOv4 the power to detect multiple objects...

Full description

Saved in:
Bibliographic Details
Published inDefence technology Vol. 21; pp. 103 - 115
Main Authors Xu, Hong-hui, Wang, Xin-qing, Wang, Dong, Duan, Bao-guo, Rui, Ting
Format Journal Article
LanguageEnglish
Published Elsevier B.V 01.03.2023
KeAi Communications Co., Ltd
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Detecting highly-overlapped objects in crowded scenes remains a challenging problem, especially for one-stage detector. In this paper, we extricate YOLOv4 from the dilemma in a crowd by fine-tuning its detection scheme, named YOLO-CS. Specifically, we give YOLOv4 the power to detect multiple objects in one cell. Center to our method is the carefully designed joint prediction scheme, which is executed through an assignment of bounding boxes and a joint loss. Equipped with the derived joint-object augmentation (DJA), refined regression loss (RL) and Score-NMS (SN), YOLO-CS achieves competitive detection performance on CrowdHuman and CityPersons benchmarks compared with state-of-the-art detectors at the cost of little time. Furthermore, on the widely used general benchmark COCO, YOLO-CS still has a good performance, indicating its robustness to various scenes.
ISSN:2214-9147
2214-9147
DOI:10.1016/j.dt.2021.10.007