Expanding Receptive Field YOLO for Small Object Detection

State-of-art object detection networks like YOLO, SSD and Faster R-CNN all have achieved great success in object detection. However, these algorithms have a low performance in small object detection. So, we produce the Expanding receptive field YOLO (ERF-YOLO) to deal with this problem. At first, we...

Full description

Saved in:

Bibliographic Details
Published in	Journal of physics. Conference series Vol. 1314; no. 1; pp. 12202 - 12207
Main Authors	Du, Zexing, Yin, Jinyong, Yang, Jian
Format	Journal Article
Language	English
Published	Bristol IOP Publishing 01.10.2019
Subjects	Algorithms Datasets Object recognition Physics Remote sensing
Online Access	Get full text

Cover

Loading…

More Information
Summary:	State-of-art object detection networks like YOLO, SSD and Faster R-CNN all have achieved great success in object detection. However, these algorithms have a low performance in small object detection. So, we produce the Expanding receptive field YOLO (ERF-YOLO) to deal with this problem. At first, we propose an efficient block which is called expanding receptive field block (ERF-block) to capture more information in larger areas. Base on YOLOv2, we down-sample the low-level location information by ERF-block, and up-sample feature information by deconvolution. Then we further assemble these two parts together to make the prediction. After training the network on VOC dataset, we have a good result with 82.6% mAP (mean Average Precision) which is 4.0% higher than the original YOLOv2 network. Thanks to the efficient block, it takes 62fps to detect one image when the input size is 416×416, which could keep a real-time speed. In addition, we also evaluate the model on a remote sensing dataset which contains many small targets, and it also shows that ours model has a better performance.
ISSN:	1742-6588 1742-6596
DOI:	10.1088/1742-6596/1314/1/012202