Expanding Receptive Field YOLO for Small Object Detection

State-of-art object detection networks like YOLO, SSD and Faster R-CNN all have achieved great success in object detection. However, these algorithms have a low performance in small object detection. So, we produce the Expanding receptive field YOLO (ERF-YOLO) to deal with this problem. At first, we...

Full description

Saved in:
Bibliographic Details
Published inJournal of physics. Conference series Vol. 1314; no. 1; pp. 12202 - 12207
Main Authors Du, Zexing, Yin, Jinyong, Yang, Jian
Format Journal Article
LanguageEnglish
Published Bristol IOP Publishing 01.10.2019
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:State-of-art object detection networks like YOLO, SSD and Faster R-CNN all have achieved great success in object detection. However, these algorithms have a low performance in small object detection. So, we produce the Expanding receptive field YOLO (ERF-YOLO) to deal with this problem. At first, we propose an efficient block which is called expanding receptive field block (ERF-block) to capture more information in larger areas. Base on YOLOv2, we down-sample the low-level location information by ERF-block, and up-sample feature information by deconvolution. Then we further assemble these two parts together to make the prediction. After training the network on VOC dataset, we have a good result with 82.6% mAP (mean Average Precision) which is 4.0% higher than the original YOLOv2 network. Thanks to the efficient block, it takes 62fps to detect one image when the input size is 416×416, which could keep a real-time speed. In addition, we also evaluate the model on a remote sensing dataset which contains many small targets, and it also shows that ours model has a better performance.
ISSN:1742-6588
1742-6596
DOI:10.1088/1742-6596/1314/1/012202