A Simple and Efficient Network for Small Target Detection

Target detection based on deep learning is developing rapidly. However, small target detection is still a challenge. In this paper, a simple and efficient network for small target detection is proposed. We put forward to improve the detection performance of the small targets in three aspects. First,...

Full description

Saved in:

Bibliographic Details
Published in	IEEE access Vol. 7; pp. 85771 - 85781
Main Authors	Ju, Moran, Luo, Jiangning, Zhang, Panpan, He, Miao, Luo, Haibo
Format	Journal Article
Language	English
Published	Piscataway IEEE 2019 The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Subjects	Algorithms Convolution Datasets Deep learning dilated convolution Feature extraction Frames per second Machine learning Modules Object detection passthrough layer Performance evaluation Real-time systems Semantics Target detection
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Target detection based on deep learning is developing rapidly. However, small target detection is still a challenge. In this paper, a simple and efficient network for small target detection is proposed. We put forward to improve the detection performance of the small targets in three aspects. First, as the contextual information is important to detect the small targets, we proposed to use "dilated module" to expand the receptive field without loss of resolution or coverage. Second, we applied feature fusion in different dilated modules to improve the ability of the network in detecting small targets. Finally, we used "passthrough module" to get the finer-grained information from the earlier layer and combined it with the semantic information from the deeper layer. To improve the detection speed of the network, it is proposed to use <inline-formula> <tex-math notation="LaTeX">1\times 1 </tex-math></inline-formula> convolution to reduce the dimension of the network. We composed small vehicle dataset based on VEDAI dataset and DOTA dataset, respectively, and also analyzed the distribution of the small targets in each dataset. To evaluate the performance of the proposed network, we trained the model on the dataset above and compared with the state-of-the-art target detection algorithms, our approach achieved 80.16% average precision (AP) on VEDAI dataset and 88.63% AP on DOTA dataset and the frames per second (FPS) is 75.4. The AP of our network is much better than the result of the tiny YOLO V3 and is nearly the same as the result of the YOLO V3. However, the FPS of our network is almost the same as that of the tiny YOLO V3.
ISSN:	2169-3536 2169-3536
DOI:	10.1109/ACCESS.2019.2924960