Non-dense Feature Aggregation for Video Object Detection

Video object detection is a challenging task in computer vision. Dynamic background change, motion blur, motion occlusion, and other problems bring great interference to the detection results. The existing detection methods often sacrifice the detection speed for the sake of accuracy or sacrifice th...

Full description

Saved in:
Bibliographic Details
Published in2021 International Conference on Artificial Intelligence and Electromechanical Automation (AIEA) pp. 312 - 316
Main Authors Liang, Zhubin, Liu, Weibin, Xing, Weiwei
Format Conference Proceeding
LanguageEnglish
Published IEEE 01.05.2021
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Video object detection is a challenging task in computer vision. Dynamic background change, motion blur, motion occlusion, and other problems bring great interference to the detection results. The existing detection methods often sacrifice the detection speed for the sake of accuracy or sacrifice the accuracy for the sake of faster detection speed. Therefore, we propose a non-dense feature aggregation method based on an optical flow network. The feature extraction is only carried out for keyframes, and then the feature aggregation of non-key frames is completed by the non-dense connection of adjacent frames, and the idea of dynamic programming is used to achieve the transmission of historical information, so as to obtain more effective features for detection. Our algorithm achieves a real-time detection effect without significantly increasing the amount of computation. And the algorithm achieves competitive detection results on the ImageNet VID dataset.
DOI:10.1109/AIEA53260.2021.00073