A Part-Aware Multi-Scale Fully Convolutional Network for Pedestrian Detection

Pedestrian detection is a crucial task in intelligent transportation systems, which can be applied in autonomous vehicles and traffic scene video surveillance systems. The past few years have witnessed much progress on the research of pedestrian detection methods, especially through the successful u...

Full description

Saved in:
Bibliographic Details
Published inIEEE transactions on intelligent transportation systems Vol. 22; no. 2; pp. 1125 - 1137
Main Authors Yang, Peiyu, Zhang, Guofeng, Wang, Lu, Xu, Lisheng, Deng, Qingxu, Yang, Ming-Hsuan
Format Journal Article
LanguageEnglish
Published New York IEEE 01.02.2021
The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Pedestrian detection is a crucial task in intelligent transportation systems, which can be applied in autonomous vehicles and traffic scene video surveillance systems. The past few years have witnessed much progress on the research of pedestrian detection methods, especially through the successful use of the deep learning based techniques. However, occlusion and large scale variation remain the challenging issues for pedestrian detection. In this work, we propose a Part-Aware Multi-Scale Fully Convolutional Network (PAMS-FCN) to tackle these difficulties. Specifically, we present a part-aware Region-of-Interest (RoI) pooling module to mine body parts with different responses, and select the part with the strongest response via voting. As such, a partially visible pedestrian instance can receive a high detection confidence score, making it less likely to become a missing detection. This module operates in parallel with an instance RoI pooling module to combine local parts and global context information. To handle vast scale variation, we construct a fully convolutional network in which multi-scale feature maps are generated efficiently, and small-scale and large-scale pedestrians are detected separately. By integrating these structures, the proposed detector achieves the state-of-the-art performance on the Caltech, KITTI, INRIA and ETH pedestrian detection datasets.
ISSN:1524-9050
1558-0016
DOI:10.1109/TITS.2019.2963700