PAHD-YOLOv5:Parallel Attention and Hybrid Dilated Convolution for Autonomous Driving Object Detection

Object detection in autonomous driving scenarios is a complicated task. As onboard cameras always move, the object scale varies violently, which makes object distinction and localization difficult. To solve the issues mentioned above, we propose PAHD-YOLOv5 by improving YOLOv5. Firstly, the backbone...

Full description

Saved in:

Bibliographic Details
Published in	2022 5th International Conference on Pattern Recognition and Artificial Intelligence (PRAI) pp. 418 - 425
Main Authors	Zhao, Yang, Lu, Jinzheng, Li, Qiang, Peng, Bo, Han, Jiaojiao, Huang, Bingsen
Format	Conference Proceeding
Language	English
Published	IEEE 19.08.2022
Subjects	autonomous driving Cameras Convolution Data mining Detectors Feature extraction hybrid dilated convolution Location awareness Object detection parallel attention YOLOv5
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Object detection in autonomous driving scenarios is a complicated task. As onboard cameras always move, the object scale varies violently, which makes object distinction and localization difficult. To solve the issues mentioned above, we propose PAHD-YOLOv5 by improving YOLOv5. Firstly, the backbone is reconstructed using hybrid dilated convolution combined with a cross-stage partial network, whose role is to extract the contour information of objects and maintain the resolution of feature maps. Secondly, we give the same priority to channel attention and spatial attention, causing the network to modulate the channel and space weights simultaneously, whose role is to focus on the interest region of feature maps. Our proposed PAHD-YOLOv5 improves YOLOv5s by 2.3 mAP and 2.89 mAP on KITTI and BSTLD benchmarks, and the inference speed has reached 159FPS and 124.9FPS. PAHD-YOLOv5 shows its outstanding advantages in speed and accuracy compared with other advanced detectors.
DOI:	10.1109/PRAI55851.2022.9904182