PAHD-YOLOv5:Parallel Attention and Hybrid Dilated Convolution for Autonomous Driving Object Detection

Object detection in autonomous driving scenarios is a complicated task. As onboard cameras always move, the object scale varies violently, which makes object distinction and localization difficult. To solve the issues mentioned above, we propose PAHD-YOLOv5 by improving YOLOv5. Firstly, the backbone...

Full description

Saved in:
Bibliographic Details
Published in2022 5th International Conference on Pattern Recognition and Artificial Intelligence (PRAI) pp. 418 - 425
Main Authors Zhao, Yang, Lu, Jinzheng, Li, Qiang, Peng, Bo, Han, Jiaojiao, Huang, Bingsen
Format Conference Proceeding
LanguageEnglish
Published IEEE 19.08.2022
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Object detection in autonomous driving scenarios is a complicated task. As onboard cameras always move, the object scale varies violently, which makes object distinction and localization difficult. To solve the issues mentioned above, we propose PAHD-YOLOv5 by improving YOLOv5. Firstly, the backbone is reconstructed using hybrid dilated convolution combined with a cross-stage partial network, whose role is to extract the contour information of objects and maintain the resolution of feature maps. Secondly, we give the same priority to channel attention and spatial attention, causing the network to modulate the channel and space weights simultaneously, whose role is to focus on the interest region of feature maps. Our proposed PAHD-YOLOv5 improves YOLOv5s by 2.3 mAP and 2.89 mAP on KITTI and BSTLD benchmarks, and the inference speed has reached 159FPS and 124.9FPS. PAHD-YOLOv5 shows its outstanding advantages in speed and accuracy compared with other advanced detectors.
DOI:10.1109/PRAI55851.2022.9904182