PVI-Net: Point–Voxel–Image Fusion for Semantic Segmentation of Point Clouds in Large-Scale Autonomous Driving Scenarios

In this study, we introduce a novel framework for the semantic segmentation of point clouds in autonomous driving scenarios, termed PVI-Net. This framework uniquely integrates three different data perspectives—point clouds, voxels, and distance maps—executing feature extraction through three paralle...

Full description

Saved in:

Bibliographic Details
Published in	Information (Basel) Vol. 15; no. 3; p. 148
Main Authors	Wang, Zongshun, Li, Ce, Ma, Jialin, Feng, Zhiqiang, Xiao, Limei
Format	Journal Article
Language	English
Published	Basel MDPI AG 01.03.2024
Subjects	Accuracy Computer vision cross-attention Efficiency Feature extraction Image segmentation LiDAR point clouds multi-perspective Neural networks Performance measurement Semantic segmentation Semantics
Online Access	Get full text

Cover

Loading…

More Information
Summary:	In this study, we introduce a novel framework for the semantic segmentation of point clouds in autonomous driving scenarios, termed PVI-Net. This framework uniquely integrates three different data perspectives—point clouds, voxels, and distance maps—executing feature extraction through three parallel branches. Throughout this process, we ingeniously design a point cloud–voxel cross-attention mechanism and a multi-perspective feature fusion strategy for point images. These strategies facilitate information interaction across different feature dimensions of perspectives, thereby optimizing the fusion of information from various viewpoints and significantly enhancing the overall performance of the model. The network employs a U-Net structure and residual connections, effectively merging and encoding information to improve the precision and efficiency of semantic segmentation. We validated the performance of PVI-Net on the SemanticKITTI and nuScenes datasets. The results demonstrate that PVI-Net surpasses most of the previous methods in various performance metrics.
ISSN:	2078-2489 2078-2489
DOI:	10.3390/info15030148