Direction-induced convolution for point cloud analysis

Point cloud analysis becomes a fundamental but challenging problem in the field of 3D scene understanding. To deal with unstructured and unordered point clouds in the embedded 3D space, we propose a novel direction-induced convolution (DIConv) to obtain the hierarchical representations of point clou...

Full description

Saved in:

Bibliographic Details
Published in	Multimedia systems Vol. 28; no. 2; pp. 457 - 468
Main Authors	Fang, Yuan, Xu, Chunyan, Zhou, Chuanwei, Cui, Zhen, Hu, Chunlong
Format	Journal Article
Language	English
Published	Berlin/Heidelberg Springer Berlin Heidelberg 01.04.2022 Springer Nature B.V
Subjects	Computer Communication Networks Computer Graphics Computer Science Convolution Cryptology Data Storage Representation Deep Learning for Intelligent Multimedia Systems Image segmentation Modules Multimedia Information Systems Operating Systems Scene analysis Special Issue Paper Three dimensional models Point cloud Convolution Semantic segmentation Classification
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Point cloud analysis becomes a fundamental but challenging problem in the field of 3D scene understanding. To deal with unstructured and unordered point clouds in the embedded 3D space, we propose a novel direction-induced convolution (DIConv) to obtain the hierarchical representations of point clouds and then boost the performance of point cloud analysis. Specifically, we first construct a direction set as the basis of spatial direction information, where its entries can denote these latent direction components of 3D points. For each neighbor point, we can project its direction information into the constructed direction set for achieving an array of direction-dependent weights, then transform its features into the canonical ordered direction set space. After that, the standard image-like convolution can be leveraged to encode the unordered neighborhood regions of point cloud data. We further develop a residual DIConv (Res_DIConv) module and a farthest point sampling residual DIConv (FPS_Res_DIConv) module for jointly capturing the hierarchical features of input point clouds. By alternately stacking Res_DIConv modules and FPS_Res_DIConv modules, a direction-induced convolution network (DICNet) can be built to perform point cloud analysis in an end-to-end fashion. Comprehensive experiments on three benchmark datasets (including ModelNet40, ShapeNet Part, and S3DIS) demonstrate that the proposed DIConv method achieves encouraging performance on both point cloud classification and semantic segmentation tasks.
ISSN:	0942-4962 1432-1882
DOI:	10.1007/s00530-021-00770-0