Fine-Grained Semantic Information Preservation and Misclassification-Aware Loss for 3D Point Cloud

Encoder-Decoder structure is a popular choice in point cloud processing for dense multi-classification tasks, e.g., 3D semantic segmentation. Though existing techniques that follow this structure achieve high performance, they are known to suffer from fine-grained information loss, especially when t...

Full description

Saved in:
Bibliographic Details
Published inIEEE robotics and automation letters Vol. 9; no. 9; pp. 7947 - 7954
Main Authors Zou, Yanmei, Lin, Xuefei, Yu, Hongshan, Yang, Zhengeng, Akhtar, Naveed
Format Journal Article
LanguageEnglish
Published IEEE 01.09.2024
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Encoder-Decoder structure is a popular choice in point cloud processing for dense multi-classification tasks, e.g., 3D semantic segmentation. Though existing techniques that follow this structure achieve high performance, they are known to suffer from fine-grained information loss, especially when the underlying networks are deep. To alleviate this, we propose a bilateral attention fusion module (BAFM) that leverages multi-resolution feature fusion to allow more effective fine-grained information flow in the network. Moreover, we also introduce a misclassification-aware loss (MAL) as a more potent alternative to the widely used cross-entropy (CE) loss for multi-classification tasks. MAL enables an explicit penalization of misclassification. Empirical experiments reveal that our method achieves state-of-the-art performance for several challenging datasets, such as 91.9% mAcc on ModelNet40, 88.4% OA and 87.0% mAcc on ScanObjectNN and 71.3% mIoU on S3DIS Area-5.
ISSN:2377-3766
2377-3766
DOI:10.1109/LRA.2024.3404751