Activating Frequency and VIT for 3D Point Cloud Quality Assessment Without Reference

Deep learning-based quality assessments have significantly enhanced perceptual multimedia quality assessment, however it is still in the early stages for 3D visual data such as 3D point clouds (PCs). Due to the high volume of 3D-PCs, such quantities are frequently compressed for transmission and vie...

Full description

Saved in:

Bibliographic Details
Published in	2023 IEEE International Conference on Image Processing Challenges and Workshops (ICIPCW) pp. 3636 - 3640
Main Authors	Messai, Oussama, Bentamou, Abdelouahid, Zein-Eddine, Abbass, Gavet, Yann
Format	Conference Proceeding
Language	English
Published	IEEE 08.10.2023
Subjects	deep learning Deformable models Image coding Image color analysis No-reference 3D point cloud quality assessment Point cloud compression Solid modeling Three-dimensional displays Vision Transformer (ViT) Visualization
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Deep learning-based quality assessments have significantly enhanced perceptual multimedia quality assessment, however it is still in the early stages for 3D visual data such as 3D point clouds (PCs). Due to the high volume of 3D-PCs, such quantities are frequently compressed for transmission and viewing, which may affect perceived quality. Therefore, we propose no-reference quality metric of a given 3D-PC. Comparing to existing methods that mostly focus on geometry or color aspects, we propose integrating frequency magnitudes as indicator of spatial degradation patterns caused by the compression. To map the input attributes to quality score, we use a light-weight hybrid deep model; combined of Deformable Convolutional Network (DCN) and Vision Transformers (ViT). Experiments are carried out on ICIP20 [1], PointXR [2] dataset, and a new big dataset called BASICS [3]. The results show that our approach outperforms state-of-the-art NR-PCQA measures and even some FR-PCQA on PointXR. The implementation code can be found at: https://github.com/o-messai/3D-PCQA
DOI:	10.1109/ICIPC59416.2023.10328373