Deep learning of volumetric representation for 3D object recognition

Robust 3D object detection and pose estimation is still a big challenging for robot vision. In this paper, we propose a new framework for 3D object detection and pose estimation. Rather than using RGB-D image as the original data, we propose to use volumetric representation with the help of unsuperv...

Full description

Saved in:
Bibliographic Details
Published in2017 32nd Youth Academic Annual Conference of Chinese Association of Automation (YAC) pp. 663 - 668
Main Authors Hongsen Liu, Yang Cong, Yandong Tang
Format Conference Proceeding
LanguageEnglish
Published IEEE 01.05.2017
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Robust 3D object detection and pose estimation is still a big challenging for robot vision. In this paper, we propose a new framework for 3D object detection and pose estimation. Rather than using RGB-D image as the original data, we propose to use volumetric representation with the help of unsupervised deep learning network to extract low dimensional feature from 3D point cloud directly. The volumetric representation can not only eliminate the dense scale sampling for offline model training, but also reduce the distortion by mapping the 3D shape to 2D plane and overcome the dependence on texture information. Depending on the Hough forest, we can achieve multi-object detection and pose estimation simultaneously. In compare with the state-of-the-arts using public datasets, we justify the effectiveness of our proposed method.
DOI:10.1109/YAC.2017.7967493