Deep learning of volumetric representation for 3D object recognition

Robust 3D object detection and pose estimation is still a big challenging for robot vision. In this paper, we propose a new framework for 3D object detection and pose estimation. Rather than using RGB-D image as the original data, we propose to use volumetric representation with the help of unsuperv...

Full description

Saved in:

Bibliographic Details
Published in	2017 32nd Youth Academic Annual Conference of Chinese Association of Automation (YAC) pp. 663 - 668
Main Authors	Hongsen Liu, Yang Cong, Yandong Tang
Format	Conference Proceeding
Language	English
Published	IEEE 01.05.2017
Subjects	Deep Learning Hough Forest and 3D Object Recognition Volumetric Representation
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Robust 3D object detection and pose estimation is still a big challenging for robot vision. In this paper, we propose a new framework for 3D object detection and pose estimation. Rather than using RGB-D image as the original data, we propose to use volumetric representation with the help of unsupervised deep learning network to extract low dimensional feature from 3D point cloud directly. The volumetric representation can not only eliminate the dense scale sampling for offline model training, but also reduce the distortion by mapping the 3D shape to 2D plane and overcome the dependence on texture information. Depending on the Hough forest, we can achieve multi-object detection and pose estimation simultaneously. In compare with the state-of-the-arts using public datasets, we justify the effectiveness of our proposed method.
DOI:	10.1109/YAC.2017.7967493