Characterization of 3-D Volumetric Probabilistic Scenes for Object Recognition

This paper presents a new volumetric representation for categorizing objects in large-scale 3-D scenes reconstructed from image sequences. This work uses a probabilistic volumetric model (PVM) that combines the ideas of background modeling and volumetric multi-view reconstruction to handle the uncer...

Full description

Saved in:

Bibliographic Details
Published in	IEEE journal of selected topics in signal processing Vol. 6; no. 5; pp. 522 - 537
Main Authors	Restrepo, Maria I., Mayer, Brandon A., Ulusoy, Ali O., Mundy, Joseph L.
Format	Journal Article
Language	English
Published	IEEE 01.09.2012
Subjects	3-D data processing 3-D object recognition Bayesian learning Feature extraction Geometry Image reconstruction machine vision Probabilistic logic Shape Solid modeling Surface reconstruction
Online Access	Get full text

Cover

Loading…

More Information
Summary:	This paper presents a new volumetric representation for categorizing objects in large-scale 3-D scenes reconstructed from image sequences. This work uses a probabilistic volumetric model (PVM) that combines the ideas of background modeling and volumetric multi-view reconstruction to handle the uncertainty inherent in the problem of reconstructing 3-D structures from 2-D images. The advantages of probabilistic modeling have been demonstrated by recent application of the PVM representation to video image registration, change detection and classification of changes based on PVM context. The applications just mentioned, operate on 2-D projections of the PVM. This paper presents the first work to characterize and use the local 3-D information in the scenes. Two approaches to local feature description are proposed and compared: 1) features derived from a PCA analysis of model neighborhoods; and 2) features derived from the coefficients of a 3-D Taylor series expansion within each neighborhood. The resulting description is used in a bag-of-features approach to classify buildings, houses, cars, planes, and parking lots learned from aerial imagery collected over Providence, RI. It is shown that both feature descriptions explain the data with similar accuracy and their effectiveness for dense-feature categorization is compared for the different classes. Finally, 3-D extensions of the Harris corner detector and a Hessian-based detector are used to detect salient features. Both types of salient features are evaluated through object categorization experiments, where only features with maximal response are retained. For most saliency criteria tested, features based on the determinant of the Hessian achieved higher classification accuracy than Harris-based features.
ISSN:	1932-4553 1941-0484
DOI:	10.1109/JSTSP.2012.2201693