Bag of spatio-visual words for context inference in scene classification

In the “bag of visual words (BoVW)” representation each image is represented by an unordered set of visual words. In this paper, a novel approach to encode ordered spatial configurations of visual words in order to add context in the representation is presented. The proposed method introduces a bag...

Full description

Saved in:
Bibliographic Details
Published inPattern recognition Vol. 46; no. 3; pp. 1039 - 1053
Main Authors Bolovinou, A., Pratikakis, I., Perantonis, S.
Format Journal Article
LanguageEnglish
Published Kidlington Elsevier Ltd 01.03.2013
Elsevier
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:In the “bag of visual words (BoVW)” representation each image is represented by an unordered set of visual words. In this paper, a novel approach to encode ordered spatial configurations of visual words in order to add context in the representation is presented. The proposed method introduces a bag of spatio-visual words representation (BoSVW) obtained by clustering of visual words' correlogram ensembles. Specifically, the spherical K-means clustering algorithm is employed accounting for the large dimensionality and the sparsity of the proposed spatio-visual descriptors. Experimental results on four standard datasets show that the proposed method significantly improves a state-of-the-art BoVW model and compares favorably to existing context-based scene classification approaches. ► Reform BoVw representation to include spatio-contextual information. ► Spherical k-means for high-dimentional spatio-visual data clustering. ► Improves a state-of-the-art BoVw model on 4 reference datasets. ► Compares favorably to existing context-based scene classification approaches.
Bibliography:ObjectType-Article-2
SourceType-Scholarly Journals-1
ObjectType-Feature-1
content type line 23
ISSN:0031-3203
1873-5142
DOI:10.1016/j.patcog.2012.07.024