The inverted multi-index

A new data structure for efficient similarity search in very large dataseis of high-dimensional vectors is introduced. This structure called the inverted multi-index generalizes the inverted index idea by replacing the standard quantization within inverted indices with product quantization. For very...

Full description

Saved in:

Bibliographic Details
Published in	2012 IEEE Conference on Computer Vision and Pattern Recognition pp. 3069 - 3076
Main Authors	Babenko, A., Lempitsky, V.
Format	Conference Proceeding
Language	English
Published	IEEE 01.06.2012
Online Access	Get full text

Cover

Loading…

More Information
Summary:	A new data structure for efficient similarity search in very large dataseis of high-dimensional vectors is introduced. This structure called the inverted multi-index generalizes the inverted index idea by replacing the standard quantization within inverted indices with product quantization. For very similar retrieval complexity and preprocessing time, inverted multi-indices achieve a much denser subdivision of the search space compared to inverted indices, while retaining their memory efficiency. Our experiments with large dataseis of SIFT and GIST vectors demonstrate that because of the denser subdivision, inverted multi-indices are able to return much shorter candidate lists with higher recall. Augmented with a suitable reranking procedure, multi-indices were able to improve the speed of approximate nearest neighbor search on the dataset of 1 billion SIFT vectors by an order of magnitude compared to the best previously published systems, while achieving better recall and incurring only few percent of memory overhead.
ISBN:	9781467312264 1467312266
ISSN:	1063-6919
DOI:	10.1109/CVPR.2012.6248038