IMI-GPU: Inverted multi-index for billion-scale approximate nearest neighbor search with GPUs

Similarity search is utilized in specialized database systems designed to handle multimedia data, often represented by high-dimensional features. In this paper, we focus on speeding up the search process with GPUs. This problem has been previously approached by accelerating the Inverted File with As...

Full description

Saved in:
Bibliographic Details
Published inJournal of parallel and distributed computing Vol. 200; p. 105066
Main Authors Araujo, Alan, Barreiros, Willian, Kong, Jun, Ferreira, Renato, Teodoro, George
Format Journal Article
LanguageEnglish
Published Elsevier Inc 01.06.2025
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Similarity search is utilized in specialized database systems designed to handle multimedia data, often represented by high-dimensional features. In this paper, we focus on speeding up the search process with GPUs. This problem has been previously approached by accelerating the Inverted File with Asymmetric Distance Computation algorithm on GPUs (IVFADC-GPU). However, the most recent algorithm for CPU, Inverted Multi-Index (IMI), was not considered for parallelization, being found too challenging for efficient GPU deployment. Thus, we propose a novel and efficient version of IMI for GPUs called IMI-GPU. We propose a new design of the multi-sequence algorithm of IMI, enabling efficient GPU execution. We compared IMI-GPU with IVFADC-GPU using a billion-scale dataset in which IMI-GPU achieved speedups of about 3.2× and 1.9× at Recall@1 and at Recall@16 respectively. The algorithms have been compared in a variety of scenarios and our novel IMI-GPU has shown to significantly outperform IVFADC on GPUs for the majority of tested cases. •Similarity search is compute intensive, being necessary to use approximate algorithms.•Approximate nearest neighbor search can be accelerated with GPUs.•State-of-the-art algorithm Inverted Multi-Index challenging for GPU implementation.•IMI can be efficiently implemented for GPU as a novel multi-sequence version.•IMI-GPU solution outperforms previous state-of-the-art GPU solution IVFADC.
ISSN:0743-7315
DOI:10.1016/j.jpdc.2025.105066