Parallel incremental power mean SVM for the classification of large-scale image datasets

The amount of image data becomes larger and larger, both image size (due the higher resolution) and image number. It is estimated for personal use only, an average single user will take 100,000 images during his life. The growth of image data is illustrated by the dataset size, for example ImageNet...

Full description

Saved in:
Bibliographic Details
Published inInternational journal of multimedia information retrieval Vol. 3; no. 2; pp. 89 - 96
Main Authors Doan, Thanh-Nghi, Do, Thanh-Nghi, Poulet, François
Format Journal Article
LanguageEnglish
Published London Springer London 01.06.2014
Springer Nature B.V
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:The amount of image data becomes larger and larger, both image size (due the higher resolution) and image number. It is estimated for personal use only, an average single user will take 100,000 images during his life. The growth of image data is illustrated by the dataset size, for example ImageNet benchmark dataset is made of more than 14 million images and more than 21,000 classes. This is very challenging for classification algorithms. They have to deal with time and space complexity and very imbalanced data when using SVM algorithms. We present extensions of Power Mean SVM to deal with such data. The first one is an incremental version to deal with the space complexity, the second one is a parallel version of the incremental version to deal with time complexity and the last one is the use of a balanced bagging algorithm for training binary classifiers to deal with imbalanced data. We evaluate our parallel incremental version of balanced bagging PmSVM on the 1,000 classes of ImageNet (ILSVRC 2010). The results show that our algorithm can be run on standard PC (with eg. 2 or 4 GB RAM); it is 255 times faster than the original version and 1,276 times faster than state-of-the-art linear classifier, LIBLINEAR with 80 cores.
ISSN:2192-6611
2192-662X
DOI:10.1007/s13735-014-0053-0