Unsupervised learning of invariant features using video

We present an algorithm that learns invariant features from real data in an entirely unsupervised fashion. The principal benefit of our method is that it can be applied without human intervention to a particular application or data set, learning the specific invariances necessary for excellent featu...

Full description

Saved in:

Bibliographic Details
Published in	2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition pp. 1649 - 1656
Main Authors	Stavens, D, Thrun, S
Format	Conference Proceeding
Language	English
Published	IEEE 01.06.2010
Subjects	Artificial intelligence Computer science Image motion analysis Laboratories Machine learning algorithms Optimization methods Streaming media Surgery Testing Unsupervised learning
Online Access	Get full text
ISBN	1424469848 9781424469840
ISSN	1063-6919 1063-6919
DOI	10.1109/CVPR.2010.5539773

Cover

Loading…

More Information
Summary:	We present an algorithm that learns invariant features from real data in an entirely unsupervised fashion. The principal benefit of our method is that it can be applied without human intervention to a particular application or data set, learning the specific invariances necessary for excellent feature performance on that data. Our algorithm relies on the ability to track image patches over time using optical flow. With the wide availability of high frame rate video (eg: on the web, from a robot), good tracking is straightforward to achieve. The algorithm then optimizes feature parameters such that patches corresponding to the same physical location have feature descriptors that are as similar as possible while simultaneously maximizing the distinctness of descriptors for different locations. Thus, our method captures data or application specific invariances yet does not require any manual supervision. We apply our algorithm to learn domain-optimized versions of SIFT and HOG. SIFT and HOG features are excellent and widely used. However, they are general and by definition not tailored to a specific domain. Our domain-optimized versions offer a substantial performance increase for classification and correspondence tasks we consider. Furthermore, we show that the features our method learns are near the optimal that would be achieved by directly optimizing the test set performance of a classifier. Finally, we demonstrate that the learning often allows fewer features to be used for some tasks, which has the potential to dramatically improve computational concerns for very large data sets.
ISBN:	1424469848 9781424469840
ISSN:	1063-6919 1063-6919
DOI:	10.1109/CVPR.2010.5539773