Addressing Feature Suppression in Unsupervised Visual Representations

Contrastive learning is one of the fastest growing research areas in machine learning due to its ability to learn useful representations without labeled data. However, contrastive learning is susceptible to feature suppression - i.e., it may discard important information relevant to the task of inte...

Full description

Saved in:
Bibliographic Details
Published in2023 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) pp. 1411 - 1420
Main Authors Li, Tianhong, Fan, Lijie, Yuan, Yuan, He, Hao, Tian, Yonglong, Feris, Rogerio, Indyk, Piotr, Katabi, Dina
Format Conference Proceeding
LanguageEnglish
Published IEEE 01.01.2023
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Contrastive learning is one of the fastest growing research areas in machine learning due to its ability to learn useful representations without labeled data. However, contrastive learning is susceptible to feature suppression - i.e., it may discard important information relevant to the task of interest, and learn irrelevant features. Past work has addressed this limitation via handcrafted data augmentations that eliminate irrelevant information. This approach however does not work across all datasets and tasks. Further, data augmentations fail in addressing feature suppression in multi-attribute classification when one attribute can suppress features relevant to other attributes. In this paper, we analyze the objective function of contrastive learning and formally prove that it is vulnerable to feature suppression. We then present Predictive Contrastive Learning (PrCL), a framework for learning unsupervised representations that are robust to feature suppression. The key idea is to force the learned representation to predict the input, and hence prevent it from discarding important information. Extensive experiments verify that PrCL is robust to feature suppression and outperforms state-of-the-art contrastive learning methods on a variety of datasets and tasks.
ISSN:2642-9381
DOI:10.1109/WACV56688.2023.00146