Delving into Identify-Emphasize Paradigm for Combating Unknown Bias

Dataset biases are notoriously detrimental to model robustness and generalization. The identify-emphasize paradigm appears to be effective in dealing with unknown biases. However, we discover that it is still plagued by two challenges: A, the quality of the identified bias-conflicting samples is far...

Full description

Saved in:

Bibliographic Details
Published in	International journal of computer vision Vol. 132; no. 6; pp. 2310 - 2330
Main Authors	Zhao, Bowen, Chen, Chen, Wang, Qian-Wei, He, Anfeng, Xia, Shu-Tao
Format	Journal Article
Language	English
Published	New York Springer US 01.06.2024 Springer Nature B.V
Subjects	Accuracy Artificial Intelligence Bias Computer Imaging Computer Science Datasets Gradients Image Processing and Computer Vision Leverage Optimization Paradigms Pattern Recognition Pattern Recognition and Graphics Reliability Robustness Samples Special Issue on Robust Vision Vision Gradient alignment Bias-conflicting scoring Identify-emphasize Self-supervision Unknown bias
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Dataset biases are notoriously detrimental to model robustness and generalization. The identify-emphasize paradigm appears to be effective in dealing with unknown biases. However, we discover that it is still plagued by two challenges: A, the quality of the identified bias-conflicting samples is far from satisfactory; B, the emphasizing strategies only produce suboptimal performance. In this paper, for challenge A, we propose an effective bias-conflicting scoring method (ECS) to boost the identification accuracy, along with two practical strategies — peer-picking and epoch-ensemble. For challenge B, we point out that the gradient contribution statistics can be a reliable indicator to inspect whether the optimization is dominated by bias-aligned samples. Then, we propose gradient alignment (GA), which employs gradient statistics to balance the contributions of the mined bias-aligned and bias-conflicting samples dynamically throughout the learning process, forcing models to leverage intrinsic features to make fair decisions. Furthermore, we incorporate self-supervised (SS) pretext tasks into training, which enable models to exploit richer features rather than the simple shortcuts, resulting in more robust models. Experiments are conducted on multiple datasets in various settings, demonstrating that the proposed solution can mitigate the impact of unknown biases and achieve state-of-the-art performance.
ISSN:	0920-5691 1573-1405
DOI:	10.1007/s11263-023-01969-6