CA-FER: Mitigating Spurious Correlation With Counterfactual Attention in Facial Expression Recognition

Although facial expression recognition based on deep learning has become a major trend, existing methods have been found to prefer learning spurious statistical correlations and non-robust features during training. This degenerates the model's generalizability in practical situations. One of th...

Full description

Saved in:
Bibliographic Details
Published inIEEE transactions on affective computing Vol. 15; no. 3; pp. 977 - 989
Main Authors Huang, Pin-Jui, Xie, Hongxia, Huang, Hung-Cheng, Shuai, Hong-Han, Cheng, Wen-Huang
Format Journal Article
LanguageEnglish
Published Piscataway IEEE 01.07.2024
The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Although facial expression recognition based on deep learning has become a major trend, existing methods have been found to prefer learning spurious statistical correlations and non-robust features during training. This degenerates the model's generalizability in practical situations. One of the research fields mitigating such misperception of correlations as causality is causal reasoning. In this article, we propose a learnable counterfactual attention mechanism, CA-FER, that uses causal reasoning to simultaneously optimize feature discrimination and diversity to mitigate spurious correlations in expression datasets. To the best of our knowledge, this is the first work to study the spurious correlations in facial expression recognition from a counterfactual attention perspective. Extensive experiments on a synthetic dataset and four public datasets demonstrate that our method outperforms previous methods, which shows the effectiveness and generalizability of our learnable counterfactual attention mechanism for the expression recognition task.
ISSN:1949-3045
1949-3045
DOI:10.1109/TAFFC.2023.3312768