Subgroup Invariant Perturbation for Unbiased Pre-Trained Model Prediction

Modern deep learning systems have achieved unparalleled success and several applications have significantly benefited due to these technological advancements. However, these systems have also shown vulnerabilities with strong implications on the fairness and trustability of such systems. Among these...

Full description

Saved in:

Bibliographic Details
Published in	Frontiers in big data Vol. 3; p. 590296
Main Authors	Majumdar, Puspita, Chhabra, Saheb, Singh, Richa, Vatsa, Mayank
Format	Journal Article
Language	English
Published	Switzerland Frontiers Media S.A 18.02.2021
Subjects	bias estimation bias mitigation Big Data Fairness gender classification subgroup invariant perturbation trustability gender classification bias estimation subgroup invariant perturbation bias mitigation trustability Fairness race classification
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Modern deep learning systems have achieved unparalleled success and several applications have significantly benefited due to these technological advancements. However, these systems have also shown vulnerabilities with strong implications on the fairness and trustability of such systems. Among these vulnerabilities, bias has been an . Many applications such as face recognition and language translation have shown high levels of bias in the systems towards particular demographic sub-groups. Unbalanced representation of these sub-groups in the training data is one of the primary reasons of biased behavior. To address this important challenge, we propose a two-fold contribution: a bias estimation metric termed as to jointly measure the bias in model prediction and the overall model performance. Secondly, we propose a novel bias mitigation algorithm which is inspired from adversarial perturbation and uses the PSE metric. The mitigation algorithm learns a single uniform perturbation termed as which is added to the input dataset to generate a transformed dataset. The transformed dataset, when given as input to the pre-trained model reduces the bias in model prediction. Multiple experiments performed on four publicly available face datasets showcase the effectiveness of the proposed algorithm for race and gender prediction.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23 Reviewed by: Chulin Xie, University of Illinois at Urbana-Champaign, United States Ren Wang, University of Michigan, United States Edited by: Pin-Yu Chen, IBM Research, United States This article was submitted to Machine Learning and Artificial Intelligence, a section of the journal Frontiers in Big Data
ISSN:	2624-909X 2624-909X
DOI:	10.3389/fdata.2020.590296