Explicit Bias Discovery in Visual Question Answering Models

Researchers have observed that Visual Question Answering (VQA ) models tend to answer questions by learning statistical biases in the data. For example, their answer to the question "What is the color of the grass?" is usually "Green", whereas a question like "What is the ti...

Full description

Saved in:

Bibliographic Details
Published in	2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) pp. 9554 - 9563
Main Authors	Manjunatha, Varun, Saini, Nirat, Davis, Larry S.
Format	Conference Proceeding
Language	English
Published	IEEE 01.06.2019
Subjects	Behavioral sciences Big Data Datasets and Evaluation Deep Learning Large Scale Methods Machine learning Machine learning algorithms Question answering (information retrieval) Training Vision + Language Visual Reasoning Visualization Vocabulary
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Researchers have observed that Visual Question Answering (VQA ) models tend to answer questions by learning statistical biases in the data. For example, their answer to the question "What is the color of the grass?" is usually "Green", whereas a question like "What is the title of the book?" cannot be answered by inferring statistical biases. It is of interest to the community to explicitly discover such biases, both for understanding the behavior of such models, and towards debugging them. Our work address this problem. In a database, we store the words of the question, answer and visual words corresponding to regions of interest in attention maps. By running simple rule mining algorithms on this database, we discover human-interpretable rules which give us unique insight into the behavior of such models. Our results also show examples of unusual behaviors learned by models in attempting VQA tasks.
ISSN:	2575-7075
DOI:	10.1109/CVPR.2019.00979