Counterfactual explanations as interventions in latent space

Explainable Artificial Intelligence (XAI) is a set of techniques that allows the understanding of both technical and non-technical aspects of Artificial Intelligence (AI) systems. XAI is crucial to help satisfying the increasingly important demand of trustworthy Artificial Intelligence, characterize...

Full description

Saved in:

Bibliographic Details
Published in	Data mining and knowledge discovery Vol. 38; no. 5; pp. 2733 - 2769
Main Authors	Crupi, Riccardo, Castelnovo, Alessandro, Regoli, Daniele, San Miguel Gonzalez, Beatriz
Format	Journal Article
Language	English
Published	New York Springer US 01.09.2024 Springer Nature B.V
Subjects	Algorithms Artificial Intelligence Chemistry and Earth Sciences Computer Science Data Mining and Knowledge Discovery Datasets End users Explainable artificial intelligence Feasibility Information Storage and Retrieval Physics Special Issue on Explainable and Interpretable Machine Learning and Data Mining Statistics for Engineering Counterfactual explanations Explainable AI Causality Artificial intelligence Machine learning Algorithmic recourse
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Explainable Artificial Intelligence (XAI) is a set of techniques that allows the understanding of both technical and non-technical aspects of Artificial Intelligence (AI) systems. XAI is crucial to help satisfying the increasingly important demand of trustworthy Artificial Intelligence, characterized by fundamental aspects such as respect of human autonomy, prevention of harm, transparency, accountability, etc. Within XAI techniques, counterfactual explanations aim to provide to end users a set of features (and their corresponding values) that need to be changed in order to achieve a desired outcome. Current approaches rarely take into account the feasibility of actions needed to achieve the proposed explanations, and in particular, they fall short of considering the causal impact of such actions. In this paper, we present Counterfactual Explanations as Interventions in Latent Space (CEILS), a methodology to generate counterfactual explanations capturing by design the underlying causal relations from the data, and at the same time to provide feasible recommendations to reach the proposed profile. Moreover, our methodology has the advantage that it can be set on top of existing counterfactuals generator algorithms, thus minimising the complexity of imposing additional causal constrains. We demonstrate the effectiveness of our approach with a set of different experiments using synthetic and real datasets (including a proprietary dataset of the financial domain).
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14
ISSN:	1384-5810 1573-756X
DOI:	10.1007/s10618-022-00889-2