Counterfactual explanation generation with minimal feature boundary

The complex and opaque decision-making process of machine learning models restrains the interpretability of predictions and makes them cannot mine results outside of learning experiences. The causality between features and the target variable can be traced by injecting counterfactual explanations in...

Full description

Saved in:

Bibliographic Details
Published in	Information sciences Vol. 625; pp. 342 - 366
Main Authors	You, Dianlong, Niu, Shina, Dong, Siqi, Yan, Huigui, Chen, Zhen, Wu, Di, Shen, Limin, Wu, Xindong
Format	Journal Article
Language	English
Published	Elsevier Inc 01.05.2023
Subjects	Causal discovery Counterfactual explanation Feature boundary Interpretability Counterfactual explanation Feature boundary Interpretability Causal discovery
Online Access	Get full text

Cover

Loading…

More Information
Summary:	The complex and opaque decision-making process of machine learning models restrains the interpretability of predictions and makes them cannot mine results outside of learning experiences. The causality between features and the target variable can be traced by injecting counterfactual explanations into the prediction model and generating counterfactual instances using adjusted features to reverse the prediction results. Existing algorithms, such as Diverse Counterfactual Explanations (DiCE) and Counterfactual Explanations Guided by Prototypes (Proto), can generate multiple/single counterfactual(s) for a data point by global optimization in the range of all-out features to gain on a local decision range. However, these methods cannot clearly identify which features are the key causes. Moreover, a Random Forest Optimal Counterfactual Set Extractor (RF-OCSE) extracts counterfactual sets from a random forest and needs to manipulate all the internal nodes of the tree, restricting it to only tree ensemble models. To address the above shortcomings, we proposed a Counterfactual Explanation Generation method with the Minimal Feature Boundary (MFB), named (CEGMFB). The proposed CEGMFB algorithm consists of two stages: 1) mining the MFB, which can reverse the prediction results to restrain the generation range of counterfactual instances, and 2) constructing a counterfactual generative method for generating counterfactual instances within the MFB to realize the minimum reversing cost. To evaluate its performance, we compared the proposed CEGMFB algorithm with six baseline algorithms on 16 datasets and conducted a case study in a real scenario. The results indicate that the proposed CEGMFB algorithm outperforms the compared algorithms.
ISSN:	0020-0255 1872-6291
DOI:	10.1016/j.ins.2023.01.012