How the different explanation classes impact trust calibration: The case of clinical decision support systems

•This paper evalutes four explanation classes during Human-AI decision-making task.•Two explanation classes were perceived more understandable.•Providing explanations, in general, can introduce over-reliance of AI recommendations.•Explainable AI (XAI) designers need to equip their XAI interfaces wit...

Full description

Saved in:
Bibliographic Details
Published inInternational journal of human-computer studies Vol. 169; p. 102941
Main Authors Naiseh, Mohammad, Al-Thani, Dena, Jiang, Nan, Ali, Raian
Format Journal Article
LanguageEnglish
Published Elsevier Ltd 01.01.2023
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:•This paper evalutes four explanation classes during Human-AI decision-making task.•Two explanation classes were perceived more understandable.•Providing explanations, in general, can introduce over-reliance of AI recommendations.•Explainable AI (XAI) designers need to equip their XAI interfaces with additional requirements to help successful trust calibration. Machine learning has made rapid advances in safety-critical applications, such as traffic control, finance, and healthcare. With the criticality of decisions they support and the potential consequences of following their recommendations, it also became critical to provide users with explanations to interpret machine learning models in general, and black-box models in particular. However, despite the agreement on explainability as a necessity, there is little evidence on how recent advances in eXplainable Artificial Intelligence literature (XAI) can be applied in collaborative decision-making tasks, i.e., human decision-maker and an AI system working together, to contribute to the process of trust calibration effectively. This research conducts an empirical study to evaluate four XAI classes for their impact on trust calibration. We take clinical decision support systems as a case study and adopt a within-subject design followed by semi-structured interviews. We gave participants clinical scenarios and XAI interfaces as a basis for decision-making and rating tasks. Our study involved 41 medical practitioners who use clinical decision support systems frequently. We found that users perceive the contribution of explanations to trust calibration differently according to the XAI class and to whether XAI interface design fits their job constraints and scope. We revealed additional requirements on how explanations shall be instantiated and designed to help a better trust calibration. Finally, we build on our findings and present guidelines for designing XAI interfaces.
ISSN:1071-5819
1095-9300
DOI:10.1016/j.ijhcs.2022.102941