Explainable AI improves task performance in human–AI collaboration

Artificial intelligence (AI) provides considerable opportunities to assist human work. However, one crucial challenge of human–AI collaboration is that many AI algorithms operate in a black-box manner where the way how the AI makes predictions remains opaque. This makes it difficult for humans to va...

Full description

Saved in:

Bibliographic Details
Published in	Scientific reports Vol. 14; no. 1; pp. 31150 - 13
Main Authors	Senoner, Julian, Schallmoser, Simon, Kratzwald, Bernhard, Feuerriegel, Stefan, Netland, Torbjørn
Format	Journal Article
Language	English
Published	London Nature Publishing Group UK 28.12.2024 Nature Publishing Group Nature Portfolio
Subjects	639/166/988 639/705/117 692/1807/1812 Adult Algorithms Artificial Intelligence Black lung Collaboration Decision making Experiments Explainable AI Female Human-centered AI Humanities and Social Sciences Humans Human–AI collaboration Inspection Male Mental task performance multidisciplinary Predictions Science Science (multidisciplinary) Subject specialists Task performance Task Performance and Analysis Decision-making Task performance Explainable AI Human–AI collaboration Human-centered AI
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Artificial intelligence (AI) provides considerable opportunities to assist human work. However, one crucial challenge of human–AI collaboration is that many AI algorithms operate in a black-box manner where the way how the AI makes predictions remains opaque. This makes it difficult for humans to validate a prediction made by AI against their own domain knowledge. For this reason, we hypothesize that augmenting humans with explainable AI improves task performance in human–AI collaboration. To test this hypothesis, we implement explainable AI in the form of visual heatmaps in inspection tasks conducted by domain experts. Visual heatmaps have the advantage that they are easy to understand and help to localize relevant parts of an image. We then compare participants that were either supported by (a) black-box AI or (b) explainable AI, where the latter supports them to follow AI predictions when the AI is accurate or overrule the AI when the AI predictions are wrong. We conducted two preregistered experiments with representative, real-world visual inspection tasks from manufacturing and medicine. The first experiment was conducted with factory workers from an electronics factory, who performed assessments of whether electronic products have defects. The second experiment was conducted with radiologists, who performed assessments of chest X-ray images to identify lung lesions. The results of our experiments with domain experts performing real-world tasks show that task performance improves when participants are supported by explainable AI with heatmaps instead of black-box AI. We find that explainable AI as a decision aid improved the task performance by 7.7 percentage points (95% confidence interval [CI]: 3.3% to 12.0%, ) in the manufacturing experiment and by 4.7 percentage points (95% CI: 1.1% to 8.3%, ) in the medical experiment compared to black-box AI. These gains represent a significant improvement in task performance.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 content type line 23
ISSN:	2045-2322 2045-2322
DOI:	10.1038/s41598-024-82501-9