Towards more sustainable and trustworthy reporting in machine learning

With machine learning (ML) becoming a popular tool across all domains, practitioners are in dire need of comprehensive reporting on the state-of-the-art. Benchmarks and open databases provide helpful insights for many tasks, however suffer from several phenomena: Firstly, they overly focus on predic...

Full description

Saved in:

Bibliographic Details
Published in	Data mining and knowledge discovery Vol. 38; no. 4; pp. 1909 - 1928
Main Authors	Fischer, Raphael, Liebig, Thomas, Morik, Katharina
Format	Journal Article
Language	English
Published	New York Springer US 01.07.2024 Springer Nature B.V
Subjects	Artificial Intelligence Chemistry and Earth Sciences Computer Science Data Mining and Knowledge Discovery Information Storage and Retrieval Machine learning Physics Statistics for Engineering Sustainability Trustworthiness Resource-awareness Benchmarking Trustworthiness Reporting Sustainability
Online Access	Get full text

Cover

Loading…

More Information
Summary:	With machine learning (ML) becoming a popular tool across all domains, practitioners are in dire need of comprehensive reporting on the state-of-the-art. Benchmarks and open databases provide helpful insights for many tasks, however suffer from several phenomena: Firstly, they overly focus on prediction quality, which is problematic considering the demand for more sustainability in ML. Depending on the use case at hand, interested users might also face tight resource constraints and thus should be allowed to interact with reporting frameworks, in order to prioritize certain reported characteristics. Furthermore, as some practitioners might not yet be well-skilled in ML, it is important to convey information on a more abstract, comprehensible level. Usability and extendability are key for moving with the state-of-the-art and in order to be trustworthy, frameworks should explicitly address reproducibility. In this work, we analyze established reporting systems under consideration of the aforementioned issues. Afterwards, we propose STREP, our novel framework that aims at overcoming these shortcomings and paves the way towards more sustainable and trustworthy reporting. We use STREP’s (publicly available) implementation to investigate various existing report databases. Our experimental results unveil the need for making reporting more resource-aware and demonstrate our framework’s capabilities of overcoming current reporting limitations. With our work, we want to initiate a paradigm shift in reporting and help with making ML advances more considerate of sustainability and trustworthiness.
ISSN:	1384-5810 1573-756X
DOI:	10.1007/s10618-024-01020-3