Harnessing Uncertainty - Multi-label Dysfluency Classification with Uncertain Labels

Manually labelled datasets inherently contain errors or uncertain/imprecise labelling as sometimes experts cannot agree or are not sure. This issue is even more prominent in multi-label datasets as some labels may be missing. However, discarding samples with high uncertainty may lead to the loss of...

Full description

Saved in:

Bibliographic Details
Published in	Speech and Computer Vol. 13721; pp. 302 - 311
Main Authors	Jouaiti, Melanie, Dautenhahn, Kerstin
Format	Book Chapter
Language	English
Published	Switzerland Springer International Publishing AG 2022 Springer International Publishing
Series	Lecture Notes in Computer Science
Subjects	Dysfluency classification Transfer learning Uncertainty
Online Access	Get full text
ISBN	3031209796 9783031209796
ISSN	0302-9743 1611-3349
DOI	10.1007/978-3-031-20980-2_26

Cover

Loading…

More Information
Summary:	Manually labelled datasets inherently contain errors or uncertain/imprecise labelling as sometimes experts cannot agree or are not sure. This issue is even more prominent in multi-label datasets as some labels may be missing. However, discarding samples with high uncertainty may lead to the loss of valuable data. In this paper, we study two datasets where the uncertainty is explicit in the expert annotations. We give an overview of the different approaches available to deal with uncertainty and evaluate them on two dysfluency datasets. Our results show that adopting methods that embrace uncertainty leads to better results than using only labels with high certainty and performs better than current state of the art results.
ISBN:	3031209796 9783031209796
ISSN:	0302-9743 1611-3349
DOI:	10.1007/978-3-031-20980-2_26