Harnessing Uncertainty - Multi-label Dysfluency Classification with Uncertain Labels
Manually labelled datasets inherently contain errors or uncertain/imprecise labelling as sometimes experts cannot agree or are not sure. This issue is even more prominent in multi-label datasets as some labels may be missing. However, discarding samples with high uncertainty may lead to the loss of...
Saved in:
Published in | Speech and Computer Vol. 13721; pp. 302 - 311 |
---|---|
Main Authors | , |
Format | Book Chapter |
Language | English |
Published |
Switzerland
Springer International Publishing AG
2022
Springer International Publishing |
Series | Lecture Notes in Computer Science |
Subjects | |
Online Access | Get full text |
ISBN | 3031209796 9783031209796 |
ISSN | 0302-9743 1611-3349 |
DOI | 10.1007/978-3-031-20980-2_26 |
Cover
Loading…
Summary: | Manually labelled datasets inherently contain errors or uncertain/imprecise labelling as sometimes experts cannot agree or are not sure. This issue is even more prominent in multi-label datasets as some labels may be missing. However, discarding samples with high uncertainty may lead to the loss of valuable data.
In this paper, we study two datasets where the uncertainty is explicit in the expert annotations. We give an overview of the different approaches available to deal with uncertainty and evaluate them on two dysfluency datasets. Our results show that adopting methods that embrace uncertainty leads to better results than using only labels with high certainty and performs better than current state of the art results. |
---|---|
ISBN: | 3031209796 9783031209796 |
ISSN: | 0302-9743 1611-3349 |
DOI: | 10.1007/978-3-031-20980-2_26 |