Harnessing Uncertainty - Multi-label Dysfluency Classification with Uncertain Labels

Manually labelled datasets inherently contain errors or uncertain/imprecise labelling as sometimes experts cannot agree or are not sure. This issue is even more prominent in multi-label datasets as some labels may be missing. However, discarding samples with high uncertainty may lead to the loss of...

Full description

Saved in:
Bibliographic Details
Published inSpeech and Computer Vol. 13721; pp. 302 - 311
Main Authors Jouaiti, Melanie, Dautenhahn, Kerstin
Format Book Chapter
LanguageEnglish
Published Switzerland Springer International Publishing AG 2022
Springer International Publishing
SeriesLecture Notes in Computer Science
Subjects
Online AccessGet full text
ISBN3031209796
9783031209796
ISSN0302-9743
1611-3349
DOI10.1007/978-3-031-20980-2_26

Cover

Loading…
More Information
Summary:Manually labelled datasets inherently contain errors or uncertain/imprecise labelling as sometimes experts cannot agree or are not sure. This issue is even more prominent in multi-label datasets as some labels may be missing. However, discarding samples with high uncertainty may lead to the loss of valuable data. In this paper, we study two datasets where the uncertainty is explicit in the expert annotations. We give an overview of the different approaches available to deal with uncertainty and evaluate them on two dysfluency datasets. Our results show that adopting methods that embrace uncertainty leads to better results than using only labels with high certainty and performs better than current state of the art results.
ISBN:3031209796
9783031209796
ISSN:0302-9743
1611-3349
DOI:10.1007/978-3-031-20980-2_26