Deep Speaker Embeddings Based Online Diarization
This paper describes our experiments with the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) model for development of an online diarization system. For this task several UIS-RNN models based on different speaker embeddings extractors were trained. These systems were evaluated in term...
Saved in:
Published in | Speech and Computer Vol. 13721; pp. 24 - 32 |
---|---|
Main Authors | , |
Format | Book Chapter |
Language | English |
Published |
Switzerland
Springer International Publishing AG
2022
Springer International Publishing |
Series | Lecture Notes in Computer Science |
Subjects | |
Online Access | Get full text |
ISBN | 3031209796 9783031209796 |
ISSN | 0302-9743 1611-3349 |
DOI | 10.1007/978-3-031-20980-2_3 |
Cover
Summary: | This paper describes our experiments with the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) model for development of an online diarization system. For this task several UIS-RNN models based on different speaker embeddings extractors were trained. These systems were evaluated in terms of Diarization Error Rate (DER) metric on public and private test datasets. Also systems were tested on real dialogue data recorded in a bank office. Proposed online models outperform standard offline Agglomerative Hierarchical Clustering (AHC) approach and are compatible with the state-of-the-art Bayesian HMM (VBx) offline method. |
---|---|
ISBN: | 3031209796 9783031209796 |
ISSN: | 0302-9743 1611-3349 |
DOI: | 10.1007/978-3-031-20980-2_3 |