Non-intrusive Speech Intelligibility Prediction of Speech with Additive Noise and Reverberation Using Multiple Deep Learning-Based Speech Enhancement
In the field of non-intrusive speech intelligibility estimation, we have proposed a model that uses speech enhancement as an auxiliary signal for prediction. In this study, we performed intelligibility estimation using multiple speech enhancers, not just a single one, with the expectation that combi...
Saved in:
Published in | 2023 IEEE 12th Global Conference on Consumer Electronics (GCCE) pp. 918 - 920 |
---|---|
Main Authors | , |
Format | Conference Proceeding |
Language | English |
Published |
IEEE
10.10.2023
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | In the field of non-intrusive speech intelligibility estimation, we have proposed a model that uses speech enhancement as an auxiliary signal for prediction. In this study, we performed intelligibility estimation using multiple speech enhancers, not just a single one, with the expectation that combining multiple enhancers would allow for error interpolation in the presence of various degradation factors such as additive noise and reverberation. As a result, using multiple speech enhancers improved the estimation accuracy compared to using a single model, and we were able to achieve a maximum correlation coefficient of 0.674 for estimation. |
---|---|
ISSN: | 2693-0854 |
DOI: | 10.1109/GCCE59613.2023.10315264 |