An Experimental Study on Speech Enhancement Based on Deep Neural Networks

This letter presents a regression-based speech enhancement framework using deep neural networks (DNNs) with a multiple-layer deep architecture. In the DNN learning process, a large training set ensures a powerful modeling capability to estimate the complicated nonlinear mapping from observed noisy s...

Full description

Saved in:

Bibliographic Details
Published in	IEEE signal processing letters Vol. 21; no. 1; pp. 65 - 68
Main Authors	Xu, Yong, Du, Jun, Dai, Li-Rong, Lee, Chin-Hui
Format	Journal Article
Language	English
Published	IEEE 01.01.2014
Subjects	Data models Deep neural networks Neural networks Noise Noise measurement noise reduction regression model Speech Speech enhancement Training
Online Access	Get full text

Cover

Loading…

More Information
Summary:	This letter presents a regression-based speech enhancement framework using deep neural networks (DNNs) with a multiple-layer deep architecture. In the DNN learning process, a large training set ensures a powerful modeling capability to estimate the complicated nonlinear mapping from observed noisy speech to desired clean signals. Acoustic context was found to improve the continuity of speech to be separated from the background noises successfully without the annoying musical artifact commonly observed in conventional speech enhancement algorithms. A series of pilot experiments were conducted under multi-condition training with more than 100 hours of simulated speech data, resulting in a good generalization capability even in mismatched testing conditions. When compared with the logarithmic minimum mean square error approach, the proposed DNN-based algorithm tends to achieve significant improvements in terms of various objective quality measures. Furthermore, in a subjective preference evaluation with 10 listeners, 76.35% of the subjects were found to prefer DNN-based enhanced speech to that obtained with other conventional technique.
ISSN:	1070-9908 1558-2361
DOI:	10.1109/LSP.2013.2291240