Noisy speech enhancement using discrete cosine transform

This paper illustrates the advantages of using the Discrete Cosine Transform (DCT) as compared to the standard Discrete Fourier Transform (DFT) for the purpose of removing noise embedded in a speech signal. The derivation of the Minimum Mean Square Error (MMSE) filter based on the statistical modell...

Full description

Saved in:

Bibliographic Details
Published in	Speech communication Vol. 24; no. 3; pp. 249 - 257
Main Authors	Soon, Ing Yann, Koh, Soo Ngee, Yeo, Chai Kiat
Format	Journal Article
Language	English
Published	Amsterdam Elsevier B.V 01.06.1998 Elsevier
Subjects	Applied sciences Discrete cosine transform (DCT) Exact sciences and technology Information, signal and communications theory MMSE amplitude estimation Noise removal Signal processing Speech enhancement Speech processing Telecommunications and information theory Speech enhancement Noise removal MMSE amplitude estimation Discrete cosine transform (DCT) Spectral data Speech analysis Filtering Gaussian noise Noise reduction Speech recognition White noise Speech processing Cosine transform Mean square error
Online Access	Get full text

Cover

Loading…

More Information
Summary:	This paper illustrates the advantages of using the Discrete Cosine Transform (DCT) as compared to the standard Discrete Fourier Transform (DFT) for the purpose of removing noise embedded in a speech signal. The derivation of the Minimum Mean Square Error (MMSE) filter based on the statistical modelling of the DCT coefficients is shown. Also shown is the derivation of an over-attenuation factor based on the fact that speech energy is not always present in the noisy signal at all times or in all coefficients. This over-attenuation factor is useful in suppressing any musical residual noise which may be present. The proposed methods are evaluated against the noise reduction filter proposed by Y. Ephraim and D. Malah (1984), using both Gaussian distributed white noise as well as recorded fan noise, with favourable results. Cet article illustre les avantages apportés par l'utilisation de la Transformation Cosinus Discrète (DCT) par rapport à celle de la Transformée de Fourier Discrète (DFT) standard, pour le débruitage de la parole bruitée. On montre comment dériver un filtre MMSE à partir de la modélisation statistique des coefficients DCT. On montre également comment dériver un facteur de sur-atténuation basé sur le fait que, dans les signaux bruités, l'énergie de la parole n'est pas toujours présente à chaque instant ni dans chaque coefficient. Ce facteur de sur-atténuation est utile pour supprimer tout bruit résiduel musical. Les méthods proposées ont été évaluées favorablement par rapport du filtre de réduction de bruit proposé par Ephraim et Malah (1994), en utilisant tant du bruit blanc guassien que du bruit de ventilateur enregistré
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23
ISSN:	0167-6393 1872-7182
DOI:	10.1016/S0167-6393(98)00019-3