Speech Enhancement via Mel-Scale Wiener Filtering with a Frequency-wise Voice Activity Detector

This paper presents a speech enhancement system that enables a comfortable communication inside an automobile. A couple of novel concepts are proposed in an effort to improve two major building blocks in the existing speech enhancement systems: a voice activity detector (VAD) and a noise filtering a...

Full description

Saved in:
Bibliographic Details
Published inJournal of mechanical science and technology Vol. 21; no. 5; pp. 708 - 722
Main Authors Kim, Hwa-Soo, Cho, Young-Man, Kim, Han-Jun
Format Journal Article
LanguageKorean
Published 2007
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:This paper presents a speech enhancement system that enables a comfortable communication inside an automobile. A couple of novel concepts are proposed in an effort to improve two major building blocks in the existing speech enhancement systems: a voice activity detector (VAD) and a noise filtering algorithm. The proposed VAD classifies a given data frame as speech or noise at each frequency, enabling the frequency-wise updates of noise statistics and thereby improving the effectiveness of the noise filtering algorithms by providing more up-to-date noise statistics. The celebrated Wiener filter is adopted in this paper as the accompanying noise filtering algorithm, which results in significant noise suppression. Yet, the musical noise present in most Wiener filter-based systems prompts the idea of applying the Wiener filter in the Mel-scale in which the human auditory system responds to the external stimulation. It turns out that the Mel-scale Wiener filter creates some masking effects and thereby reduces musical noise significantly, leading to smooth transition between data frames.
Bibliography:KISTI1.1003/JNL.JAKO200727543088794
ISSN:1738-494X
1976-3824