Dual-Channel Speech Enhancement Using Neural Network Adaptive Beamforming

Dual-channel speech enhancement based on traditional beamforming is difficult to effectively suppress noise. In recent years, it is promising to replace beamforming with a neural network that learns spectral characteristic. This paper proposes a neural network adaptive beamforming end-to-end dual-ch...

Full description

Saved in:
Bibliographic Details
Published inCommunications and Networking pp. 497 - 506
Main Authors Jiang, Tao, Liu, Hongqing, Shuai, Chenhao, Wang, Mingtian, Zhou, Yi, Gan, Lu
Format Book Chapter
LanguageEnglish
Published Cham Springer International Publishing
SeriesLecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Dual-channel speech enhancement based on traditional beamforming is difficult to effectively suppress noise. In recent years, it is promising to replace beamforming with a neural network that learns spectral characteristic. This paper proposes a neural network adaptive beamforming end-to-end dual-channel model for speech enhancement task. First, the LSTM layer is used to directly process the original speech waveform to estimate the time-domain beamforming filter coefficients of each channel and convolve and sum it with the input speech. Second, we modified a fully-convolutional time-domain audio separation network (Conv-TasNet) into a network suitable for speech enhancement which is called Denoising-TasNet to further enhance the output of the beamforming. The experimental results show that the proposed method is better than convolutional recurrent network (CRN) model and several popular noise reduction methods.
Bibliography:Supplementary InformationThe online version contains supplementary material available at https://doi.org/10.1007/978-3-030-99200-2_37.
ISBN:3030991997
9783030991999
ISSN:1867-8211
1867-822X
DOI:10.1007/978-3-030-99200-2_37