Automatic speech recognition in presence of music noise on multichannel far-field recordings

Subject of Research. The paper considers a method of music noise reduction in a multichannel speech signal based on noise mask estimation. The method is applied for automatic speech recognition in presence of music noise. Method. The study is performed using an acoustic model implemented in artiﬁcia...

Full description

Saved in:

Bibliographic Details
Published in	Nauchno-tekhnicheskiĭ vestnik informat͡s︡ionnykh tekhnologiĭ, mekhaniki i optiki Vol. 19; no. 3; pp. 557 - 559
Main Authors	Astapov, S.S., Shuranov, E.V., Lavrentyev, A.V., Kabarov, V.I.
Format	Journal Article
Language	English
Published	Saint Petersburg National Research University of Information Technologies, Mechanics and Optics (ITMO University) 03.06.2019
Subjects	acoustic model automatic speech recognition microphone array music noise reduction MVDR noise mask estimation
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Subject of Research. The paper considers a method of music noise reduction in a multichannel speech signal based on noise mask estimation. The method is applied for automatic speech recognition in presence of music noise. Method. The study is performed using an acoustic model implemented in artiﬁcial neural networks and real life recordings performed in reverberant conditions. Main Results. It is shown that the acoustic model is capable of estimating the noise mask on a multichannel mixture for different music genres. The application of such mask to covariance matrix estimation for MVDR (Minimum Variance Distortionless Response) beamforming algorithm results in increasing the recognition accuracy by at least 4.9 % at signal-noise ratio levels of 10–30 dB. Practical Relevance. The method of MVDR coefﬁcient estimation based on noise mask estimation by an acoustic model serves to suppress non-stationary noise, such as music, thus increasing the robustness of automatic speech recognition systems.
ISSN:	2226-1494 2500-0373
DOI:	10.17586/2226-1494-2019-19-3-557-559