Phase Constraint and Deep Neural Network for Speech Separation

The phase response of speech is an important part in speech separation. In this paper, we apply the complex mask to the speech separation. It both enhances the magnitude and phase of speech. Specifically, we use a deep neural network to estimate the complex mask of two sources. And considering the i...

Full description

Saved in:
Bibliographic Details
Published inAdvances in Neural Networks - ISNN 2017 Vol. 10262; pp. 266 - 273
Main Authors Miao, Zhuangguo, Ma, Xiaohong, Ding, Shuxue
Format Book Chapter
LanguageEnglish
Japanese
Published Switzerland Springer International Publishing AG 2017
Springer International Publishing
SeriesLecture Notes in Computer Science
Subjects
Online AccessGet full text
ISBN9783319590806
3319590804
ISSN0302-9743
1611-3349
DOI10.1007/978-3-319-59081-3_32

Cover

Loading…
More Information
Summary:The phase response of speech is an important part in speech separation. In this paper, we apply the complex mask to the speech separation. It both enhances the magnitude and phase of speech. Specifically, we use a deep neural network to estimate the complex mask of two sources. And considering the importance of the phase, we also explore a phase constraint objective function, which can ensure the phase of the sum of estimated sources that is close to the phase of the mixture. We demonstrate the efficiency of the method on the TIMIT speech corpus for single channel speech separation.
ISBN:9783319590806
3319590804
ISSN:0302-9743
1611-3349
DOI:10.1007/978-3-319-59081-3_32