DEEP SOURCE SEPARATION ARCHITECTURE

A speech separation server comprises a deep-learning encoder with nonlinear activation. The encoder is programmed to take a mixture audio waveform in the time domain, learn generalized patterns from the mixture audio waveform, and generate an encoded representation that effectively characterizes the...

Full description

Saved in:
Bibliographic Details
Main Authors LIU, Xiaoyu, KADIOGLU, Berkan, PUIG, Jordi Pons, HORGAN, Michael Getty
Format Patent
LanguageEnglish
French
German
Published 18.09.2024
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:A speech separation server comprises a deep-learning encoder with nonlinear activation. The encoder is programmed to take a mixture audio waveform in the time domain, learn generalized patterns from the mixture audio waveform, and generate an encoded representation that effectively characterizes the mixture audio waveform for speech separation.
Bibliography:Application Number: EP20200804119