Audio Source Separation using Hyperbolic Embeddings

There is provided an audio processing system and method comprising an input interface that receives an input audio mixture and transforms it into a time-frequency representation defined by values of time-frequency bins, a processor that maps the values of time-frequency bins into a hyperbolic space...

Full description

Saved in:
Bibliographic Details
Main Authors Le Roux, Jonathan, Subramanian, Aswin Shanmugam, Petermann, Darius, Wichern, Gordon
Format Patent
LanguageEnglish
Published 13.06.2024
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:There is provided an audio processing system and method comprising an input interface that receives an input audio mixture and transforms it into a time-frequency representation defined by values of time-frequency bins, a processor that maps the values of time-frequency bins into a hyperbolic space by executing an embedding neural network trained to associate each time-frequency bin to a high-dimensional embedding and projecting each high-dimensional embedding into the hyperbolic space, and an output interface that accepts a selection of at least a portion of the hyperbolic space and renders selected hyperbolic embeddings falling within the selected portion of the hyperbolic space.
Bibliography:Application Number: US202318191417