Audio Source Separation using Hyperbolic Embeddings

There is provided an audio processing system and method comprising an input interface that receives an input audio mixture and transforms it into a time-frequency representation defined by values of time-frequency bins, a processor that maps the values of time-frequency bins into a hyperbolic space...

Full description

Saved in:

Bibliographic Details
Main Authors	Le Roux, Jonathan, Subramanian, Aswin Shanmugam, Petermann, Darius, Wichern, Gordon
Format	Patent
Language	English
Published	13.06.2024
Subjects	ACOUSTICS MEASUREMENT OF MECHANICAL VIBRATIONS OR ULTRASONIC, SONIC ORINFRASONIC WAVES MEASURING MUSICAL INSTRUMENTS PHYSICS SPEECH ANALYSIS OR SYNTHESIS SPEECH OR AUDIO CODING OR DECODING SPEECH OR VOICE PROCESSING SPEECH RECOGNITION TESTING
Online Access	Get full text

Cover

Loading…

More Information
Summary:	There is provided an audio processing system and method comprising an input interface that receives an input audio mixture and transforms it into a time-frequency representation defined by values of time-frequency bins, a processor that maps the values of time-frequency bins into a hyperbolic space by executing an embedding neural network trained to associate each time-frequency bin to a high-dimensional embedding and projecting each high-dimensional embedding into the hyperbolic space, and an output interface that accepts a selection of at least a portion of the hyperbolic space and renders selected hyperbolic embeddings falling within the selected portion of the hyperbolic space.
Bibliography:	Application Number: US202318191417