Direction of Arrival Estimation In Highly Reverberant Environments Using Soft Time-Frequency Mask

A recent approach to improving the robustness of sound localization in reverberant environments is based on pre-selection of time- frequency pixels that are dominated by direct sound. This approach is equivalent to applying a binary time-frequency mask prior to the localization stage. Although the b...

Full description

Saved in:
Bibliographic Details
Published in2019 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) pp. 383 - 387
Main Authors Tourbabin, Vladimir, Donley, Jacob, Rafaely, Boaz, Mehra, Ravish
Format Conference Proceeding
LanguageEnglish
Published IEEE 01.10.2019
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:A recent approach to improving the robustness of sound localization in reverberant environments is based on pre-selection of time- frequency pixels that are dominated by direct sound. This approach is equivalent to applying a binary time-frequency mask prior to the localization stage. Although the binary mask approach was shown to be effective, it may not exploit the information available in the captured signal to its full extent. In an attempt to overcome this limitation, it is hereby proposed to employ a soft mask instead of the binary mask. The proposed weighting scheme is based directly on a metric of the direct-to-reverberant sound ratio in each individual time-frequency pixel. Evaluation using simulated reverberant speech recordings indicates substantial improvement in the localization performance when using the proposed soft mask weighting.
ISSN:1947-1629
DOI:10.1109/WASPAA.2019.8937233