SRP-PHAT methods of locating simultaneous multiple talkers using a frame of microphone array data

Two new methods for locating multiple sound sources using a single segment of data from a large-aperture microphone array are presented. Both methods employ the proven-robust steered response power using the phase transform (SRP-PHAT) as a functional. To cluster the data points into highly probable...

Full description

Saved in:
Bibliographic Details
Published in2010 IEEE International Conference on Acoustics, Speech and Signal Processing pp. 125 - 128
Main Authors Hoang Do, Silverman, Harvey F
Format Conference Proceeding
LanguageEnglish
Published IEEE 01.03.2010
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Two new methods for locating multiple sound sources using a single segment of data from a large-aperture microphone array are presented. Both methods employ the proven-robust steered response power using the phase transform (SRP-PHAT) as a functional. To cluster the data points into highly probable regions containing global peaks, the first method fits a Gaussian mixture model (GMM), whereas the second one sequentially finds the points with highest SRP-PHAT values that most likely represent different clusters. Then the low-cost global optimization method, stochastic region contraction (SRC), is applied to each cluster to find the global peaks. We test the two methods using real data from five simultaneous talkers in a room with high noise and reverberation. Results are presented and discussed.
ISBN:9781424442959
1424442958
ISSN:1520-6149
2379-190X
DOI:10.1109/ICASSP.2010.5496133