SRP-PHAT methods of locating simultaneous multiple talkers using a frame of microphone array data
Two new methods for locating multiple sound sources using a single segment of data from a large-aperture microphone array are presented. Both methods employ the proven-robust steered response power using the phase transform (SRP-PHAT) as a functional. To cluster the data points into highly probable...
Saved in:
Published in | 2010 IEEE International Conference on Acoustics, Speech and Signal Processing pp. 125 - 128 |
---|---|
Main Authors | , |
Format | Conference Proceeding |
Language | English |
Published |
IEEE
01.03.2010
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | Two new methods for locating multiple sound sources using a single segment of data from a large-aperture microphone array are presented. Both methods employ the proven-robust steered response power using the phase transform (SRP-PHAT) as a functional. To cluster the data points into highly probable regions containing global peaks, the first method fits a Gaussian mixture model (GMM), whereas the second one sequentially finds the points with highest SRP-PHAT values that most likely represent different clusters. Then the low-cost global optimization method, stochastic region contraction (SRC), is applied to each cluster to find the global peaks. We test the two methods using real data from five simultaneous talkers in a room with high noise and reverberation. Results are presented and discussed. |
---|---|
ISBN: | 9781424442959 1424442958 |
ISSN: | 1520-6149 2379-190X |
DOI: | 10.1109/ICASSP.2010.5496133 |