A robust audio searching method for cellular-phone-based music information retrieval

We propose a search method for detecting a query audio signal fragment in long audio recordings. The query signal is assumed to be captured by a portable terminal, such as a cellular phone, in the real world. A major problem in this kind of search is that the features of the query sound may include...

Full description

Saved in:
Bibliographic Details
Published inObject recognition supported by user interaction for service robots Vol. 3; pp. 991 - 994 vol.3
Main Authors Kurozumi, T., Kashino, K., Murase, H.
Format Conference Proceeding
LanguageEnglish
Published IEEE 2002
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:We propose a search method for detecting a query audio signal fragment in long audio recordings. The query signal is assumed to be captured by a portable terminal, such as a cellular phone, in the real world. A major problem in this kind of search is that the features of the query sound may include distortions due to terminal characteristics or environment noise. The method proposed comprises local time-frequency-region normalization and robust subspace spanning. The former is used to make features invariant to additive noise and frequency characteristics, and the latter to choose frequency bands that minimize the effect of feature distortions. Experiments using cellular phones in the real world show the proposed method is effective.
ISBN:076951695X
ISSN:1051-4651
2831-7475
DOI:10.1109/ICPR.2002.1048204