Extracting noise-robust features from audio data

A key problem faced by audio identification, classification, and retrieval systems is the mapping of high-dimensional audio input data into informative lower-dimensional feature vectors. This paper explores an automatic dimensionality reduction algorithm called Distortion Discriminant Analysis (DDA)...

Full description

Saved in:

Bibliographic Details
Published in	2002 IEEE International Conference on Acoustics, Speech, and Signal Processing Vol. 1; pp. I-1021 - I-1024
Main Authors	Burges, Christopher J. C., Platt, John C., Jana, Soumya
Format	Conference Proceeding
Language	English
Published	IEEE 01.01.2002
Subjects	Data mining Ferroelectric films Nonvolatile memory Random access memory Robustness
Online Access	Get full text

Cover

Loading…

More Information
Summary:	A key problem faced by audio identification, classification, and retrieval systems is the mapping of high-dimensional audio input data into informative lower-dimensional feature vectors. This paper explores an automatic dimensionality reduction algorithm called Distortion Discriminant Analysis (DDA). Each layer of DDA projects its input into directions which maximize the SNR for a given set of distortions. Multiple layers efficiently extract features over a wide temporal window. The audio input to DDA undergoes perceptually-relevant preprocessing and de-equalization, to further suppress distortions. We apply DDA to the task of identifying audio clips in an incoming audio stream, based on matching stored audio fingerprints. We show excellent test results on matching input fingerprints against 36 hours of stored audio data.
ISBN:	9780780374027 0780374029
ISSN:	1520-6149 2379-190X
DOI:	10.1109/ICASSP.2002.5743968