Environmental sound recognition with CELP-based features

In this work, we propose the use of a set of new features based on CELP (Code Excited Linear Prediction) to enhance the performance of the environmental sound recognition (ESR) problem. Traditionally, Mel Frequency Cepstral Coefficients (MFCC) have been used for the recognition of structured data li...

Full description

Saved in:
Bibliographic Details
Published inISSCS 2011 - International Symposium on Signals, Circuits and Systems pp. 1 - 4
Main Authors EnShuo Tsau, Seung-Hwan Kim, Kuo, C-C J.
Format Conference Proceeding
LanguageEnglish
Published IEEE 01.06.2011
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:In this work, we propose the use of a set of new features based on CELP (Code Excited Linear Prediction) to enhance the performance of the environmental sound recognition (ESR) problem. Traditionally, Mel Frequency Cepstral Coefficients (MFCC) have been used for the recognition of structured data like speech and music. However, their performance for the ESR problem is limited. An audio signal can be well preserved by its highly compressed CELP bit streams, which motivates us to study the CELP-based features for the audio scene recognition problem. We present a way to extract a set of features from the CELP bit streams and compare the performance of ESR using different feature sets with the Bayesian network classifier. It is shown by experimental results that the CELP-based features outperform the MFCC features in the ESR problem by a significant 9% margin in average and the integrated MFCC and CELP-based feature set can even reach a correct classification rate of 95.2% using the Bayesian network classifier.
ISBN:9781612849447
161284944X
DOI:10.1109/ISSCS.2011.5978729