Environmental sound recognition with CELP-based features

In this work, we propose the use of a set of new features based on CELP (Code Excited Linear Prediction) to enhance the performance of the environmental sound recognition (ESR) problem. Traditionally, Mel Frequency Cepstral Coefficients (MFCC) have been used for the recognition of structured data li...

Full description

Saved in:

Bibliographic Details
Published in	ISSCS 2011 - International Symposium on Signals, Circuits and Systems pp. 1 - 4
Main Authors	EnShuo Tsau, Seung-Hwan Kim, Kuo, C-C J.
Format	Conference Proceeding
Language	English
Published	IEEE 01.06.2011
Subjects	Bayesian methods Feature extraction Image analysis Mel frequency cepstral coefficient Performance evaluation
Online Access	Get full text

Cover

Loading…

More Information
Summary:	In this work, we propose the use of a set of new features based on CELP (Code Excited Linear Prediction) to enhance the performance of the environmental sound recognition (ESR) problem. Traditionally, Mel Frequency Cepstral Coefficients (MFCC) have been used for the recognition of structured data like speech and music. However, their performance for the ESR problem is limited. An audio signal can be well preserved by its highly compressed CELP bit streams, which motivates us to study the CELP-based features for the audio scene recognition problem. We present a way to extract a set of features from the CELP bit streams and compare the performance of ESR using different feature sets with the Bayesian network classifier. It is shown by experimental results that the CELP-based features outperform the MFCC features in the ESR problem by a significant 9% margin in average and the integrated MFCC and CELP-based feature set can even reach a correct classification rate of 95.2% using the Bayesian network classifier.
ISBN:	9781612849447 161284944X
DOI:	10.1109/ISSCS.2011.5978729