Improved Speaker Identification System Based on MFCC and DMFCC Feature Extraction Technique

Speaker Identification (SI) pertains to the method of understanding person voice by utilizing the techniques of machine learning algorithms. Extracting the feature information from speaker utterances is an essential activity in the speaker identification process to classify the speakers accurately....

Full description

Saved in:

Bibliographic Details
Published in	2021 Fourth International Conference on Electrical, Computer and Communication Technologies (ICECCT) pp. 1 - 5
Main Authors	Jaffino, G., Raman, R., Jose, J Prabin
Format	Conference Proceeding
Language	English
Published	IEEE 15.09.2021
Subjects	Bayes methods Bayesian Classifier Cepstral Coefficients Computational modeling Feature extraction GMM Machine learning algorithms Speaker recognition Speech signal Systematics windowing
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Speaker Identification (SI) pertains to the method of understanding person voice by utilizing the techniques of machine learning algorithms. Extracting the feature information from speaker utterances is an essential activity in the speaker identification process to classify the speakers accurately. In many speaker identification systems Mel Frequency Cepstral Coefficients (MFCC) and Dynamic Mel Frequency Cepstral Coefficients (DMFCC) are used as features because of its potential to represent repetitive nature of speech signal. This work aims to consider the MFCC and DMFCC coefficient which is used to improve the recognition of speaker systems in precision. To construct the speaker identity model, the extracted MFCC and DMFCC features were fed into a Gaussian Mixture Model (GMM) and Bayesian Classifier as input. The Performance of GMM and Bayesian classifier is analyzed for different number of mixtures. The GNN Model can reach maximum accuracy of 82.7% for MFCC and 80.12% for DMFCC. On the other hand the Bayesian classifier performance is 79.43 for MFCC and 83.92% for DMFCC. To classify a speaker, the extraction of features and the classification model can be applied extensively to different types of speaker datasets.
DOI:	10.1109/ICECCT52121.2021.9616805