Improved Speaker Identification System Based on MFCC and DMFCC Feature Extraction Technique
Speaker Identification (SI) pertains to the method of understanding person voice by utilizing the techniques of machine learning algorithms. Extracting the feature information from speaker utterances is an essential activity in the speaker identification process to classify the speakers accurately....
Saved in:
Published in | 2021 Fourth International Conference on Electrical, Computer and Communication Technologies (ICECCT) pp. 1 - 5 |
---|---|
Main Authors | , , |
Format | Conference Proceeding |
Language | English |
Published |
IEEE
15.09.2021
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | Speaker Identification (SI) pertains to the method of understanding person voice by utilizing the techniques of machine learning algorithms. Extracting the feature information from speaker utterances is an essential activity in the speaker identification process to classify the speakers accurately. In many speaker identification systems Mel Frequency Cepstral Coefficients (MFCC) and Dynamic Mel Frequency Cepstral Coefficients (DMFCC) are used as features because of its potential to represent repetitive nature of speech signal. This work aims to consider the MFCC and DMFCC coefficient which is used to improve the recognition of speaker systems in precision. To construct the speaker identity model, the extracted MFCC and DMFCC features were fed into a Gaussian Mixture Model (GMM) and Bayesian Classifier as input. The Performance of GMM and Bayesian classifier is analyzed for different number of mixtures. The GNN Model can reach maximum accuracy of 82.7% for MFCC and 80.12% for DMFCC. On the other hand the Bayesian classifier performance is 79.43 for MFCC and 83.92% for DMFCC. To classify a speaker, the extraction of features and the classification model can be applied extensively to different types of speaker datasets. |
---|---|
DOI: | 10.1109/ICECCT52121.2021.9616805 |