Deep Convolution Neural Network Based Speech Recognition for Chhattisgarhi

The existing ASR for Chhattisgarhi using conventional machine learning technique was implemented for speaker dependent speech recognition. However, the conventional machine learning based speech recognition is incapable to handle the spectral variations as well as the spectral correlation of acousti...

Full description

Saved in:
Bibliographic Details
Published in2018 5th International Conference on Signal Processing and Integrated Networks (SPIN) pp. 667 - 671
Main Authors Londhe, Narendra D., Kshirsagar, Ghanahshyam B., Tekchandani, Hitesh
Format Conference Proceeding
LanguageEnglish
Published IEEE 01.02.2018
Subjects
Online AccessGet full text
DOI10.1109/SPIN.2018.8474064

Cover

Loading…
More Information
Summary:The existing ASR for Chhattisgarhi using conventional machine learning technique was implemented for speaker dependent speech recognition. However, the conventional machine learning based speech recognition is incapable to handle the spectral variations as well as the spectral correlation of acoustic signals. Therefore, to overcome the aforementioned limitations, authors have implemented the deep convolution neural network (DCNN) based ASR for Chhattisgarhi dialect. Unlike other deep learning models, DCNN can efficiently handle the spectral variations and spectral correlation of speech signal with the less computational burden. The experiment of isolated Chhattisgarhi word recognition was implemented on self-recorded dataset acquired from 150 subjects from various geographical parts of Chhattisgarh state. The implemented algorithm is promisingly achieving 99.49% of accuracy for isolated word recognition. The different performance paraments are presented to validate the performed experiment.
DOI:10.1109/SPIN.2018.8474064