Amazigh speech recognition based on the Kaldi ASR toolkit

In this work, we offer a new approach to integrating the Amazigh language, which is a less-resourced language, into an isolated speech recognition system by exploiting the Kaldi open-source platform. Our designed system is able to recognize the ten first Amazigh digits and ten daily must-used Amazig...

Full description

Saved in:
Bibliographic Details
Published inInternational journal of information technology (Singapore. Online) Vol. 15; no. 7; pp. 3533 - 3540
Main Authors Barkani, Fatima, Hamidi, Mohamed, Laaidi, Naouar, Zealouk, Ouissam, Satori, Hassan, Satori, Khalid
Format Journal Article
LanguageEnglish
Published Singapore Springer Nature Singapore 01.10.2023
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:In this work, we offer a new approach to integrating the Amazigh language, which is a less-resourced language, into an isolated speech recognition system by exploiting the Kaldi open-source platform. Our designed system is able to recognize the ten first Amazigh digits and ten daily must-used Amazigh isolated words, which present typical syllabic structure and which are considered a good representative sample of the Amazigh language. The designed speech system was implemented using Hidden Markov Models (HMMs) with different number of Gaussian distributions. In addition, we evaluated our created system performance by varying the feature extraction methods in order to determine the optimal method for maximum performance. The best-obtained result is 93.96% was obtained with Mel Frequency Cepstral Coefficients (MFCCs) technique.
ISSN:2511-2104
2511-2112
DOI:10.1007/s41870-023-01354-z