Sinhala Speech Recognition for Interactive Voice Response Systems Accessed Through Mobile Phones

This paper presents the development of a Sinhala Speech Recognition System to be deployed in an Interactive Voice Response (IVR) system of a telecommunication service provider. The main objectives are to recognize Sinhala digits and names of Sinhala songs to be set up as ringback tones. Sinhala bein...

Full description

Saved in:
Bibliographic Details
Published in2018 Moratuwa Engineering Research Conference (MERCon) pp. 241 - 246
Main Authors Manamperi, Wageesha, Karunathilake, Dinesha, Madhushani, Thilini, Galagedara, Nimasha, Dias, Dileeka
Format Conference Proceeding
LanguageEnglish
Published IEEE 01.05.2018
Subjects
Online AccessGet full text

Cover

Loading…
Abstract This paper presents the development of a Sinhala Speech Recognition System to be deployed in an Interactive Voice Response (IVR) system of a telecommunication service provider. The main objectives are to recognize Sinhala digits and names of Sinhala songs to be set up as ringback tones. Sinhala being a phonetic language, its features are studied to develop a list of 47 phonemes. A continuous speech recognition system is developed based on Hidden Markov Model (HMM). The acoustic model is trained using the voice through mobile phone. The outcome is a speaker independent speech recognition system which is capable of recognizing 10 digits and 50 Sinhala songs. A word error rate (WER) of 11.2% using a speech corpus of 0.862 hours and a sentence error rate (SER) of 5.7% using a speech corpus of 1.388 hours are achieved for digits and songs respectively.
AbstractList This paper presents the development of a Sinhala Speech Recognition System to be deployed in an Interactive Voice Response (IVR) system of a telecommunication service provider. The main objectives are to recognize Sinhala digits and names of Sinhala songs to be set up as ringback tones. Sinhala being a phonetic language, its features are studied to develop a list of 47 phonemes. A continuous speech recognition system is developed based on Hidden Markov Model (HMM). The acoustic model is trained using the voice through mobile phone. The outcome is a speaker independent speech recognition system which is capable of recognizing 10 digits and 50 Sinhala songs. A word error rate (WER) of 11.2% using a speech corpus of 0.862 hours and a sentence error rate (SER) of 5.7% using a speech corpus of 1.388 hours are achieved for digits and songs respectively.
Author Karunathilake, Dinesha
Galagedara, Nimasha
Manamperi, Wageesha
Dias, Dileeka
Madhushani, Thilini
Author_xml – sequence: 1
  givenname: Wageesha
  surname: Manamperi
  fullname: Manamperi, Wageesha
  organization: Department of Electronic and Telecommunication Engineering, University of Moratuwa, Sri Lanka
– sequence: 2
  givenname: Dinesha
  surname: Karunathilake
  fullname: Karunathilake, Dinesha
  organization: Department of Electronic and Telecommunication Engineering, University of Moratuwa, Sri Lanka
– sequence: 3
  givenname: Thilini
  surname: Madhushani
  fullname: Madhushani, Thilini
  organization: Department of Electronic and Telecommunication Engineering, University of Moratuwa, Sri Lanka
– sequence: 4
  givenname: Nimasha
  surname: Galagedara
  fullname: Galagedara, Nimasha
  organization: Department of Electronic and Telecommunication Engineering, University of Moratuwa, Sri Lanka
– sequence: 5
  givenname: Dileeka
  surname: Dias
  fullname: Dias, Dileeka
  organization: Department of Electronic and Telecommunication Engineering, University of Moratuwa, Sri Lanka
BookMark eNotkNFOwjAYRmuiiYI8ATd9gWG7tuu_S0JQSSAaht5i1_1lNaMl6zTh7SWRq-_m5OTkG5HbEAMSMuVsxjkrnzbL7SKGWc44zEDmHABuyIgrAYWUXOt7MknpmzGWFyCLonwgX5UPrekMrU6ItqVbtPEQ_OBjoC72dBUG7I0d_C_Sz-gtXoh0iiEhrc5pwGOic2sxJWzoru3jz6Glm1j7Dul7e4lLj-TOmS7h5Lpj8vG83C1es_Xby2oxX2eeazVkNXLDa8Mgd5CLXJYgnDZOSM0aWcraOSiQC2MVKMkRldXaYiMLphpQzokxmf57PSLuT70_mv68v54g_gBSKlZn
ContentType Conference Proceeding
DBID 6IE
6IL
CBEJK
RIE
RIL
DOI 10.1109/MERCon.2018.8421888
DatabaseName IEEE Electronic Library (IEL) Conference Proceedings
IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume
IEEE Xplore All Conference Proceedings
IEEE
IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE
  url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
EISBN 1538644177
9781538644171
EndPage 246
ExternalDocumentID 8421888
Genre orig-research
GroupedDBID 6IE
6IF
6IL
6IN
AAJGR
ABLEC
ALMA_UNASSIGNED_HOLDINGS
BEFXN
BFFAM
BGNUA
BKEBE
BPEOZ
CBEJK
IEGSK
OCL
RIE
RIL
ID FETCH-LOGICAL-i175t-be1a1ba082f82324983f7af3470d494bff86e13ac58541ee5c77ced4605d85ff3
IEDL.DBID RIE
IngestDate Thu Jun 29 18:39:11 EDT 2023
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-i175t-be1a1ba082f82324983f7af3470d494bff86e13ac58541ee5c77ced4605d85ff3
PageCount 6
ParticipantIDs ieee_primary_8421888
PublicationCentury 2000
PublicationDate 2018-May
PublicationDateYYYYMMDD 2018-05-01
PublicationDate_xml – month: 05
  year: 2018
  text: 2018-May
PublicationDecade 2010
PublicationTitle 2018 Moratuwa Engineering Research Conference (MERCon)
PublicationTitleAbbrev MERCon
PublicationYear 2018
Publisher IEEE
Publisher_xml – name: IEEE
SSID ssj0002684669
Score 1.749279
Snippet This paper presents the development of a Sinhala Speech Recognition System to be deployed in an Interactive Voice Response (IVR) system of a telecommunication...
SourceID ieee
SourceType Publisher
StartPage 241
SubjectTerms Acoustics
Decoding
Dictionaries
Hidden Markov models
HMM
IVR
Phonetics
sinhala
Speech recognition
Training
Title Sinhala Speech Recognition for Interactive Voice Response Systems Accessed Through Mobile Phones
URI https://ieeexplore.ieee.org/document/8421888
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV07a8MwEBZJpk5tSUrfaOhYO3F8jqWxhIRScAl5lGypHicSWuxQnKW_vpLspLR06GaMLQkd3Hc6ffcdIXdCgdDcWUAoFUCiIeCGxQH0hUxjLSzou3rn7HnwuICnZbJskPtDLQwievIZhu7R3-XrQu1cqqzLwAISY03SZL1-Vat1yKc41ZLBgNfCQlGPd7PRdFg4idOIhfWfP1qoeAQZH5NsP3dFHHkLd6UM1ecvWcb_Lu6EdL5r9ejkgEKnpIF5m7zONvlavAs62yKqNZ3uaUJFTm2USn0eUHhXR18K6yvsF54ri7SWMKcPvpMiajqvOvnQrJDWg9DJ2qn7d8hiPJoPH4O6l0KwsQFCGUiMRCSFBXzDXBDFWWxSYWJIexo4SGPYAKNYKHt8gAgxUWmqULtbU80SY-Iz0srt-OeEAtfSjqOty1bAMZVSG4REoT2rJAbwgrTd7qy2lVzGqt6Yy79fX5EjZ6GKQ3hNWuXHDm8szpfy1hv4C5qsq2k
link.rule.ids 310,311,786,790,795,796,802,27956,55107
linkProvider IEEE
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV07T8MwELZKGWAC1CLeeGAkaUOcxB5RVVSgqao-ULfix1mtQEmF0oVfj-2kRSAGtihKbMsn3Xc-f_cdQjdcEq6YtQCX0iORIh7TNPTIHRdJqLgBfVvvnA7i3pQ8zaJZDd1ua2EAwJHPwLeP7i5f5XJtU2UtSgwgUbqDdg3Ot5OyWmubUbG6JXHMKmmhoM1aaXfUya3IaUD96t8fTVQchjwcoHQze0kdefPXhfDl5y9hxv8u7xA1v6v18HCLQ0eoBlkDvY6X2YK_czxeAcgFHm2IQnmGTZyKXSaQO2eHX3LjLcwXji0LuBIxx_eulyIoPCl7-eA0F8aH4OHC6vs30fShO-n0vKqbgrc0IULhCQh4ILiBfE1tGMVoqBOuQ5K0FWFEaE1jCEIuzQGCBACRTBIJyt6bKhppHR6jembGP0GYMCXMOMo4bUkYJEIoDSSSYE4rkSZwihp2d-arUjBjXm3M2d-vr9Feb5L25_3HwfM52rfWKhmFF6hefKzh0qB-Ia6csb8AziSuvQ
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2018+Moratuwa+Engineering+Research+Conference+%28MERCon%29&rft.atitle=Sinhala+Speech+Recognition+for+Interactive+Voice+Response+Systems+Accessed+Through+Mobile+Phones&rft.au=Manamperi%2C+Wageesha&rft.au=Karunathilake%2C+Dinesha&rft.au=Madhushani%2C+Thilini&rft.au=Galagedara%2C+Nimasha&rft.date=2018-05-01&rft.pub=IEEE&rft.spage=241&rft.epage=246&rft_id=info:doi/10.1109%2FMERCon.2018.8421888&rft.externalDocID=8421888