Sinhala Speech Recognition for Interactive Voice Response Systems Accessed Through Mobile Phones
This paper presents the development of a Sinhala Speech Recognition System to be deployed in an Interactive Voice Response (IVR) system of a telecommunication service provider. The main objectives are to recognize Sinhala digits and names of Sinhala songs to be set up as ringback tones. Sinhala bein...
Saved in:
Published in | 2018 Moratuwa Engineering Research Conference (MERCon) pp. 241 - 246 |
---|---|
Main Authors | , , , , |
Format | Conference Proceeding |
Language | English |
Published |
IEEE
01.05.2018
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Abstract | This paper presents the development of a Sinhala Speech Recognition System to be deployed in an Interactive Voice Response (IVR) system of a telecommunication service provider. The main objectives are to recognize Sinhala digits and names of Sinhala songs to be set up as ringback tones. Sinhala being a phonetic language, its features are studied to develop a list of 47 phonemes. A continuous speech recognition system is developed based on Hidden Markov Model (HMM). The acoustic model is trained using the voice through mobile phone. The outcome is a speaker independent speech recognition system which is capable of recognizing 10 digits and 50 Sinhala songs. A word error rate (WER) of 11.2% using a speech corpus of 0.862 hours and a sentence error rate (SER) of 5.7% using a speech corpus of 1.388 hours are achieved for digits and songs respectively. |
---|---|
AbstractList | This paper presents the development of a Sinhala Speech Recognition System to be deployed in an Interactive Voice Response (IVR) system of a telecommunication service provider. The main objectives are to recognize Sinhala digits and names of Sinhala songs to be set up as ringback tones. Sinhala being a phonetic language, its features are studied to develop a list of 47 phonemes. A continuous speech recognition system is developed based on Hidden Markov Model (HMM). The acoustic model is trained using the voice through mobile phone. The outcome is a speaker independent speech recognition system which is capable of recognizing 10 digits and 50 Sinhala songs. A word error rate (WER) of 11.2% using a speech corpus of 0.862 hours and a sentence error rate (SER) of 5.7% using a speech corpus of 1.388 hours are achieved for digits and songs respectively. |
Author | Karunathilake, Dinesha Galagedara, Nimasha Manamperi, Wageesha Dias, Dileeka Madhushani, Thilini |
Author_xml | – sequence: 1 givenname: Wageesha surname: Manamperi fullname: Manamperi, Wageesha organization: Department of Electronic and Telecommunication Engineering, University of Moratuwa, Sri Lanka – sequence: 2 givenname: Dinesha surname: Karunathilake fullname: Karunathilake, Dinesha organization: Department of Electronic and Telecommunication Engineering, University of Moratuwa, Sri Lanka – sequence: 3 givenname: Thilini surname: Madhushani fullname: Madhushani, Thilini organization: Department of Electronic and Telecommunication Engineering, University of Moratuwa, Sri Lanka – sequence: 4 givenname: Nimasha surname: Galagedara fullname: Galagedara, Nimasha organization: Department of Electronic and Telecommunication Engineering, University of Moratuwa, Sri Lanka – sequence: 5 givenname: Dileeka surname: Dias fullname: Dias, Dileeka organization: Department of Electronic and Telecommunication Engineering, University of Moratuwa, Sri Lanka |
BookMark | eNotkNFOwjAYRmuiiYI8ATd9gWG7tuu_S0JQSSAaht5i1_1lNaMl6zTh7SWRq-_m5OTkG5HbEAMSMuVsxjkrnzbL7SKGWc44zEDmHABuyIgrAYWUXOt7MknpmzGWFyCLonwgX5UPrekMrU6ItqVbtPEQ_OBjoC72dBUG7I0d_C_Sz-gtXoh0iiEhrc5pwGOic2sxJWzoru3jz6Glm1j7Dul7e4lLj-TOmS7h5Lpj8vG83C1es_Xby2oxX2eeazVkNXLDa8Mgd5CLXJYgnDZOSM0aWcraOSiQC2MVKMkRldXaYiMLphpQzokxmf57PSLuT70_mv68v54g_gBSKlZn |
ContentType | Conference Proceeding |
DBID | 6IE 6IL CBEJK RIE RIL |
DOI | 10.1109/MERCon.2018.8421888 |
DatabaseName | IEEE Electronic Library (IEL) Conference Proceedings IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume IEEE Xplore All Conference Proceedings IEEE IEEE Proceedings Order Plans (POP All) 1998-Present |
DatabaseTitleList | |
Database_xml | – sequence: 1 dbid: RIE name: IEEE url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/ sourceTypes: Publisher |
DeliveryMethod | fulltext_linktorsrc |
EISBN | 1538644177 9781538644171 |
EndPage | 246 |
ExternalDocumentID | 8421888 |
Genre | orig-research |
GroupedDBID | 6IE 6IF 6IL 6IN AAJGR ABLEC ALMA_UNASSIGNED_HOLDINGS BEFXN BFFAM BGNUA BKEBE BPEOZ CBEJK IEGSK OCL RIE RIL |
ID | FETCH-LOGICAL-i175t-be1a1ba082f82324983f7af3470d494bff86e13ac58541ee5c77ced4605d85ff3 |
IEDL.DBID | RIE |
IngestDate | Thu Jun 29 18:39:11 EDT 2023 |
IsPeerReviewed | false |
IsScholarly | false |
Language | English |
LinkModel | DirectLink |
MergedId | FETCHMERGED-LOGICAL-i175t-be1a1ba082f82324983f7af3470d494bff86e13ac58541ee5c77ced4605d85ff3 |
PageCount | 6 |
ParticipantIDs | ieee_primary_8421888 |
PublicationCentury | 2000 |
PublicationDate | 2018-May |
PublicationDateYYYYMMDD | 2018-05-01 |
PublicationDate_xml | – month: 05 year: 2018 text: 2018-May |
PublicationDecade | 2010 |
PublicationTitle | 2018 Moratuwa Engineering Research Conference (MERCon) |
PublicationTitleAbbrev | MERCon |
PublicationYear | 2018 |
Publisher | IEEE |
Publisher_xml | – name: IEEE |
SSID | ssj0002684669 |
Score | 1.749279 |
Snippet | This paper presents the development of a Sinhala Speech Recognition System to be deployed in an Interactive Voice Response (IVR) system of a telecommunication... |
SourceID | ieee |
SourceType | Publisher |
StartPage | 241 |
SubjectTerms | Acoustics Decoding Dictionaries Hidden Markov models HMM IVR Phonetics sinhala Speech recognition Training |
Title | Sinhala Speech Recognition for Interactive Voice Response Systems Accessed Through Mobile Phones |
URI | https://ieeexplore.ieee.org/document/8421888 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV07a8MwEBZJpk5tSUrfaOhYO3F8jqWxhIRScAl5lGypHicSWuxQnKW_vpLspLR06GaMLQkd3Hc6ffcdIXdCgdDcWUAoFUCiIeCGxQH0hUxjLSzou3rn7HnwuICnZbJskPtDLQwievIZhu7R3-XrQu1cqqzLwAISY03SZL1-Vat1yKc41ZLBgNfCQlGPd7PRdFg4idOIhfWfP1qoeAQZH5NsP3dFHHkLd6UM1ecvWcb_Lu6EdL5r9ejkgEKnpIF5m7zONvlavAs62yKqNZ3uaUJFTm2USn0eUHhXR18K6yvsF54ri7SWMKcPvpMiajqvOvnQrJDWg9DJ2qn7d8hiPJoPH4O6l0KwsQFCGUiMRCSFBXzDXBDFWWxSYWJIexo4SGPYAKNYKHt8gAgxUWmqULtbU80SY-Iz0srt-OeEAtfSjqOty1bAMZVSG4REoT2rJAbwgrTd7qy2lVzGqt6Yy79fX5EjZ6GKQ3hNWuXHDm8szpfy1hv4C5qsq2k |
link.rule.ids | 310,311,786,790,795,796,802,27956,55107 |
linkProvider | IEEE |
linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV07T8MwELZKGWAC1CLeeGAkaUOcxB5RVVSgqao-ULfix1mtQEmF0oVfj-2kRSAGtihKbMsn3Xc-f_cdQjdcEq6YtQCX0iORIh7TNPTIHRdJqLgBfVvvnA7i3pQ8zaJZDd1ua2EAwJHPwLeP7i5f5XJtU2UtSgwgUbqDdg3Ot5OyWmubUbG6JXHMKmmhoM1aaXfUya3IaUD96t8fTVQchjwcoHQze0kdefPXhfDl5y9hxv8u7xA1v6v18HCLQ0eoBlkDvY6X2YK_czxeAcgFHm2IQnmGTZyKXSaQO2eHX3LjLcwXji0LuBIxx_eulyIoPCl7-eA0F8aH4OHC6vs30fShO-n0vKqbgrc0IULhCQh4ILiBfE1tGMVoqBOuQ5K0FWFEaE1jCEIuzQGCBACRTBIJyt6bKhppHR6jembGP0GYMCXMOMo4bUkYJEIoDSSSYE4rkSZwihp2d-arUjBjXm3M2d-vr9Feb5L25_3HwfM52rfWKhmFF6hefKzh0qB-Ia6csb8AziSuvQ |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2018+Moratuwa+Engineering+Research+Conference+%28MERCon%29&rft.atitle=Sinhala+Speech+Recognition+for+Interactive+Voice+Response+Systems+Accessed+Through+Mobile+Phones&rft.au=Manamperi%2C+Wageesha&rft.au=Karunathilake%2C+Dinesha&rft.au=Madhushani%2C+Thilini&rft.au=Galagedara%2C+Nimasha&rft.date=2018-05-01&rft.pub=IEEE&rft.spage=241&rft.epage=246&rft_id=info:doi/10.1109%2FMERCon.2018.8421888&rft.externalDocID=8421888 |