Sinhala Speech Recognition for Interactive Voice Response Systems Accessed Through Mobile Phones

This paper presents the development of a Sinhala Speech Recognition System to be deployed in an Interactive Voice Response (IVR) system of a telecommunication service provider. The main objectives are to recognize Sinhala digits and names of Sinhala songs to be set up as ringback tones. Sinhala bein...

Full description

Saved in:

Bibliographic Details
Published in	2018 Moratuwa Engineering Research Conference (MERCon) pp. 241 - 246
Main Authors	Manamperi, Wageesha, Karunathilake, Dinesha, Madhushani, Thilini, Galagedara, Nimasha, Dias, Dileeka
Format	Conference Proceeding
Language	English
Published	IEEE 01.05.2018
Subjects	Acoustics Decoding Dictionaries Hidden Markov models HMM IVR Phonetics sinhala Speech recognition Training
Online Access	Get full text

Cover

Loading…

Abstract	This paper presents the development of a Sinhala Speech Recognition System to be deployed in an Interactive Voice Response (IVR) system of a telecommunication service provider. The main objectives are to recognize Sinhala digits and names of Sinhala songs to be set up as ringback tones. Sinhala being a phonetic language, its features are studied to develop a list of 47 phonemes. A continuous speech recognition system is developed based on Hidden Markov Model (HMM). The acoustic model is trained using the voice through mobile phone. The outcome is a speaker independent speech recognition system which is capable of recognizing 10 digits and 50 Sinhala songs. A word error rate (WER) of 11.2% using a speech corpus of 0.862 hours and a sentence error rate (SER) of 5.7% using a speech corpus of 1.388 hours are achieved for digits and songs respectively.
AbstractList	This paper presents the development of a Sinhala Speech Recognition System to be deployed in an Interactive Voice Response (IVR) system of a telecommunication service provider. The main objectives are to recognize Sinhala digits and names of Sinhala songs to be set up as ringback tones. Sinhala being a phonetic language, its features are studied to develop a list of 47 phonemes. A continuous speech recognition system is developed based on Hidden Markov Model (HMM). The acoustic model is trained using the voice through mobile phone. The outcome is a speaker independent speech recognition system which is capable of recognizing 10 digits and 50 Sinhala songs. A word error rate (WER) of 11.2% using a speech corpus of 0.862 hours and a sentence error rate (SER) of 5.7% using a speech corpus of 1.388 hours are achieved for digits and songs respectively.
Author	Karunathilake, Dinesha Galagedara, Nimasha Manamperi, Wageesha Dias, Dileeka Madhushani, Thilini
Author_xml	– sequence: 1 givenname: Wageesha surname: Manamperi fullname: Manamperi, Wageesha organization: Department of Electronic and Telecommunication Engineering, University of Moratuwa, Sri Lanka – sequence: 2 givenname: Dinesha surname: Karunathilake fullname: Karunathilake, Dinesha organization: Department of Electronic and Telecommunication Engineering, University of Moratuwa, Sri Lanka – sequence: 3 givenname: Thilini surname: Madhushani fullname: Madhushani, Thilini organization: Department of Electronic and Telecommunication Engineering, University of Moratuwa, Sri Lanka – sequence: 4 givenname: Nimasha surname: Galagedara fullname: Galagedara, Nimasha organization: Department of Electronic and Telecommunication Engineering, University of Moratuwa, Sri Lanka – sequence: 5 givenname: Dileeka surname: Dias fullname: Dias, Dileeka organization: Department of Electronic and Telecommunication Engineering, University of Moratuwa, Sri Lanka
BookMark	eNotkNFOwjAYRmuiiYI8ATd9gWG7tuu_S0JQSSAaht5i1_1lNaMl6zTh7SWRq-_m5OTkG5HbEAMSMuVsxjkrnzbL7SKGWc44zEDmHABuyIgrAYWUXOt7MknpmzGWFyCLonwgX5UPrekMrU6ItqVbtPEQ_OBjoC72dBUG7I0d_C_Sz-gtXoh0iiEhrc5pwGOic2sxJWzoru3jz6Glm1j7Dul7e4lLj-TOmS7h5Lpj8vG83C1es_Xby2oxX2eeazVkNXLDa8Mgd5CLXJYgnDZOSM0aWcraOSiQC2MVKMkRldXaYiMLphpQzokxmf57PSLuT70_mv68v54g_gBSKlZn
ContentType	Conference Proceeding
DBID	6IE 6IL CBEJK RIE RIL
DOI	10.1109/MERCon.2018.8421888
DatabaseName	IEEE Electronic Library (IEL) Conference Proceedings IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume IEEE Xplore All Conference Proceedings IEEE IEEE Proceedings Order Plans (POP All) 1998-Present
DatabaseTitleList
Database_xml	– sequence: 1 dbid: RIE name: IEEE url: https://proxy.k.utb.cz/login?url=https://ieeexplore.ieee.org/ sourceTypes: Publisher
DeliveryMethod	fulltext_linktorsrc
EISBN	1538644177 9781538644171
EndPage	246
ExternalDocumentID	8421888
Genre	orig-research
GroupedDBID	6IE 6IF 6IL 6IN AAJGR ABLEC ALMA_UNASSIGNED_HOLDINGS BEFXN BFFAM BGNUA BKEBE BPEOZ CBEJK IEGSK OCL RIE RIL
ID	FETCH-LOGICAL-i175t-be1a1ba082f82324983f7af3470d494bff86e13ac58541ee5c77ced4605d85ff3
IEDL.DBID	RIE
IngestDate	Thu Jun 29 18:39:11 EDT 2023
IsPeerReviewed	false
IsScholarly	false
Language	English
LinkModel	DirectLink
MergedId	FETCHMERGED-LOGICAL-i175t-be1a1ba082f82324983f7af3470d494bff86e13ac58541ee5c77ced4605d85ff3
PageCount	6
ParticipantIDs	ieee_primary_8421888
PublicationCentury	2000
PublicationDate	2018-May
PublicationDateYYYYMMDD	2018-05-01
PublicationDate_xml	– month: 05 year: 2018 text: 2018-May
PublicationDecade	2010
PublicationTitle	2018 Moratuwa Engineering Research Conference (MERCon)
PublicationTitleAbbrev	MERCon
PublicationYear	2018
Publisher	IEEE
Publisher_xml	– name: IEEE
SSID	ssj0002684669
Score	1.749279
Snippet	This paper presents the development of a Sinhala Speech Recognition System to be deployed in an Interactive Voice Response (IVR) system of a telecommunication...
SourceID	ieee
SourceType	Publisher
StartPage	241
SubjectTerms	Acoustics Decoding Dictionaries Hidden Markov models HMM IVR Phonetics sinhala Speech recognition Training
Title	Sinhala Speech Recognition for Interactive Voice Response Systems Accessed Through Mobile Phones
URI	https://ieeexplore.ieee.org/document/8421888
hasFullText	1
inHoldings	1
isFullTextHit
isPrint
link	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV07a8MwEBZJpk5tSUrfaOhYO3F8jqWxhIRScAl5lGypHicSWuxQnKW_vpLspLR06GaMLQkd3Hc6ffcdIXdCgdDcWUAoFUCiIeCGxQH0hUxjLSzou3rn7HnwuICnZbJskPtDLQwievIZhu7R3-XrQu1cqqzLwAISY03SZL1-Vat1yKc41ZLBgNfCQlGPd7PRdFg4idOIhfWfP1qoeAQZH5NsP3dFHHkLd6UM1ecvWcb_Lu6EdL5r9ejkgEKnpIF5m7zONvlavAs62yKqNZ3uaUJFTm2USn0eUHhXR18K6yvsF54ri7SWMKcPvpMiajqvOvnQrJDWg9DJ2qn7d8hiPJoPH4O6l0KwsQFCGUiMRCSFBXzDXBDFWWxSYWJIexo4SGPYAKNYKHt8gAgxUWmqULtbU80SY-Iz0srt-OeEAtfSjqOty1bAMZVSG4REoT2rJAbwgrTd7qy2lVzGqt6Yy79fX5EjZ6GKQ3hNWuXHDm8szpfy1hv4C5qsq2k
link.rule.ids	310,311,786,790,795,796,802,27956,55107
linkProvider	IEEE
linkToHtml	http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV07T8MwELZKGWAC1CLeeGAkaUOcxB5RVVSgqao-ULfix1mtQEmF0oVfj-2kRSAGtihKbMsn3Xc-f_cdQjdcEq6YtQCX0iORIh7TNPTIHRdJqLgBfVvvnA7i3pQ8zaJZDd1ua2EAwJHPwLeP7i5f5XJtU2UtSgwgUbqDdg3Ot5OyWmubUbG6JXHMKmmhoM1aaXfUya3IaUD96t8fTVQchjwcoHQze0kdefPXhfDl5y9hxv8u7xA1v6v18HCLQ0eoBlkDvY6X2YK_czxeAcgFHm2IQnmGTZyKXSaQO2eHX3LjLcwXji0LuBIxx_eulyIoPCl7-eA0F8aH4OHC6vs30fShO-n0vKqbgrc0IULhCQh4ILiBfE1tGMVoqBOuQ5K0FWFEaE1jCEIuzQGCBACRTBIJyt6bKhppHR6jembGP0GYMCXMOMo4bUkYJEIoDSSSYE4rkSZwihp2d-arUjBjXm3M2d-vr9Feb5L25_3HwfM52rfWKhmFF6hefKzh0qB-Ia6csb8AziSuvQ
openUrl	ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=proceeding&rft.title=2018+Moratuwa+Engineering+Research+Conference+%28MERCon%29&rft.atitle=Sinhala+Speech+Recognition+for+Interactive+Voice+Response+Systems+Accessed+Through+Mobile+Phones&rft.au=Manamperi%2C+Wageesha&rft.au=Karunathilake%2C+Dinesha&rft.au=Madhushani%2C+Thilini&rft.au=Galagedara%2C+Nimasha&rft.date=2018-05-01&rft.pub=IEEE&rft.spage=241&rft.epage=246&rft_id=info:doi/10.1109%2FMERCon.2018.8421888&rft.externalDocID=8421888