Web Service-Based Turkish Automatic Speech Recognition Platform

In response to the similar challenges in building large-scale distributed applications and platforms on the Web, microservice architecture has emerged and gained a lot of popularity in recent years. Therefore, both for the use of microservices and for the provided of the necessary interface for Auto...

Full description

Saved in:
Bibliographic Details
Published in2020 International Congress on Human-Computer Interaction, Optimization and Robotic Applications (HORA) pp. 1 - 5
Main Authors Oyucu, Saadin, Polat, Huseyin, Sever, Hayri
Format Conference Proceeding
LanguageEnglish
Published IEEE 01.06.2020
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:In response to the similar challenges in building large-scale distributed applications and platforms on the Web, microservice architecture has emerged and gained a lot of popularity in recent years. Therefore, both for the use of microservices and for the provided of the necessary interface for Automatic Speech Recognition (ASR), a web-based platform has been developed. Within firstly the scope of the study, a Turkish ASR system was developed. A web service structure was created to facilitate access to the ASR system. The access of methods and data in the web service structure was provided through Representational State Transfer (REST) web services and service layer. An interface was developed to enable interaction with the web service. The platform was developed using a combination of different technologies such as ASR, web services, microservices, and interface technologies. The developed platform can be used via a standard web browser or an Application Programming Interface (API). In this study, Docker packages were used to improve system performance instead of using different virtual machines on a single server. In the experiments performed, it was shown that the Turkish ASR system had a word error rate of 24.70%. In web service performance tests, it was shown that the platform responded in an average of 9.6 seconds for a 59-second speech recording. The developed user interface was tested in both mobile and desktop web browsers and was shown to function properly. Applications and other services were given access to the platform without the need to use an interface via API support provided by the platform. As a result, a web service-based Turkish ASR platform working seamlessly on the ever-increasing number of mobile devices, the Internet of Things ecosystem, or other access devices was developed.
DOI:10.1109/HORA49412.2020.9152920