Quranic Optical Text Recognition Using Deep Learning Models

A Quranic optical character recognition (OCR) system based on convolutional neural network (CNN) followed by recurrent neural network (RNN) is introduced in this work. Six deep learning models are built to study the effect of different representations of the input and output, and the accuracy and pe...

Full description

Saved in:

Bibliographic Details
Published in	IEEE access Vol. 9; pp. 38318 - 38330
Main Authors	Mohd, Masnizah, Qamar, Faizan, Al-Sheikh, Idris, Salah, Ramzi
Format	Journal Article
Language	English
Published	Piscataway IEEE 2021 The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Subjects	Arabic Artificial neural networks Character recognition Datasets Deep learning gated recurrent unit Hidden Markov models long short-term memory Machine learning Model accuracy Neural networks Object recognition Optical character recognition Optical character recognition software Quranic text Recurrent neural networks Text recognition Writing
Online Access	Get full text

Cover

Loading…

More Information
Summary:	A Quranic optical character recognition (OCR) system based on convolutional neural network (CNN) followed by recurrent neural network (RNN) is introduced in this work. Six deep learning models are built to study the effect of different representations of the input and output, and the accuracy and performance of the models, and compare long short-term memory (LSTM) and gated recurrent unit (GRU). A new Quranic OCR dataset is developed based on the most famous printed version of the Holy Quran (Mushaf Al-Madinah), and a page and line-text image with the corresponding labels is prepared. This work's contribution is a Quranic OCR model capable of recognizing the Quranic image's diacritic text. A better performance in word recognition rate (WRR) and character recognition rate (CRR) is achieved in the experiments. The LSTM and GRU are compared in the Arabic text recognition domain. In addition, a public database is built for research purposes in Arabic text recognition that contains the diacritics and the Uthmanic script, and is large enough to be used with the deep learning models. The outcome of this work shows that the proposed system obtains an accuracy of 98% on the validation data, and a WRR of 95% and a CRR of 99% in the test dataset.
ISSN:	2169-3536 2169-3536
DOI:	10.1109/ACCESS.2021.3064019