Recognizing Cursive Typewritten Text Using Segmentation-Free System

Feature extraction plays an important role in text recognition as it aims to capture essential characteristics of the text image. Feature extraction algorithms widely range between robust and hard to extract features and noise sensitive and easy to extract features. Among those feature types are sta...

Full description

Saved in:
Bibliographic Details
Published inTheScientificWorld Vol. 2015; no. 2015; pp. 1 - 7
Main Author Khorsheed, Mohammad S.
Format Journal Article
LanguageEnglish
Published Cairo, Egypt Hindawi Publishing Corporation 2015
John Wiley & Sons, Inc
Hindawi Limited
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Feature extraction plays an important role in text recognition as it aims to capture essential characteristics of the text image. Feature extraction algorithms widely range between robust and hard to extract features and noise sensitive and easy to extract features. Among those feature types are statistical features which are derived from the statistical distribution of the image pixels. This paper presents a novel method for feature extraction where simple statistical features are extracted from a one-pixel wide window that slides across the text line. The feature set is clustered in the feature space using vector quantization. The feature vector sequence is then injected to a classification engine for training and recognition purposes. The recognition system is applied to a data corpus which includes cursive Arabic text of more than 600 A4-size sheets typewritten in multiple computer-generated fonts. The system performance is compared to a previously published system from the literature with a similar engine but a different feature set.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
Academic Editor: Jzau Sheng Lin
ISSN:2356-6140
1537-744X
1537-744X
DOI:10.1155/2015/818432