Strokelets: A Learned Multi-scale Representation for Scene Text Recognition

Driven by the wide range of applications, scene text detection and recognition have become active research topics in computer vision. Though extensively studied, localizing and reading text in uncontrolled environments remain extremely challenging, due to various interference factors. In this paper,...

Full description

Saved in:

Bibliographic Details
Published in	2014 IEEE Conference on Computer Vision and Pattern Recognition pp. 4042 - 4049
Main Authors	Yao, Cong, Bai, Xiang, Shi, Baoguang, Liu, Wenyu
Format	Conference Proceeding Journal Article
Language	English
Published	IEEE 01.06.2014
Subjects	Character recognition Clustering algorithms Computer vision Conferences Interference Noise Pattern recognition Prototypes Recognition Representations Robustness Text recognition Texts Training
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Driven by the wide range of applications, scene text detection and recognition have become active research topics in computer vision. Though extensively studied, localizing and reading text in uncontrolled environments remain extremely challenging, due to various interference factors. In this paper, we propose a novel multi-scale representation for scene text recognition. This representation consists of a set of detectable primitives, termed as strokelets, which capture the essential substructures of characters at different granularities. Strokelets possess four distinctive advantages: (1) Usability: automatically learned from bounding box labels, (2) Robustness: insensitive to interference factors, (3) Generality: applicable to variant languages, and (4) Expressivity: effective at describing characters. Extensive experiments on standard benchmarks verify the advantages of strokelets and demonstrate the effectiveness of the proposed algorithm for text recognition.
Bibliography:	ObjectType-Article-2 SourceType-Scholarly Journals-1 ObjectType-Conference-1 ObjectType-Feature-3 content type line 23 SourceType-Conference Papers & Proceedings-2
ISSN:	1063-6919 1063-6919 2575-7075
DOI:	10.1109/CVPR.2014.515