Document image decoding approach to character template estimation

An approach to supervised training of document-specific character templates from sample page images and unaligned transcriptions is presented. The template estimation problem is formulated as one of constrained maximum likelihood parameter estimation within the document image decoding (DID) framewor...

Full description

Saved in:

Bibliographic Details
Published in	Proceedings of 3rd IEEE International Conference on Image Processing Vol. 2; pp. 213 - 216 vol.2
Main Authors	Kopec, G.E., Lomelin, M.
Format	Conference Proceeding
Language	English
Published	IEEE 1996
Subjects	Error analysis Heart Image recognition Image segmentation Iterative algorithms Iterative decoding Labeling Maximum likelihood decoding Maximum likelihood estimation Parameter estimation
Online Access	Get full text

Cover

Loading…

More Information
Summary:	An approach to supervised training of document-specific character templates from sample page images and unaligned transcriptions is presented. The template estimation problem is formulated as one of constrained maximum likelihood parameter estimation within the document image decoding (DID) framework. This leads to a two-phase iterative training algorithm consisting of transcription alignment and aligned template estimation (ATE) steps. The ATE step is the heart of the algorithm and involves assigning template pixel colors to maximize likelihood while satisfying a template disjointness constraint. In one large-scale experiment, use of document-specific templates resulted in a character error rate that was about an order of magnitude less than that of a commercial omni-font OCR program.
ISBN:	9780780332591 0780332598
DOI:	10.1109/ICIP.1996.560730