Pixel-Accurate Representation and Evaluation of Page Segmentation in Document Images

This paper presents a new representation and evaluation procedure of page segmentation algorithms and analyzes six widely-used layout analysis algorithms using the procedure. The method permits a detailed analysis of the behavior of page segmentation algorithms in terms of over- and undersegmentatio...

Full description

Saved in:
Bibliographic Details
Published in18th International Conference on Pattern Recognition (ICPR'06) Vol. 1; pp. 872 - 875
Main Authors Shafait, F., Keysers, D., Breuel, T.M.
Format Conference Proceeding
LanguageEnglish
Published IEEE 2006
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:This paper presents a new representation and evaluation procedure of page segmentation algorithms and analyzes six widely-used layout analysis algorithms using the procedure. The method permits a detailed analysis of the behavior of page segmentation algorithms in terms of over- and undersegmentation at different layout levels, as well as determination of the geometric accuracy of the segmentation. The representation of document layouts relies on labeling each pixel according to its function in the overall segmentation, permitting pixel-accurate representation of layout information of arbitrary layouts and allowing background pixels to be classified as "don't care". Our representations can be encoded easily in standard color image formats like PNG, permitting easy interchange of segmentation results and ground truth
ISBN:0769525210
9780769525211
ISSN:1051-4651
2831-7475
DOI:10.1109/ICPR.2006.934