Implementation of Content Analysis System for Recognition of Journals Table of Contents
In this paper, we design the primary component, an automated segmentation method that extracts article titles, author names and page numbers from scanned image of a journal's table of contents. We expanded the method to five types of journal layout. An interactive tool was also developed with V...
Saved in:
Published in | Ninth International Conference on Document Analysis and Recognition (ICDAR 2007) Vol. 2; pp. 1018 - 1022 |
---|---|
Main Authors | , |
Format | Conference Proceeding |
Language | English |
Published |
IEEE
01.09.2007
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | In this paper, we design the primary component, an automated segmentation method that extracts article titles, author names and page numbers from scanned image of a journal's table of contents. We expanded the method to five types of journal layout. An interactive tool was also developed with Visual C++ to help the user manage the segmentation results, and manually control the table analysis. Its performance was tested over a panel of seventy images, which gather the most common types of layout encountered. |
---|---|
ISBN: | 0769528228 9780769528229 |
ISSN: | 1520-5363 2379-2140 |
DOI: | 10.1109/ICDAR.2007.4377069 |