EXTRACTING METHOD FOR CHARACTER SIZE INFORMATION
PURPOSE:To extract the highly accurate character size information by segmenting a character line and then an rectangular area (tentative characters), obtaining a histogram (distribution) of the width and the height of the rectangle, and then extracting the information on the character size of a docu...
Saved in:
Main Authors | , , |
---|---|
Format | Patent |
Language | English |
Published |
21.01.1992
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | PURPOSE:To extract the highly accurate character size information by segmenting a character line and then an rectangular area (tentative characters), obtaining a histogram (distribution) of the width and the height of the rectangle, and then extracting the information on the character size of a document or the end of a paragraph and the deforming rates of characters out of the distribution. CONSTITUTION:When a document is inputted as an image, this image is projected in the vertical direction and a part including a black picture element is segmented with a certain threshold value for segmentation of lines. Then the document image is projected in the horizontal direction for each segmented line, and a part including a black picture element is segmented for segmentation of a rectangle. Furthermore the document image is vertically projected in the rectangle, and a part including a black picture element is segmented for acquisition of a circumscribed rectangle 11 (tentative characters). Then 'en' characters are selected with the line width defined as a tentative standard character size, and the segmented rectangles are connected together or separated from each other for production of the combined characters. Thus plural lines are processed and a circumscribed rectangle is obtained. Then the distribution of the width and the height of the rectangle is utilized for extraction of the highly accurate character size information. |
---|---|
Bibliography: | Application Number: JP19900111754 |