Word recognition of text undergoing an OCR process
The invention discloses word recognition of text undergoing an OCR process. A method for identifying words in a textual image undergoing optical character recognition includes receiving 410 a bitmap of an input image 15 which includes textual lines that have been segmented by a plurality of chop lin...
Saved in:
Main Authors | , , , |
---|---|
Format | Patent |
Language | Chinese English |
Published |
09.11.2011
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | The invention discloses word recognition of text undergoing an OCR process. A method for identifying words in a textual image undergoing optical character recognition includes receiving 410 a bitmap of an input image 15 which includes textual lines that have been segmented by a plurality of chop lines. The chop lines are each associated with a confidence level reflecting a degree to which the respective chop line properly segments the textual line into individual characters. One or more words are identified 420 in one of the textual lines based at least in part on the textual lines and a first subset of the plurality of chop lines which have a chop line confidence level above a first threshold value. If 430 the first word is not associated with a sufficiently high word confidence level, at least a second word in the textual line is identified 440 based at least in part on a second subset of the plurality of chop lines which have a confidence level above a second threshold value lower than the first threshold |
---|---|
Bibliography: | Application Number: CN201110117322 |