Word recognition of text undergoing an OCR process

The invention discloses word recognition of text undergoing an OCR process. A method for identifying words in a textual image undergoing optical character recognition includes receiving 410 a bitmap of an input image 15 which includes textual lines that have been segmented by a plurality of chop lin...

Full description

Saved in:
Bibliographic Details
Main Authors NIJEMCEVIC DJORDJE, CIMPOI MIRCEA, ANTONIJEVIC ALEKSANDAR, MITIC IVAN
Format Patent
LanguageChinese
English
Published 09.11.2011
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:The invention discloses word recognition of text undergoing an OCR process. A method for identifying words in a textual image undergoing optical character recognition includes receiving 410 a bitmap of an input image 15 which includes textual lines that have been segmented by a plurality of chop lines. The chop lines are each associated with a confidence level reflecting a degree to which the respective chop line properly segments the textual line into individual characters. One or more words are identified 420 in one of the textual lines based at least in part on the textual lines and a first subset of the plurality of chop lines which have a chop line confidence level above a first threshold value. If 430 the first word is not associated with a sufficiently high word confidence level, at least a second word in the textual line is identified 440 based at least in part on a second subset of the plurality of chop lines which have a confidence level above a second threshold value lower than the first threshold
Bibliography:Application Number: CN201110117322