Providing capitalization correction for unstructured excerpts

Providing capitalization correction for unstructured excerpts is described. An excerpt of unstructured content is tokenized into a set of words. The set of words is analyzed for correct capitalization. Individual characters constituting at least one such word in the set of words are evaluated. The a...

Full description

Saved in:
Bibliographic Details
Main Author ROHRS CHRISTOPHER
Format Patent
LanguageEnglish
Published 11.11.2008
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Providing capitalization correction for unstructured excerpts is described. An excerpt of unstructured content is tokenized into a set of words. The set of words is analyzed for correct capitalization. Individual characters constituting at least one such word in the set of words are evaluated. The at least one such word is skipped if determined to be of a predefined type.
Bibliography:Application Number: US20030716951