Systems and methods for spell correction of non-roman characters and words

Systems and methods to process and correct spelling errors for non-Roman based words such as in Chinese, Japanese, and Korean languages using a rule-based classifier and a hidden Markov model are disclosed. The method generally includes converting an input entry in a first language such as Chinese t...

Full description

Saved in:
Bibliographic Details
Main Authors WU JUN, ZHU HONGJUN, ZHU HUICAN, CHAN CHIU-KI, HUANG WEI-HWA
Format Patent
LanguageEnglish
Published 29.12.2005
Edition7
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Systems and methods to process and correct spelling errors for non-Roman based words such as in Chinese, Japanese, and Korean languages using a rule-based classifier and a hidden Markov model are disclosed. The method generally includes converting an input entry in a first language such as Chinese to at least one intermediate entry in an intermediate representation, such as pinyin, different from the first language, converting the intermediate entry to at least one possible alternative spelling or form of the input in the first language, and determining that the input entry is either a correct or questionable input entry when a match between the input entry and all possible alternative spellings to the input entry is or is not located, respectively. The questionable input entry may be classified using, for example, a transformation rule based classifier based on transformation rules generated by a transformation rules generator.
Bibliography:Application Number: US20040875449