Predicting results for input data based on a model generated from clusters

A method for predicting results for input data based on a model that is generated based on clusters of related characters, clusters of related segments, and training data. The method comprises receiving a data set that includes a plurality of words in a particular language. In the particular languag...

Full description

Saved in:
Bibliographic Details
Main Author PENG FUCHUN
Format Patent
LanguageEnglish
Published 06.12.2007
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:A method for predicting results for input data based on a model that is generated based on clusters of related characters, clusters of related segments, and training data. The method comprises receiving a data set that includes a plurality of words in a particular language. In the particular language, words are formed by characters. Clusters of related characters are formed from the data set. A model is generated based at least on the clusters of related characters and training data. The model may also be based on the clusters of related segments. The training data includes a plurality of entries, wherein each entry includes a character and a designated result for said character. A set of input data that includes characters that have not been associated with designated results is received. The model is applied to the input data to determine predicted results for characters within the input data.
Bibliography:Application Number: US20060445587