FINDING AND DISAMBIGUATING REFERENCES TO ENTITIES ON WEB PAGES

A system and method for disambiguating references to entities in a document. In one embodiment, an iterative process is used to disambiguate references to entities in documents. An initial model is used to identify documents referring to an entity based on features contained in those documents. The...

Full description

Saved in:
Bibliographic Details
Main Authors JEVTIC NIKOLA, REYNAR JEFFREY, LAROCO, JR. LEONARDO A, YAKOVENKO NIKOLAI V
Format Patent
LanguageEnglish
Published 25.12.2014
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:A system and method for disambiguating references to entities in a document. In one embodiment, an iterative process is used to disambiguate references to entities in documents. An initial model is used to identify documents referring to an entity based on features contained in those documents. The occurrence of various features in these documents is measured. From the number occurrences of features in these documents, a second model is constructed. The second model is used to identify documents referring to the entity based on features contained in the documents. The process can be repeated, iteratively identifying documents referring to the entity and improving subsequent models based on those identifications. Additional features of the entity can be extracted from documents identified as referring to the entity.
Bibliography:Application Number: US201414457869