CONFIDENCE LINKS BETWEEN NAME ENTITIES IN DISPARATE DOCUMENTS

The invention relates to cross-document entity co-reference systems in which naturally occurring entity mentions in a document corpus are analyzed and transformed into name clusters that represent global entities. In a first aspect of the invention, a name variation module analyzes naturally occurri...

Full description

Saved in:
Bibliographic Details
Main Authors FREEDMAN MARJORIE RUTH, BOSCHEE ELIZABETH MEGAN, BARON ALEX, WEISCHEDEL RALPH M
Format Patent
LanguageEnglish
Published 25.03.2010
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:The invention relates to cross-document entity co-reference systems in which naturally occurring entity mentions in a document corpus are analyzed and transformed into name clusters that represent global entities. In a first aspect of the invention, a name variation module analyzes naturally occurring names of entities extracted from the document corpus and provides an initial set of equivalent names that could refer to the same real world entity. In a second aspect of the invention, a disambiguation module takes the initial set of equivalent names and uses an agglomerative clustering algorithm to disambiguate the potentially co-referent named entities.
Bibliography:Application Number: US20080344871