USING NATURAL LANGUAGE PROCESSING (NLP) TO CREATE SUBJECT MATTER SYNONYMS FROM DEFINITIONS

Methods, apparatus and systems, including computer program products, for creating subject matter synonyms from definitions extracted from a subject matter glossary. Confidence scores, each representing a likelihood that two terms defined in the subject matter glossary are synonyms, are determined by...

Full description

Saved in:
Bibliographic Details
Main Authors GERARD SCOTT N, MEGERIAN MARK G
Format Patent
LanguageEnglish
Published 16.06.2016
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Methods, apparatus and systems, including computer program products, for creating subject matter synonyms from definitions extracted from a subject matter glossary. Confidence scores, each representing a likelihood that two terms defined in the subject matter glossary are synonyms, are determined by applying natural language processing (e.g., passage term matching, lexical matching, and syntactic matching) to the extracted definitions. A subject matter thesaurus is built based on the confidence scores. In one embodiment, a statement containing a first term is created based on an extracted definition of the first term, a modified statement is created by substituting a second term in the statement in lieu of the first term, a corpus is searched, and a confidence score is determined based on evidence in the corpus that the modified statement is accurate. The first and second terms are marked as synonyms if the confidence score is greater than a threshold.
Bibliography:Application Number: US201615043447