Natural language processing using an ontology-based concept embedding model

A method 300, system, or program for a computer system to perform natural language processing (NLP) by generating a vector space model based on an ontology of concepts 310: training examples are extracted for concepts of a hierarchical ontology (200, fig.2A), wherein the training examples are based...

Full description

Saved in:
Bibliographic Details
Main Authors Brendan Bull, Paul Lewis Felt, Andrew Hicks
Format Patent
LanguageEnglish
Published 13.09.2023
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:A method 300, system, or program for a computer system to perform natural language processing (NLP) by generating a vector space model based on an ontology of concepts 310: training examples are extracted for concepts of a hierarchical ontology (200, fig.2A), wherein the training examples are based on neighbouring concepts 330; a plurality of vectors, each including one or more features, are initialized 340 and each vector corresponds to a concept, then a vector space model is generated by iteratively modifying the concept vectors to optimize a loss function 350; finally, natural language processing 360 is performed using the vector space model. Concepts can be assessed using cosine similarity between a concept vector and a mean vector of at least one of each of a parent concept and child concept. The vector space model may be a continuous bag of words model. The iterative optimisation can use both positive and negative training examples in order to optimise the loss function. The adjustment of vectors may also use a gradient descent algorithm.
Bibliography:Application Number: GB202308265