Jointly learning word embeddings using a corpus and a knowledge base

Methods for representing the meaning of words in vector spaces purely using the information distributed in text corpora have proved to be very valuable in various text mining and natural language processing (NLP) tasks. However, these methods still disregard the valuable semantic relational structur...

Full description

Saved in:

Bibliographic Details
Published in	PLOS ONE Vol. 13; no. 3; p. e0193094
Main Authors	Alsuhaibani, Mohammed, Bollegala, Danushka, Maehara, Takanori, Kawarabayashi, Ken-ichi
Format	Journal Article
Language	English
Published	United States Public Library of Science (PLoS) 12.03.2018 Public Library of Science
Subjects	Algorithms Analysis Biology and Life Sciences Cats Computer and Information Sciences Computer science Data Mining Data Mining - methods Distance learning Humans Knowledge base Knowledge Bases Knowledge bases (artificial intelligence) Knowledge representation Learning Lexicology Linguistics Medicine Methods Natural Language Processing Objective function Physical Sciences Q R Research and Analysis Methods Research Article Science Semantics Social Sciences Texts Vector spaces United Kingdom > UK Japan
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Methods for representing the meaning of words in vector spaces purely using the information distributed in text corpora have proved to be very valuable in various text mining and natural language processing (NLP) tasks. However, these methods still disregard the valuable semantic relational structure between words in co-occurring contexts. These beneficial semantic relational structures are contained in manually-created knowledge bases (KBs) such as ontologies and semantic lexicons, where the meanings of words are represented by defining the various relationships that exist among those words. We combine the knowledge in both a corpus and a KB to learn better word embeddings. Specifically, we propose a joint word representation learning method that uses the knowledge in the KBs, and simultaneously predicts the co-occurrences of two words in a corpus context. In particular, we use the corpus to define our objective function subject to the relational constrains derived from the KB. We further utilise the corpus co-occurrence statistics to propose two novel approaches, Nearest Neighbour Expansion (NNE) and Hedged Nearest Neighbour Expansion (HNE), that dynamically expand the KB and therefore derive more constraints that guide the optimisation process. Our experimental results over a wide-range of benchmark tasks demonstrate that the proposed method statistically significantly improves the accuracy of the word embeddings learnt. It outperforms a corpus-only baseline and reports an improvement of a number of previously proposed methods that incorporate corpora and KBs in both semantic similarity prediction and word analogy detection tasks.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 content type line 23 Competing Interests: The authors have declared that no competing interests exist.
ISSN:	1932-6203 1932-6203
DOI:	10.1371/journal.pone.0193094