Semantic Relata for the Evaluation of Distributional Models in Mandarin Chinese

Distributional Semantic Models (DSMs) established themselves as a standard for the representation of word and sentence meaning. However, DSMs provide quantitative measurement of how strongly two linguistic expressions are related, without being able to automatically classify different semantic relat...

Full description

Saved in:

Bibliographic Details
Published in	IEEE access Vol. 7; p. 1
Main Authors	Liu, Hongchao, Chersoni, Emmanuele, Klyueva, Natalia, Santus, Enrico, Huang, Chu-Ren
Format	Journal Article
Language	English
Published	Piscataway IEEE 01.01.2019 The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Subjects	Chinese languages Computational modeling Computational Semantics Data mining Datasets Evaluation Lexical Resources Linguistics Natural language processing Ontologies Relation Classification Semantic Relations Semantics Similarity Task analysis Vector Space Models Words (language)
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Distributional Semantic Models (DSMs) established themselves as a standard for the representation of word and sentence meaning. However, DSMs provide quantitative measurement of how strongly two linguistic expressions are related, without being able to automatically classify different semantic relations. Hence the notion of semantic similarity is underspecified in DSMs. We introduce Evalution-MAN in this paper as an effort to address this underspecification problem. Following the EVALution 1.0 dataset for English, we present a dataset for evaluating DSMs on the task of the identification of semantic relations in Mandarin Chinese. Moreover, we test different types of word vectors on the automatic learning of these semantic relations, and we evaluate them both in a unsupervised and in a supervised setting, finding that distributional models tend, in general, to assign higher similarity scores to synonyms and that deep learning classifiers are the best performing ones in the identification of semantic relations.
ISSN:	2169-3536 2169-3536
DOI:	10.1109/ACCESS.2019.2945061