Evolution of Semantic Similarity—A Survey

Estimating the semantic similarity between text data is one of the challenging and open research problems in the field of Natural Language Processing (NLP). The versatility of natural language makes it difficult to define rule-based methods for determining semantic similarity measures. To address th...

Full description

Saved in:

Bibliographic Details
Published in	ACM computing surveys Vol. 54; no. 2; pp. 1 - 37
Main Authors	Chandrasekaran, Dhivya, Mago, Vijay
Format	Journal Article
Language	English
Published	Baltimore Association for Computing Machinery 31.03.2022
Subjects	Artificial neural networks Computer science Evolution Natural language Natural language processing Semantics Similarity
Online Access	Get full text
ISSN	0360-0300 1557-7341
DOI	10.1145/3440755

Cover

Loading…

More Information
Summary:	Estimating the semantic similarity between text data is one of the challenging and open research problems in the field of Natural Language Processing (NLP). The versatility of natural language makes it difficult to define rule-based methods for determining semantic similarity measures. To address this issue, various semantic similarity methods have been proposed over the years. This survey article traces the evolution of such methods beginning from traditional NLP techniques such as kernel-based methods to the most recent research work on transformer-based models, categorizing them based on their underlying principles as knowledge-based, corpus-based, deep neural network–based methods, and hybrid methods. Discussing the strengths and weaknesses of each method, this survey provides a comprehensive view of existing systems in place for new researchers to experiment and develop innovative ideas to address the issue of semantic similarity.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14
ISSN:	0360-0300 1557-7341
DOI:	10.1145/3440755