Arabic Search Results Disambiguation: A Set of Benchmarks
Web search engines aim at retrieving relevant results in response to a user information need. The query expressing the user information need can be ambiguous by potentially referring to different meanings or senses. Search results clustering (SRC) attempts to disambiguate query results by grouping t...
Saved in:
Published in | Arabic Language Processing: From Theory to Practice pp. 276 - 291 |
---|---|
Main Authors | , , |
Format | Book Chapter |
Language | English |
Published |
Cham
Springer International Publishing
|
Series | Communications in Computer and Information Science |
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | Web search engines aim at retrieving relevant results in response to a user information need. The query expressing the user information need can be ambiguous by potentially referring to different meanings or senses. Search results clustering (SRC) attempts to disambiguate query results by grouping them into groups of sense-relevant clusters. Little research was done on Arabic SRC, and one important reason may be the lack of quality benchmarks for SRC testing and evaluation.
The main contribution of this paper is to introduce a set of benchmarks for Arabic SRC, called AMBIGArabic to aid in performing SRC experiments. The benchmarks include manually labeled datasets and a dataset based on blind relevance feedback (BRF). The designed benchmarks were used in a series of SRC experiments we performed and the results were encouraging. The benchmarks are being made available for use by researchers working on Arabic SRC. |
---|---|
ISBN: | 3030329585 9783030329587 |
ISSN: | 1865-0929 1865-0937 |
DOI: | 10.1007/978-3-030-32959-4_20 |