Arabic Search Results Disambiguation: A Set of Benchmarks

Web search engines aim at retrieving relevant results in response to a user information need. The query expressing the user information need can be ambiguous by potentially referring to different meanings or senses. Search results clustering (SRC) attempts to disambiguate query results by grouping t...

Full description

Saved in:
Bibliographic Details
Published inArabic Language Processing: From Theory to Practice pp. 276 - 291
Main Authors Salhi, Haytham, Jarrar, Radi, Yahya, Adnan
Format Book Chapter
LanguageEnglish
Published Cham Springer International Publishing
SeriesCommunications in Computer and Information Science
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Web search engines aim at retrieving relevant results in response to a user information need. The query expressing the user information need can be ambiguous by potentially referring to different meanings or senses. Search results clustering (SRC) attempts to disambiguate query results by grouping them into groups of sense-relevant clusters. Little research was done on Arabic SRC, and one important reason may be the lack of quality benchmarks for SRC testing and evaluation. The main contribution of this paper is to introduce a set of benchmarks for Arabic SRC, called AMBIGArabic to aid in performing SRC experiments. The benchmarks include manually labeled datasets and a dataset based on blind relevance feedback (BRF). The designed benchmarks were used in a series of SRC experiments we performed and the results were encouraging. The benchmarks are being made available for use by researchers working on Arabic SRC.
ISBN:3030329585
9783030329587
ISSN:1865-0929
1865-0937
DOI:10.1007/978-3-030-32959-4_20