Arabic Search Results Disambiguation: A Set of Benchmarks

Web search engines aim at retrieving relevant results in response to a user information need. The query expressing the user information need can be ambiguous by potentially referring to different meanings or senses. Search results clustering (SRC) attempts to disambiguate query results by grouping t...

Full description

Saved in:

Bibliographic Details
Published in	Arabic Language Processing: From Theory to Practice pp. 276 - 291
Main Authors	Salhi, Haytham, Jarrar, Radi, Yahya, Adnan
Format	Book Chapter
Language	English
Published	Cham Springer International Publishing
Series	Communications in Computer and Information Science
Subjects	Ambiguous arabic queries Arabic SRC Search results clustering Search results disambiguation
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Web search engines aim at retrieving relevant results in response to a user information need. The query expressing the user information need can be ambiguous by potentially referring to different meanings or senses. Search results clustering (SRC) attempts to disambiguate query results by grouping them into groups of sense-relevant clusters. Little research was done on Arabic SRC, and one important reason may be the lack of quality benchmarks for SRC testing and evaluation. The main contribution of this paper is to introduce a set of benchmarks for Arabic SRC, called AMBIGArabic to aid in performing SRC experiments. The benchmarks include manually labeled datasets and a dataset based on blind relevance feedback (BRF). The designed benchmarks were used in a series of SRC experiments we performed and the results were encouraging. The benchmarks are being made available for use by researchers working on Arabic SRC.
ISBN:	3030329585 9783030329587
ISSN:	1865-0929 1865-0937
DOI:	10.1007/978-3-030-32959-4_20