DCU and ISI@INEX 2010: Adhoc and Data-Centric Tracks

We describe the participation of Dublin City University (DCU) and Indian Statistical Institute (ISI) in INEX 2010 for the Adhoc and Data Centric tracks. The main contributions of this paper are: i) a simplified version of Hierarchical Language Model (HLM), which involves scoring XML elements with a...

Full description

Saved in:
Bibliographic Details
Published inComparative Evaluation of Focused Retrieval pp. 182 - 193
Main Authors Ganguly, Debasis, Leveling, Johannes, Jones, Gareth J. F., Palchowdhury, Sauparna, Pal, Sukomal, Mitra, Mandar
Format Book Chapter
LanguageEnglish
Published Berlin, Heidelberg Springer Berlin Heidelberg
SeriesLecture Notes in Computer Science
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:We describe the participation of Dublin City University (DCU) and Indian Statistical Institute (ISI) in INEX 2010 for the Adhoc and Data Centric tracks. The main contributions of this paper are: i) a simplified version of Hierarchical Language Model (HLM), which involves scoring XML elements with a combined probability of generating the given query from itself and the top level articl node, is shown to outperform the baselines of LM and VSM scoring of XML elements; ii) the Expectation Maximization (EM) feedback in LM is shown to be the most effective on the domain specific collection of IMDB; iii) automated removal of sentences indicating aspects of irrelevance from the narratives of INEX ad hoc topics is shown to improve retrieval effectiveness.
ISBN:9783642235764
364223576X
ISSN:0302-9743
1611-3349
DOI:10.1007/978-3-642-23577-1_16