Integrating document and data retrieval based on XML

For querying structured and semistructured data, data retrieval and document retrieval are two valuable and complementary techniques that have not yet been fully integrated. In this paper, we introduce integrated information retrieval (IIR), an XML-based retrieval approach that closes this gap. We i...

Full description

Saved in:
Bibliographic Details
Published inThe VLDB journal Vol. 15; no. 1; pp. 53 - 83
Main Authors BREMER, Jan-Marco, GERTZ, Michael
Format Journal Article
LanguageEnglish
Published Heidelberg Springer 2006
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:For querying structured and semistructured data, data retrieval and document retrieval are two valuable and complementary techniques that have not yet been fully integrated. In this paper, we introduce integrated information retrieval (IIR), an XML-based retrieval approach that closes this gap. We introduce the syntax and semantics of an extension of the XQuery language called XQuery/IR. The extended language realizes IIR and thereby allows users to formulate new kinds of queries by nesting ranked document retrieval and precise data retrieval queries. Furthermore, we detail index structures and efficient query processing approaches for implementing XQuery/IR. Based on a new identification scheme for nodes in node-labeled tree structures, the extended index structures require only a fraction of the space of comparable index structures that only support data retrieval.
Bibliography:ObjectType-Article-2
SourceType-Scholarly Journals-1
ObjectType-Feature-1
content type line 23
ISSN:1066-8888
0949-877X
DOI:10.1007/s00778-004-0150-4