LexEQUAL: supporting multilexical queries in SQL

Current database systems offer support for storing multilingual data, but are not capable of querying across languages, an important consideration in today's global economy. We therefore propose a new multilexical operator called LexEQUAL that extends the standard lexicographic matching in data...

Full description

Saved in:
Bibliographic Details
Published inProceedings. 20th International Conference on Data Engineering p. 845
Main Authors Kumaran, A., Haritsa, J.R.
Format Conference Proceeding
LanguageEnglish
Published Los Alamitos CA IEEE 2004
IEEE Computer Society
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Current database systems offer support for storing multilingual data, but are not capable of querying across languages, an important consideration in today's global economy. We therefore propose a new multilexical operator called LexEQUAL that extends the standard lexicographic matching in database systems to matching of text data across languages, specifically for names, which form close to twenty percent of text corpora. The implementation of the LexEQUAL operator is based on transforming matches in language space into parameterized approximate matches in the equivalent phoneme space. A detailed evaluation of our approach on a real data set shows that there exist settings of the algorithm parameters with which it is possible to achieve both good recall and precision.
ISBN:9780769520650
0769520650
ISSN:1063-6382
2375-026X
DOI:10.1109/ICDE.2004.1320075