Massively Multi-Lingual Event Understanding: Extraction, Visualization, and Search

In this paper, we present ISI-Clear, a state-of-the-art, cross-lingual, zero-shot event extraction system and accompanying user interface for event visualization & search. Using only English training data, ISI-Clear makes global events available on-demand, processing user-supplied text in 100 la...

Full description

Saved in:
Bibliographic Details
Published inarXiv.org
Main Authors Jenkins, Chris, Agarwal, Shantanu, Barry, Joel, Fincke, Steven, Boschee, Elizabeth
Format Paper
LanguageEnglish
Published Ithaca Cornell University Library, arXiv.org 17.05.2023
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:In this paper, we present ISI-Clear, a state-of-the-art, cross-lingual, zero-shot event extraction system and accompanying user interface for event visualization & search. Using only English training data, ISI-Clear makes global events available on-demand, processing user-supplied text in 100 languages ranging from Afrikaans to Yiddish. We provide multiple event-centric views of extracted events, including both a graphical representation and a document-level summary. We also integrate existing cross-lingual search algorithms with event extraction capabilities to provide cross-lingual event-centric search, allowing English-speaking users to search over events automatically extracted from a corpus of non-English documents, using either English natural language queries (e.g. cholera outbreaks in Iran) or structured queries (e.g. find all events of type Disease-Outbreak with agent cholera and location Iran).
ISSN:2331-8422