InteractiveIE: Towards Assessing the Strength of Human-AI Collaboration in Improving the Performance of Information Extraction

Learning template based information extraction from documents is a crucial yet difficult task. Prior template-based IE approaches assume foreknowledge of the domain templates; however, real-world IE do not have pre-defined schemas and it is a figure-out-as you go phenomena. To quickly bootstrap temp...

Full description

Saved in:
Bibliographic Details
Published inarXiv.org
Main Authors Mondal, Ishani, Yuan, Michelle, Anandhavelu, N, Garimella, Aparna, Ferraro, Francis, Blair-Stanek, Andrew, Benjamin Van Durme, Boyd-Graber, Jordan
Format Paper
LanguageEnglish
Published Ithaca Cornell University Library, arXiv.org 17.11.2023
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Learning template based information extraction from documents is a crucial yet difficult task. Prior template-based IE approaches assume foreknowledge of the domain templates; however, real-world IE do not have pre-defined schemas and it is a figure-out-as you go phenomena. To quickly bootstrap templates in a real-world setting, we need to induce template slots from documents with zero or minimal supervision. Since the purpose of question answering intersect with the goal of information extraction, we use automatic question generation to induce template slots from the documents and investigate how a tiny amount of a proxy human-supervision on-the-fly (termed as InteractiveIE) can further boost the performance. Extensive experiments on biomedical and legal documents, where obtaining training data is expensive, reveal encouraging trends of performance improvement using InteractiveIE over AI-only baseline.
ISSN:2331-8422