Heuristic Algorithm for Zero Subject Detection in Polish

This article describes a heuristic approach to zero subject detection in Polish. It focuses on the zero subject detection as a crucial step in end-to-end coreference resolution. The zero subject verbs are recognized using a set of manually created rules utilizing information from different sources,...

Full description

Saved in:
Bibliographic Details
Published inText, Speech, and Dialogue pp. 378 - 386
Main Authors Kaczmarek, Adam, Marcińczuk, Michał
Format Book Chapter
LanguageEnglish
Published Cham Springer International Publishing 2015
SeriesLecture Notes in Computer Science
Subjects
Online AccessGet full text
ISBN3319240323
9783319240329
ISSN0302-9743
1611-3349
DOI10.1007/978-3-319-24033-6_43

Cover

Loading…
More Information
Summary:This article describes a heuristic approach to zero subject detection in Polish. It focuses on the zero subject detection as a crucial step in end-to-end coreference resolution. The zero subject verbs are recognized using a set of manually created rules utilizing information from different sources, including: a dependency parser, a shallow relational parser and a valence dictionary. The rules were developed and evaluated on the Polish Coreference Corpus. The experimental results show that the presented method significantly outperforms the only machine learning-based alternative for Polish, i.e., MentionDetector. We also discuss and evaluate the importance of zero subject detection for existing coreference resolution tools for Polish.
ISBN:3319240323
9783319240329
ISSN:0302-9743
1611-3349
DOI:10.1007/978-3-319-24033-6_43