Heuristic Algorithm for Zero Subject Detection in Polish
This article describes a heuristic approach to zero subject detection in Polish. It focuses on the zero subject detection as a crucial step in end-to-end coreference resolution. The zero subject verbs are recognized using a set of manually created rules utilizing information from different sources,...
Saved in:
Published in | Text, Speech, and Dialogue pp. 378 - 386 |
---|---|
Main Authors | , |
Format | Book Chapter |
Language | English |
Published |
Cham
Springer International Publishing
2015
|
Series | Lecture Notes in Computer Science |
Subjects | |
Online Access | Get full text |
ISBN | 3319240323 9783319240329 |
ISSN | 0302-9743 1611-3349 |
DOI | 10.1007/978-3-319-24033-6_43 |
Cover
Loading…
Summary: | This article describes a heuristic approach to zero subject detection in Polish. It focuses on the zero subject detection as a crucial step in end-to-end coreference resolution. The zero subject verbs are recognized using a set of manually created rules utilizing information from different sources, including: a dependency parser, a shallow relational parser and a valence dictionary. The rules were developed and evaluated on the Polish Coreference Corpus. The experimental results show that the presented method significantly outperforms the only machine learning-based alternative for Polish, i.e., MentionDetector. We also discuss and evaluate the importance of zero subject detection for existing coreference resolution tools for Polish. |
---|---|
ISBN: | 3319240323 9783319240329 |
ISSN: | 0302-9743 1611-3349 |
DOI: | 10.1007/978-3-319-24033-6_43 |