Interesting Linguistic Features in Coreference Annotation of an Inflectional Language

This paper reports on linguistic features and decisions that we find vital in the process of annotation and resolution of coreference for highly inflectional languages. The presented results have been collected during preparation of a corpus of general direct nominal coreference of Polish. Starting...

Full description

Saved in:
Bibliographic Details
Published inChinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data pp. 97 - 108
Main Authors Ogrodniczuk, Maciej, Głowińska, Katarzyna, Kopeć, Mateusz, Savary, Agata, Zawisławska, Magdalena
Format Book Chapter
LanguageEnglish
Published Berlin, Heidelberg Springer Berlin Heidelberg 2013
SeriesLecture Notes in Computer Science
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:This paper reports on linguistic features and decisions that we find vital in the process of annotation and resolution of coreference for highly inflectional languages. The presented results have been collected during preparation of a corpus of general direct nominal coreference of Polish. Starting from the notion of a mention, its borders and potential vs. actual referentiality, we discuss the problem of complete and near-identity, zero subjects and dominant expressions. We also present interesting linguistic cases influencing the coreference resolution such as the difference between semantic and syntactic heads or the phenomenon of coreference chains made of indefinite pronouns.
Bibliography:The work reported here was carried out within the Computer-based methods for coreference resolution in Polish texts (CORE) project financed by the Polish National Science Centre (contract number 6505/B/T02/2011/40). The paper is also co-founded by the European Union from resources of the European Social Fund, Project PO KL “Information technologies: Research and their interdisciplinary applications”.
ISBN:9783642414909
3642414907
ISSN:0302-9743
1611-3349
DOI:10.1007/978-3-642-41491-6_10