Korpus XIX w. Uniwersytetu Warszawskiego i IJP PAN

The article describes a historical corpus which documents the 19th and early 20th century. The corpus was created as part of a research grant whose objective was to investigate the development of the aspectual system of Polish in the last 250 years against the background of Czech and Russian. An imp...

Full description

Saved in:
Bibliographic Details
Published inLingVaria Vol. 18; no. 35; pp. 125 - 134
Main Authors Łaziński, Marek, Górski, Rafał L, Woźniak, Michał
Format Journal Article
LanguagePolish
English
German
Published KSIĘGARNIA AKADEMICKA Sp. z o.o 16.05.2023
Academia Publishers
Ksiegarnia Akademicka Publishing
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:The article describes a historical corpus which documents the 19th and early 20th century. The corpus was created as part of a research grant whose objective was to investigate the development of the aspectual system of Polish in the last 250 years against the background of Czech and Russian. An important resource for this investigation was a database of aspectual triplets, which, in turn, was based on materials such as text corpora. Since there was no large corpus of the 19th and early 20th century available, there was a need to bridge this gap. In the course of the project, such corpus was made and it is now publicly accessible with no restrictions. This comprehensive corpus contains over 12 million contemporary words. Its texts originate from major Polish virtual libraries. It is POS-tagged with a tagger dedicated for 19th century texts. A web-based concordancer, an adjusted version of ParaVoz, allows for querying the corpus. The queries may be constrained by metadata.
ISSN:1896-2122
2392-1226
DOI:10.12797/LV.18.2023.35.09