Named-Entity Dataset for Medieval Latin, Middle High German and Old Norse
We present a dataset of named entities in three languages: Medieval Latin, Middle High German and Old Norse. The dataset, containing proper nouns of persons and places, was originally created to extract characters from three related medieval texts. Since the annotation is on low-resource pre-modern...
Saved in:
Published in | Journal of open humanities data Vol. 7 |
---|---|
Main Authors | , |
Format | Journal Article |
Language | English |
Published |
Ubiquity Press
06.10.2021
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | We present a dataset of named entities in three languages: Medieval Latin, Middle High German and Old Norse. The dataset, containing proper nouns of persons and places, was originally created to extract characters from three related medieval texts. Since the annotation is on low-resource pre-modern languages, they may be important to build named-entity recognition tools for languages with little data and high linguistic variation. |
---|---|
ISSN: | 2059-481X 2059-481X |
DOI: | 10.5334/johd.36 |