Named-Entity Dataset for Medieval Latin, Middle High German and Old Norse

We present a dataset of named entities in three languages: Medieval Latin, Middle High German and Old Norse. The dataset, containing proper nouns of persons and places, was originally created to extract characters from three related medieval texts. Since the annotation is on low-resource pre-modern...

Full description

Saved in:
Bibliographic Details
Published inJournal of open humanities data Vol. 7
Main Authors Besnier, Clément, Mattingly, William
Format Journal Article
LanguageEnglish
Published Ubiquity Press 06.10.2021
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:We present a dataset of named entities in three languages: Medieval Latin, Middle High German and Old Norse. The dataset, containing proper nouns of persons and places, was originally created to extract characters from three related medieval texts. Since the annotation is on low-resource pre-modern languages, they may be important to build named-entity recognition tools for languages with little data and high linguistic variation.
ISSN:2059-481X
2059-481X
DOI:10.5334/johd.36