LowResourceEval-2019: a shared task on morphological analysis for low-resource languages

Dialog 2019, Issue 18, Supplementary volume, Pp. 45-62 The paper describes the results of the first shared task on morphological analysis for the languages of Russia, namely, Evenki, Karelian, Selkup, and Veps. For the languages in question, only small-sized corpora are available. The tasks include...

Full description

Saved in:
Bibliographic Details
Main Authors Klyachko, Elena, Sorokin, Alexey, Krizhanovskaya, Natalia, Krizhanovsky, Andrew, Ryazanskaya, Galina
Format Journal Article
LanguageEnglish
Published 30.01.2020
Subjects
Online AccessGet full text

Cover

Loading…
Abstract Dialog 2019, Issue 18, Supplementary volume, Pp. 45-62 The paper describes the results of the first shared task on morphological analysis for the languages of Russia, namely, Evenki, Karelian, Selkup, and Veps. For the languages in question, only small-sized corpora are available. The tasks include morphological analysis, word form generation and morpheme segmentation. Four teams participated in the shared task. Most of them use machine-learning approaches, outperforming the existing rule-based ones. The article describes the datasets prepared for the shared tasks and contains analysis of the participants' solutions. Language corpora having different formats were transformed into CONLL-U format. The universal format makes the datasets comparable to other language corpura and facilitates using them in other NLP tasks.
AbstractList Dialog 2019, Issue 18, Supplementary volume, Pp. 45-62 The paper describes the results of the first shared task on morphological analysis for the languages of Russia, namely, Evenki, Karelian, Selkup, and Veps. For the languages in question, only small-sized corpora are available. The tasks include morphological analysis, word form generation and morpheme segmentation. Four teams participated in the shared task. Most of them use machine-learning approaches, outperforming the existing rule-based ones. The article describes the datasets prepared for the shared tasks and contains analysis of the participants' solutions. Language corpora having different formats were transformed into CONLL-U format. The universal format makes the datasets comparable to other language corpura and facilitates using them in other NLP tasks.
Author Klyachko, Elena
Krizhanovskaya, Natalia
Sorokin, Alexey
Krizhanovsky, Andrew
Ryazanskaya, Galina
Author_xml – sequence: 1
  givenname: Elena
  surname: Klyachko
  fullname: Klyachko, Elena
– sequence: 2
  givenname: Alexey
  surname: Sorokin
  fullname: Sorokin, Alexey
– sequence: 3
  givenname: Natalia
  surname: Krizhanovskaya
  fullname: Krizhanovskaya, Natalia
– sequence: 4
  givenname: Andrew
  surname: Krizhanovsky
  fullname: Krizhanovsky, Andrew
– sequence: 5
  givenname: Galina
  surname: Ryazanskaya
  fullname: Ryazanskaya, Galina
BackLink https://doi.org/10.48550/arXiv.2001.11285$$DView paper in arXiv
BookMark eNotz7FOwzAUQFEPMEDhA5jwDzg8O3bssKGqFKRISKgDW_Ti2GmEG1c2benfV5ROd7vSuSVXU5wcIQ8cCmmUgidMv-O-EAC84FwYdUO-mnj4dDnuknWLPQYmgNfPFGleY3I9_cH8TeNENzFt1zHEYbQYKE4YjnnM1MdEQzywdFnQgNOww8HlO3LtMWR3f-mMrF4Xq_kbaz6W7_OXhmGlFfPCe2mMFJWFXiLW3MtSeqg7bQDQc9FBZfqu07xyuqqVLQ061SsL3OvOlDPy-L8909ptGjeYju0fsT0TyxN_IE4D
ContentType Journal Article
Copyright http://creativecommons.org/licenses/by/4.0
Copyright_xml – notice: http://creativecommons.org/licenses/by/4.0
DBID AKY
GOX
DOI 10.48550/arxiv.2001.11285
DatabaseName arXiv Computer Science
arXiv.org
DatabaseTitleList
Database_xml – sequence: 1
  dbid: GOX
  name: arXiv.org
  url: http://arxiv.org/find
  sourceTypes: Open Access Repository
DeliveryMethod fulltext_linktorsrc
ExternalDocumentID 2001_11285
GroupedDBID AKY
GOX
ID FETCH-LOGICAL-a675-f2ff488426c0d4aa91f434f09b7800af12b068dbb716e7695c38ae5d5c01f7b83
IEDL.DBID GOX
IngestDate Mon Jan 08 05:49:05 EST 2024
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed false
IsScholarly false
Language English
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-a675-f2ff488426c0d4aa91f434f09b7800af12b068dbb716e7695c38ae5d5c01f7b83
OpenAccessLink https://arxiv.org/abs/2001.11285
ParticipantIDs arxiv_primary_2001_11285
PublicationCentury 2000
PublicationDate 2020-01-30
PublicationDateYYYYMMDD 2020-01-30
PublicationDate_xml – month: 01
  year: 2020
  text: 2020-01-30
  day: 30
PublicationDecade 2020
PublicationYear 2020
Score 1.7570131
SecondaryResourceType preprint
Snippet Dialog 2019, Issue 18, Supplementary volume, Pp. 45-62 The paper describes the results of the first shared task on morphological analysis for the languages of...
SourceID arxiv
SourceType Open Access Repository
SubjectTerms Computer Science - Computation and Language
Title LowResourceEval-2019: a shared task on morphological analysis for low-resource languages
URI https://arxiv.org/abs/2001.11285
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwdV07T8MwED61nVgQCFB5ygNrhO3YTsKGEKVCPJYiZavs2BaIkqAmUH4-5zwEC6tjWcrZl-_z5e47gPMi4Vp4dCSmYx0JHnxOKBe6prqg96XSJNQOPzyq-bO4y2U-AjLUwuj19-tXpw9s6ouQ8BOKXFI5hjHnIWXr9invfk62Ulz9_N95yDHboT8gMduB7Z7dkatuO3Zh5Mo9yO-rzRAlR-K6wiPDskuiSf0Ssr9Jo-s3UpXkvcJ3Hr5FRPdqIQRZJVlVm2jdL0GGEGO9D4vZzeJ6HvUNDSKNvDzy3Hv0F8TEglqhdca8iIWnmUmQtmnPuKEqtcbgHcYlKpNFnGonrSwo84lJ4wOYlFXppkA8wq5TBZI7mwmPLA4dMxE0ll5YBBh7CNPWDMuPTrMidJtky9ZCR_8_OoYtHq6TNISkTmDSrD_dKWJuY85aw_8AZnOAWQ
link.rule.ids 228,230,783,888
linkProvider Cornell University
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=LowResourceEval-2019%3A+a+shared+task+on+morphological+analysis+for+low-resource+languages&rft.au=Klyachko%2C+Elena&rft.au=Sorokin%2C+Alexey&rft.au=Krizhanovskaya%2C+Natalia&rft.au=Krizhanovsky%2C+Andrew&rft.date=2020-01-30&rft_id=info:doi/10.48550%2Farxiv.2001.11285&rft.externalDocID=2001_11285