Terminology/Keyphrase Extraction for Creation of Book Indexes in Polish

The paper addresses the problem of automatic identification of phrases to be included in back-of-book indexes. We analyzed books in Polish and English published with subject indexes compiled by their authors. We checked what kinds of phrases are placed in those indexes and how often they actually oc...

Full description

Saved in:
Bibliographic Details
Published inLinking Theory and Practice of Digital Libraries Vol. 12866; pp. 49 - 54
Main Authors Marciniak, Małgorzata, Mykowiecka, Agnieszka, Rychlik, Piotr
Format Book Chapter
LanguageEnglish
Published Switzerland Springer International Publishing AG 2021
Springer International Publishing
SeriesLecture Notes in Computer Science
Subjects
Online AccessGet full text
ISBN3030863239
9783030863234
ISSN0302-9743
1611-3349
DOI10.1007/978-3-030-86324-1_5

Cover

Loading…
More Information
Summary:The paper addresses the problem of automatic identification of phrases to be included in back-of-book indexes. We analyzed books in Polish and English published with subject indexes compiled by their authors. We checked what kinds of phrases are placed in those indexes and how often they actually occur in the corresponding books. In the experiments, we use existing terminology and keyphrase extraction tools. For Polish, the first tool is better than the second one, but for English texts, the results are inconclusive.
ISBN:3030863239
9783030863234
ISSN:0302-9743
1611-3349
DOI:10.1007/978-3-030-86324-1_5