Terminology/Keyphrase Extraction for Creation of Book Indexes in Polish
The paper addresses the problem of automatic identification of phrases to be included in back-of-book indexes. We analyzed books in Polish and English published with subject indexes compiled by their authors. We checked what kinds of phrases are placed in those indexes and how often they actually oc...
Saved in:
Published in | Linking Theory and Practice of Digital Libraries Vol. 12866; pp. 49 - 54 |
---|---|
Main Authors | , , |
Format | Book Chapter |
Language | English |
Published |
Switzerland
Springer International Publishing AG
2021
Springer International Publishing |
Series | Lecture Notes in Computer Science |
Subjects | |
Online Access | Get full text |
ISBN | 3030863239 9783030863234 |
ISSN | 0302-9743 1611-3349 |
DOI | 10.1007/978-3-030-86324-1_5 |
Cover
Abstract | The paper addresses the problem of automatic identification of phrases to be included in back-of-book indexes. We analyzed books in Polish and English published with subject indexes compiled by their authors. We checked what kinds of phrases are placed in those indexes and how often they actually occur in the corresponding books. In the experiments, we use existing terminology and keyphrase extraction tools. For Polish, the first tool is better than the second one, but for English texts, the results are inconclusive. |
---|---|
AbstractList | The paper addresses the problem of automatic identification of phrases to be included in back-of-book indexes. We analyzed books in Polish and English published with subject indexes compiled by their authors. We checked what kinds of phrases are placed in those indexes and how often they actually occur in the corresponding books. In the experiments, we use existing terminology and keyphrase extraction tools. For Polish, the first tool is better than the second one, but for English texts, the results are inconclusive. |
Author | Marciniak, Małgorzata Mykowiecka, Agnieszka Rychlik, Piotr |
Author_xml | – sequence: 1 givenname: Małgorzata orcidid: 0000-0002-0953-758X surname: Marciniak fullname: Marciniak, Małgorzata – sequence: 2 givenname: Agnieszka orcidid: 0000-0002-8939-3255 surname: Mykowiecka fullname: Mykowiecka, Agnieszka – sequence: 3 givenname: Piotr orcidid: 0000-0002-3539-0999 surname: Rychlik fullname: Rychlik, Piotr email: rychlik@ipipan.waw.pl |
BookMark | eNo1kMFOwzAMhgMMxDb2BFz6AmFJ3CbNEaYxEJPgMM5R1rpboTQlKdL29mQb-GL7tz7L_kdk0LoWCbnl7I4zpqZa5RQoA0ZzCSKl3GRnZBJViNpR4udkyCXnFCDVF2T0PwA9IMNYC6pVCldkxIVUSmaCZddkEsIHY0wokUqRD8lihf6rbl3jNvvpC-67rbcBk_mu97boa9cmlfPJzKM9Nq5KHpz7TJ7bEncYkrpN3lxTh-0NuaxsE3Dyl8fk_XG-mj3R5evieXa_pJ1IWU_LLJW6wLIsdMYrKwXkRQZpJWHNqhgqsxJ1vkYmCo4sZVxX1iqrhEWJkMOY8NPe0Pm63aA363hPMJyZg20mGmTAxO_N0SMTbYuMODGdd98_GHqDB6jANj7ZFFvb9eiDkUpwYBGQRgL8AjoWbQw |
ContentType | Book Chapter |
Copyright | Springer Nature Switzerland AG 2021 |
Copyright_xml | – notice: Springer Nature Switzerland AG 2021 |
DBID | FFUUA |
DEWEY | 025.00285 |
DOI | 10.1007/978-3-030-86324-1_5 |
DatabaseName | ProQuest Ebook Central - Book Chapters - Demo use only |
DatabaseTitleList | |
DeliveryMethod | fulltext_linktorsrc |
Discipline | Library & Information Science Computer Science |
EISBN | 9783030863241 3030863247 |
EISSN | 1611-3349 |
Editor | Hall, Mark Michael Kumpulainen, Sanna Brenn, Daniel Berget, Gerd |
Editor_xml | – sequence: 1 fullname: Kumpulainen, Sanna – sequence: 2 fullname: Hall, Mark Michael – sequence: 3 fullname: Brenn, Daniel – sequence: 4 fullname: Berget, Gerd |
EndPage | 54 |
ExternalDocumentID | EBC6721301_56_63 |
GroupedDBID | 38. AABBV AABLV ACWLQ AEDXK AEJLV AEKFX AELOD ALMA_UNASSIGNED_HOLDINGS BAHJK BBABE CZZ DBWEY FFUUA I4C IEZ OCUHQ ORHYB SBO TPJZQ TSXQS Z5O Z7R Z7S Z7U Z7V Z7X Z7Y Z7Z Z81 Z83 Z84 Z85 Z87 Z88 -DT -GH -~X 1SB 29L 2HA 2HV 5QI 875 AASHB ABMNI ACGFS ADCXD AEFIE EJD F5P FEDTE HVGLF LAS LDH P2P RIG RNI RSU SVGTG VI1 ~02 |
ID | FETCH-LOGICAL-p240t-d5469ceddc951fa6238c534f63b0ffff75a6e98be02c1e04019faa7a72ae6e383 |
ISBN | 3030863239 9783030863234 |
ISSN | 0302-9743 |
IngestDate | Tue Jul 29 20:40:25 EDT 2025 Tue Apr 22 23:46:50 EDT 2025 |
IsPeerReviewed | false |
IsScholarly | false |
LCCallNum | QA76.76.A65 |
Language | English |
LinkModel | OpenURL |
MergedId | FETCHMERGED-LOGICAL-p240t-d5469ceddc951fa6238c534f63b0ffff75a6e98be02c1e04019faa7a72ae6e383 |
OCLC | 1267765205 |
ORCID | 0000-0002-0953-758X 0000-0002-3539-0999 0000-0002-8939-3255 |
PQID | EBC6721301_56_63 |
PageCount | 6 |
ParticipantIDs | springer_books_10_1007_978_3_030_86324_1_5 proquest_ebookcentralchapters_6721301_56_63 |
PublicationCentury | 2000 |
PublicationDate | 2021 |
PublicationDateYYYYMMDD | 2021-01-01 |
PublicationDate_xml | – year: 2021 text: 2021 |
PublicationDecade | 2020 |
PublicationPlace | Switzerland |
PublicationPlace_xml | – name: Switzerland – name: Cham |
PublicationSeriesSubtitle | Information Systems and Applications, incl. Internet/Web, and HCI |
PublicationSeriesTitle | Lecture Notes in Computer Science |
PublicationSeriesTitleAlternate | Lect.Notes Computer |
PublicationSubtitle | 25th International Conference on Theory and Practice of Digital Libraries, TPDL 2021, Virtual Event, September 13-17, 2021, Proceedings |
PublicationTitle | Linking Theory and Practice of Digital Libraries |
PublicationYear | 2021 |
Publisher | Springer International Publishing AG Springer International Publishing |
Publisher_xml | – name: Springer International Publishing AG – name: Springer International Publishing |
RelatedPersons | Hartmanis, Juris Gao, Wen Bertino, Elisa Woeginger, Gerhard Goos, Gerhard Steffen, Bernhard Yung, Moti |
RelatedPersons_xml | – sequence: 1 givenname: Gerhard surname: Goos fullname: Goos, Gerhard – sequence: 2 givenname: Juris surname: Hartmanis fullname: Hartmanis, Juris – sequence: 3 givenname: Elisa surname: Bertino fullname: Bertino, Elisa – sequence: 4 givenname: Wen surname: Gao fullname: Gao, Wen – sequence: 5 givenname: Bernhard orcidid: 0000-0001-9619-1558 surname: Steffen fullname: Steffen, Bernhard – sequence: 6 givenname: Gerhard orcidid: 0000-0001-8816-2693 surname: Woeginger fullname: Woeginger, Gerhard – sequence: 7 givenname: Moti orcidid: 0000-0003-0848-0873 surname: Yung fullname: Yung, Moti |
SSID | ssj0002724628 ssj0002792 |
Score | 1.6190729 |
Snippet | The paper addresses the problem of automatic identification of phrases to be included in back-of-book indexes. We analyzed books in Polish and English... |
SourceID | springer proquest |
SourceType | Publisher |
StartPage | 49 |
SubjectTerms | Back-of-book index Extraction tools Polish |
Title | Terminology/Keyphrase Extraction for Creation of Book Indexes in Polish |
URI | http://ebookcentral.proquest.com/lib/SITE_ID/reader.action?docID=6721301&ppg=63 http://link.springer.com/10.1007/978-3-030-86324-1_5 |
Volume | 12866 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwnV3NjtMwELa65YI48C9gAeVAL1SBxk6c5MBhVboqS1khtIv2FjmOU0VFySrNCrZPg3gUnozxX5IGLksPUZM6TjqfNTMezzdG6BUDj014grtxHgYu2NvMZTM_c2nA8jiKsZeqYjqfTuny3D-5CC5Go1-9rKWrJn3Dd__klfwPqnANcJUs2Rsg23YKF-A74AtHQBiOA-d3P8yq2ct61wPDrjdZ_5ryJD3A98Va7gcyXdn5cBd7rnlRFmyjqTqTeTCJ8Lqqd5qlpttcb6rvheAbHXVdl3D_btP-_AWU5rdCdfC5qJp6b9yp7BrDgTn-KK5htIClnC5-NLXZlxz85MkcT45m857HqkgXH2TtRpUiphLztjrKJmUptu9WZrnjtGp0E7sjhVVQ_QgG9gYRDBvBHMRAuzDc3pSXyAI7lGATAjXUL1DrMDHSmlJoTU5lfUai66Ea7WxOtJ3Xtav_siD9pBHo15XP8l0vCQ7QQRj5Y3TraHGy-trG8XCIJb23tf6yIKNeudKvJPlE9pVjXfGp-wttGSxd6XjwxL1Jz2CdXrk_Z_fQHYmOI7kqILv7aCTKB-iuFb9jxP8QLXvgv22hdzroHYD-908Lu1PljurYwO4UpaNhf4TOjxdn86VrtupwL8ElbNws8GnMRZZx8NhzBj51xAPi55Sksxw-YcCoiKNUzDD3BBgOL84ZC1mImaCCROQxGpdVKZ4gRzCasyzmlMjikLkfxSkjjONQkCCCDp-iqZVKohIKTBYz1zLYJjTE4JiB_GgiW7-2gktk421i63SDwBOSgMATJfAEbnh2k8aH6HY3mJ-jcVNfiRfgoDbpSzNG_gDyFItP |
linkProvider | Library Specific Holdings |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Abook&rft.genre=bookitem&rft.title=Linking+Theory+and+Practice+of+Digital+Libraries&rft.au=Marciniak%2C+Ma%C5%82gorzata&rft.au=Mykowiecka%2C+Agnieszka&rft.au=Rychlik%2C+Piotr&rft.atitle=Terminology%2FKeyphrase+Extraction+for%C2%A0Creation+of+Book+Indexes+in+Polish&rft.series=Lecture+Notes+in+Computer+Science&rft.date=2021-01-01&rft.pub=Springer+International+Publishing&rft.isbn=9783030863234&rft.issn=0302-9743&rft.eissn=1611-3349&rft.spage=49&rft.epage=54&rft_id=info:doi/10.1007%2F978-3-030-86324-1_5 |
thumbnail_s | http://utb.summon.serialssolutions.com/2.0.0/image/custom?url=https%3A%2F%2Febookcentral.proquest.com%2Fcovers%2F6721301-l.jpg |