Thesaurus and subject heading lists as Linked Data

Abstract Most libraries put a lot of effort into developing subject headings or thesauri, which are used to index and retrieve information. Nevertheless, in the library field, controlled vocabularies are associated to authority records as authority files. In order to become findable by search engine...

Full description

Saved in:
Bibliographic Details
Published inTransinformação Vol. 33
Main Authors Barbosa, Everton Rodrigues, Dutra, Moisés Lima, Godoy Viera, Angel Freddy, Macedo, Douglas Dyllon Jeronimo de
Format Journal Article
LanguageEnglish
Published Pontificia Universidade Católica de Campinas 01.01.2021
Subjects
Online AccessGet full text
ISSN0103-3786
2318-0889
DOI10.1590/2318-0889202133e200077

Cover

Loading…
Abstract Abstract Most libraries put a lot of effort into developing subject headings or thesauri, which are used to index and retrieve information. Nevertheless, in the library field, controlled vocabularies are associated to authority records as authority files. In order to become findable by search engines, these authority files should be modelled on semantic vocabularies. This research proposes an authority-record conversion process for publishing thesauri and subject headings as linked data, by using the Simple Knowledge Organization Systems data model. To this purpose, we undertook a bibliographic and documentary research on the World Wide Web Consortium recommendation guidelines, which were used to produce a set of procedures and technologies to support the conversion proposal. This research provides evidences that controlled vocabularies are an important resource for improving information retrieval on the web. The proposed conversion process works as a quick guide for controlled vocabulary integration and reuse among users and systems on the linked data environment. Although the proposal was originally intended for a library setting, it can be applied and tested in another type of institution, such as documentation centres, museums, or cultural heritage archives. It can also be used in other linked open data projects. Resumo Grande parte das bibliotecas concentram esforços em desenvolver cabeçalhos de assuntos ou tesauros, os quais são usados para indexar e para recuperar informações. No entanto, no campo das bibliotecas, os vocabulários controlados são associados aos registros bibliográficos como arquivos de autoridade. Para se tornarem localizáveis pelos mecanismos de pesquisa, esses registros de autoridade devem ser modelados em vocabulários semânticos. Esta pesquisa propõe um processo de conversão de registros de autoridades para a publicação de tesauros e de cabeçalhos de assuntos como dados abertos conectados, utilizando o modelo de dados Simple Knowledge Organization Systems. Para tanto, realizou-se uma pesquisa bibliográfica e documental sobre as diretrizes e a recomendação do World Wide Web Consortium, as quais foram usadas para produzir um conjunto de procedimentos e de tecnologias para apoiar a proposta de conversão. Este trabalho fornece evidências de que os vocabulários controlados são um recurso importante para melhorar a recuperação de informações na web. O processo de conversão proposto funciona como um guia rápido para a integração e a reutilização dos vocabulários controlados entre usuários e sistemas no ambiente de dados abertos conectados. Embora a proposta tenha sido originalmente destinada à realidade das bibliotecas, pode ser aplicada e testada em instituições de natureza diversificada, como centros de documentação, museus ou arquivos. Ela também pode ser usada em outros projetos de dados abertos conectados.
AbstractList Abstract Most libraries put a lot of effort into developing subject headings or thesauri, which are used to index and retrieve information. Nevertheless, in the library field, controlled vocabularies are associated to authority records as authority files. In order to become findable by search engines, these authority files should be modelled on semantic vocabularies. This research proposes an authority-record conversion process for publishing thesauri and subject headings as linked data, by using the Simple Knowledge Organization Systems data model. To this purpose, we undertook a bibliographic and documentary research on the World Wide Web Consortium recommendation guidelines, which were used to produce a set of procedures and technologies to support the conversion proposal. This research provides evidences that controlled vocabularies are an important resource for improving information retrieval on the web. The proposed conversion process works as a quick guide for controlled vocabulary integration and reuse among users and systems on the linked data environment. Although the proposal was originally intended for a library setting, it can be applied and tested in another type of institution, such as documentation centres, museums, or cultural heritage archives. It can also be used in other linked open data projects.
Abstract Most libraries put a lot of effort into developing subject headings or thesauri, which are used to index and retrieve information. Nevertheless, in the library field, controlled vocabularies are associated to authority records as authority files. In order to become findable by search engines, these authority files should be modelled on semantic vocabularies. This research proposes an authority-record conversion process for publishing thesauri and subject headings as linked data, by using the Simple Knowledge Organization Systems data model. To this purpose, we undertook a bibliographic and documentary research on the World Wide Web Consortium recommendation guidelines, which were used to produce a set of procedures and technologies to support the conversion proposal. This research provides evidences that controlled vocabularies are an important resource for improving information retrieval on the web. The proposed conversion process works as a quick guide for controlled vocabulary integration and reuse among users and systems on the linked data environment. Although the proposal was originally intended for a library setting, it can be applied and tested in another type of institution, such as documentation centres, museums, or cultural heritage archives. It can also be used in other linked open data projects. Resumo Grande parte das bibliotecas concentram esforços em desenvolver cabeçalhos de assuntos ou tesauros, os quais são usados para indexar e para recuperar informações. No entanto, no campo das bibliotecas, os vocabulários controlados são associados aos registros bibliográficos como arquivos de autoridade. Para se tornarem localizáveis pelos mecanismos de pesquisa, esses registros de autoridade devem ser modelados em vocabulários semânticos. Esta pesquisa propõe um processo de conversão de registros de autoridades para a publicação de tesauros e de cabeçalhos de assuntos como dados abertos conectados, utilizando o modelo de dados Simple Knowledge Organization Systems. Para tanto, realizou-se uma pesquisa bibliográfica e documental sobre as diretrizes e a recomendação do World Wide Web Consortium, as quais foram usadas para produzir um conjunto de procedimentos e de tecnologias para apoiar a proposta de conversão. Este trabalho fornece evidências de que os vocabulários controlados são um recurso importante para melhorar a recuperação de informações na web. O processo de conversão proposto funciona como um guia rápido para a integração e a reutilização dos vocabulários controlados entre usuários e sistemas no ambiente de dados abertos conectados. Embora a proposta tenha sido originalmente destinada à realidade das bibliotecas, pode ser aplicada e testada em instituições de natureza diversificada, como centros de documentação, museus ou arquivos. Ela também pode ser usada em outros projetos de dados abertos conectados.
Author Barbosa, Everton Rodrigues
Macedo, Douglas Dyllon Jeronimo de
Dutra, Moisés Lima
Godoy Viera, Angel Freddy
Author_xml – sequence: 1
  givenname: Everton Rodrigues
  orcidid: 0000-0002-1111-5861
  surname: Barbosa
  fullname: Barbosa, Everton Rodrigues
  organization: Universidade Federal de Santa Catarina, Brasil
– sequence: 2
  givenname: Moisés Lima
  orcidid: 0000-0003-1000-5553
  surname: Dutra
  fullname: Dutra, Moisés Lima
  organization: Universidade Federal de Santa Catarina, Brasil
– sequence: 3
  givenname: Angel Freddy
  orcidid: 0000-0001-6657-4734
  surname: Godoy Viera
  fullname: Godoy Viera, Angel Freddy
  organization: Universidade Federal de Santa Catarina, Brasil
– sequence: 4
  givenname: Douglas Dyllon Jeronimo de
  orcidid: 0000-0002-3237-4168
  surname: Macedo
  fullname: Macedo, Douglas Dyllon Jeronimo de
  organization: Universidade Federal de Santa Catarina, Brasil
BookMark eNplkM1KAzEUhYNUsNa-gswLjN7kNpPMUupfoeCmrkMmuWlT64wkMwvf3qmVblxcDtwDH4fvmk3ariXGbjnccVnDvUCuS9C6FiA4IgkAUOqCTc_FhE2BA5aodHXF5jnHBoRQSkqOUyY2O8p2SEMubOuLPDR7cn2xI-tjuy0OMfdjk4t1bD_IF4-2tzfsMthDpvlfztj789Nm-Vqu315Wy4d16VDKvqzREedOoKxIu_EINUjtHfkghfIYtPOCax6acZ4kEtxrqMdhlSQZFM7Y6sT1nd2brxQ_bfo2nY3m99GlrbGpj-5ABmoXFo0MDaBfSBANBKGroCtr68AVjKzqxHKpyzlROPM4mKNIc_Rl_onEHyZCZik
Cites_doi 10.1016/j.is.2019.04.008
10.1108/02640470911004057
10.1016/j.ecoinf.2012.04.004
10.3145/epi.2008.ene.02
10.5771/0943-7444-2008-2-3-160
10.36311/1981-1640.2015.v9n2.01.p1
10.3233/SW-130128
10.1108/07419051111145118
ContentType Journal Article
DBID AAYXX
CITATION
DOA
DOI 10.1590/2318-0889202133e200077
DatabaseName CrossRef
DOAJ Directory of Open Access Journals
DatabaseTitle CrossRef
DatabaseTitleList
CrossRef
Database_xml – sequence: 1
  dbid: DOA
  name: DOAJ Directory of Open Access Journals
  url: https://www.doaj.org/
  sourceTypes: Open Website
DeliveryMethod fulltext_linktorsrc
Discipline Library & Information Science
EISSN 2318-0889
ExternalDocumentID oai_doaj_org_article_09cf4b5fb03d4502b0f286f86aa9f170
10_1590_2318_0889202133e200077
GroupedDBID 91A
AAYXX
ACHQT
ADBBV
ALMA_UNASSIGNED_HOLDINGS
APOWU
AZFZN
BCNDV
CITATION
GROUPED_DOAJ
INF
KQ8
OK1
5VS
ID FETCH-LOGICAL-c355t-93ce11c2356e8c6e8e38058dcedf527d3f8cd2181fb1035ee21d80977565e5f73
IEDL.DBID DOA
ISSN 0103-3786
IngestDate Wed Aug 27 01:31:01 EDT 2025
Tue Jul 01 01:35:07 EDT 2025
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Language English
License http://creativecommons.org/licenses/by/4.0
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c355t-93ce11c2356e8c6e8e38058dcedf527d3f8cd2181fb1035ee21d80977565e5f73
ORCID 0000-0002-1111-5861
0000-0003-1000-5553
0000-0002-3237-4168
0000-0001-6657-4734
OpenAccessLink https://doaj.org/article/09cf4b5fb03d4502b0f286f86aa9f170
ParticipantIDs doaj_primary_oai_doaj_org_article_09cf4b5fb03d4502b0f286f86aa9f170
crossref_primary_10_1590_2318_0889202133e200077
ProviderPackageCode CITATION
AAYXX
PublicationCentury 2000
PublicationDate 2021-01-01
PublicationDateYYYYMMDD 2021-01-01
PublicationDate_xml – month: 01
  year: 2021
  text: 2021-01-01
  day: 01
PublicationDecade 2020
PublicationTitle Transinformação
PublicationYear 2021
Publisher Pontificia Universidade Católica de Campinas
Publisher_xml – name: Pontificia Universidade Católica de Campinas
References (ref49) 2014
Rudic G. (ref33) 2009; 27
Zeng M. L (ref56) 2008; 35
Matthews B. (ref27) 2001
(ref45) 2008
(ref29) 2019
Ramalho R. A. S (ref32) 2015; 2
Colepícolo E. (ref7) 2006
Harpring P (ref17) 2015
Dunsire G. (ref11) 2011; 28
(ref2) 2019
Berners-Lee T (ref6) 2009
Laporte M. A. (ref21) 2012; 11
(ref13) 2018
(ref26) 2018
Heath T. (ref18) 2011
Bandholtz T. (ref4) 2010
Basharat A. (ref5) 2016
(ref42) 2004
(ref53) 2017
García-Torres A. (ref14) 2008; 17
Leroi M.-V. (ref22) 2010
(ref25) 2017
(ref48) 2013
Anibaldi S. (ref3) 2015; 6
(ref51) 2014
(ref1) 2018
Korn K. (ref20) 2011
(ref37) 2018
Díaz-Corona D. (ref9) 2019; 84
Pastor-Sánchez J (ref30) 2015; 9
Molli P. (ref28) 2016
(ref46) 2009
(ref8) 2018
van Assem M. (ref39) 2006
Harper C. A (ref16) 2006
(ref43) 2005
(ref23) 2008
(ref50) 2014
(ref54) 2017
(ref47) 2009
Isaac A. (ref19) 2015
Pastor-Sanchez J. A. (ref31) 2009; 14
(ref24) 1999
Scholz H (ref35) 2017
(ref44) 2005
(ref38) 2018
Summers E. (ref36) 2008
van Hooland S. (ref40) 2015
(ref12) 2019
Reitz J. M (ref34) 2004
(ref41) 2005
Dodebei V. L. D (ref10) 2002
(ref55) 2019
(ref52) 2016
Zoghlami K. (ref57) 2011
Haider S (ref15) 2020
References_xml – year: 2019
  ident: ref29
– volume: 84
  start-page: 17
  year: 2019
  ident: ref9
  article-title: Profiling of knowledge organisation systems for the annotation of Linked Data cultural resources
  publication-title: Information Systems
  doi: 10.1016/j.is.2019.04.008
– year: 2018
  ident: ref13
– year: 2015
  ident: ref40
– year: 2017
  ident: ref53
– year: 2014
  ident: ref50
– year: 2005
  ident: ref43
– year: 2006
  ident: ref16
– year: 2019
  ident: ref2
– volume: 27
  start-page: 950
  issue: 6
  year: 2009
  ident: ref33
  article-title: Conversion of bibliographic records to MARC 21 format
  publication-title: The Electronic Library
  doi: 10.1108/02640470911004057
– year: 2019
  ident: ref12
– year: 2017
  ident: ref25
– volume: 11
  start-page: 34
  year: 2012
  ident: ref21
  article-title: ThesauForm-Traits: a web based collaborative tool to develop a thesaurus for plant functional diversity research
  publication-title: Ecological Informatics
  doi: 10.1016/j.ecoinf.2012.04.004
– year: 2009
  ident: ref6
– year: 2011
  ident: ref57
– year: 2020
  ident: ref15
  article-title: Vocabulary control
  publication-title: Librarianship Studies & Information Technology
– volume: 17
  start-page: 8
  issue: 1
  year: 2008
  ident: ref14
  article-title: Reutilización de tesauros: el documentalista frente al reto de la Web semántica
  publication-title: El Profesional de la Información
  doi: 10.3145/epi.2008.ene.02
– year: 2008
  ident: ref23
– year: 2006
  ident: ref7
– volume: 35
  start-page: 160
  issue: 2-3
  year: 2008
  ident: ref56
  article-title: Knowledge Organization Systems (KOS)
  publication-title: Knowledge Organization
  doi: 10.5771/0943-7444-2008-2-3-160
– year: 2018
  ident: ref37
– year: 2018
  ident: ref8
– year: 2005
  ident: ref41
– year: 2009
  ident: ref47
– year: 2015
  ident: ref19
– volume-title: A method to convert Thesauri to SKOS
  year: 2006
  ident: ref39
– year: 2018
  ident: ref38
– year: 2010
  ident: ref22
– year: 2016
  ident: ref52
– year: 2004
  ident: ref42
– year: 2002
  ident: ref10
– year: 2018
  ident: ref1
– year: 2011
  ident: ref18
– year: 2005
  ident: ref44
– year: 2008
  ident: ref36
– year: 2014
  ident: ref51
– year: 2018
  ident: ref26
– year: 2017
  ident: ref54
– year: 2004
  ident: ref34
– volume-title: Semantic hadith: leveraging Linked Data opportunities for Islamic knowledge
  year: 2016
  ident: ref5
– volume-title: iQvoc - Open Source SKOS (XL) Maintenance and Publishing Tool
  year: 2010
  ident: ref4
– volume-title: Controlled vocabularies in Context
  year: 2015
  ident: ref17
– year: 2019
  ident: ref55
– year: 2014
  ident: ref49
– year: 2011
  ident: ref20
– year: 2009
  ident: ref46
– volume: 9
  start-page: 1
  issue: 2
  year: 2015
  ident: ref30
  article-title: Proposal To Represent the Unesco Thesaurus for the Semantic Web Applying ISO-25964
  publication-title: Brazilian Journal of Information Science
  doi: 10.36311/1981-1640.2015.v9n2.01.p1
– year: 2017
  ident: ref35
– volume: 6
  start-page: 113
  year: 2015
  ident: ref3
  article-title: Migrating bibliographic datasets to the semantic web: the AGRIS case
  publication-title: Semantic Web
  doi: 10.3233/SW-130128
– year: 2008
  ident: ref45
– year: 2013
  ident: ref48
– volume: 14
  start-page: 1
  issue: 4
  year: 2009
  ident: ref31
  article-title: Advantages of thesaurus representation using the Simple Knowledge Organization System (SKOS) compared with proposed alternatives
  publication-title: Information Research
– volume: 28
  start-page: 1
  issue: 3
  year: 2011
  ident: ref11
  article-title: Standard library metadata models and structures for the semantic web
  publication-title: Library Hi Tech News
  doi: 10.1108/07419051111145118
– volume: 2
  start-page: 66
  issue: 1
  year: 2015
  ident: ref32
  article-title: Análise do modelo de dados SKOS: Sistema de Organização do Conhecimento Simples para a Web
  publication-title: Informação & Tecnologia
– year: 1999
  ident: ref24
– year: 2001
  ident: ref27
– year: 2016
  ident: ref28
SSID ssib022775513
ssib026972165
ssj0000651043
Score 2.1487448
Snippet Abstract Most libraries put a lot of effort into developing subject headings or thesauri, which are used to index and retrieve information. Nevertheless, in...
SourceID doaj
crossref
SourceType Open Website
Index Database
SubjectTerms Authority records
Controlled vocabularies
Semantic Web
Simple Knowledge Organization System
Title Thesaurus and subject heading lists as Linked Data
URI https://doaj.org/article/09cf4b5fb03d4502b0f286f86aa9f170
Volume 33
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrV1JS8NAFB6kJy-iqFg35uA1dJbMkqNbKYqCYKG3MMsbPEiVJv3_vkliqXjw4iGBLITJ9wa-9w1vvkfIlXImmMpBEWxQRWm0LhwSU8EhMhYq53hXjPn0rGfz8mGhFlutvnJNWG8P3AM3YVVIpVfJMxlLxYRnSVidrHauStx0ah05b0tM4UwSwpjcuWRzrTuTGjVsEVYVm2Bek62ZbYXqH3Ua5C0rxvxgpy0T_45tpvtkb0gT6XU_vAOyA8tDIjCmjVuv1g1F_U-btc-LKPStr4On7xgxfNLQrC8h0jvXuiMyn96_3s6KoedBEZD526KSATgPQioNNuAB0jJlY4CYlDBRJhtipuXkOZMKQPBoGSZxmJiBSkYek9HyYwknhCIGUoTIHAdfcq599mKLXquAooUzGJPJ97_Wn721RZ0lAaJTZ3TqX-iMyU2GZPN2tqbubmDA6iFg9V8BO_2Pj5yR3Tyyfi3knIza1RouMDto_WU3EfD8-GK_AP4wsiM
linkProvider Directory of Open Access Journals
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Thesaurus+and+subject+heading+lists+as+Linked+Data&rft.jtitle=Transinforma%C3%A7%C3%A3o&rft.au=Everton+Rodrigues+Barbosa&rft.au=Mois%C3%A9s+Lima+Dutra&rft.au=Angel+Freddy+Godoy+Viera&rft.au=Douglas+Dyllon+Jeronimo+de+Macedo&rft.date=2021-01-01&rft.pub=Pontificia+Universidade+Cat%C3%B3lica+de+Campinas&rft.eissn=2318-0889&rft.volume=33&rft_id=info:doi/10.1590%2F2318-0889202133e200077&rft.externalDBID=DOA&rft.externalDocID=oai_doaj_org_article_09cf4b5fb03d4502b0f286f86aa9f170
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0103-3786&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0103-3786&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0103-3786&client=summon