Thesaurus and subject heading lists as Linked Data
Abstract Most libraries put a lot of effort into developing subject headings or thesauri, which are used to index and retrieve information. Nevertheless, in the library field, controlled vocabularies are associated to authority records as authority files. In order to become findable by search engine...
Saved in:
Published in | Transinformação Vol. 33 |
---|---|
Main Authors | , , , |
Format | Journal Article |
Language | English |
Published |
Pontificia Universidade Católica de Campinas
01.01.2021
|
Subjects | |
Online Access | Get full text |
ISSN | 0103-3786 2318-0889 |
DOI | 10.1590/2318-0889202133e200077 |
Cover
Loading…
Abstract | Abstract Most libraries put a lot of effort into developing subject headings or thesauri, which are used to index and retrieve information. Nevertheless, in the library field, controlled vocabularies are associated to authority records as authority files. In order to become findable by search engines, these authority files should be modelled on semantic vocabularies. This research proposes an authority-record conversion process for publishing thesauri and subject headings as linked data, by using the Simple Knowledge Organization Systems data model. To this purpose, we undertook a bibliographic and documentary research on the World Wide Web Consortium recommendation guidelines, which were used to produce a set of procedures and technologies to support the conversion proposal. This research provides evidences that controlled vocabularies are an important resource for improving information retrieval on the web. The proposed conversion process works as a quick guide for controlled vocabulary integration and reuse among users and systems on the linked data environment. Although the proposal was originally intended for a library setting, it can be applied and tested in another type of institution, such as documentation centres, museums, or cultural heritage archives. It can also be used in other linked open data projects.
Resumo Grande parte das bibliotecas concentram esforços em desenvolver cabeçalhos de assuntos ou tesauros, os quais são usados para indexar e para recuperar informações. No entanto, no campo das bibliotecas, os vocabulários controlados são associados aos registros bibliográficos como arquivos de autoridade. Para se tornarem localizáveis pelos mecanismos de pesquisa, esses registros de autoridade devem ser modelados em vocabulários semânticos. Esta pesquisa propõe um processo de conversão de registros de autoridades para a publicação de tesauros e de cabeçalhos de assuntos como dados abertos conectados, utilizando o modelo de dados Simple Knowledge Organization Systems. Para tanto, realizou-se uma pesquisa bibliográfica e documental sobre as diretrizes e a recomendação do World Wide Web Consortium, as quais foram usadas para produzir um conjunto de procedimentos e de tecnologias para apoiar a proposta de conversão. Este trabalho fornece evidências de que os vocabulários controlados são um recurso importante para melhorar a recuperação de informações na web. O processo de conversão proposto funciona como um guia rápido para a integração e a reutilização dos vocabulários controlados entre usuários e sistemas no ambiente de dados abertos conectados. Embora a proposta tenha sido originalmente destinada à realidade das bibliotecas, pode ser aplicada e testada em instituições de natureza diversificada, como centros de documentação, museus ou arquivos. Ela também pode ser usada em outros projetos de dados abertos conectados. |
---|---|
AbstractList | Abstract Most libraries put a lot of effort into developing subject headings or thesauri, which are used to index and retrieve information. Nevertheless, in the library field, controlled vocabularies are associated to authority records as authority files. In order to become findable by search engines, these authority files should be modelled on semantic vocabularies. This research proposes an authority-record conversion process for publishing thesauri and subject headings as linked data, by using the Simple Knowledge Organization Systems data model. To this purpose, we undertook a bibliographic and documentary research on the World Wide Web Consortium recommendation guidelines, which were used to produce a set of procedures and technologies to support the conversion proposal. This research provides evidences that controlled vocabularies are an important resource for improving information retrieval on the web. The proposed conversion process works as a quick guide for controlled vocabulary integration and reuse among users and systems on the linked data environment. Although the proposal was originally intended for a library setting, it can be applied and tested in another type of institution, such as documentation centres, museums, or cultural heritage archives. It can also be used in other linked open data projects. Abstract Most libraries put a lot of effort into developing subject headings or thesauri, which are used to index and retrieve information. Nevertheless, in the library field, controlled vocabularies are associated to authority records as authority files. In order to become findable by search engines, these authority files should be modelled on semantic vocabularies. This research proposes an authority-record conversion process for publishing thesauri and subject headings as linked data, by using the Simple Knowledge Organization Systems data model. To this purpose, we undertook a bibliographic and documentary research on the World Wide Web Consortium recommendation guidelines, which were used to produce a set of procedures and technologies to support the conversion proposal. This research provides evidences that controlled vocabularies are an important resource for improving information retrieval on the web. The proposed conversion process works as a quick guide for controlled vocabulary integration and reuse among users and systems on the linked data environment. Although the proposal was originally intended for a library setting, it can be applied and tested in another type of institution, such as documentation centres, museums, or cultural heritage archives. It can also be used in other linked open data projects. Resumo Grande parte das bibliotecas concentram esforços em desenvolver cabeçalhos de assuntos ou tesauros, os quais são usados para indexar e para recuperar informações. No entanto, no campo das bibliotecas, os vocabulários controlados são associados aos registros bibliográficos como arquivos de autoridade. Para se tornarem localizáveis pelos mecanismos de pesquisa, esses registros de autoridade devem ser modelados em vocabulários semânticos. Esta pesquisa propõe um processo de conversão de registros de autoridades para a publicação de tesauros e de cabeçalhos de assuntos como dados abertos conectados, utilizando o modelo de dados Simple Knowledge Organization Systems. Para tanto, realizou-se uma pesquisa bibliográfica e documental sobre as diretrizes e a recomendação do World Wide Web Consortium, as quais foram usadas para produzir um conjunto de procedimentos e de tecnologias para apoiar a proposta de conversão. Este trabalho fornece evidências de que os vocabulários controlados são um recurso importante para melhorar a recuperação de informações na web. O processo de conversão proposto funciona como um guia rápido para a integração e a reutilização dos vocabulários controlados entre usuários e sistemas no ambiente de dados abertos conectados. Embora a proposta tenha sido originalmente destinada à realidade das bibliotecas, pode ser aplicada e testada em instituições de natureza diversificada, como centros de documentação, museus ou arquivos. Ela também pode ser usada em outros projetos de dados abertos conectados. |
Author | Barbosa, Everton Rodrigues Macedo, Douglas Dyllon Jeronimo de Dutra, Moisés Lima Godoy Viera, Angel Freddy |
Author_xml | – sequence: 1 givenname: Everton Rodrigues orcidid: 0000-0002-1111-5861 surname: Barbosa fullname: Barbosa, Everton Rodrigues organization: Universidade Federal de Santa Catarina, Brasil – sequence: 2 givenname: Moisés Lima orcidid: 0000-0003-1000-5553 surname: Dutra fullname: Dutra, Moisés Lima organization: Universidade Federal de Santa Catarina, Brasil – sequence: 3 givenname: Angel Freddy orcidid: 0000-0001-6657-4734 surname: Godoy Viera fullname: Godoy Viera, Angel Freddy organization: Universidade Federal de Santa Catarina, Brasil – sequence: 4 givenname: Douglas Dyllon Jeronimo de orcidid: 0000-0002-3237-4168 surname: Macedo fullname: Macedo, Douglas Dyllon Jeronimo de organization: Universidade Federal de Santa Catarina, Brasil |
BookMark | eNplkM1KAzEUhYNUsNa-gswLjN7kNpPMUupfoeCmrkMmuWlT64wkMwvf3qmVblxcDtwDH4fvmk3ariXGbjnccVnDvUCuS9C6FiA4IgkAUOqCTc_FhE2BA5aodHXF5jnHBoRQSkqOUyY2O8p2SEMubOuLPDR7cn2xI-tjuy0OMfdjk4t1bD_IF4-2tzfsMthDpvlfztj789Nm-Vqu315Wy4d16VDKvqzREedOoKxIu_EINUjtHfkghfIYtPOCax6acZ4kEtxrqMdhlSQZFM7Y6sT1nd2brxQ_bfo2nY3m99GlrbGpj-5ABmoXFo0MDaBfSBANBKGroCtr68AVjKzqxHKpyzlROPM4mKNIc_Rl_onEHyZCZik |
Cites_doi | 10.1016/j.is.2019.04.008 10.1108/02640470911004057 10.1016/j.ecoinf.2012.04.004 10.3145/epi.2008.ene.02 10.5771/0943-7444-2008-2-3-160 10.36311/1981-1640.2015.v9n2.01.p1 10.3233/SW-130128 10.1108/07419051111145118 |
ContentType | Journal Article |
DBID | AAYXX CITATION DOA |
DOI | 10.1590/2318-0889202133e200077 |
DatabaseName | CrossRef DOAJ Directory of Open Access Journals |
DatabaseTitle | CrossRef |
DatabaseTitleList | CrossRef |
Database_xml | – sequence: 1 dbid: DOA name: DOAJ Directory of Open Access Journals url: https://www.doaj.org/ sourceTypes: Open Website |
DeliveryMethod | fulltext_linktorsrc |
Discipline | Library & Information Science |
EISSN | 2318-0889 |
ExternalDocumentID | oai_doaj_org_article_09cf4b5fb03d4502b0f286f86aa9f170 10_1590_2318_0889202133e200077 |
GroupedDBID | 91A AAYXX ACHQT ADBBV ALMA_UNASSIGNED_HOLDINGS APOWU AZFZN BCNDV CITATION GROUPED_DOAJ INF KQ8 OK1 5VS |
ID | FETCH-LOGICAL-c355t-93ce11c2356e8c6e8e38058dcedf527d3f8cd2181fb1035ee21d80977565e5f73 |
IEDL.DBID | DOA |
ISSN | 0103-3786 |
IngestDate | Wed Aug 27 01:31:01 EDT 2025 Tue Jul 01 01:35:07 EDT 2025 |
IsDoiOpenAccess | true |
IsOpenAccess | true |
IsPeerReviewed | true |
IsScholarly | true |
Language | English |
License | http://creativecommons.org/licenses/by/4.0 |
LinkModel | DirectLink |
MergedId | FETCHMERGED-LOGICAL-c355t-93ce11c2356e8c6e8e38058dcedf527d3f8cd2181fb1035ee21d80977565e5f73 |
ORCID | 0000-0002-1111-5861 0000-0003-1000-5553 0000-0002-3237-4168 0000-0001-6657-4734 |
OpenAccessLink | https://doaj.org/article/09cf4b5fb03d4502b0f286f86aa9f170 |
ParticipantIDs | doaj_primary_oai_doaj_org_article_09cf4b5fb03d4502b0f286f86aa9f170 crossref_primary_10_1590_2318_0889202133e200077 |
ProviderPackageCode | CITATION AAYXX |
PublicationCentury | 2000 |
PublicationDate | 2021-01-01 |
PublicationDateYYYYMMDD | 2021-01-01 |
PublicationDate_xml | – month: 01 year: 2021 text: 2021-01-01 day: 01 |
PublicationDecade | 2020 |
PublicationTitle | Transinformação |
PublicationYear | 2021 |
Publisher | Pontificia Universidade Católica de Campinas |
Publisher_xml | – name: Pontificia Universidade Católica de Campinas |
References | (ref49) 2014 Rudic G. (ref33) 2009; 27 Zeng M. L (ref56) 2008; 35 Matthews B. (ref27) 2001 (ref45) 2008 (ref29) 2019 Ramalho R. A. S (ref32) 2015; 2 Colepícolo E. (ref7) 2006 Harpring P (ref17) 2015 Dunsire G. (ref11) 2011; 28 (ref2) 2019 Berners-Lee T (ref6) 2009 Laporte M. A. (ref21) 2012; 11 (ref13) 2018 (ref26) 2018 Heath T. (ref18) 2011 Bandholtz T. (ref4) 2010 Basharat A. (ref5) 2016 (ref42) 2004 (ref53) 2017 García-Torres A. (ref14) 2008; 17 Leroi M.-V. (ref22) 2010 (ref25) 2017 (ref48) 2013 Anibaldi S. (ref3) 2015; 6 (ref51) 2014 (ref1) 2018 Korn K. (ref20) 2011 (ref37) 2018 Díaz-Corona D. (ref9) 2019; 84 Pastor-Sánchez J (ref30) 2015; 9 Molli P. (ref28) 2016 (ref46) 2009 (ref8) 2018 van Assem M. (ref39) 2006 Harper C. A (ref16) 2006 (ref43) 2005 (ref23) 2008 (ref50) 2014 (ref54) 2017 (ref47) 2009 Isaac A. (ref19) 2015 Pastor-Sanchez J. A. (ref31) 2009; 14 (ref24) 1999 Scholz H (ref35) 2017 (ref44) 2005 (ref38) 2018 Summers E. (ref36) 2008 van Hooland S. (ref40) 2015 (ref12) 2019 Reitz J. M (ref34) 2004 (ref41) 2005 Dodebei V. L. D (ref10) 2002 (ref55) 2019 (ref52) 2016 Zoghlami K. (ref57) 2011 Haider S (ref15) 2020 |
References_xml | – year: 2019 ident: ref29 – volume: 84 start-page: 17 year: 2019 ident: ref9 article-title: Profiling of knowledge organisation systems for the annotation of Linked Data cultural resources publication-title: Information Systems doi: 10.1016/j.is.2019.04.008 – year: 2018 ident: ref13 – year: 2015 ident: ref40 – year: 2017 ident: ref53 – year: 2014 ident: ref50 – year: 2005 ident: ref43 – year: 2006 ident: ref16 – year: 2019 ident: ref2 – volume: 27 start-page: 950 issue: 6 year: 2009 ident: ref33 article-title: Conversion of bibliographic records to MARC 21 format publication-title: The Electronic Library doi: 10.1108/02640470911004057 – year: 2019 ident: ref12 – year: 2017 ident: ref25 – volume: 11 start-page: 34 year: 2012 ident: ref21 article-title: ThesauForm-Traits: a web based collaborative tool to develop a thesaurus for plant functional diversity research publication-title: Ecological Informatics doi: 10.1016/j.ecoinf.2012.04.004 – year: 2009 ident: ref6 – year: 2011 ident: ref57 – year: 2020 ident: ref15 article-title: Vocabulary control publication-title: Librarianship Studies & Information Technology – volume: 17 start-page: 8 issue: 1 year: 2008 ident: ref14 article-title: Reutilización de tesauros: el documentalista frente al reto de la Web semántica publication-title: El Profesional de la Información doi: 10.3145/epi.2008.ene.02 – year: 2008 ident: ref23 – year: 2006 ident: ref7 – volume: 35 start-page: 160 issue: 2-3 year: 2008 ident: ref56 article-title: Knowledge Organization Systems (KOS) publication-title: Knowledge Organization doi: 10.5771/0943-7444-2008-2-3-160 – year: 2018 ident: ref37 – year: 2018 ident: ref8 – year: 2005 ident: ref41 – year: 2009 ident: ref47 – year: 2015 ident: ref19 – volume-title: A method to convert Thesauri to SKOS year: 2006 ident: ref39 – year: 2018 ident: ref38 – year: 2010 ident: ref22 – year: 2016 ident: ref52 – year: 2004 ident: ref42 – year: 2002 ident: ref10 – year: 2018 ident: ref1 – year: 2011 ident: ref18 – year: 2005 ident: ref44 – year: 2008 ident: ref36 – year: 2014 ident: ref51 – year: 2018 ident: ref26 – year: 2017 ident: ref54 – year: 2004 ident: ref34 – volume-title: Semantic hadith: leveraging Linked Data opportunities for Islamic knowledge year: 2016 ident: ref5 – volume-title: iQvoc - Open Source SKOS (XL) Maintenance and Publishing Tool year: 2010 ident: ref4 – volume-title: Controlled vocabularies in Context year: 2015 ident: ref17 – year: 2019 ident: ref55 – year: 2014 ident: ref49 – year: 2011 ident: ref20 – year: 2009 ident: ref46 – volume: 9 start-page: 1 issue: 2 year: 2015 ident: ref30 article-title: Proposal To Represent the Unesco Thesaurus for the Semantic Web Applying ISO-25964 publication-title: Brazilian Journal of Information Science doi: 10.36311/1981-1640.2015.v9n2.01.p1 – year: 2017 ident: ref35 – volume: 6 start-page: 113 year: 2015 ident: ref3 article-title: Migrating bibliographic datasets to the semantic web: the AGRIS case publication-title: Semantic Web doi: 10.3233/SW-130128 – year: 2008 ident: ref45 – year: 2013 ident: ref48 – volume: 14 start-page: 1 issue: 4 year: 2009 ident: ref31 article-title: Advantages of thesaurus representation using the Simple Knowledge Organization System (SKOS) compared with proposed alternatives publication-title: Information Research – volume: 28 start-page: 1 issue: 3 year: 2011 ident: ref11 article-title: Standard library metadata models and structures for the semantic web publication-title: Library Hi Tech News doi: 10.1108/07419051111145118 – volume: 2 start-page: 66 issue: 1 year: 2015 ident: ref32 article-title: Análise do modelo de dados SKOS: Sistema de Organização do Conhecimento Simples para a Web publication-title: Informação & Tecnologia – year: 1999 ident: ref24 – year: 2001 ident: ref27 – year: 2016 ident: ref28 |
SSID | ssib022775513 ssib026972165 ssj0000651043 |
Score | 2.1487448 |
Snippet | Abstract Most libraries put a lot of effort into developing subject headings or thesauri, which are used to index and retrieve information. Nevertheless, in... |
SourceID | doaj crossref |
SourceType | Open Website Index Database |
SubjectTerms | Authority records Controlled vocabularies Semantic Web Simple Knowledge Organization System |
Title | Thesaurus and subject heading lists as Linked Data |
URI | https://doaj.org/article/09cf4b5fb03d4502b0f286f86aa9f170 |
Volume | 33 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwrV1JS8NAFB6kJy-iqFg35uA1dJbMkqNbKYqCYKG3MMsbPEiVJv3_vkliqXjw4iGBLITJ9wa-9w1vvkfIlXImmMpBEWxQRWm0LhwSU8EhMhYq53hXjPn0rGfz8mGhFlutvnJNWG8P3AM3YVVIpVfJMxlLxYRnSVidrHauStx0ah05b0tM4UwSwpjcuWRzrTuTGjVsEVYVm2Bek62ZbYXqH3Ua5C0rxvxgpy0T_45tpvtkb0gT6XU_vAOyA8tDIjCmjVuv1g1F_U-btc-LKPStr4On7xgxfNLQrC8h0jvXuiMyn96_3s6KoedBEZD526KSATgPQioNNuAB0jJlY4CYlDBRJhtipuXkOZMKQPBoGSZxmJiBSkYek9HyYwknhCIGUoTIHAdfcq599mKLXquAooUzGJPJ97_Wn721RZ0lAaJTZ3TqX-iMyU2GZPN2tqbubmDA6iFg9V8BO_2Pj5yR3Tyyfi3knIza1RouMDto_WU3EfD8-GK_AP4wsiM |
linkProvider | Directory of Open Access Journals |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Thesaurus+and+subject+heading+lists+as+Linked+Data&rft.jtitle=Transinforma%C3%A7%C3%A3o&rft.au=Everton+Rodrigues+Barbosa&rft.au=Mois%C3%A9s+Lima+Dutra&rft.au=Angel+Freddy+Godoy+Viera&rft.au=Douglas+Dyllon+Jeronimo+de+Macedo&rft.date=2021-01-01&rft.pub=Pontificia+Universidade+Cat%C3%B3lica+de+Campinas&rft.eissn=2318-0889&rft.volume=33&rft_id=info:doi/10.1590%2F2318-0889202133e200077&rft.externalDBID=DOA&rft.externalDocID=oai_doaj_org_article_09cf4b5fb03d4502b0f286f86aa9f170 |
thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=0103-3786&client=summon |
thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=0103-3786&client=summon |
thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=0103-3786&client=summon |