Resource Description Framework reification for trustworthiness in knowledge graphs [version 1; peer review: 1 approved, 1 not approved]

Knowledge graph (KG) publishes machine-readable representation of knowledge on the Web. Structured data in the knowledge graph is published using Resource Description Framework (RDF) where knowledge is represented as a triple (subject, predicate, object). Due to the presence of erroneous, outdated o...

Full description

Saved in:
Bibliographic Details
Published inF1000 research Vol. 10; p. 881
Main Authors Govindapillai, Sini, Soon, Lay-Ki, Haw, Su-Cheng
Format Journal Article
LanguageEnglish
Published 2021
Subjects
Online AccessGet full text
ISSN2046-1402
2046-1402
DOI10.12688/f1000research.72843.1

Cover

Loading…
Abstract Knowledge graph (KG) publishes machine-readable representation of knowledge on the Web. Structured data in the knowledge graph is published using Resource Description Framework (RDF) where knowledge is represented as a triple (subject, predicate, object). Due to the presence of erroneous, outdated or conflicting data in the knowledge graph, the quality of facts cannot be guaranteed. Therefore, the provenance of knowledge can assist in building up the trust of these knowledge graphs. In this paper, we have provided an analysis of popular, general knowledge graphs Wikidata and YAGO4 with regard to the representation of provenance and context data. Since RDF does not support metadata for providing provenance and contextualization, an alternate method, RDF reification is employed by most of the knowledge graphs. Trustworthiness of facts in knowledge graph can be enhanced by the addition of metadata like the source of information, location and time of the fact occurrence. Wikidata employs qualifiers to include metadata to facts, while YAGO4 collects metadata from Wikidata qualifiers. RDF reification increases the magnitude of data as several statements are required to represent a single fact. However, facts in Wikidata and YAGO4 can be fetched without using reification. Another limitation for applications that uses provenance data is that not all facts in these knowledge graphs are annotated with provenance data. Structured data in the knowledge graph is noisy. Therefore, the reliability of data in knowledge graphs can be increased by provenance data. To the best of our knowledge, this is the first paper that investigates the method and the extent of the addition of metadata of two prominent KGs, Wikidata and YAGO4.
AbstractList Knowledge graph (KG) publishes machine-readable representation of knowledge on the Web. Structured data in the knowledge graph is published using Resource Description Framework (RDF) where knowledge is represented as a triple (subject, predicate, object). Due to the presence of erroneous, outdated or conflicting data in the knowledge graph, the quality of facts cannot be guaranteed. Therefore, the provenance of knowledge can assist in building up the trust of these knowledge graphs. In this paper, we have provided an analysis of popular, general knowledge graphs Wikidata and YAGO4 with regard to the representation of provenance and context data. Since RDF does not support metadata for providing provenance and contextualization, an alternate method, RDF reification is employed by most of the knowledge graphs. Trustworthiness of facts in knowledge graph can be enhanced by the addition of metadata like the source of information, location and time of the fact occurrence. Wikidata employs qualifiers to include metadata to facts, while YAGO4 collects metadata from Wikidata qualifiers. RDF reification increases the magnitude of data as several statements are required to represent a single fact. However, facts in Wikidata and YAGO4 can be fetched without using reification. Another limitation for applications that uses provenance data is that not all facts in these knowledge graphs are annotated with provenance data. Structured data in the knowledge graph is noisy. Therefore, the reliability of data in knowledge graphs can be increased by provenance data. To the best of our knowledge, this is the first paper that investigates the method and the extent of the addition of metadata of two prominent KGs, Wikidata and YAGO4.
Knowledge graph (KG) publishes machine-readable representation of knowledge on the Web. Structured data in the knowledge graph is published using Resource Description Framework (RDF) where knowledge is represented as a triple (subject, predicate, object). Due to the presence of erroneous, outdated or conflicting data in the knowledge graph, the quality of facts cannot be guaranteed. Therefore, the provenance of knowledge can assist in building up the trust of these knowledge graphs. In this paper, we have provided an analysis of popular, general knowledge graphs Wikidata and YAGO4 with regard to the representation of provenance and context data. Since RDF does not support metadata for providing provenance and contextualization, an alternate method, RDF reification is employed by most of the knowledge graphs. Trustworthiness of facts in knowledge graph can be enhanced by the addition of metadata like the source of information, location and time of the fact occurrence. Wikidata employs qualifiers to include metadata to facts, while YAGO4 collects metadata from Wikidata qualifiers. RDF reification increases the magnitude of data as several statements are required to represent a single fact. However, facts in Wikidata and YAGO4 can be fetched without using reification. Another limitation for applications that uses provenance data is that not all facts in these knowledge graphs are annotated with provenance data. Structured data in the knowledge graph is noisy. Therefore, the reliability of data in knowledge graphs can be increased by provenance data. To the best of our knowledge, this is the first paper that investigates the method and the extent of the addition of metadata of two prominent KGs, Wikidata and YAGO4.Knowledge graph (KG) publishes machine-readable representation of knowledge on the Web. Structured data in the knowledge graph is published using Resource Description Framework (RDF) where knowledge is represented as a triple (subject, predicate, object). Due to the presence of erroneous, outdated or conflicting data in the knowledge graph, the quality of facts cannot be guaranteed. Therefore, the provenance of knowledge can assist in building up the trust of these knowledge graphs. In this paper, we have provided an analysis of popular, general knowledge graphs Wikidata and YAGO4 with regard to the representation of provenance and context data. Since RDF does not support metadata for providing provenance and contextualization, an alternate method, RDF reification is employed by most of the knowledge graphs. Trustworthiness of facts in knowledge graph can be enhanced by the addition of metadata like the source of information, location and time of the fact occurrence. Wikidata employs qualifiers to include metadata to facts, while YAGO4 collects metadata from Wikidata qualifiers. RDF reification increases the magnitude of data as several statements are required to represent a single fact. However, facts in Wikidata and YAGO4 can be fetched without using reification. Another limitation for applications that uses provenance data is that not all facts in these knowledge graphs are annotated with provenance data. Structured data in the knowledge graph is noisy. Therefore, the reliability of data in knowledge graphs can be increased by provenance data. To the best of our knowledge, this is the first paper that investigates the method and the extent of the addition of metadata of two prominent KGs, Wikidata and YAGO4.
Author Govindapillai, Sini
Soon, Lay-Ki
Haw, Su-Cheng
Author_xml – sequence: 1
  givenname: Sini
  orcidid: 0000-0002-0829-4870
  surname: Govindapillai
  fullname: Govindapillai, Sini
  organization: Faculty of Computing Informatics, Multimedia University, Cyberjaya, Selangor, 63100, Malaysia
– sequence: 2
  givenname: Lay-Ki
  orcidid: 0000-0002-8072-242X
  surname: Soon
  fullname: Soon, Lay-Ki
  organization: School of Information Technology, Monash University Malaysia, Bandar Sunway, Selangor, 47500, Malaysia
– sequence: 3
  givenname: Su-Cheng
  orcidid: 0000-0002-7190-0837
  surname: Haw
  fullname: Haw, Su-Cheng
  email: sucheng@mmu.edu.my
  organization: Faculty of Computing Informatics, Multimedia University, Cyberjaya, Selangor, 63100, Malaysia
BookMark eNqFkMFOGzEURa0KJCjlF5CXXTTB9sx47HRV0QYqIVVCsELIcpxn4mYynj7PJMoX8Ns4SYXKqitf2-e8J92P5KiNLRBywdmYC6nUpeeMMYQEFt1iXAtVFmP-gZwKVsoRL5k4-iefkPOUfmeBaV1IUZ-SlztIcUAH9Dskh6HrQ2zpFO0KNhGXFCH44Oz-1UekPQ6pzz_9IrSQEg0tXbZx08D8Gegz2m6R6OMaMO0E_pV2AJiHrANsJpRT23UY1zD_knMb-7f70ydy7G2T4PzveUYepj_ur25Gt7-uf159ux05UWo-UoorLUF7N1Nl5UVltahVbSvryrIqtCzmVTWHDHsripLpWS1zspKpWQGMFWfk82Fu3vtngNSbVUgOmsa2EIdkhMx91lpqnlF5QB3GlBC86TCsLG4NZ2ZfvnlXvtmXb3bi5CB664am3-4g80b9R34FJCyQfQ
Cites_doi 10.1145/2566486.2567973
10.3233/SW-160218
10.1016/j.artint.2012.06.001
10.1007/978-3-319-11964-9_4
10.3233/SW-170275
10.1007/s41019-020-00118-0
10.1145/1963192.1963296
10.1007/978-3-642-32873-2_10
10.3233/SW-180307
ContentType Journal Article
Copyright Copyright: © 2021 Govindapillai S et al.
Copyright_xml – notice: Copyright: © 2021 Govindapillai S et al.
DBID C-E
CH4
AAYXX
CITATION
7X8
DOI 10.12688/f1000research.72843.1
DatabaseName F1000Research
Faculty of 1000
CrossRef
MEDLINE - Academic
DatabaseTitle CrossRef
MEDLINE - Academic
DatabaseTitleList
MEDLINE - Academic
CrossRef
DeliveryMethod fulltext_linktorsrc
Discipline Medicine
Women's Studies
EISSN 2046-1402
ExternalDocumentID 10_12688_f1000research_72843_1
GrantInformation_xml – fundername: Multimedia University Internal Fund
  grantid: MMUI/180006
– fundername: Fundamental Research Grant Scheme (FRGS) by Malaysia Ministry of Higher Education
  grantid: FRGS/2/2013/ICT07/MMU/02/2
GroupedDBID 3V.
53G
5VS
7X7
88I
8FE
8FH
8FI
8FJ
ABUWG
ACGOD
ACPRK
ADACO
ADBBV
ADRAZ
AFKRA
AHMBA
ALMA_UNASSIGNED_HOLDINGS
AZQEC
BAWUL
BBAFP
BBNVY
BCNDV
BENPR
BHPHI
BPHCQ
BVXVI
C-E
CH4
DIK
DWQXO
FRP
FYUFA
GNUQQ
GROUPED_DOAJ
GX1
HCIFZ
HYE
KQ8
LK8
M2P
M48
M7P
OK1
PIMPY
PQEST
PQQKQ
PQUKI
PRINS
PROAC
RPM
AAFWJ
AAYXX
AFPKN
ALIPV
AOIJS
CCPQU
CITATION
HMCUK
M~E
PGMZT
PHGZM
PHGZT
UKHRP
W2D
7X8
PQGLB
PUEGO
ID FETCH-LOGICAL-c2491-881896e9fcb845f25a92787a5ac4453963d55dec24fa23409b76fa2a608b3e003
IEDL.DBID M48
ISSN 2046-1402
IngestDate Fri Sep 05 13:34:53 EDT 2025
Tue Jul 01 04:27:24 EDT 2025
Tue Oct 26 01:52:36 EDT 2021
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Keywords RDF reification
provenance data
YAGO
Wikidata
Knowledge Graph
Language English
License http://creativecommons.org/licenses/by/4.0/: This is an open access article distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c2491-881896e9fcb845f25a92787a5ac4453963d55dec24fa23409b76fa2a608b3e003
Notes ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
ObjectType-Review-3
content type line 23
ORCID 0000-0002-7190-0837
0000-0002-8072-242X
0000-0002-0829-4870
OpenAccessLink http://journals.scholarsportal.info/openUrl.xqy?doi=10.12688/f1000research.72843.1
PQID 2610079691
PQPubID 23479
ParticipantIDs proquest_miscellaneous_2610079691
crossref_primary_10_12688_f1000research_72843_1
faculty1000_research_10_12688_f1000research_72843_1
ProviderPackageCode CITATION
AAYXX
PublicationCentury 2000
PublicationDate 2021
2021-00-00
20210101
PublicationDateYYYYMMDD 2021-01-01
PublicationDate_xml – year: 2021
  text: 2021
PublicationDecade 2020
PublicationTitle F1000 research
PublicationYear 2021
References P Patel-Schneider (ref13) 2018
J Hoffart (ref15) 2013
H Paulheim (ref1) 2017
F Manola (ref7)
M Färber (ref12) 2017; 9
S Malyshev (ref5) 2018; 11137
F Erxleben (ref4) 2014; 8796
O Hartig (ref6) June, 2017; 1963
P Hayes (ref9)
V Nguyen (ref8) 2014
J Hoffart (ref14) 2011; 23
L Sikos (ref3) 2020; 5
O Hartig (ref10) 2017
M Bienvenu (ref2)
J Frey (ref11) 2019; 10
References_xml – volume: 1963
  year: June, 2017
  ident: ref6
  article-title: Foundations of RDF* and SPARQL* (An Alternative Approach to Statement-Level Metadata in RDF).
  publication-title: CEUR Workshop Proc.
– start-page: 759-769
  year: 2014
  ident: ref8
  article-title: Don’t like RDF reification? Making statements about statements using singleton property.
  publication-title: WWW 2014 - Proc. 23rd Int. Conf. World Wide Web.
  doi: 10.1145/2566486.2567973
– start-page: 489-508
  year: 2017
  ident: ref1
  article-title: Knowledge Graph Refinement: A Survey of Approaches and Evaluation Methods.
  publication-title: Semant. Web.
  doi: 10.3233/SW-160218
– start-page: 3161-3165
  year: 2013
  ident: ref15
  article-title: YAGO2: A spatially and temporally enhanced knowledge base from Wikipedia.
  publication-title: IJCAI Int. Jt. Conf. Artif. Intell.
  doi: 10.1016/j.artint.2012.06.001
– volume: 8796
  start-page: 50-65
  year: 2014
  ident: ref4
  article-title: Introducing wikidata to the linked data web.
  publication-title: Lect. Notes Comput. Sci. (including Subser. Lect. Notes Artif. Intell. Lect. Notes Bioinformatics).
  doi: 10.1007/978-3-319-11964-9_4
– volume: 11137
  start-page: 8-12
  year: 2018
  ident: ref5
  article-title: Getting the Most Out of Wikidata: Semantic Technology Usage in Wikipedia’s Knowledge Graph.
  publication-title: Proc. 17th Int. Semant. Web Conf. (ISWC 2018).
– volume: 9
  start-page: 77-129
  year: 2017
  ident: ref12
  article-title: Linked Data Quality of DBpedia, Freebase, OpenCyc, Wikidata, and YAGO.
  publication-title: Semant. Web.
  doi: 10.3233/SW-170275
– volume: 5
  start-page: 293-316
  year: 2020
  ident: ref3
  article-title: Provenance-Aware Knowledge Representation: A Survey of Data Models and Contextualized Knowledge Graphs.
  publication-title: Data Sci. Eng.
  doi: 10.1007/s41019-020-00118-0
– volume: 23
  start-page: 229-232
  year: 2011
  ident: ref14
  article-title: YAGO2: Exploring and Querying World Knowledge in Time , Space, Context, and Many Languages.
  publication-title: Time.
  doi: 10.1145/1963192.1963296
– year: 2018
  ident: ref13
  article-title: Contextualization via qualifiers.
  publication-title: CEUR Workshop Proc.
– ident: ref2
  article-title: Provenance for Web 2.0 Data.
  doi: 10.1007/978-3-642-32873-2_10
– ident: ref9
  article-title: Defining N-ary Relations on the Semantic Web.
– year: 2017
  ident: ref10
  article-title: RDF∗ and SPARQL∗: An alternative approach to annotate statements in RDF.
  publication-title: Int. Semant. Web Conf.
– ident: ref7
  article-title: RDF Primer.
  publication-title: W3C Recommendation 10 February 2004. [Online].
– volume: 10
  start-page: 205-229
  year: 2019
  ident: ref11
  article-title: Evaluation of metadata representations in RDF stores.
  publication-title: Semant. Web.
  doi: 10.3233/SW-180307
SSID ssj0000993627
Score 2.1683116
SecondaryResourceType review_article
Snippet Knowledge graph (KG) publishes machine-readable representation of knowledge on the Web. Structured data in the knowledge graph is published using Resource...
SourceID proquest
crossref
faculty1000
SourceType Aggregation Database
Index Database
Publisher
StartPage 881
Title Resource Description Framework reification for trustworthiness in knowledge graphs [version 1; peer review: 1 approved, 1 not approved]
URI http://dx.doi.org/10.12688/f1000research.72843.1
https://www.proquest.com/docview/2610079691
Volume 10
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3fS8MwED7mBqIPQ6fi_FEiCD51rk2Trg8iKptD2JDhYG8laRNUpNNVwf33XrJWHIr6UgpN-vBdcnff5XIHcKwSQVO0E27qUeGixROuVAoduZSh9aMiTaSt9jnk_XFwM2GTCpTtUgsA8x-pneknNZ49td5f5ue44c9sbQSODE6bIHVRHOe-FaLGpS1kRDW0Ttys9EHh8j8uPCLU2eYWtY_U0PVsgo_z-6-WbNa6FqYoxtwM_KbArVXqbUC9cCfJxUL-m1BRWQNWB8WBeQPqtkPlSU6KfMEtGJUBe4KUs1QZpFfmaJGZMslDVl4EHVpiL2XYREKbIE8eMvIZhyO23HW-DeNe9-6q7xaNFdwE2ZbndtBKR1xFOpGdgGmficjHjSuYSIIAZcRpyliqcLAWPkUGKEOOb4K3O5Iq1AM7UM2mmdoFQgOtBAsjJpkOVDuUPNRCdDyqdMRlJJpwWkIXPy_qZ8SGdxiw4yWwYwt27DWBfkE4_vz816yjUhIxbhBz6iEyNX3LY6SI6AdFPPL2_jFmH9Z8k7VigywHUEWQ1SG6Ha_SgZVwEjpQu-wOb0eOJe_4vJ54jl1hHwa32TA
linkProvider Scholars Portal
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Resource+Description+Framework+reification+for+trustworthiness+in+knowledge+graphs&rft.jtitle=F1000+research&rft.au=Govindapillai%2C+Sini&rft.au=Soon%2C+Lay-Ki&rft.au=Haw%2C+Su-Cheng&rft.date=2021&rft.issn=2046-1402&rft.eissn=2046-1402&rft.volume=10&rft.spage=881&rft_id=info:doi/10.12688%2Ff1000research.72843.1&rft.externalDBID=n%2Fa&rft.externalDocID=10_12688_f1000research_72843_1
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=2046-1402&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=2046-1402&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=2046-1402&client=summon