Resource Description Framework reification for trustworthiness in knowledge graphs [version 1; peer review: 1 approved, 1 not approved]
Knowledge graph (KG) publishes machine-readable representation of knowledge on the Web. Structured data in the knowledge graph is published using Resource Description Framework (RDF) where knowledge is represented as a triple (subject, predicate, object). Due to the presence of erroneous, outdated o...
Saved in:
Published in | F1000 research Vol. 10; p. 881 |
---|---|
Main Authors | , , |
Format | Journal Article |
Language | English |
Published |
2021
|
Subjects | |
Online Access | Get full text |
ISSN | 2046-1402 2046-1402 |
DOI | 10.12688/f1000research.72843.1 |
Cover
Loading…
Abstract | Knowledge graph (KG) publishes machine-readable representation of knowledge on the Web. Structured data in the knowledge graph is published using Resource Description Framework (RDF) where knowledge is represented as a triple (subject, predicate, object). Due to the presence of erroneous, outdated or conflicting data in the knowledge graph, the quality of facts cannot be guaranteed. Therefore, the provenance of knowledge can assist in building up the trust of these knowledge graphs. In this paper, we have provided an analysis of popular, general knowledge graphs Wikidata and YAGO4 with regard to the representation of provenance and context data. Since RDF does not support metadata for providing provenance and contextualization, an alternate method, RDF reification is employed by most of the knowledge graphs. Trustworthiness of facts in knowledge graph can be enhanced by the addition of metadata like the source of information, location and time of the fact occurrence. Wikidata employs qualifiers to include metadata to facts, while YAGO4 collects metadata from Wikidata qualifiers. RDF reification increases the magnitude of data as several statements are required to represent a single fact. However, facts in Wikidata and YAGO4 can be fetched without using reification. Another limitation for applications that uses provenance data is that not all facts in these knowledge graphs are annotated with provenance data. Structured data in the knowledge graph is noisy. Therefore, the reliability of data in knowledge graphs can be increased by provenance data. To the best of our knowledge, this is the first paper that investigates the method and the extent of the addition of metadata of two prominent KGs, Wikidata and YAGO4. |
---|---|
AbstractList | Knowledge graph (KG) publishes machine-readable representation of knowledge on the Web. Structured data in the knowledge graph is published using Resource Description Framework (RDF) where knowledge is represented as a triple (subject, predicate, object). Due to the presence of erroneous, outdated or conflicting data in the knowledge graph, the quality of facts cannot be guaranteed. Therefore, the provenance of knowledge can assist in building up the trust of these knowledge graphs. In this paper, we have provided an analysis of popular, general knowledge graphs Wikidata and YAGO4 with regard to the representation of provenance and context data. Since RDF does not support metadata for providing provenance and contextualization, an alternate method, RDF reification is employed by most of the knowledge graphs. Trustworthiness of facts in knowledge graph can be enhanced by the addition of metadata like the source of information, location and time of the fact occurrence. Wikidata employs qualifiers to include metadata to facts, while YAGO4 collects metadata from Wikidata qualifiers. RDF reification increases the magnitude of data as several statements are required to represent a single fact. However, facts in Wikidata and YAGO4 can be fetched without using reification. Another limitation for applications that uses provenance data is that not all facts in these knowledge graphs are annotated with provenance data. Structured data in the knowledge graph is noisy. Therefore, the reliability of data in knowledge graphs can be increased by provenance data. To the best of our knowledge, this is the first paper that investigates the method and the extent of the addition of metadata of two prominent KGs, Wikidata and YAGO4. Knowledge graph (KG) publishes machine-readable representation of knowledge on the Web. Structured data in the knowledge graph is published using Resource Description Framework (RDF) where knowledge is represented as a triple (subject, predicate, object). Due to the presence of erroneous, outdated or conflicting data in the knowledge graph, the quality of facts cannot be guaranteed. Therefore, the provenance of knowledge can assist in building up the trust of these knowledge graphs. In this paper, we have provided an analysis of popular, general knowledge graphs Wikidata and YAGO4 with regard to the representation of provenance and context data. Since RDF does not support metadata for providing provenance and contextualization, an alternate method, RDF reification is employed by most of the knowledge graphs. Trustworthiness of facts in knowledge graph can be enhanced by the addition of metadata like the source of information, location and time of the fact occurrence. Wikidata employs qualifiers to include metadata to facts, while YAGO4 collects metadata from Wikidata qualifiers. RDF reification increases the magnitude of data as several statements are required to represent a single fact. However, facts in Wikidata and YAGO4 can be fetched without using reification. Another limitation for applications that uses provenance data is that not all facts in these knowledge graphs are annotated with provenance data. Structured data in the knowledge graph is noisy. Therefore, the reliability of data in knowledge graphs can be increased by provenance data. To the best of our knowledge, this is the first paper that investigates the method and the extent of the addition of metadata of two prominent KGs, Wikidata and YAGO4.Knowledge graph (KG) publishes machine-readable representation of knowledge on the Web. Structured data in the knowledge graph is published using Resource Description Framework (RDF) where knowledge is represented as a triple (subject, predicate, object). Due to the presence of erroneous, outdated or conflicting data in the knowledge graph, the quality of facts cannot be guaranteed. Therefore, the provenance of knowledge can assist in building up the trust of these knowledge graphs. In this paper, we have provided an analysis of popular, general knowledge graphs Wikidata and YAGO4 with regard to the representation of provenance and context data. Since RDF does not support metadata for providing provenance and contextualization, an alternate method, RDF reification is employed by most of the knowledge graphs. Trustworthiness of facts in knowledge graph can be enhanced by the addition of metadata like the source of information, location and time of the fact occurrence. Wikidata employs qualifiers to include metadata to facts, while YAGO4 collects metadata from Wikidata qualifiers. RDF reification increases the magnitude of data as several statements are required to represent a single fact. However, facts in Wikidata and YAGO4 can be fetched without using reification. Another limitation for applications that uses provenance data is that not all facts in these knowledge graphs are annotated with provenance data. Structured data in the knowledge graph is noisy. Therefore, the reliability of data in knowledge graphs can be increased by provenance data. To the best of our knowledge, this is the first paper that investigates the method and the extent of the addition of metadata of two prominent KGs, Wikidata and YAGO4. |
Author | Govindapillai, Sini Soon, Lay-Ki Haw, Su-Cheng |
Author_xml | – sequence: 1 givenname: Sini orcidid: 0000-0002-0829-4870 surname: Govindapillai fullname: Govindapillai, Sini organization: Faculty of Computing Informatics, Multimedia University, Cyberjaya, Selangor, 63100, Malaysia – sequence: 2 givenname: Lay-Ki orcidid: 0000-0002-8072-242X surname: Soon fullname: Soon, Lay-Ki organization: School of Information Technology, Monash University Malaysia, Bandar Sunway, Selangor, 47500, Malaysia – sequence: 3 givenname: Su-Cheng orcidid: 0000-0002-7190-0837 surname: Haw fullname: Haw, Su-Cheng email: sucheng@mmu.edu.my organization: Faculty of Computing Informatics, Multimedia University, Cyberjaya, Selangor, 63100, Malaysia |
BookMark | eNqFkMFOGzEURa0KJCjlF5CXXTTB9sx47HRV0QYqIVVCsELIcpxn4mYynj7PJMoX8Ns4SYXKqitf2-e8J92P5KiNLRBywdmYC6nUpeeMMYQEFt1iXAtVFmP-gZwKVsoRL5k4-iefkPOUfmeBaV1IUZ-SlztIcUAH9Dskh6HrQ2zpFO0KNhGXFCH44Oz-1UekPQ6pzz_9IrSQEg0tXbZx08D8Gegz2m6R6OMaMO0E_pV2AJiHrANsJpRT23UY1zD_knMb-7f70ydy7G2T4PzveUYepj_ur25Gt7-uf159ux05UWo-UoorLUF7N1Nl5UVltahVbSvryrIqtCzmVTWHDHsripLpWS1zspKpWQGMFWfk82Fu3vtngNSbVUgOmsa2EIdkhMx91lpqnlF5QB3GlBC86TCsLG4NZ2ZfvnlXvtmXb3bi5CB664am3-4g80b9R34FJCyQfQ |
Cites_doi | 10.1145/2566486.2567973 10.3233/SW-160218 10.1016/j.artint.2012.06.001 10.1007/978-3-319-11964-9_4 10.3233/SW-170275 10.1007/s41019-020-00118-0 10.1145/1963192.1963296 10.1007/978-3-642-32873-2_10 10.3233/SW-180307 |
ContentType | Journal Article |
Copyright | Copyright: © 2021 Govindapillai S et al. |
Copyright_xml | – notice: Copyright: © 2021 Govindapillai S et al. |
DBID | C-E CH4 AAYXX CITATION 7X8 |
DOI | 10.12688/f1000research.72843.1 |
DatabaseName | F1000Research Faculty of 1000 CrossRef MEDLINE - Academic |
DatabaseTitle | CrossRef MEDLINE - Academic |
DatabaseTitleList | MEDLINE - Academic CrossRef |
DeliveryMethod | fulltext_linktorsrc |
Discipline | Medicine Women's Studies |
EISSN | 2046-1402 |
ExternalDocumentID | 10_12688_f1000research_72843_1 |
GrantInformation_xml | – fundername: Multimedia University Internal Fund grantid: MMUI/180006 – fundername: Fundamental Research Grant Scheme (FRGS) by Malaysia Ministry of Higher Education grantid: FRGS/2/2013/ICT07/MMU/02/2 |
GroupedDBID | 3V. 53G 5VS 7X7 88I 8FE 8FH 8FI 8FJ ABUWG ACGOD ACPRK ADACO ADBBV ADRAZ AFKRA AHMBA ALMA_UNASSIGNED_HOLDINGS AZQEC BAWUL BBAFP BBNVY BCNDV BENPR BHPHI BPHCQ BVXVI C-E CH4 DIK DWQXO FRP FYUFA GNUQQ GROUPED_DOAJ GX1 HCIFZ HYE KQ8 LK8 M2P M48 M7P OK1 PIMPY PQEST PQQKQ PQUKI PRINS PROAC RPM AAFWJ AAYXX AFPKN ALIPV AOIJS CCPQU CITATION HMCUK M~E PGMZT PHGZM PHGZT UKHRP W2D 7X8 PQGLB PUEGO |
ID | FETCH-LOGICAL-c2491-881896e9fcb845f25a92787a5ac4453963d55dec24fa23409b76fa2a608b3e003 |
IEDL.DBID | M48 |
ISSN | 2046-1402 |
IngestDate | Fri Sep 05 13:34:53 EDT 2025 Tue Jul 01 04:27:24 EDT 2025 Tue Oct 26 01:52:36 EDT 2021 |
IsDoiOpenAccess | true |
IsOpenAccess | true |
IsPeerReviewed | true |
IsScholarly | true |
Keywords | RDF reification provenance data YAGO Wikidata Knowledge Graph |
Language | English |
License | http://creativecommons.org/licenses/by/4.0/: This is an open access article distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
LinkModel | DirectLink |
MergedId | FETCHMERGED-LOGICAL-c2491-881896e9fcb845f25a92787a5ac4453963d55dec24fa23409b76fa2a608b3e003 |
Notes | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 ObjectType-Review-3 content type line 23 |
ORCID | 0000-0002-7190-0837 0000-0002-8072-242X 0000-0002-0829-4870 |
OpenAccessLink | http://journals.scholarsportal.info/openUrl.xqy?doi=10.12688/f1000research.72843.1 |
PQID | 2610079691 |
PQPubID | 23479 |
ParticipantIDs | proquest_miscellaneous_2610079691 crossref_primary_10_12688_f1000research_72843_1 faculty1000_research_10_12688_f1000research_72843_1 |
ProviderPackageCode | CITATION AAYXX |
PublicationCentury | 2000 |
PublicationDate | 2021 2021-00-00 20210101 |
PublicationDateYYYYMMDD | 2021-01-01 |
PublicationDate_xml | – year: 2021 text: 2021 |
PublicationDecade | 2020 |
PublicationTitle | F1000 research |
PublicationYear | 2021 |
References | P Patel-Schneider (ref13) 2018 J Hoffart (ref15) 2013 H Paulheim (ref1) 2017 F Manola (ref7) M Färber (ref12) 2017; 9 S Malyshev (ref5) 2018; 11137 F Erxleben (ref4) 2014; 8796 O Hartig (ref6) June, 2017; 1963 P Hayes (ref9) V Nguyen (ref8) 2014 J Hoffart (ref14) 2011; 23 L Sikos (ref3) 2020; 5 O Hartig (ref10) 2017 M Bienvenu (ref2) J Frey (ref11) 2019; 10 |
References_xml | – volume: 1963 year: June, 2017 ident: ref6 article-title: Foundations of RDF* and SPARQL* (An Alternative Approach to Statement-Level Metadata in RDF). publication-title: CEUR Workshop Proc. – start-page: 759-769 year: 2014 ident: ref8 article-title: Don’t like RDF reification? Making statements about statements using singleton property. publication-title: WWW 2014 - Proc. 23rd Int. Conf. World Wide Web. doi: 10.1145/2566486.2567973 – start-page: 489-508 year: 2017 ident: ref1 article-title: Knowledge Graph Refinement: A Survey of Approaches and Evaluation Methods. publication-title: Semant. Web. doi: 10.3233/SW-160218 – start-page: 3161-3165 year: 2013 ident: ref15 article-title: YAGO2: A spatially and temporally enhanced knowledge base from Wikipedia. publication-title: IJCAI Int. Jt. Conf. Artif. Intell. doi: 10.1016/j.artint.2012.06.001 – volume: 8796 start-page: 50-65 year: 2014 ident: ref4 article-title: Introducing wikidata to the linked data web. publication-title: Lect. Notes Comput. Sci. (including Subser. Lect. Notes Artif. Intell. Lect. Notes Bioinformatics). doi: 10.1007/978-3-319-11964-9_4 – volume: 11137 start-page: 8-12 year: 2018 ident: ref5 article-title: Getting the Most Out of Wikidata: Semantic Technology Usage in Wikipedia’s Knowledge Graph. publication-title: Proc. 17th Int. Semant. Web Conf. (ISWC 2018). – volume: 9 start-page: 77-129 year: 2017 ident: ref12 article-title: Linked Data Quality of DBpedia, Freebase, OpenCyc, Wikidata, and YAGO. publication-title: Semant. Web. doi: 10.3233/SW-170275 – volume: 5 start-page: 293-316 year: 2020 ident: ref3 article-title: Provenance-Aware Knowledge Representation: A Survey of Data Models and Contextualized Knowledge Graphs. publication-title: Data Sci. Eng. doi: 10.1007/s41019-020-00118-0 – volume: 23 start-page: 229-232 year: 2011 ident: ref14 article-title: YAGO2: Exploring and Querying World Knowledge in Time , Space, Context, and Many Languages. publication-title: Time. doi: 10.1145/1963192.1963296 – year: 2018 ident: ref13 article-title: Contextualization via qualifiers. publication-title: CEUR Workshop Proc. – ident: ref2 article-title: Provenance for Web 2.0 Data. doi: 10.1007/978-3-642-32873-2_10 – ident: ref9 article-title: Defining N-ary Relations on the Semantic Web. – year: 2017 ident: ref10 article-title: RDF∗ and SPARQL∗: An alternative approach to annotate statements in RDF. publication-title: Int. Semant. Web Conf. – ident: ref7 article-title: RDF Primer. publication-title: W3C Recommendation 10 February 2004. [Online]. – volume: 10 start-page: 205-229 year: 2019 ident: ref11 article-title: Evaluation of metadata representations in RDF stores. publication-title: Semant. Web. doi: 10.3233/SW-180307 |
SSID | ssj0000993627 |
Score | 2.1683116 |
SecondaryResourceType | review_article |
Snippet | Knowledge graph (KG) publishes machine-readable representation of knowledge on the Web. Structured data in the knowledge graph is published using Resource... |
SourceID | proquest crossref faculty1000 |
SourceType | Aggregation Database Index Database Publisher |
StartPage | 881 |
Title | Resource Description Framework reification for trustworthiness in knowledge graphs [version 1; peer review: 1 approved, 1 not approved] |
URI | http://dx.doi.org/10.12688/f1000research.72843.1 https://www.proquest.com/docview/2610079691 |
Volume | 10 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV3fS8MwED7mBqIPQ6fi_FEiCD51rk2Trg8iKptD2JDhYG8laRNUpNNVwf33XrJWHIr6UgpN-vBdcnff5XIHcKwSQVO0E27qUeGixROuVAoduZSh9aMiTaSt9jnk_XFwM2GTCpTtUgsA8x-pneknNZ49td5f5ue44c9sbQSODE6bIHVRHOe-FaLGpS1kRDW0Ttys9EHh8j8uPCLU2eYWtY_U0PVsgo_z-6-WbNa6FqYoxtwM_KbArVXqbUC9cCfJxUL-m1BRWQNWB8WBeQPqtkPlSU6KfMEtGJUBe4KUs1QZpFfmaJGZMslDVl4EHVpiL2XYREKbIE8eMvIZhyO23HW-DeNe9-6q7xaNFdwE2ZbndtBKR1xFOpGdgGmficjHjSuYSIIAZcRpyliqcLAWPkUGKEOOb4K3O5Iq1AM7UM2mmdoFQgOtBAsjJpkOVDuUPNRCdDyqdMRlJJpwWkIXPy_qZ8SGdxiw4yWwYwt27DWBfkE4_vz816yjUhIxbhBz6iEyNX3LY6SI6AdFPPL2_jFmH9Z8k7VigywHUEWQ1SG6Ha_SgZVwEjpQu-wOb0eOJe_4vJ54jl1hHwa32TA |
linkProvider | Scholars Portal |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Resource+Description+Framework+reification+for+trustworthiness+in+knowledge+graphs&rft.jtitle=F1000+research&rft.au=Govindapillai%2C+Sini&rft.au=Soon%2C+Lay-Ki&rft.au=Haw%2C+Su-Cheng&rft.date=2021&rft.issn=2046-1402&rft.eissn=2046-1402&rft.volume=10&rft.spage=881&rft_id=info:doi/10.12688%2Ff1000research.72843.1&rft.externalDBID=n%2Fa&rft.externalDocID=10_12688_f1000research_72843_1 |
thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=2046-1402&client=summon |
thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=2046-1402&client=summon |
thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=2046-1402&client=summon |