OpenIE-based approach for Knowledge Graph construction from text

•A Knowledge Graph construction approach is proposed.•The integration of entity linking systems improves the extraction performance.•The association of named entities with noun phrases preserves RDF data coherence.•Thematic roles are used to associate relation phrases with Semantic Web properties. T...

Full description

Saved in:
Bibliographic Details
Published inExpert systems with applications Vol. 113; pp. 339 - 355
Main Authors Martinez-Rodriguez, Jose L., Lopez-Arevalo, Ivan, Rios-Alvarado, Ana B.
Format Journal Article
LanguageEnglish
Published New York Elsevier Ltd 15.12.2018
Elsevier BV
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:•A Knowledge Graph construction approach is proposed.•The integration of entity linking systems improves the extraction performance.•The association of named entities with noun phrases preserves RDF data coherence.•Thematic roles are used to associate relation phrases with Semantic Web properties. Transforming unstructured text into a formal representation is an important goal of the Semantic Web in order to facilitate the integration and retrieval of information. The construction of Knowledge Graphs (KGs) pursues such an idea, where named entities (real world things) and their relations are extracted from text. In recent years, many approaches for the construction of KGs have been proposed by exploiting Discourse Analysis, Semantic Frames, or Machine Learning algorithms with existing Semantic Web data. Although such approaches are useful for processing taxonomies and connecting beliefs, they provide several linguistic descriptions, which lead to semantic data heterogeneity and thus, complicating data consumption. Moreover, Open Information Extraction (OpenIE) approaches have been slightly explored for the construction of KGs, which provide binary relations representing atomic units of information that could simplify the querying and representation of data. In this paper, we propose an approach to generate KGs using binary relations produced by an OpenIE approach. For such purpose, we present strategies for favoring the extraction and linking of named entities with KG individuals, and additionally, their association with grammatical units that lead to producing more coherent facts. We also provide decisions for selecting the extracted information elements for creating potentially useful RDF triples for the KG. Our results demonstrate that the integration of information extraction units with grammatical structures provides a better understanding of proposition-based representations provided by OpenIE for supporting the construction of KGs.
ISSN:0957-4174
1873-6793
DOI:10.1016/j.eswa.2018.07.017