A Minimal Rare Substructures-Based Model for Graph Database Indexing

Systems such as proteins, chemical compounds, and the Internet are stored as graph structures in graph databases. A basic, common problem in graph related applications is to find graph data that contains a query. It is not possible to scan the whole data in graph databases since subgraph isomorphism...

Full description

Saved in:
Bibliographic Details
Published inIntelligent Systems Design and Applications Vol. 557; pp. 250 - 259
Main Authors Azaouzi, Mehdi, Ben Romdhane, Lotfi
Format Book Chapter
LanguageEnglish
Published Switzerland Springer International Publishing AG 2017
Springer International Publishing
SeriesAdvances in Intelligent Systems and Computing
Subjects
Online AccessGet full text
ISBN9783319534794
3319534793
ISSN2194-5357
2194-5365
DOI10.1007/978-3-319-53480-0_25

Cover

Loading…
More Information
Summary:Systems such as proteins, chemical compounds, and the Internet are stored as graph structures in graph databases. A basic, common problem in graph related applications is to find graph data that contains a query. It is not possible to scan the whole data in graph databases since subgraph isomorphism testing is an NP-complete problem. In recent years, some effective graphs indexes have been proposed to first obtain a candidate answer set and then performing verification on each candidate by checking subgraph isomorphism. However, candidate verification is still inevitable and expensive when the size of the candidate answer set is large. In this paper, we propose a new Structural Graph Indexing, called GIRAS, based on RAre subGraphs (RGs) as the basic indexing feature. The idea is to have a single characteristic that can uniquely identify a graph in a database. Few substructures are ideal candidates since they are rare graphs, which means they occurs in only a small number of graphs in the database. Thus, in confronting a query using these indexes, the size of the candidate answer set is close to that of the exact answer set, and the number of subgraph isomorphism tests is small. Therefore, the time of the candidate verification step is reduced to a minimum.
ISBN:9783319534794
3319534793
ISSN:2194-5357
2194-5365
DOI:10.1007/978-3-319-53480-0_25