Preprint Article Version 1 This version is not peer-reviewed

Matching and Rewriting Rules in Object-Oriented Databases

Version 1 : Received: 6 August 2024 / Approved: 7 August 2024 / Online: 8 August 2024 (12:16:12 CEST)

How to cite: Bergami, G.; Fox, O. R.; Morgan, G. Matching and Rewriting Rules in Object-Oriented Databases. Preprints 2024, 2024080536. https://doi.org/10.20944/preprints202408.0536.v1 Bergami, G.; Fox, O. R.; Morgan, G. Matching and Rewriting Rules in Object-Oriented Databases. Preprints 2024, 2024080536. https://doi.org/10.20944/preprints202408.0536.v1

Abstract

Graph query languages such as Cypher are widely adopted to match and retrieve data in a graph representation, due to their ability of retrieving and transforming information. Despite the most natural way to match and transform information is through rewriting rules, those are scarcely or partially adopted in graph query languages. Their inability of doing so has a major impact on the subsequent way the information is structured, as it might then appear more natural to provide major constraint over the data representation so to consequently constraint the way the information should be represented. On the other hand, recent works are starting to move towards an opposite direction, as the provision of a truly general semistructured model (GSM) allows to both represent all the available data formats (Network-Based, Relational, and Semistructured) as well as support an holistic query language expressing all major queries in such languages. In this paper, we show that the usage of GSM enables the definition of a general rewriting mechanism which can be expressed in current graph query languages only at the cost of adhering the query to the specificity of the underlying data representation. We formalise the proposed query language in terms graph rewriting mechanisms described as a set of production rules $L\to R$ while providing restriction to the characterization of L, while extending it to support structural graph nesting operations, useful to aggregate similar information around an entry-point of interest. We discuss how GSM, by fully supporting index-based data representation, allows for a better physical model implementation leveraging the benefits of columnar database storages. Preliminary benchmarks shows the scalability of this proposed implementation in comparison with state-of-the-art implementations.

Keywords

direct acyclic graphs; generalised semistructured model; graph grammars; graph query languages; algorithms; operator algebras 

Subject

Computer Science and Mathematics, Information Systems

Comments (0)

We encourage comments and feedback from a broad range of readers. See criteria for comments and our Diversity statement.

Leave a public comment
Send a private comment to the author(s)
* All users must log in before leaving a comment
Views 0
Downloads 0
Comments 0


×
Alerts
Notify me about updates to this article or when a peer-reviewed version is published.
We use cookies on our website to ensure you get the best experience.
Read more about our cookies here.