Datalog in Wonderland

ACM SIGMOD Record Pub Date : 2022-07-29 DOI:10.1145/3552490.3552492

Mahmoud Abo Khamis, RelationalAI, H. Ngo, R. Pichler, T. Wien, Dan Suciu

{"title":"Datalog in Wonderland","authors":"Mahmoud Abo Khamis, RelationalAI, H. Ngo, R. Pichler, T. Wien, Dan Suciu","doi":"10.1145/3552490.3552492","DOIUrl":null,"url":null,"abstract":"Modern data analytics applications, such as knowledge graph reasoning and machine learning, typically involve recursion through aggregation. Such computations pose great challenges to both system builders and theoreticians: first, to derive simple yet powerful abstractions for these computations; second, to define and study the semantics for the abstractions; third, to devise optimization techniques for these computations. In recent work we presented a generalization of Datalog called Datalog, which addresses these challenges. Datalog is a simple abstraction, which allows aggregates to be interleaved with recursion, and retains much of the simplicity and elegance of Datalog. We define its formal semantics based on an algebraic structure called Partially Ordered Pre-Semirings, and illustrate through several examples how Datalog can be used for a variety of applications. Finally, we describe a new optimization rule for Datalog, called the FGH-rule, then illustrate the FGH-rule on several examples, including a simple magic-set rewriting, generalized semi-naïve evaluation, and a bill-of-material example, and briefly discuss the implementation of the FGH-rule and present some experimental validation of its effectiveness.","PeriodicalId":346332,"journal":{"name":"ACM SIGMOD Record","volume":"38 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-07-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACM SIGMOD Record","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3552490.3552492","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

Modern data analytics applications, such as knowledge graph reasoning and machine learning, typically involve recursion through aggregation. Such computations pose great challenges to both system builders and theoreticians: first, to derive simple yet powerful abstractions for these computations; second, to define and study the semantics for the abstractions; third, to devise optimization techniques for these computations. In recent work we presented a generalization of Datalog called Datalog, which addresses these challenges. Datalog is a simple abstraction, which allows aggregates to be interleaved with recursion, and retains much of the simplicity and elegance of Datalog. We define its formal semantics based on an algebraic structure called Partially Ordered Pre-Semirings, and illustrate through several examples how Datalog can be used for a variety of applications. Finally, we describe a new optimization rule for Datalog, called the FGH-rule, then illustrate the FGH-rule on several examples, including a simple magic-set rewriting, generalized semi-naïve evaluation, and a bill-of-material example, and briefly discuss the implementation of the FGH-rule and present some experimental validation of its effectiveness.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

《漫游仙境

现代数据分析应用程序，如知识图推理和机器学习，通常涉及通过聚合的递归。这样的计算对系统构建者和理论家都提出了巨大的挑战:首先，为这些计算推导出简单而强大的抽象;第二，对抽象的语义进行定义和研究;第三，为这些计算设计优化技术。在最近的工作中，我们提出了Datalog的泛化，称为Datalog，它解决了这些挑战。Datalog是一个简单的抽象，它允许聚合与递归交织在一起，并保留了Datalog的许多简单性和优雅性。我们基于称为部分有序预半环的代数结构定义其形式语义，并通过几个示例说明如何将Datalog用于各种应用程序。最后，我们描述了一种新的Datalog优化规则，称为fgh规则，然后在几个例子上说明了fgh规则，包括一个简单的magic-set重写，广义semi-naïve评估和一个物料清单示例，并简要讨论了fgh规则的实现，并给出了一些实验验证其有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊