Bandit join: preliminary results

Proceedings of the Third International Workshop on Exploiting Artificial Intelligence Techniques for Data Management Pub Date : 2020-06-14 DOI:10.1145/3401071.3401655

Vahid Ghadakchi, Mian Xie, Arash Termehchy

引用次数: 4

Abstract

Join is arguably the most costly and frequently used operation in relational query processing. Join algorithms usually spend the majority of their time on scanning and attempting to join the parts of the base relations that do not satisfy the join condition and do not generate any results. This causes slow response time, particularly, in interactive and exploratory environments where users would like real-time performance. In this paper, we outline our vision on using online learning and adaptation to execute joins efficiently. In our approach, scan operators that precede a join, learn which parts of the relations are more likely to join during the query execution and produce more results faster by doing fewer I/O accesses. Our empirical studies using standard benchmarks indicate that this approach outperforms similar methods considerably.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

土匪加入:初步结果

连接无疑是关系查询处理中成本最高、使用最频繁的操作。连接算法通常将大部分时间花在扫描和尝试连接不满足连接条件且不生成任何结果的基本关系部分上。这会导致响应时间变慢，特别是在用户希望实时性能的交互式和探索性环境中。在本文中，我们概述了我们使用在线学习和适应来有效执行连接的愿景。在我们的方法中，扫描连接之前的操作符，了解在查询执行期间关系的哪些部分更有可能连接，并通过执行更少的I/O访问来更快地生成更多结果。我们使用标准基准进行的实证研究表明，这种方法的性能大大优于类似的方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Proceedings of the Third International Workshop on Exploiting Artificial Intelligence Techniques for Data Management

自引率

0.00%

发文量

期刊最新文献

Research challenges in deep reinforcement learning-based join query optimization Bandit join: preliminary results Automated tuning of query degree of parallelism via machine learning PartLy Best of both worlds: combining traditional and machine learning models for cardinality estimation