建立一个融合关系图优化框架

Yunkai Lou, Longbin Lai, Bingqing Lyu, Yufan Yang, Xiaoli Zhou, Wenyuan Yu, Ying Zhang, Jingren Zhou
{"title":"建立一个融合关系图优化框架","authors":"Yunkai Lou, Longbin Lai, Bingqing Lyu, Yufan Yang, Xiaoli Zhou, Wenyuan Yu, Ying Zhang, Jingren Zhou","doi":"arxiv-2408.13480","DOIUrl":null,"url":null,"abstract":"The recent ISO SQL:2023 standard adopts SQL/PGQ (Property Graph Queries),\nfacilitating graph-like querying within relational databases. This advancement,\nhowever, underscores a significant gap in how to effectively optimize SQL/PGQ\nqueries within relational database systems. To address this gap, we extend the\nfoundational SPJ(Select-Project-Join) queries to SPJM queries, which include an\nadditional matching operator for representing graph pattern matching in\nSQL/PGQ. Although SPJM queries can be converted to SPJ queries and optimized\nusing existing relational query optimizers, our analysis shows that such a\ngraph-agnostic method fails to benefit from graph-specific optimization\ntechniques found in the literature. To address this issue, we develop a\nconverged relational-graph optimization framework called RelGo for optimizing\nSPJM queries, leveraging joint efforts from both relational and graph query\noptimizations. Using DuckDB as the underlying relational execution engine, our\nexperiments show that RelGo can generate efficient execution plans for SPJM\nqueries. On well-established benchmarks, these plans exhibit an average speedup\nof 21.90$\\times$ compared to those produced by the graph-agnostic optimizer.","PeriodicalId":501123,"journal":{"name":"arXiv - CS - Databases","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-08-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Towards a Converged Relational-Graph Optimization Framework\",\"authors\":\"Yunkai Lou, Longbin Lai, Bingqing Lyu, Yufan Yang, Xiaoli Zhou, Wenyuan Yu, Ying Zhang, Jingren Zhou\",\"doi\":\"arxiv-2408.13480\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The recent ISO SQL:2023 standard adopts SQL/PGQ (Property Graph Queries),\\nfacilitating graph-like querying within relational databases. This advancement,\\nhowever, underscores a significant gap in how to effectively optimize SQL/PGQ\\nqueries within relational database systems. To address this gap, we extend the\\nfoundational SPJ(Select-Project-Join) queries to SPJM queries, which include an\\nadditional matching operator for representing graph pattern matching in\\nSQL/PGQ. Although SPJM queries can be converted to SPJ queries and optimized\\nusing existing relational query optimizers, our analysis shows that such a\\ngraph-agnostic method fails to benefit from graph-specific optimization\\ntechniques found in the literature. To address this issue, we develop a\\nconverged relational-graph optimization framework called RelGo for optimizing\\nSPJM queries, leveraging joint efforts from both relational and graph query\\noptimizations. Using DuckDB as the underlying relational execution engine, our\\nexperiments show that RelGo can generate efficient execution plans for SPJM\\nqueries. On well-established benchmarks, these plans exhibit an average speedup\\nof 21.90$\\\\times$ compared to those produced by the graph-agnostic optimizer.\",\"PeriodicalId\":501123,\"journal\":{\"name\":\"arXiv - CS - Databases\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-08-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - CS - Databases\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2408.13480\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Databases","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2408.13480","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

最近发布的 ISO SQL:2023 标准采用了 SQL/PGQ(属性图查询),以方便在关系数据库中进行图式查询。然而,这一进步凸显了如何在关系型数据库系统中有效优化 SQL/PGQ 查询的巨大差距。为了解决这个问题,我们将基本的 SPJ(Select-Project-Join)查询扩展为 SPJM 查询,SPJM 查询包括一个额外的匹配操作符,用于在SQL/PGQ 中表示图形模式匹配。虽然 SPJM 查询可以转换为 SPJ 查询,并使用现有的关系查询优化器进行优化,但我们的分析表明,这种与图无关的方法无法从文献中发现的图特定优化技术中获益。为了解决这个问题,我们开发了一个名为 RelGo 的关系-图优化融合框架,用于优化SPJM 查询,充分利用了关系和图查询优化的共同努力。使用 DuckDB 作为底层关系执行引擎,我们的实验表明 RelGo 可以为 SPJM 查询生成高效的执行计划。在成熟的基准测试中,这些计划与图无关优化器生成的计划相比,平均提速 21.90 倍。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Towards a Converged Relational-Graph Optimization Framework
The recent ISO SQL:2023 standard adopts SQL/PGQ (Property Graph Queries), facilitating graph-like querying within relational databases. This advancement, however, underscores a significant gap in how to effectively optimize SQL/PGQ queries within relational database systems. To address this gap, we extend the foundational SPJ(Select-Project-Join) queries to SPJM queries, which include an additional matching operator for representing graph pattern matching in SQL/PGQ. Although SPJM queries can be converted to SPJ queries and optimized using existing relational query optimizers, our analysis shows that such a graph-agnostic method fails to benefit from graph-specific optimization techniques found in the literature. To address this issue, we develop a converged relational-graph optimization framework called RelGo for optimizing SPJM queries, leveraging joint efforts from both relational and graph query optimizations. Using DuckDB as the underlying relational execution engine, our experiments show that RelGo can generate efficient execution plans for SPJM queries. On well-established benchmarks, these plans exhibit an average speedup of 21.90$\times$ compared to those produced by the graph-agnostic optimizer.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Development of Data Evaluation Benchmark for Data Wrangling Recommendation System Messy Code Makes Managing ML Pipelines Difficult? Just Let LLMs Rewrite the Code! Fast and Adaptive Bulk Loading of Multidimensional Points Matrix Profile for Anomaly Detection on Multidimensional Time Series Extending predictive process monitoring for collaborative processes
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1