Dynamic Subgraph Matching via Cost-Model-based Vertex Dominance Embeddings (Technical Report)

Yutong Ye, Xiang Lian, Nan Zhang, Mingsong Chen
{"title":"Dynamic Subgraph Matching via Cost-Model-based Vertex Dominance Embeddings (Technical Report)","authors":"Yutong Ye, Xiang Lian, Nan Zhang, Mingsong Chen","doi":"arxiv-2407.16660","DOIUrl":null,"url":null,"abstract":"In many real-world applications such as social network analysis, knowledge\ngraph discovery, biological network analytics, and so on, graph data management\nhas become increasingly important and has drawn much attention from the\ndatabase community. While many graphs (e.g., Twitter, Wikipedia, etc.) are\nusually involving over time, it is of great importance to study the dynamic\nsubgraph matching (DSM) problem, a fundamental yet challenging graph operator,\nwhich continuously monitors subgraph matching results over dynamic graphs with\na stream of edge updates. To efficiently tackle the DSM problem, we carefully\ndesign a novel vertex dominance embedding approach, which effectively encodes\nvertex labels that can be incrementally maintained upon graph updates. Inspire\nby low pruning power for high-degree vertices, we propose a new degree grouping\ntechnique over basic subgraph patterns in different degree groups (i.e., groups\nof star substructures), and devise degree-aware star substructure synopses\n(DAS^3) to effectively facilitate our designed vertex dominance and range\npruning strategies. We develop efficient algorithms to incrementally maintain\ndynamic graphs and answer DSM queries. Through extensive experiments, we\nconfirm the efficiency of our proposed approaches over both real and synthetic\ngraphs.","PeriodicalId":501123,"journal":{"name":"arXiv - CS - Databases","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-07-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Databases","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2407.16660","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

In many real-world applications such as social network analysis, knowledge graph discovery, biological network analytics, and so on, graph data management has become increasingly important and has drawn much attention from the database community. While many graphs (e.g., Twitter, Wikipedia, etc.) are usually involving over time, it is of great importance to study the dynamic subgraph matching (DSM) problem, a fundamental yet challenging graph operator, which continuously monitors subgraph matching results over dynamic graphs with a stream of edge updates. To efficiently tackle the DSM problem, we carefully design a novel vertex dominance embedding approach, which effectively encodes vertex labels that can be incrementally maintained upon graph updates. Inspire by low pruning power for high-degree vertices, we propose a new degree grouping technique over basic subgraph patterns in different degree groups (i.e., groups of star substructures), and devise degree-aware star substructure synopses (DAS^3) to effectively facilitate our designed vertex dominance and range pruning strategies. We develop efficient algorithms to incrementally maintain dynamic graphs and answer DSM queries. Through extensive experiments, we confirm the efficiency of our proposed approaches over both real and synthetic graphs.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
通过基于成本模型的顶点支配嵌入实现动态子图匹配(技术报告)
在社交网络分析、知识图谱发现、生物网络分析等许多现实世界的应用中,图数据管理变得越来越重要,并引起了数据库界的广泛关注。许多图(如 Twitter、维基百科等)通常会随着时间的推移而发生变化,因此研究动态子图匹配(DSM)问题具有重要意义。为了高效地解决 DSM 问题,我们精心设计了一种新颖的顶点优势嵌入方法,它能有效地编码顶点标签,并在图更新时增量地维护这些标签。受高度顶点剪枝能力低的启发,我们在不同度组(即星形子结构组)的基本子图模式上提出了一种新的度分组技术,并设计了度感知星形子结构概要(DAS^3),以有效促进我们设计的顶点支配和范围运行策略。我们开发了增量维护动态图和回答 DSM 查询的高效算法。通过大量实验,我们证实了我们提出的方法在真实图和合成图上的效率。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Development of Data Evaluation Benchmark for Data Wrangling Recommendation System Messy Code Makes Managing ML Pipelines Difficult? Just Let LLMs Rewrite the Code! Fast and Adaptive Bulk Loading of Multidimensional Points Matrix Profile for Anomaly Detection on Multidimensional Time Series Extending predictive process monitoring for collaborative processes
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1