Incremental Graph Computations: Doable and Undoable

IF 2.2 2区计算机科学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS ACM Transactions on Database Systems Pub Date : 2022-05-23 DOI:https://dl.acm.org/doi/full/10.1145/3500930

Wenfei Fan, Chao Tian

{"title":"Incremental Graph Computations: Doable and Undoable","authors":"Wenfei Fan, Chao Tian","doi":"https://dl.acm.org/doi/full/10.1145/3500930","DOIUrl":null,"url":null,"abstract":"The incremental problem for a class \\( {\\mathcal {Q}} \\) of graph queries aims to compute, given a query \\( Q \\in {\\mathcal {Q}} \\), graph G, answers Q(G) to Q in G and updates ΔG to G as input, changes ΔO to output Q(G) such that Q(G⊕ΔG) = Q(G)⊕ΔO. It is called bounded if its cost can be expressed as a polynomial function in the sizes of Q, ΔG and ΔO, which reduces the computations on possibly big G to small ΔG and ΔO. No matter how desirable, however, our first results are negative: For common graph queries such as traversal, connectivity, keyword search, pattern matching, and maximum cardinality matching, their incremental problems are unbounded. In light of the negative results, we propose two characterizations for the effectiveness of incremental graph computation: (a) localizable, if its cost is decided by small neighbors of nodes in ΔG instead of the entire G; and (b) bounded relative to a batch graph algorithm \\( {\\mathcal {T}} \\), if the cost is determined by the sizes of ΔG and changes to the affected area that is necessarily checked by any algorithms that incrementalize \\( {\\mathcal {T}} \\). We show that the incremental computations above are either localizable or relatively bounded by providing corresponding incremental algorithms. That is, we can either reduce the incremental computations on big graphs to small data, or incrementalize existing batch graph algorithms by minimizing unnecessary recomputation. Using real-life and synthetic data, we experimentally verify the effectiveness of our incremental algorithms.","PeriodicalId":50915,"journal":{"name":"ACM Transactions on Database Systems","volume":"7 2","pages":""},"PeriodicalIF":2.2000,"publicationDate":"2022-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACM Transactions on Database Systems","FirstCategoryId":"94","ListUrlMain":"https://doi.org/https://dl.acm.org/doi/full/10.1145/3500930","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

The incremental problem for a class \( {\mathcal {Q}} \) of graph queries aims to compute, given a query \( Q \in {\mathcal {Q}} \), graph G, answers Q(G) to Q in G and updates ΔG to G as input, changes ΔO to output Q(G) such that Q(G⊕ΔG) = Q(G)⊕ΔO. It is called bounded if its cost can be expressed as a polynomial function in the sizes of Q, ΔG and ΔO, which reduces the computations on possibly big G to small ΔG and ΔO. No matter how desirable, however, our first results are negative: For common graph queries such as traversal, connectivity, keyword search, pattern matching, and maximum cardinality matching, their incremental problems are unbounded.

In light of the negative results, we propose two characterizations for the effectiveness of incremental graph computation: (a) localizable, if its cost is decided by small neighbors of nodes in ΔG instead of the entire G; and (b) bounded relative to a batch graph algorithm \( {\mathcal {T}} \), if the cost is determined by the sizes of ΔG and changes to the affected area that is necessarily checked by any algorithms that incrementalize \( {\mathcal {T}} \). We show that the incremental computations above are either localizable or relatively bounded by providing corresponding incremental algorithms. That is, we can either reduce the incremental computations on big graphs to small data, or incrementalize existing batch graph algorithms by minimizing unnecessary recomputation. Using real-life and synthetic data, we experimentally verify the effectiveness of our incremental algorithms.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

增量图计算:可行和不可行的

图查询类\( {\mathcal {Q}} \)的增量问题旨在计算，给定查询\( Q \in {\mathcal {Q}} \)，图G，将G中的Q(G)回答为Q，并将ΔG更新为G作为输入，将ΔO更改为输出Q(G)，使得Q(G⊕ΔG) = Q(G)⊕ΔO。如果它的代价可以表示为大小为Q， ΔG和ΔO的多项式函数，则称为有界，这将可能的大G的计算减少到小ΔG和ΔO。然而，无论多么理想，我们的第一个结果都是否定的:对于常见的图查询，如遍历、连通性、关键字搜索、模式匹配和最大基数匹配，它们的增量问题是无界的。鉴于负面结果，我们提出了增量图计算有效性的两个特征:(a)可本地化，如果其成本由ΔG中节点的小邻居决定，而不是整个G;(b)相对于批处理图算法\( {\mathcal {T}} \)有界，如果成本由ΔG的大小和受影响区域的变化决定，则必须由任何对\( {\mathcal {T}} \)进行增量化的算法检查。通过提供相应的增量算法，我们证明了上述增量计算要么是可本地化的，要么是相对有界的。也就是说，我们可以将大图上的增量计算减少到小数据上，或者通过最小化不必要的重新计算来增量化现有的批处理图算法。使用真实数据和合成数据，我们通过实验验证了增量算法的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

ACM Transactions on Database Systems 工程技术-计算机：软件工程

CiteScore

5.60

自引率

0.00%

发文量

审稿时长

>12 weeks

期刊介绍： Heavily used in both academic and corporate R&D settings, ACM Transactions on Database Systems (TODS) is a key publication for computer scientists working in data abstraction, data modeling, and designing data management systems. Topics include storage and retrieval, transaction management, distributed and federated databases, semantics of data, intelligent databases, and operations and algorithms relating to these areas. In this rapidly changing field, TODS provides insights into the thoughts of the best minds in database R&D.

期刊最新文献

Automated Category Tree Construction: Hardness Bounds and Algorithms Database Repairing with Soft Functional Dependencies Sharing Queries with Nonequivalent User-Defined Aggregate Functions A family of centrality measures for graph data based on subgraphs GraphZeppelin: How to Find Connected Components (Even When Graphs Are Dense, Dynamic, and Massive)