{"title":"Incremental Graph Computations: Doable and Undoable","authors":"Wenfei Fan, Chao Tian","doi":"https://dl.acm.org/doi/full/10.1145/3500930","DOIUrl":null,"url":null,"abstract":"<p>The incremental problem for a class \\( {\\mathcal {Q}} \\) of graph queries aims to compute, given a query \\( Q \\in {\\mathcal {Q}} \\), graph <i>G</i>, answers <i>Q</i>(<i>G</i>) to <i>Q</i> in <i>G</i> and updates <i>ΔG</i> to <i>G</i> as input, changes <i>ΔO</i> to output <i>Q</i>(<i>G</i>) such that <i>Q</i>(<i>G</i>⊕<i>ΔG</i>) = <i>Q</i>(<i>G</i>)⊕<i>ΔO</i>. It is called <i>bounded</i> if its cost can be expressed as a polynomial function in the sizes of <i>Q</i>, <i>ΔG</i> and <i>ΔO</i>, which reduces the computations on possibly big <i>G</i> to small <i>ΔG</i> and <i>ΔO</i>. No matter how desirable, however, our first results are negative: For common graph queries such as traversal, connectivity, keyword search, pattern matching, and maximum cardinality matching, their incremental problems are unbounded. </p><p>In light of the negative results, we propose two characterizations for the effectiveness of incremental graph computation: (a) <i>localizable</i>, if its cost is decided by small neighbors of nodes in <i>ΔG</i> instead of the entire <i>G</i>; and (b) <i>bounded relative to</i> a batch graph algorithm \\( {\\mathcal {T}} \\), if the cost is determined by the sizes of <i>ΔG</i> and changes to the affected area that is necessarily checked by any algorithms that incrementalize \\( {\\mathcal {T}} \\). We show that the incremental computations above are either localizable or relatively bounded by providing corresponding incremental algorithms. That is, we can either reduce the incremental computations on big graphs to small data, or incrementalize existing batch graph algorithms by minimizing unnecessary recomputation. Using real-life and synthetic data, we experimentally verify the effectiveness of our incremental algorithms.</p>","PeriodicalId":50915,"journal":{"name":"ACM Transactions on Database Systems","volume":"7 2","pages":""},"PeriodicalIF":2.2000,"publicationDate":"2022-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACM Transactions on Database Systems","FirstCategoryId":"94","ListUrlMain":"https://doi.org/https://dl.acm.org/doi/full/10.1145/3500930","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
The incremental problem for a class \( {\mathcal {Q}} \) of graph queries aims to compute, given a query \( Q \in {\mathcal {Q}} \), graph G, answers Q(G) to Q in G and updates ΔG to G as input, changes ΔO to output Q(G) such that Q(G⊕ΔG) = Q(G)⊕ΔO. It is called bounded if its cost can be expressed as a polynomial function in the sizes of Q, ΔG and ΔO, which reduces the computations on possibly big G to small ΔG and ΔO. No matter how desirable, however, our first results are negative: For common graph queries such as traversal, connectivity, keyword search, pattern matching, and maximum cardinality matching, their incremental problems are unbounded.
In light of the negative results, we propose two characterizations for the effectiveness of incremental graph computation: (a) localizable, if its cost is decided by small neighbors of nodes in ΔG instead of the entire G; and (b) bounded relative to a batch graph algorithm \( {\mathcal {T}} \), if the cost is determined by the sizes of ΔG and changes to the affected area that is necessarily checked by any algorithms that incrementalize \( {\mathcal {T}} \). We show that the incremental computations above are either localizable or relatively bounded by providing corresponding incremental algorithms. That is, we can either reduce the incremental computations on big graphs to small data, or incrementalize existing batch graph algorithms by minimizing unnecessary recomputation. Using real-life and synthetic data, we experimentally verify the effectiveness of our incremental algorithms.
期刊介绍:
Heavily used in both academic and corporate R&D settings, ACM Transactions on Database Systems (TODS) is a key publication for computer scientists working in data abstraction, data modeling, and designing data management systems. Topics include storage and retrieval, transaction management, distributed and federated databases, semantics of data, intelligent databases, and operations and algorithms relating to these areas. In this rapidly changing field, TODS provides insights into the thoughts of the best minds in database R&D.