Efficient Algorithm for Budgeted Adaptive Influence Maximization: An Incremental RR-set Update Approach

Proceedings of the ACM on Management of Data Pub Date : 2023-11-13 DOI:10.1145/3617328

Qintian Guo, Chen Feng, Fangyuan Zhang, Sibo Wang

{"title":"Efficient Algorithm for Budgeted Adaptive Influence Maximization: An Incremental RR-set Update Approach","authors":"Qintian Guo, Chen Feng, Fangyuan Zhang, Sibo Wang","doi":"10.1145/3617328","DOIUrl":null,"url":null,"abstract":"Given a graph G, a cost associated with each node, and a budget B, the budgeted influence maximization (BIM) aims to find the optimal set S of seed nodes that maximizes the influence among all possible sets such that the total cost of nodes in S is no larger than B. Existing solutions mainly follow the non-adaptive idea, i.e., determining all the seeds before observing any actual diffusion. Due to the absence of actual diffusion information, they may result in unsatisfactory influence spread. Motivated by the limitation of existing solutions, in this paper, we make the first attempt to solve the BIM problem under the adaptive setting, where seed nodes are iteratively selected after observing the diffusion result of the previous seeds. We design the first practical algorithm which achieves an expected approximation guarantee by probabilistically adopting a cost-aware greedy idea or a single influential node. Further, we develop an optimized version to improve its practical performance in terms of influence spread. Besides, the scalability issues of the adaptive IM-related problems still remain open. It is because they usually involve multiple rounds (e.g., equal to the number of seeds) and in each round, they have to construct sufficient new reverse-reachable set (RR-set) samples such that the claimed approximation guarantee can actually hold. However, this incurs prohibitive computation, imposing limitations on real applications. To solve this dilemma, we propose an incremental update approach. Specifically, it maintains extra construction information when building RR-sets, and then it can quickly correct a problematic RR-set from the very step where it is first affected. As a result, we recycle the RR-sets at a small computational cost, while still providing correctness guarantee. Finally, extensive experiments on large-scale real graphs demonstrate the superiority of our algorithms over baselines in terms of both influence spread and running time.","PeriodicalId":498157,"journal":{"name":"Proceedings of the ACM on Management of Data","volume":"35 9","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the ACM on Management of Data","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3617328","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Given a graph G, a cost associated with each node, and a budget B, the budgeted influence maximization (BIM) aims to find the optimal set S of seed nodes that maximizes the influence among all possible sets such that the total cost of nodes in S is no larger than B. Existing solutions mainly follow the non-adaptive idea, i.e., determining all the seeds before observing any actual diffusion. Due to the absence of actual diffusion information, they may result in unsatisfactory influence spread. Motivated by the limitation of existing solutions, in this paper, we make the first attempt to solve the BIM problem under the adaptive setting, where seed nodes are iteratively selected after observing the diffusion result of the previous seeds. We design the first practical algorithm which achieves an expected approximation guarantee by probabilistically adopting a cost-aware greedy idea or a single influential node. Further, we develop an optimized version to improve its practical performance in terms of influence spread. Besides, the scalability issues of the adaptive IM-related problems still remain open. It is because they usually involve multiple rounds (e.g., equal to the number of seeds) and in each round, they have to construct sufficient new reverse-reachable set (RR-set) samples such that the claimed approximation guarantee can actually hold. However, this incurs prohibitive computation, imposing limitations on real applications. To solve this dilemma, we propose an incremental update approach. Specifically, it maintains extra construction information when building RR-sets, and then it can quickly correct a problematic RR-set from the very step where it is first affected. As a result, we recycle the RR-sets at a small computational cost, while still providing correctness guarantee. Finally, extensive experiments on large-scale real graphs demonstrate the superiority of our algorithms over baselines in terms of both influence spread and running time.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

预算自适应影响最大化的有效算法:一种增量rr集更新方法

给定一个图G，每个节点的成本和预算B，预算影响最大化(BIM)的目标是在所有可能的集合中找到影响最大的种子节点的最优集合S，使S中节点的总成本不大于B。现有的解决方案主要遵循非自适应思想，即在观察任何实际扩散之前确定所有的种子。由于缺乏实际的传播信息，可能导致影响传播不理想。由于现有解决方案的局限性，本文首次尝试在自适应设置下解决BIM问题，通过观察之前种子的扩散结果，迭代选择种子节点。我们设计了第一个实用的算法，该算法通过概率地采用成本感知贪婪思想或单个影响节点来实现期望的近似保证。进一步，我们开发了一个优化版本，以提高其在影响力传播方面的实际性能。此外，自适应im相关问题的可扩展性问题仍有待解决。这是因为它们通常涉及多轮(例如，等于种子的数量)，并且在每轮中，它们必须构造足够的新的逆可达集(RR-set)样本，以便声称的近似保证能够实际成立。然而，这导致了令人望而却步的计算，对实际应用程序施加了限制。为了解决这一困境，我们提出了一种增量更新方法。具体来说，它在构建rr集时维护额外的构造信息，然后它可以从第一次受到影响的步骤开始快速纠正有问题的rr集。因此，我们以很小的计算成本回收rr集，同时仍然提供正确性保证。最后，在大规模真实图上的大量实验表明，我们的算法在影响范围和运行时间方面都优于基线。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Proceedings of the ACM on Management of Data

自引率

0.00%

发文量

期刊最新文献

Verification of Unary Communicating Datalog Programs Postulates for Provenance: Instance-based provenance for first-order logic Tight Lower Bounds for Directed Cut Sparsification and Distributed Min-Cut Containment of Graph Queries Modulo Schema Bag Semantics Conjunctive Query Containment. Four Small Steps Towards Undecidability.