{"title":"Generic network sparsification via hybrid edge sampling","authors":"Zhen Su , Jürgen Kurths , Henning Meyerhenke","doi":"10.1016/j.jfranklin.2024.107404","DOIUrl":null,"url":null,"abstract":"<div><div>Network (or graph) sparsification benefits downstream graph mining tasks. Finding a sparsified subgraph <span><math><mover><mrow><mi>G</mi></mrow><mrow><mo>ˆ</mo></mrow></mover></math></span> similar to the original graph <span><math><mi>G</mi></math></span> is, however, challenging due to the requirement of preserving various (or at least representative) network properties. In this paper, we propose a general hybrid edge sampling scheme named LOGA, as the combination of the <u>Lo</u>cal-filtering-based Random Edge sampling (LRE) (Hamann et al., 2016) and the <u>Ga</u>me-theoretic Sparsification with Tolerance (GST) (Su et al., 2022). LOGA fully utilizes the advantages of GST — in preserving complex structural properties by preserving local node properties in expectation – and LRE – in preserving the connectivity of a given network. Specifically, we first prove the existence of multiple equilibria in GST. This insight leads us to propose LOGA and its variant LOGA<span><math><msup><mrow></mrow><mrow><mi>s</mi><mi>c</mi></mrow></msup></math></span> by refining GST. LOGA is obtained by regarding LRE as an empirically good initializer for GST, while LOGA<span><math><msup><mrow></mrow><mrow><mi>s</mi><mi>c</mi></mrow></msup></math></span> is obtained by further including a constrained update for GST. In this way, LOGA/LOGA<span><math><msup><mrow></mrow><mrow><mi>s</mi><mi>c</mi></mrow></msup></math></span> generalize the work on GST to graphs with weights and different densities, without increasing the asymptotic time complexity. Extensive experiments on 26 weighted and unweighted networks with different densities demonstrate that LOGA<span><math><msup><mrow></mrow><mrow><mi>s</mi><mi>c</mi></mrow></msup></math></span> performs best for all 26 instances, i.e., they preserve representative network properties better than state-of-the-art sampling methods alone.</div></div>","PeriodicalId":17283,"journal":{"name":"Journal of The Franklin Institute-engineering and Applied Mathematics","volume":"362 1","pages":"Article 107404"},"PeriodicalIF":3.7000,"publicationDate":"2024-11-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of The Franklin Institute-engineering and Applied Mathematics","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0016003224008251","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
Network (or graph) sparsification benefits downstream graph mining tasks. Finding a sparsified subgraph similar to the original graph is, however, challenging due to the requirement of preserving various (or at least representative) network properties. In this paper, we propose a general hybrid edge sampling scheme named LOGA, as the combination of the Local-filtering-based Random Edge sampling (LRE) (Hamann et al., 2016) and the Game-theoretic Sparsification with Tolerance (GST) (Su et al., 2022). LOGA fully utilizes the advantages of GST — in preserving complex structural properties by preserving local node properties in expectation – and LRE – in preserving the connectivity of a given network. Specifically, we first prove the existence of multiple equilibria in GST. This insight leads us to propose LOGA and its variant LOGA by refining GST. LOGA is obtained by regarding LRE as an empirically good initializer for GST, while LOGA is obtained by further including a constrained update for GST. In this way, LOGA/LOGA generalize the work on GST to graphs with weights and different densities, without increasing the asymptotic time complexity. Extensive experiments on 26 weighted and unweighted networks with different densities demonstrate that LOGA performs best for all 26 instances, i.e., they preserve representative network properties better than state-of-the-art sampling methods alone.
期刊介绍:
The Journal of The Franklin Institute has an established reputation for publishing high-quality papers in the field of engineering and applied mathematics. Its current focus is on control systems, complex networks and dynamic systems, signal processing and communications and their applications. All submitted papers are peer-reviewed. The Journal will publish original research papers and research review papers of substance. Papers and special focus issues are judged upon possible lasting value, which has been and continues to be the strength of the Journal of The Franklin Institute.