{"title":"随机图中聚类问题的渐近边界","authors":"Eugene Lykhovyd, Sergiy Butenko, Pavlo Krokhmal","doi":"10.1002/net.22203","DOIUrl":null,"url":null,"abstract":"Graph clustering is an important problem in network analysis. This problem can be approached by first finding a large cluster subgraph (i.e., a subgraph in which every connected component is a complete graph), perhaps in a relaxed form (connected components may have missing edges), and then assigning each of the remaining vertices to one of the connected components of the cluster subgraph according to some optimization criteria. The more vertices can be included in the initial cluster subgraph (also referred to as independent union of clusters), the more “clusterable” the graph is. This paper proposes a framework for establishing asymptotic bounds on the cardinality of independent unions of clusters in Erdős-Rényi random graphs <math altimg=\"urn:x-wiley:net:media:net22203:net22203-math-0001\" display=\"inline\" location=\"graphic/net22203-math-0001.png\" overflow=\"scroll\">\n<semantics>\n<mrow>\n<mi>G</mi>\n<mo stretchy=\"false\">(</mo>\n<mi>n</mi>\n<mo>,</mo>\n<mi>p</mi>\n<mo stretchy=\"false\">)</mo>\n</mrow>\n$$ G\\left(n,p\\right) $$</annotation>\n</semantics></math> with constant <math altimg=\"urn:x-wiley:net:media:net22203:net22203-math-0002\" display=\"inline\" location=\"graphic/net22203-math-0002.png\" overflow=\"scroll\">\n<semantics>\n<mrow>\n<mi>p</mi>\n</mrow>\n$$ p $$</annotation>\n</semantics></math>, referred to as uniform random graphs. In particular, sufficient conditions ensuring <math altimg=\"urn:x-wiley:net:media:net22203:net22203-math-0003\" display=\"inline\" location=\"graphic/net22203-math-0003.png\" overflow=\"scroll\">\n<semantics>\n<mrow>\n<mi>O</mi>\n<mo stretchy=\"false\">(</mo>\n<mi>log</mi>\n<mi>n</mi>\n<mo stretchy=\"false\">)</mo>\n</mrow>\n$$ O\\left(\\log n\\right) $$</annotation>\n</semantics></math> (where <math altimg=\"urn:x-wiley:net:media:net22203:net22203-math-0004\" display=\"inline\" location=\"graphic/net22203-math-0004.png\" overflow=\"scroll\">\n<semantics>\n<mrow>\n<mi>n</mi>\n</mrow>\n$$ n $$</annotation>\n</semantics></math> is the number of nodes) upper bounds with probability 1 are developed and shown to be applicable for the maximum independent union of cliques as well as some clique relaxations. In addition, it is shown that every graph must have an independent union of cliques of cardinality at least <math altimg=\"urn:x-wiley:net:media:net22203:net22203-math-0005\" display=\"inline\" location=\"graphic/net22203-math-0005.png\" overflow=\"scroll\">\n<semantics>\n<mrow>\n<mi mathvariant=\"normal\">Ω</mi>\n<mo stretchy=\"false\">(</mo>\n<mi>log</mi>\n<mi>n</mi>\n<mo stretchy=\"false\">)</mo>\n</mrow>\n$$ \\Omega \\left(\\log n\\right) $$</annotation>\n</semantics></math>. Since this bound is asymptotically tight on uniform random graphs, this suggests that these graphs can be viewed as a “least clusterable” class of graphs.","PeriodicalId":54734,"journal":{"name":"Networks","volume":"286 1","pages":""},"PeriodicalIF":1.6000,"publicationDate":"2023-12-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Asymptotic bounds for clustering problems in random graphs\",\"authors\":\"Eugene Lykhovyd, Sergiy Butenko, Pavlo Krokhmal\",\"doi\":\"10.1002/net.22203\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Graph clustering is an important problem in network analysis. This problem can be approached by first finding a large cluster subgraph (i.e., a subgraph in which every connected component is a complete graph), perhaps in a relaxed form (connected components may have missing edges), and then assigning each of the remaining vertices to one of the connected components of the cluster subgraph according to some optimization criteria. The more vertices can be included in the initial cluster subgraph (also referred to as independent union of clusters), the more “clusterable” the graph is. This paper proposes a framework for establishing asymptotic bounds on the cardinality of independent unions of clusters in Erdős-Rényi random graphs <math altimg=\\\"urn:x-wiley:net:media:net22203:net22203-math-0001\\\" display=\\\"inline\\\" location=\\\"graphic/net22203-math-0001.png\\\" overflow=\\\"scroll\\\">\\n<semantics>\\n<mrow>\\n<mi>G</mi>\\n<mo stretchy=\\\"false\\\">(</mo>\\n<mi>n</mi>\\n<mo>,</mo>\\n<mi>p</mi>\\n<mo stretchy=\\\"false\\\">)</mo>\\n</mrow>\\n$$ G\\\\left(n,p\\\\right) $$</annotation>\\n</semantics></math> with constant <math altimg=\\\"urn:x-wiley:net:media:net22203:net22203-math-0002\\\" display=\\\"inline\\\" location=\\\"graphic/net22203-math-0002.png\\\" overflow=\\\"scroll\\\">\\n<semantics>\\n<mrow>\\n<mi>p</mi>\\n</mrow>\\n$$ p $$</annotation>\\n</semantics></math>, referred to as uniform random graphs. In particular, sufficient conditions ensuring <math altimg=\\\"urn:x-wiley:net:media:net22203:net22203-math-0003\\\" display=\\\"inline\\\" location=\\\"graphic/net22203-math-0003.png\\\" overflow=\\\"scroll\\\">\\n<semantics>\\n<mrow>\\n<mi>O</mi>\\n<mo stretchy=\\\"false\\\">(</mo>\\n<mi>log</mi>\\n<mi>n</mi>\\n<mo stretchy=\\\"false\\\">)</mo>\\n</mrow>\\n$$ O\\\\left(\\\\log n\\\\right) $$</annotation>\\n</semantics></math> (where <math altimg=\\\"urn:x-wiley:net:media:net22203:net22203-math-0004\\\" display=\\\"inline\\\" location=\\\"graphic/net22203-math-0004.png\\\" overflow=\\\"scroll\\\">\\n<semantics>\\n<mrow>\\n<mi>n</mi>\\n</mrow>\\n$$ n $$</annotation>\\n</semantics></math> is the number of nodes) upper bounds with probability 1 are developed and shown to be applicable for the maximum independent union of cliques as well as some clique relaxations. In addition, it is shown that every graph must have an independent union of cliques of cardinality at least <math altimg=\\\"urn:x-wiley:net:media:net22203:net22203-math-0005\\\" display=\\\"inline\\\" location=\\\"graphic/net22203-math-0005.png\\\" overflow=\\\"scroll\\\">\\n<semantics>\\n<mrow>\\n<mi mathvariant=\\\"normal\\\">Ω</mi>\\n<mo stretchy=\\\"false\\\">(</mo>\\n<mi>log</mi>\\n<mi>n</mi>\\n<mo stretchy=\\\"false\\\">)</mo>\\n</mrow>\\n$$ \\\\Omega \\\\left(\\\\log n\\\\right) $$</annotation>\\n</semantics></math>. Since this bound is asymptotically tight on uniform random graphs, this suggests that these graphs can be viewed as a “least clusterable” class of graphs.\",\"PeriodicalId\":54734,\"journal\":{\"name\":\"Networks\",\"volume\":\"286 1\",\"pages\":\"\"},\"PeriodicalIF\":1.6000,\"publicationDate\":\"2023-12-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Networks\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://doi.org/10.1002/net.22203\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Networks","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1002/net.22203","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE","Score":null,"Total":0}
引用次数: 0
摘要
图聚类是网络分析中的一个重要问题。这个问题可以通过首先找到一个大的聚类子图(即,其中每个连接组件都是完整图的子图)来解决,可能是在一个宽松的形式(连接组件可能有缺失的边),然后根据一些优化标准将每个剩余的顶点分配给聚类子图的一个连接组件。初始聚类子图(也称为簇的独立并集)中包含的顶点越多,该图就越“可聚类”。本文提出了一个框架,用于建立具有常数p $$ p $$的Erdős-Rényi随机图G(n,p) $$ G\left(n,p\right) $$(称为一致随机图)中簇的独立并集基数的渐近界。特别是,给出了保证概率为1的O(logn) $$ O\left(\log n\right) $$(其中n $$ n $$为节点数)上界的充分条件,并证明了该条件适用于团的最大独立联合以及一些团松弛。此外,还证明了每个图必须具有至少Ω(logn) $$ \Omega \left(\log n\right) $$的基数团的独立并集。由于这个界在均匀随机图上是渐近紧的,这表明这些图可以被看作是“最小可聚类”的图类。
Asymptotic bounds for clustering problems in random graphs
Graph clustering is an important problem in network analysis. This problem can be approached by first finding a large cluster subgraph (i.e., a subgraph in which every connected component is a complete graph), perhaps in a relaxed form (connected components may have missing edges), and then assigning each of the remaining vertices to one of the connected components of the cluster subgraph according to some optimization criteria. The more vertices can be included in the initial cluster subgraph (also referred to as independent union of clusters), the more “clusterable” the graph is. This paper proposes a framework for establishing asymptotic bounds on the cardinality of independent unions of clusters in Erdős-Rényi random graphs with constant , referred to as uniform random graphs. In particular, sufficient conditions ensuring (where is the number of nodes) upper bounds with probability 1 are developed and shown to be applicable for the maximum independent union of cliques as well as some clique relaxations. In addition, it is shown that every graph must have an independent union of cliques of cardinality at least . Since this bound is asymptotically tight on uniform random graphs, this suggests that these graphs can be viewed as a “least clusterable” class of graphs.
期刊介绍:
Network problems are pervasive in our modern technological society, as witnessed by our reliance on physical networks that provide power, communication, and transportation. As well, a number of processes can be modeled using logical networks, as in the scheduling of interdependent tasks, the dating of archaeological artifacts, or the compilation of subroutines comprising a large computer program. Networks provide a common framework for posing and studying problems that often have wider applicability than their originating context.
The goal of this journal is to provide a central forum for the distribution of timely information about network problems, their design and mathematical analysis, as well as efficient algorithms for carrying out optimization on networks. The nonstandard modeling of diverse processes using networks and network concepts is also of interest. Consequently, the disciplines that are useful in studying networks are varied, including applied mathematics, operations research, computer science, discrete mathematics, and economics.
Networks publishes material on the analytic modeling of problems using networks, the mathematical analysis of network problems, the design of computationally efficient network algorithms, and innovative case studies of successful network applications. We do not typically publish works that fall in the realm of pure graph theory (without significant algorithmic and modeling contributions) or papers that deal with engineering aspects of network design. Since the audience for this journal is then necessarily broad, articles that impact multiple application areas or that creatively use new or existing methodologies are especially appropriate. We seek to publish original, well-written research papers that make a substantive contribution to the knowledge base. In addition, tutorial and survey articles are welcomed. All manuscripts are carefully refereed.