Synthetic Networks That Preserve Edge Connectivity

Lahari Anne, The-Anh Vu-Le, Minhyuk Park, Tandy Warnow, George Chacko
{"title":"Synthetic Networks That Preserve Edge Connectivity","authors":"Lahari Anne, The-Anh Vu-Le, Minhyuk Park, Tandy Warnow, George Chacko","doi":"arxiv-2408.13647","DOIUrl":null,"url":null,"abstract":"Since true communities within real-world networks are rarely known, synthetic\nnetworks with planted ground truths are valuable for evaluating the performance\nof community detection methods. Of the synthetic network generation tools\navailable, Stochastic Block Models (SBMs) produce networks with ground truth\nclusters that well approximate input parameters from real-world networks and\nclusterings. However, we show that SBMs can produce disconnected ground truth\nclusters, even when given parameters from clusterings where all clusters are\nconnected. Here we describe the REalistic Cluster Connectivity Simulator\n(RECCS), a technique that modifies an SBM synthetic network to improve the fit\nto a given clustered real-world network with respect to edge connectivity\nwithin clusters, while maintaining the good fit with respect to other network\nand cluster statistics. Using real-world networks up to 13.9 million nodes in\nsize, we show that RECCS, applied to stochastic block models, results in\nsynthetic networks that have a better fit to cluster edge connectivity than\nunmodified SBMs, while providing roughly the same quality fit for other network\nand clustering parameters as unmodified SBMs.","PeriodicalId":501032,"journal":{"name":"arXiv - CS - Social and Information Networks","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-08-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Social and Information Networks","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2408.13647","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Since true communities within real-world networks are rarely known, synthetic networks with planted ground truths are valuable for evaluating the performance of community detection methods. Of the synthetic network generation tools available, Stochastic Block Models (SBMs) produce networks with ground truth clusters that well approximate input parameters from real-world networks and clusterings. However, we show that SBMs can produce disconnected ground truth clusters, even when given parameters from clusterings where all clusters are connected. Here we describe the REalistic Cluster Connectivity Simulator (RECCS), a technique that modifies an SBM synthetic network to improve the fit to a given clustered real-world network with respect to edge connectivity within clusters, while maintaining the good fit with respect to other network and cluster statistics. Using real-world networks up to 13.9 million nodes in size, we show that RECCS, applied to stochastic block models, results in synthetic networks that have a better fit to cluster edge connectivity than unmodified SBMs, while providing roughly the same quality fit for other network and clustering parameters as unmodified SBMs.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
保持边缘连接的合成网络
由于真实世界网络中的真实群落很少为人所知,因此具有基本事实的合成网络对于评估群落检测方法的性能非常有价值。在现有的合成网络生成工具中,随机块模型(SBM)能生成具有地面实况聚类的网络,这些聚类能很好地近似真实世界网络和聚类的输入参数。然而,我们发现,即使给定的参数来自所有簇都相互连接的聚类,随机块模型也能生成断开的地面实况簇。在这里,我们介绍了现实簇连接模拟器(RECCS),它是一种修改 SBM 合成网络的技术,可以在簇间边缘连接性方面提高与给定聚类真实世界网络的拟合度,同时保持与其他网络和簇统计数据的良好拟合度。通过使用规模高达 1,390 万节点的真实世界网络,我们发现 RECCS 应用于随机块模型后,合成网络与未修改的 SBM 相比,能更好地拟合聚类边缘连通性,同时在其他网络和聚类参数方面提供与未修改 SBM 大致相同的拟合质量。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
My Views Do Not Reflect Those of My Employer: Differences in Behavior of Organizations' Official and Personal Social Media Accounts A novel DFS/BFS approach towards link prediction Community Shaping in the Digital Age: A Temporal Fusion Framework for Analyzing Discourse Fragmentation in Online Social Networks Skill matching at scale: freelancer-project alignment for efficient multilingual candidate retrieval "It Might be Technically Impressive, But It's Practically Useless to Us": Practices, Challenges, and Opportunities for Cross-Functional Collaboration around AI within the News Industry
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1