基于图稀疏的社交网络社区结构检测

Partha Basuchowdhuri, Satyaki Sikdar, Sonu Sreshtha, S. Majumder
{"title":"基于图稀疏的社交网络社区结构检测","authors":"Partha Basuchowdhuri, Satyaki Sikdar, Sonu Sreshtha, S. Majumder","doi":"10.1145/2888451.2888479","DOIUrl":null,"url":null,"abstract":"Community structures are inherent in social networks and finding them is an interesting and well-studied problem. Finding community structures in social networks is similar to locating densely connected clusters of nodes in a graph. One of the popular methods for finding communities is to first find the inter-community edges and then removing them to reveal the communities. It is well-known that a network centrality measure named edge betweenness can be used to detect the inter-community edges. The edges with high edge betweenness are those that fall in a large number of shortest paths out of all possible pairs of shortest paths. Finding all-pair shortest paths is a computationally expensive task, especially for large-sized graphs. So we construct a t-spanner, a known graph sparsification technique, for finding edges with high betweenness and eventually find communities by removing such edges. Using the t-spanner, we then detect the inter-community edges in O(km) running time by building a distance oracle of size O(kn1+1/k), where t = 2k-1. Compared to the traditional community detection methods dependent on calculation of betweenness values, our algorithm runs much faster. Experiments show that our algorithm finds communities of quality comparable to the other state-of-the-art community detection algorithms.","PeriodicalId":136431,"journal":{"name":"Proceedings of the 3rd IKDD Conference on Data Science, 2016","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2016-03-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Detecting Community Structures in Social Networks by Graph Sparsification\",\"authors\":\"Partha Basuchowdhuri, Satyaki Sikdar, Sonu Sreshtha, S. Majumder\",\"doi\":\"10.1145/2888451.2888479\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Community structures are inherent in social networks and finding them is an interesting and well-studied problem. Finding community structures in social networks is similar to locating densely connected clusters of nodes in a graph. One of the popular methods for finding communities is to first find the inter-community edges and then removing them to reveal the communities. It is well-known that a network centrality measure named edge betweenness can be used to detect the inter-community edges. The edges with high edge betweenness are those that fall in a large number of shortest paths out of all possible pairs of shortest paths. Finding all-pair shortest paths is a computationally expensive task, especially for large-sized graphs. So we construct a t-spanner, a known graph sparsification technique, for finding edges with high betweenness and eventually find communities by removing such edges. Using the t-spanner, we then detect the inter-community edges in O(km) running time by building a distance oracle of size O(kn1+1/k), where t = 2k-1. Compared to the traditional community detection methods dependent on calculation of betweenness values, our algorithm runs much faster. Experiments show that our algorithm finds communities of quality comparable to the other state-of-the-art community detection algorithms.\",\"PeriodicalId\":136431,\"journal\":{\"name\":\"Proceedings of the 3rd IKDD Conference on Data Science, 2016\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-03-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 3rd IKDD Conference on Data Science, 2016\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/2888451.2888479\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 3rd IKDD Conference on Data Science, 2016","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2888451.2888479","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5

摘要

社区结构是社会网络中固有的,找到它们是一个有趣且研究得很充分的问题。在社交网络中寻找社区结构类似于在图中定位密集连接的节点簇。一种常用的寻找群落的方法是首先找到群落间的边缘,然后将其移除以显示群落。众所周知,可以使用一种称为边缘之间的网络中心性度量来检测社区间的边缘。在所有可能的最短路径对中,处于大量最短路径中的边是具有高边间度的边。寻找全对最短路径是一项计算成本很高的任务,特别是对于大型图。因此,我们构造了一个t形扳手,一种已知的图稀疏化技术,用于寻找具有高中间度的边,并最终通过去除这些边来找到群落。然后,我们使用t-扳手,通过构建大小为O(kn1+1/k)的距离oracle,在O(km)运行时间内检测社区间边缘,其中t = 2k-1。与传统的依赖于计算中间值的社区检测方法相比,我们的算法运行速度更快。实验表明,我们的算法发现的社区质量与其他最先进的社区检测算法相当。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Detecting Community Structures in Social Networks by Graph Sparsification
Community structures are inherent in social networks and finding them is an interesting and well-studied problem. Finding community structures in social networks is similar to locating densely connected clusters of nodes in a graph. One of the popular methods for finding communities is to first find the inter-community edges and then removing them to reveal the communities. It is well-known that a network centrality measure named edge betweenness can be used to detect the inter-community edges. The edges with high edge betweenness are those that fall in a large number of shortest paths out of all possible pairs of shortest paths. Finding all-pair shortest paths is a computationally expensive task, especially for large-sized graphs. So we construct a t-spanner, a known graph sparsification technique, for finding edges with high betweenness and eventually find communities by removing such edges. Using the t-spanner, we then detect the inter-community edges in O(km) running time by building a distance oracle of size O(kn1+1/k), where t = 2k-1. Compared to the traditional community detection methods dependent on calculation of betweenness values, our algorithm runs much faster. Experiments show that our algorithm finds communities of quality comparable to the other state-of-the-art community detection algorithms.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
On the Dynamics of Username Changing Behavior on Twitter Smart filters for social retrieval Improving Urban Transportation through Social Media Analytics AMEO 2015: A dataset comprising AMCAT test scores, biodata details and employment outcomes of job seekers Learning from Gurus: Analysis and Modeling of Reopened Questions on Stack Overflow
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1