Zi-xuan Han, Lei-lei Shi, Lu Liu, Liang Jiang, Wan Tang, Xiao Chen, Jing-yu Yang, Ayodeji O. Ayorinde, Nick Antonopoulos
{"title":"H-Louvain: Hierarchical Louvain-based community detection in social media data streams","authors":"Zi-xuan Han, Lei-lei Shi, Lu Liu, Liang Jiang, Wan Tang, Xiao Chen, Jing-yu Yang, Ayodeji O. Ayorinde, Nick Antonopoulos","doi":"10.1007/s12083-024-01689-9","DOIUrl":null,"url":null,"abstract":"<p>The rise of online social networks has fundamentally transformed the traditional way of social interaction and information dissemination, leading to a growing interest in precise community detection and in-depth network structure analysis. However, the complexity of network structures and potential issues like singularity and subjectivity in information extraction affect the accuracy of community detection. To overcome these challenges, we propose a new community detection algorithm, known as the Hierarchical Louvain (H-Louvain) algorithm. It enhances the performance of community detection through a multi-level processing and information fusion strategy. Specifically, the algorithm integrates graph compression techniques with the Hyperlink-Induced Topic Search (HITS) algorithm for initial network hierarchical partitioning, simultaneously filtering out low-quality posts and users while retaining critical information. Furthermore, the proposed method enhances semantic representation by automatically determining an appropriate number of attribute vector dimensions and obtaining attribute weight information through the calculation of self-authority values and the \"minimum distance\" attribute of posts. Lastly, the method creates an initial user training set through network re-partitioning in hierarchical layers and improves the Louvain algorithm for community partitioning by estimating the comprehensive influence of nodes. Extensive experimentation has demonstrated that the H-Louvain algorithm outperforms state-of-the-art comparative algorithms in terms of accuracy and stability in community detection based on real-world Twitter datasets.</p>","PeriodicalId":49313,"journal":{"name":"Peer-To-Peer Networking and Applications","volume":"21 1","pages":""},"PeriodicalIF":3.3000,"publicationDate":"2024-05-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Peer-To-Peer Networking and Applications","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1007/s12083-024-01689-9","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
The rise of online social networks has fundamentally transformed the traditional way of social interaction and information dissemination, leading to a growing interest in precise community detection and in-depth network structure analysis. However, the complexity of network structures and potential issues like singularity and subjectivity in information extraction affect the accuracy of community detection. To overcome these challenges, we propose a new community detection algorithm, known as the Hierarchical Louvain (H-Louvain) algorithm. It enhances the performance of community detection through a multi-level processing and information fusion strategy. Specifically, the algorithm integrates graph compression techniques with the Hyperlink-Induced Topic Search (HITS) algorithm for initial network hierarchical partitioning, simultaneously filtering out low-quality posts and users while retaining critical information. Furthermore, the proposed method enhances semantic representation by automatically determining an appropriate number of attribute vector dimensions and obtaining attribute weight information through the calculation of self-authority values and the "minimum distance" attribute of posts. Lastly, the method creates an initial user training set through network re-partitioning in hierarchical layers and improves the Louvain algorithm for community partitioning by estimating the comprehensive influence of nodes. Extensive experimentation has demonstrated that the H-Louvain algorithm outperforms state-of-the-art comparative algorithms in terms of accuracy and stability in community detection based on real-world Twitter datasets.
期刊介绍:
The aim of the Peer-to-Peer Networking and Applications journal is to disseminate state-of-the-art research and development results in this rapidly growing research area, to facilitate the deployment of P2P networking and applications, and to bring together the academic and industry communities, with the goal of fostering interaction to promote further research interests and activities, thus enabling new P2P applications and services. The journal not only addresses research topics related to networking and communications theory, but also considers the standardization, economic, and engineering aspects of P2P technologies, and their impacts on software engineering, computer engineering, networked communication, and security.
The journal serves as a forum for tackling the technical problems arising from both file sharing and media streaming applications. It also includes state-of-the-art technologies in the P2P security domain.
Peer-to-Peer Networking and Applications publishes regular papers, tutorials and review papers, case studies, and correspondence from the research, development, and standardization communities. Papers addressing system, application, and service issues are encouraged.