Enhanced Community Detection via Convolutional Neural Network: A Modified Approach Based on MRFasGCN Algorithm

IF 3.4 3区计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS IEEE Access Pub Date : 2024-10-04 DOI:10.1109/ACCESS.2024.3474303

Puneet Kumar;Dalwinder Singh;Mamoona Humayun;Ali Alqazzaz;Arun Malik;Ibrahim Alrashdi;Isha Batra;Ghadah Naif Alwakid

{"title":"Enhanced Community Detection via Convolutional Neural Network: A Modified Approach Based on MRFasGCN Algorithm","authors":"Puneet Kumar;Dalwinder Singh;Mamoona Humayun;Ali Alqazzaz;Arun Malik;Ibrahim Alrashdi;Isha Batra;Ghadah Naif Alwakid","doi":"10.1109/ACCESS.2024.3474303","DOIUrl":null,"url":null,"abstract":"Community detection is a very important research topic in the field of Social Network Analysis. Lots of researchers are working in this field due to its applications in various fields like medicine, social, Business, Marketing, and research. Researchers are proposing new algorithms to detect the communities having better performance as compared to the existing techniques. Initially, Newman and Girvan proposed traditional algorithms for community detection from social networks in 2004, but with the growth of social networks, Convolutional Neural Network (CNN) based algorithms are proposed by different researchers in recent years due to the inefficiency of traditional methods. After reviewing the state-of-the-art algorithms based on CNN learned that MRFasGCN is having the best performance compared to any other state-of-the-art algorithms for large data sets. In this algorithm, researchers have integrated the technique of Graph Convolutional Neural Network (GCN) with the statistical model Markov Random Field (MRF) to get better results and after implementing it on large datasets comparison is done on its results with other state-of-the-art algorithms and got to know its performance is far better than any other algorithm. While MRFasGCN is performing well on social networks and provides ground truth communities, there is a possibility available for improvement due to the sparsity problem. This paper proposes a new algorithm called Modified MRFasGCN. Two modifications are done to the existing algorithm, 1. In pre-processing, rather than passing the adjacency matrix with the normalized adjacency matrix, it will pass the reconstructed adjacency matrix and normalized reconstructed adjacency matrix, which resolves the sparsity problem, 2. GCN layer output will be fed to the MRF layer and refined results will be passed to the Adam optimizer without subtraction. Our experimental analysis shows that the modified algorithm provides better ground truth communities than the MRFasGCN and solves the problem of sparsity as passes a reconstructed adjacency matrix. In this paper, the proposed algorithm is executed on different datasets having different sizes like CORA (2708 Nodes), Flicker (80513 Nodes) and DBLP (317080 Nodes) and compared on different Community Detection metrics like accuracy, NMI, F1 Score, and execution time with other algorithms. After Experiments NMI value for MRFasGCN on DBLP data set is 0.662 while for Modified MRFasGCN it is 0.672, Modified MRFasGCN algorithm provides significant improvement of 2.9% in performance as compared to MRFasGCN. F1-Score of proposed algorithm is 0.511 on DBLP dataset which is better than MRFasGCN.","PeriodicalId":13079,"journal":{"name":"IEEE Access","volume":"12 ","pages":"146733-146748"},"PeriodicalIF":3.4000,"publicationDate":"2024-10-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10705291","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Access","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10705291/","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

Community detection is a very important research topic in the field of Social Network Analysis. Lots of researchers are working in this field due to its applications in various fields like medicine, social, Business, Marketing, and research. Researchers are proposing new algorithms to detect the communities having better performance as compared to the existing techniques. Initially, Newman and Girvan proposed traditional algorithms for community detection from social networks in 2004, but with the growth of social networks, Convolutional Neural Network (CNN) based algorithms are proposed by different researchers in recent years due to the inefficiency of traditional methods. After reviewing the state-of-the-art algorithms based on CNN learned that MRFasGCN is having the best performance compared to any other state-of-the-art algorithms for large data sets. In this algorithm, researchers have integrated the technique of Graph Convolutional Neural Network (GCN) with the statistical model Markov Random Field (MRF) to get better results and after implementing it on large datasets comparison is done on its results with other state-of-the-art algorithms and got to know its performance is far better than any other algorithm. While MRFasGCN is performing well on social networks and provides ground truth communities, there is a possibility available for improvement due to the sparsity problem. This paper proposes a new algorithm called Modified MRFasGCN. Two modifications are done to the existing algorithm, 1. In pre-processing, rather than passing the adjacency matrix with the normalized adjacency matrix, it will pass the reconstructed adjacency matrix and normalized reconstructed adjacency matrix, which resolves the sparsity problem, 2. GCN layer output will be fed to the MRF layer and refined results will be passed to the Adam optimizer without subtraction. Our experimental analysis shows that the modified algorithm provides better ground truth communities than the MRFasGCN and solves the problem of sparsity as passes a reconstructed adjacency matrix. In this paper, the proposed algorithm is executed on different datasets having different sizes like CORA (2708 Nodes), Flicker (80513 Nodes) and DBLP (317080 Nodes) and compared on different Community Detection metrics like accuracy, NMI, F1 Score, and execution time with other algorithms. After Experiments NMI value for MRFasGCN on DBLP data set is 0.662 while for Modified MRFasGCN it is 0.672, Modified MRFasGCN algorithm provides significant improvement of 2.9% in performance as compared to MRFasGCN. F1-Score of proposed algorithm is 0.511 on DBLP dataset which is better than MRFasGCN.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

通过卷积神经网络增强群落检测：基于 MRFasGCN 算法的改进方法

社群检测是社交网络分析领域一个非常重要的研究课题。由于其在医学、社会、商业、营销和研究等各个领域的应用，许多研究人员都在从事这一领域的研究。与现有技术相比，研究人员提出了性能更好的新算法来检测社群。最初，Newman 和 Girvan 于 2004 年提出了从社交网络中检测社群的传统算法，但随着社交网络的发展，由于传统方法效率低下，近年来不同的研究人员提出了基于卷积神经网络（CNN）的算法。在回顾了基于 CNN 的最先进算法后，我们发现 MRFasGCN 在大型数据集方面比其他任何最先进算法的性能都要好。在该算法中，研究人员将图卷积神经网络（GCN）技术与马尔可夫随机场（MRF）统计模型进行了整合，以获得更好的结果，并在大型数据集上实施后，将其结果与其他最先进的算法进行了比较，发现其性能远远优于其他算法。虽然 MRFasGCN 在社交网络上表现出色，并提供了地面实况社区，但由于稀疏性问题，仍有改进的可能。本文提出了一种名为修正 MRFasGCN 的新算法。对现有算法做了两处修改：1.在预处理中，不传递邻接矩阵和归一化邻接矩阵，而是传递重构邻接矩阵和归一化重构邻接矩阵，从而解决了稀疏性问题；2.将 GCN 层输出馈送至 MRF 层，并将精炼结果传递至 Adam 优化器，而不做减法处理。我们的实验分析表明，修改后的算法比 MRFasGCN 提供了更好的地面真实群落，并通过重建的邻接矩阵解决了稀疏性问题。本文在 CORA（2708 个节点）、Flicker（80513 个节点）和 DBLP（317080 个节点）等不同规模的数据集上执行了所提出的算法，并就准确率、NMI、F1 分数和执行时间等不同的社区检测指标与其他算法进行了比较。在 DBLP 数据集上进行实验后，MRFasGCN 的 NMI 值为 0.662，而修改后的 MRFasGCN 的 NMI 值为 0.672，与 MRFasGCN 相比，修改后的 MRFasGCN 算法的性能显著提高了 2.9%。在 DBLP 数据集上，建议算法的 F1 分数为 0.511，优于 MRFasGCN。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

IEEE Access COMPUTER SCIENCE, INFORMATION SYSTEMSENGIN-ENGINEERING, ELECTRICAL & ELECTRONIC

CiteScore

9.80

自引率

7.70%

发文量

6673

审稿时长

6 weeks

期刊介绍： IEEE Access® is a multidisciplinary, open access (OA), applications-oriented, all-electronic archival journal that continuously presents the results of original research or development across all of IEEE''s fields of interest. IEEE Access will publish articles that are of high interest to readers, original, technically correct, and clearly presented. Supported by author publication charges (APC), its hallmarks are a rapid peer review and publication process with open access to all readers. Unlike IEEE''s traditional Transactions or Journals, reviews are "binary", in that reviewers will either Accept or Reject an article in the form it is submitted in order to achieve rapid turnaround. Especially encouraged are submissions on: Multidisciplinary topics, or applications-oriented articles and negative results that do not fit within the scope of IEEE''s traditional journals. Practical articles discussing new experiments or measurement techniques, interesting solutions to engineering. Development of new or improved fabrication or manufacturing techniques. Reviews or survey articles of new or evolving fields oriented to assist others in understanding the new area.