{"title":"带离群值的随机块模型中群落数量的估计","authors":"Jingsong Xiao, Fei Ye, Weidong Ma, Ying Yang","doi":"10.1093/comnet/cnac042","DOIUrl":null,"url":null,"abstract":"\n The stochastic block model (SBM) is a popular model for community detecting problems. Many community detecting approaches have been proposed, and most of them assume that the number of communities is given previously. However, in practice, the number of communities is often unknown. Plenty of approaches were proposed to estimate the number of communities, but most of them were computationally intensive. Moreover, when outliers exist, there are no approaches to consistently estimate the number of communities. In this article, we propose a fast method based on the eigenvalues of the regularized and normalized adjacency matrix to estimate the number of communities under the SBM with outliers. We show that our method can consistently estimate the number of communities when outliers exist. Moreover, we extend our method to the degree-corrected SBM. We show that our approach is comparable to the other existing approaches in simulations. We also illustrate our approach on four real-world networks.","PeriodicalId":2,"journal":{"name":"ACS Applied Bio Materials","volume":null,"pages":null},"PeriodicalIF":4.6000,"publicationDate":"2022-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Estimating the number of communities in the stochastic block model with outliers\",\"authors\":\"Jingsong Xiao, Fei Ye, Weidong Ma, Ying Yang\",\"doi\":\"10.1093/comnet/cnac042\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"\\n The stochastic block model (SBM) is a popular model for community detecting problems. Many community detecting approaches have been proposed, and most of them assume that the number of communities is given previously. However, in practice, the number of communities is often unknown. Plenty of approaches were proposed to estimate the number of communities, but most of them were computationally intensive. Moreover, when outliers exist, there are no approaches to consistently estimate the number of communities. In this article, we propose a fast method based on the eigenvalues of the regularized and normalized adjacency matrix to estimate the number of communities under the SBM with outliers. We show that our method can consistently estimate the number of communities when outliers exist. Moreover, we extend our method to the degree-corrected SBM. We show that our approach is comparable to the other existing approaches in simulations. We also illustrate our approach on four real-world networks.\",\"PeriodicalId\":2,\"journal\":{\"name\":\"ACS Applied Bio Materials\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":4.6000,\"publicationDate\":\"2022-08-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"ACS Applied Bio Materials\",\"FirstCategoryId\":\"100\",\"ListUrlMain\":\"https://doi.org/10.1093/comnet/cnac042\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"MATERIALS SCIENCE, BIOMATERIALS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACS Applied Bio Materials","FirstCategoryId":"100","ListUrlMain":"https://doi.org/10.1093/comnet/cnac042","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MATERIALS SCIENCE, BIOMATERIALS","Score":null,"Total":0}
Estimating the number of communities in the stochastic block model with outliers
The stochastic block model (SBM) is a popular model for community detecting problems. Many community detecting approaches have been proposed, and most of them assume that the number of communities is given previously. However, in practice, the number of communities is often unknown. Plenty of approaches were proposed to estimate the number of communities, but most of them were computationally intensive. Moreover, when outliers exist, there are no approaches to consistently estimate the number of communities. In this article, we propose a fast method based on the eigenvalues of the regularized and normalized adjacency matrix to estimate the number of communities under the SBM with outliers. We show that our method can consistently estimate the number of communities when outliers exist. Moreover, we extend our method to the degree-corrected SBM. We show that our approach is comparable to the other existing approaches in simulations. We also illustrate our approach on four real-world networks.