{"title":"博客主体影响力与用户主体影响力评价方法研究","authors":"Chunhui Deng, Peifu Zhou, Huifang Deng","doi":"10.1145/3424978.3425078","DOIUrl":null,"url":null,"abstract":"The large group of users and massive blogs information in microblog bring serious problem of information overload. It constitutes a great challenge in finding the most valuable and influential blogs or users for a given subject. To address this, we first constructed an improved text clustering algorithm called DSCURE based on CURE (clustering using representatives) to generate clusters on different subjects. The improvements include representative points' selection which considers both density and scatterness of the points and text distance calculation which combines Vector Space Model and Latent Dirichlet Allocation model. The experimental results show that this algorithm outperforms other algorithms and reaches the stable state earlier. Second, we proposed a blog's subject influence evaluation model which mainly considers the subject relevance, the content quality and the timeliness of a blog. Experimental results demonstrate that this model is effective and reasonable in identifying influential blogs. Based on this model, we further put forward a user's subject influence evaluation model - QualityRank which considers the characteristics of the user's personal attributes, the user's blog features and the network structure. Experimental results show that QualityRank outperforms other models we referred to.","PeriodicalId":178822,"journal":{"name":"Proceedings of the 4th International Conference on Computer Science and Application Engineering","volume":"20 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-10-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Evaluation Method Study of Blog's Subject Influence and User's Subject Influence\",\"authors\":\"Chunhui Deng, Peifu Zhou, Huifang Deng\",\"doi\":\"10.1145/3424978.3425078\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The large group of users and massive blogs information in microblog bring serious problem of information overload. It constitutes a great challenge in finding the most valuable and influential blogs or users for a given subject. To address this, we first constructed an improved text clustering algorithm called DSCURE based on CURE (clustering using representatives) to generate clusters on different subjects. The improvements include representative points' selection which considers both density and scatterness of the points and text distance calculation which combines Vector Space Model and Latent Dirichlet Allocation model. The experimental results show that this algorithm outperforms other algorithms and reaches the stable state earlier. Second, we proposed a blog's subject influence evaluation model which mainly considers the subject relevance, the content quality and the timeliness of a blog. Experimental results demonstrate that this model is effective and reasonable in identifying influential blogs. Based on this model, we further put forward a user's subject influence evaluation model - QualityRank which considers the characteristics of the user's personal attributes, the user's blog features and the network structure. Experimental results show that QualityRank outperforms other models we referred to.\",\"PeriodicalId\":178822,\"journal\":{\"name\":\"Proceedings of the 4th International Conference on Computer Science and Application Engineering\",\"volume\":\"20 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-10-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 4th International Conference on Computer Science and Application Engineering\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3424978.3425078\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 4th International Conference on Computer Science and Application Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3424978.3425078","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Evaluation Method Study of Blog's Subject Influence and User's Subject Influence
The large group of users and massive blogs information in microblog bring serious problem of information overload. It constitutes a great challenge in finding the most valuable and influential blogs or users for a given subject. To address this, we first constructed an improved text clustering algorithm called DSCURE based on CURE (clustering using representatives) to generate clusters on different subjects. The improvements include representative points' selection which considers both density and scatterness of the points and text distance calculation which combines Vector Space Model and Latent Dirichlet Allocation model. The experimental results show that this algorithm outperforms other algorithms and reaches the stable state earlier. Second, we proposed a blog's subject influence evaluation model which mainly considers the subject relevance, the content quality and the timeliness of a blog. Experimental results demonstrate that this model is effective and reasonable in identifying influential blogs. Based on this model, we further put forward a user's subject influence evaluation model - QualityRank which considers the characteristics of the user's personal attributes, the user's blog features and the network structure. Experimental results show that QualityRank outperforms other models we referred to.