{"title":"基于AdaBoost的微博垃圾评论识别方法","authors":"Ling Huang, Xueming Li","doi":"10.3724/SP.J.1087.2013.03563","DOIUrl":null,"url":null,"abstract":"In view of the existence of a lot of spam comments in microblog,a new method based on AdaBoost was proposed to identify spam comments. This method firstly extracted feature vectors which consisted of eight feature values to represent the comments,then trained several weak classifiers which were better than random prediction on these features via AdaBoost algorithm,and finally combined these weighted weak classifiers to build a strong classifier with a high precision. The experimental results on comment data sets extracted from the popular Sina microblogs indicate that the selected eight features are effective for the method,and it has a high recognition rate in the identification of spam comments in microblog.","PeriodicalId":61778,"journal":{"name":"计算机应用","volume":"33 1","pages":"3563-3566"},"PeriodicalIF":0.0000,"publicationDate":"2013-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Identification method of spam comments in microblog based on AdaBoost\",\"authors\":\"Ling Huang, Xueming Li\",\"doi\":\"10.3724/SP.J.1087.2013.03563\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In view of the existence of a lot of spam comments in microblog,a new method based on AdaBoost was proposed to identify spam comments. This method firstly extracted feature vectors which consisted of eight feature values to represent the comments,then trained several weak classifiers which were better than random prediction on these features via AdaBoost algorithm,and finally combined these weighted weak classifiers to build a strong classifier with a high precision. The experimental results on comment data sets extracted from the popular Sina microblogs indicate that the selected eight features are effective for the method,and it has a high recognition rate in the identification of spam comments in microblog.\",\"PeriodicalId\":61778,\"journal\":{\"name\":\"计算机应用\",\"volume\":\"33 1\",\"pages\":\"3563-3566\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2013-12-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"计算机应用\",\"FirstCategoryId\":\"1093\",\"ListUrlMain\":\"https://doi.org/10.3724/SP.J.1087.2013.03563\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"计算机应用","FirstCategoryId":"1093","ListUrlMain":"https://doi.org/10.3724/SP.J.1087.2013.03563","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Identification method of spam comments in microblog based on AdaBoost
In view of the existence of a lot of spam comments in microblog,a new method based on AdaBoost was proposed to identify spam comments. This method firstly extracted feature vectors which consisted of eight feature values to represent the comments,then trained several weak classifiers which were better than random prediction on these features via AdaBoost algorithm,and finally combined these weighted weak classifiers to build a strong classifier with a high precision. The experimental results on comment data sets extracted from the popular Sina microblogs indicate that the selected eight features are effective for the method,and it has a high recognition rate in the identification of spam comments in microblog.