{"title":"Research on Improvement of Text Processing and Clustering Algorithms in Public Opinion Early Warning System","authors":"Kongyu Yang, Ruijie Miao","doi":"10.1109/ICSAI.2018.8599424","DOIUrl":null,"url":null,"abstract":"In order to provide the necessary data for Public opinion monitoring and trend warning, this paper did some researches on text processing and clustering algorithms based on hot topics of the Weibo. Data that get from Weibo were classification data which contain two properties. To adapt this feature and meet the requirement of public opinion trends warning, hamming distance was used to do text similarity computing. By improving the traditional K-means algorithm, a new K-mode algorithm which is used to text clustering on hot topics was achieved. Simulation and results analysis indicated the text processing method was accurate and suitable to the microblog public opinion early warning.","PeriodicalId":375852,"journal":{"name":"2018 5th International Conference on Systems and Informatics (ICSAI)","volume":"12 4","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 5th International Conference on Systems and Informatics (ICSAI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSAI.2018.8599424","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5
Abstract
In order to provide the necessary data for Public opinion monitoring and trend warning, this paper did some researches on text processing and clustering algorithms based on hot topics of the Weibo. Data that get from Weibo were classification data which contain two properties. To adapt this feature and meet the requirement of public opinion trends warning, hamming distance was used to do text similarity computing. By improving the traditional K-means algorithm, a new K-mode algorithm which is used to text clustering on hot topics was achieved. Simulation and results analysis indicated the text processing method was accurate and suitable to the microblog public opinion early warning.