{"title":"基于规则的自扩展词典中文情感挖掘系统——以TripAdvisor为例","authors":"Jung-Bin Li, Li-Bing Yang","doi":"10.1109/ICEBE.2017.45","DOIUrl":null,"url":null,"abstract":"With the wide adoption of social networks, people are accustomed to post their ideas and thinking via these platforms. Tweets or comments online usually come with individual sentiment, which are time consuming to be analyzed by human labor. This study encapsulates a prototype Chinese sentiment mining system and takes a global hotel reviewing website TripAdvisor as the evaluation sample. The proposed sentiment mining model is compared with logistic regression and support vector machine models based on their performances. This proposed model outperforms LR and SVM in all datasets in terms of classification accuracy and F-measure. An additional module embedded in proposed system enables expansion of novel or undefined terms to the dictionary referred (NTUSD). With this Word2Vec-based module, the system further improves accuracy while reduces both type I and type II error for at least 5%.","PeriodicalId":347774,"journal":{"name":"2017 IEEE 14th International Conference on e-Business Engineering (ICEBE)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"A Rule-Based Chinese Sentiment Mining System with Self-Expanding Dictionary - Taking TripAdvisor as an Example\",\"authors\":\"Jung-Bin Li, Li-Bing Yang\",\"doi\":\"10.1109/ICEBE.2017.45\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With the wide adoption of social networks, people are accustomed to post their ideas and thinking via these platforms. Tweets or comments online usually come with individual sentiment, which are time consuming to be analyzed by human labor. This study encapsulates a prototype Chinese sentiment mining system and takes a global hotel reviewing website TripAdvisor as the evaluation sample. The proposed sentiment mining model is compared with logistic regression and support vector machine models based on their performances. This proposed model outperforms LR and SVM in all datasets in terms of classification accuracy and F-measure. An additional module embedded in proposed system enables expansion of novel or undefined terms to the dictionary referred (NTUSD). With this Word2Vec-based module, the system further improves accuracy while reduces both type I and type II error for at least 5%.\",\"PeriodicalId\":347774,\"journal\":{\"name\":\"2017 IEEE 14th International Conference on e-Business Engineering (ICEBE)\",\"volume\":\"10 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 IEEE 14th International Conference on e-Business Engineering (ICEBE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICEBE.2017.45\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 IEEE 14th International Conference on e-Business Engineering (ICEBE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICEBE.2017.45","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A Rule-Based Chinese Sentiment Mining System with Self-Expanding Dictionary - Taking TripAdvisor as an Example
With the wide adoption of social networks, people are accustomed to post their ideas and thinking via these platforms. Tweets or comments online usually come with individual sentiment, which are time consuming to be analyzed by human labor. This study encapsulates a prototype Chinese sentiment mining system and takes a global hotel reviewing website TripAdvisor as the evaluation sample. The proposed sentiment mining model is compared with logistic regression and support vector machine models based on their performances. This proposed model outperforms LR and SVM in all datasets in terms of classification accuracy and F-measure. An additional module embedded in proposed system enables expansion of novel or undefined terms to the dictionary referred (NTUSD). With this Word2Vec-based module, the system further improves accuracy while reduces both type I and type II error for at least 5%.