A Rule-Based Chinese Sentiment Mining System with Self-Expanding Dictionary - Taking TripAdvisor as an Example

Jung-Bin Li, Li-Bing Yang
{"title":"A Rule-Based Chinese Sentiment Mining System with Self-Expanding Dictionary - Taking TripAdvisor as an Example","authors":"Jung-Bin Li, Li-Bing Yang","doi":"10.1109/ICEBE.2017.45","DOIUrl":null,"url":null,"abstract":"With the wide adoption of social networks, people are accustomed to post their ideas and thinking via these platforms. Tweets or comments online usually come with individual sentiment, which are time consuming to be analyzed by human labor. This study encapsulates a prototype Chinese sentiment mining system and takes a global hotel reviewing website TripAdvisor as the evaluation sample. The proposed sentiment mining model is compared with logistic regression and support vector machine models based on their performances. This proposed model outperforms LR and SVM in all datasets in terms of classification accuracy and F-measure. An additional module embedded in proposed system enables expansion of novel or undefined terms to the dictionary referred (NTUSD). With this Word2Vec-based module, the system further improves accuracy while reduces both type I and type II error for at least 5%.","PeriodicalId":347774,"journal":{"name":"2017 IEEE 14th International Conference on e-Business Engineering (ICEBE)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 IEEE 14th International Conference on e-Business Engineering (ICEBE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICEBE.2017.45","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

Abstract

With the wide adoption of social networks, people are accustomed to post their ideas and thinking via these platforms. Tweets or comments online usually come with individual sentiment, which are time consuming to be analyzed by human labor. This study encapsulates a prototype Chinese sentiment mining system and takes a global hotel reviewing website TripAdvisor as the evaluation sample. The proposed sentiment mining model is compared with logistic regression and support vector machine models based on their performances. This proposed model outperforms LR and SVM in all datasets in terms of classification accuracy and F-measure. An additional module embedded in proposed system enables expansion of novel or undefined terms to the dictionary referred (NTUSD). With this Word2Vec-based module, the system further improves accuracy while reduces both type I and type II error for at least 5%.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于规则的自扩展词典中文情感挖掘系统——以TripAdvisor为例
随着社交网络的广泛采用,人们习惯于通过这些平台发布自己的想法和想法。网上的推文或评论通常都带有个人情绪,需要耗费大量的时间来进行人工分析。本研究封装了一个中文情感挖掘系统原型,并以全球酒店点评网站TripAdvisor作为评价样本。将所提出的情感挖掘模型与逻辑回归和支持向量机模型的性能进行了比较。该模型在所有数据集的分类精度和F-measure方面都优于LR和SVM。一个额外的模块嵌入到拟议的系统中,可以将新的或未定义的术语扩展到所引用的字典(NTUSD)。使用这个基于word2vec的模块,系统进一步提高了精度,同时将I类和II类错误减少了至少5%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
The Integrated System Optimization Based on the Boiler Combustion and Denitration with Denitration Operating Cost Consideration Chinese Questions Classification in the Law Domain Dust Removal with Boundary and Spatial Constraint for Videos Captured in Car Indexing for Large Scale Data Querying Based on Spark SQL Finding K-Most Influential Users in Social Networks for Information Diffusion Based on Network Structure and Different User Behavioral Patterns
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1