Implementing Rule-based and Naive Bayes Algorithm on Incremental Sentiment Analysis System for Indonesian Online Transportation Services Review

Cut Fiarni, Herastia Maharani, Enriko Irawan
{"title":"Implementing Rule-based and Naive Bayes Algorithm on Incremental Sentiment Analysis System for Indonesian Online Transportation Services Review","authors":"Cut Fiarni, Herastia Maharani, Enriko Irawan","doi":"10.1109/ICITEED.2018.8534912","DOIUrl":null,"url":null,"abstract":"The emerging trend on smartphone application and service use on a daily basis, has also increased the volume of online opinion regarding various topics on the internet. In Indonesia, one of the most popular topics to share, post and comment is online-based transportation service (TNCs). These comments could lead to valuable knowledge that would be tremendous assets for supporting critical business intelligence applications. The knowledge gained from social media can potentially lead to the development of novel services that are better tailored to users’ needs and also meet the objectives of businesses offering them. The problem to build an effective Indonesian sentiment analysis system is that there is still no availability of the corpus, complete with each word characteristic, whether it subjective, adjective, adverb, noun, etc. Another problem is because their cultural heritage, or for politeness reason, Indonesian people often used negation in their sentence. So instead of saying “ugly”, they say “not good”, or instead of saying expensive, they said “not cheap”, which could lead to miss-classify of the sentiment. Thus, this research focus on building model that has the ability to classify TNC element target on its sentiment class, by considering negation form sentences and then implement it in the proposed sentiment analysis system. Another important feature is system’s ability to learning new keywords for TNC elements and sentiment. This proposed approach would use rule based algorithm to classify target object, the polarity of sentiment and negation from online opinion. And used Naive Bayes algorithm for the incremental feature. Result from this study show that the proposed system is able to classify user opinion with 90% precision and 70% recall. This concludes that from evaluation results, the proposed algorithm performs well to automatically analyze sentiment.","PeriodicalId":142523,"journal":{"name":"2018 10th International Conference on Information Technology and Electrical Engineering (ICITEE)","volume":"48 7","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 10th International Conference on Information Technology and Electrical Engineering (ICITEE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICITEED.2018.8534912","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

Abstract

The emerging trend on smartphone application and service use on a daily basis, has also increased the volume of online opinion regarding various topics on the internet. In Indonesia, one of the most popular topics to share, post and comment is online-based transportation service (TNCs). These comments could lead to valuable knowledge that would be tremendous assets for supporting critical business intelligence applications. The knowledge gained from social media can potentially lead to the development of novel services that are better tailored to users’ needs and also meet the objectives of businesses offering them. The problem to build an effective Indonesian sentiment analysis system is that there is still no availability of the corpus, complete with each word characteristic, whether it subjective, adjective, adverb, noun, etc. Another problem is because their cultural heritage, or for politeness reason, Indonesian people often used negation in their sentence. So instead of saying “ugly”, they say “not good”, or instead of saying expensive, they said “not cheap”, which could lead to miss-classify of the sentiment. Thus, this research focus on building model that has the ability to classify TNC element target on its sentiment class, by considering negation form sentences and then implement it in the proposed sentiment analysis system. Another important feature is system’s ability to learning new keywords for TNC elements and sentiment. This proposed approach would use rule based algorithm to classify target object, the polarity of sentiment and negation from online opinion. And used Naive Bayes algorithm for the incremental feature. Result from this study show that the proposed system is able to classify user opinion with 90% precision and 70% recall. This concludes that from evaluation results, the proposed algorithm performs well to automatically analyze sentiment.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于规则和朴素贝叶斯算法的印尼在线交通服务评论增量情感分析系统
智能手机应用程序和服务的日常使用的新兴趋势也增加了关于互联网上各种话题的在线意见量。在印尼,最受欢迎的话题之一是在线交通服务(TNCs)。这些评论可能导致有价值的知识,这些知识将成为支持关键业务智能应用程序的巨大资产。从社交媒体中获得的知识可能会导致新的服务的发展,这些服务更适合用户的需求,也符合提供这些服务的企业的目标。建立一个有效的印尼语情感分析系统的问题是,目前仍然没有可用的语料库,无论是主词、形容词、副词、名词等,都没有完整的每个词的特征。另一个问题是由于他们的文化传统,或者出于礼貌的原因,印尼人经常在句子中使用否定。所以他们不说“丑”,而是说“不好”,或者不说“贵”,而是说“不便宜”,这可能会导致对情绪的错误分类。因此,本研究的重点是通过考虑否定形式句,构建能够在情感类上对TNC元素目标进行分类的模型,并将其实现到所提出的情感分析系统中。另一个重要的特征是系统能够学习TNC元素和情感的新关键词。该方法将使用基于规则的算法对目标对象、情感极性和在线意见的否定性进行分类。并对增量特征采用朴素贝叶斯算法。研究结果表明,该系统能够以90%的准确率和70%的召回率对用户意见进行分类。从评价结果来看,本文提出的算法在情感自动分析方面表现良好。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Implementing Rule-based and Naive Bayes Algorithm on Incremental Sentiment Analysis System for Indonesian Online Transportation Services Review Wideband Circularly Polarized Printed Slot Antenna With A Pair of Slant Line Slots A new model for measuring the complexity of SQL commands VoBiRo - Vocational Bipedal Robot Platform, Kinematic and Locomotion Control Maximum Allowable Intermittent Renewable Energy Source Penetration in Java-Bali Power System
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1