Coastal Sentiment Review Using Naïve Bayes with Feature Selection Genetic Algorithm

O. Somantri, R. Maharrani, Santi Purwaningrum
{"title":"Coastal Sentiment Review Using Naïve Bayes with Feature Selection Genetic Algorithm","authors":"O. Somantri, R. Maharrani, Santi Purwaningrum","doi":"10.15294/sji.v10i3.43988","DOIUrl":null,"url":null,"abstract":"Purpose: The tourism potential in the maritime sector can be Indonesia's mainstay at this time, especially in enjoying the charm of the natural beauty of the coast as people know Indonesia is an archipelagic country. The purpose of this study is to find the best model by applying the feature selection genetic algorithm (GA) and Information Gain (IG) to get the best Naïve Bayes (NB) model and the best features to produce the best level of sentiment classification accuracy.Methods: The stages of the research were carried out by going through the process of searching, pre-processing, analyzing research data using the Naïve Bayes model and optimizing genetic algorithms, validating data, and model evaluation.Result: The experimental results show that the best model is naïve Bayes based on information gain and the genetic algorithm yields an accuracy rate of 86.34%.Novelty: The main contribution to this research is proposing a new model of the best NB optimization model by applying an optimization algorithm in the search for feature selection to increase sentiment classification accuracy.","PeriodicalId":30781,"journal":{"name":"Scientific Journal of Informatics","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-06-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Scientific Journal of Informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.15294/sji.v10i3.43988","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Purpose: The tourism potential in the maritime sector can be Indonesia's mainstay at this time, especially in enjoying the charm of the natural beauty of the coast as people know Indonesia is an archipelagic country. The purpose of this study is to find the best model by applying the feature selection genetic algorithm (GA) and Information Gain (IG) to get the best Naïve Bayes (NB) model and the best features to produce the best level of sentiment classification accuracy.Methods: The stages of the research were carried out by going through the process of searching, pre-processing, analyzing research data using the Naïve Bayes model and optimizing genetic algorithms, validating data, and model evaluation.Result: The experimental results show that the best model is naïve Bayes based on information gain and the genetic algorithm yields an accuracy rate of 86.34%.Novelty: The main contribution to this research is proposing a new model of the best NB optimization model by applying an optimization algorithm in the search for feature selection to increase sentiment classification accuracy.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于特征选择遗传算法的朴素贝叶斯海岸情感评价
目的:海事部门的旅游潜力可能是印尼目前的支柱,尤其是在享受海岸自然美景的魅力方面,因为人们知道印尼是一个群岛国家。本研究的目的是通过应用特征选择遗传算法(GA)和信息增益(IG)来找到最佳模型,以获得最佳的朴素贝叶斯(NB)模型和产生最佳情绪分类精度水平的最佳特征。方法:研究阶段包括搜索、预处理、使用朴素贝叶斯模型分析研究数据以及优化遗传算法、验证数据和模型评估。结果:实验结果表明,最佳模型是基于信息增益的朴素贝叶斯,遗传算法的准确率为86.34%。新颖性:本研究的主要贡献是提出了一种新的最佳NB优化模型,将优化算法应用于特征选择的搜索中,以提高情感分类的准确率。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
13
审稿时长
24 weeks
期刊最新文献
A Comparative Study of Random Forest and Double Random Forest Models from View Points of Their Interpretability Comparative Analysis of LSTM Neural Network and SVM for USD Exchange Rate Prediction: A Study on Different Training Data Scenarios Knowledge Discovery from Confusion Matrix of Pruned CART in Imbalanced Microarray Data Ovarian Cancer Classification Comparison of Discriminant Analysis and Support Vector Machine on Mixed Categorical and Continuous Independent Variables for COVID-19 Patients Data The Comparison of K-Nearest Neighbors and Random Forest Algorithm to Recognize Indonesian Sign Language in a Real-Time
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1