Rating Prediction of Product Reviews of Bangla Language using Machine Learning Algorithms

Md. Iqbal Hossain, Maqsudur Rahman, M. T. Ahmed, Md. Saifur Rahman, A. Z. M. T. Islam
{"title":"Rating Prediction of Product Reviews of Bangla Language using Machine Learning Algorithms","authors":"Md. Iqbal Hossain, Maqsudur Rahman, M. T. Ahmed, Md. Saifur Rahman, A. Z. M. T. Islam","doi":"10.1109/AIMS52415.2021.9466022","DOIUrl":null,"url":null,"abstract":"The only way to provide feedback about a product is through reviews. When a new shopper proceeds to an online shop to purchase a product but does not have adequate time to study the reviews provided by other shoppers to get an opinion about the product, the shopper determines whether to buy the product or not on the number rating. Through reviews, shoppers can acquaint everyone about the product's quality and the manufacturer can enhance their products and business by interpreting that review. However, manufacturers demand a number review more than a text review for business analysis. This paper represents a machine learning based model for predicting the number rating from written text for Bangla product review. This study performs on a dataset collected manually from Daraz.com.bd, a Bangladeshi leading e-commerce shop. We have implemented Support Vector Machine (SVM), Random Forest, XGBoost, and Logistic Regression with Term Frequency-Inverse Document Frequency (TF-IDF) Vectorizer on our collected dataset and record all the performance metrics like accuracy, precision, recall and f1-score. From these above four algorithms, SVM showed more outstanding results than others in terms of performance metrics. SVM achieved 90% accuracy on the applied dataset. The other SVM performance metrics are 0.90, 0.92, and 0.91 for precision, recall and f1-score respectively.","PeriodicalId":299121,"journal":{"name":"2021 International Conference on Artificial Intelligence and Mechatronics Systems (AIMS)","volume":"473 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-04-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Conference on Artificial Intelligence and Mechatronics Systems (AIMS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AIMS52415.2021.9466022","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

The only way to provide feedback about a product is through reviews. When a new shopper proceeds to an online shop to purchase a product but does not have adequate time to study the reviews provided by other shoppers to get an opinion about the product, the shopper determines whether to buy the product or not on the number rating. Through reviews, shoppers can acquaint everyone about the product's quality and the manufacturer can enhance their products and business by interpreting that review. However, manufacturers demand a number review more than a text review for business analysis. This paper represents a machine learning based model for predicting the number rating from written text for Bangla product review. This study performs on a dataset collected manually from Daraz.com.bd, a Bangladeshi leading e-commerce shop. We have implemented Support Vector Machine (SVM), Random Forest, XGBoost, and Logistic Regression with Term Frequency-Inverse Document Frequency (TF-IDF) Vectorizer on our collected dataset and record all the performance metrics like accuracy, precision, recall and f1-score. From these above four algorithms, SVM showed more outstanding results than others in terms of performance metrics. SVM achieved 90% accuracy on the applied dataset. The other SVM performance metrics are 0.90, 0.92, and 0.91 for precision, recall and f1-score respectively.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于机器学习算法的孟加拉语产品评论评级预测
提供产品反馈的唯一途径就是通过评论。当一个新购物者进入网上商店购买产品,但没有足够的时间研究其他购物者提供的评论以获得对产品的意见时,购物者根据数字评级决定是否购买该产品。通过评论,购物者可以让每个人都了解产品的质量,制造商可以通过解释评论来提高他们的产品和业务。然而,对于业务分析,制造商更需要数字审查而不是文本审查。本文提出了一种基于机器学习的模型,用于从孟加拉产品评论的书面文本中预测数字评级。这项研究的数据集是从Daraz.com.bd手工收集的,这是一家孟加拉国领先的电子商务商店。我们在收集的数据集上实现了支持向量机(SVM)、随机森林(Random Forest)、XGBoost和具有Term Frequency- inverse Document Frequency (TF-IDF)矢量器的逻辑回归,并记录了所有的性能指标,如准确性、精密度、召回率和f1-score。在以上四种算法中,SVM在性能指标方面表现出比其他算法更突出的结果。SVM在应用数据集上的准确率达到90%。其他支持向量机性能指标分别为0.90,0.92和0.91的精度,召回率和f1-score。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
A Feasibility Study of M2M/IoT Numbering Model in Indonesia Classification of sensorimotor cortex signals based on the task durations: an fNIRS-BCI study A genetic algorithm with an elitism replacement method for solving the nonfunctional web service composition under fuzzy QoS parameters The Effect of Wave Stirring Mechanism in Improving Heating Uniformity in Microwave Chamber For Fishing Industry A Survey of Emotion Recognition using Physiological Signal in Wearable Devices
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1