基于方面的孟加拉语娱乐评论情感分析

N. Sultana, R. Sultana, Risul Islam Rasel, M. M. Hoque
{"title":"基于方面的孟加拉语娱乐评论情感分析","authors":"N. Sultana, R. Sultana, Risul Islam Rasel, M. M. Hoque","doi":"10.1109/ICCIT57492.2022.10055705","DOIUrl":null,"url":null,"abstract":"Low-resource natural language processing is getting more attention nowadays. Aspect-Based Sentiment Analysis (ABSA) from a high-resource language such as English becomes unchallenging because of sufficient datasets and experimentation tools. However, Aspect-Based Sentiment Analysis from low-resource languages such as Bangla is quite hard. So, many researchers are investing their time and knowledge in low-resource natural language processing. In this paper, we are proposing a Bangla Aspect-Based Sentiment Analysis model using Bangla natural language processing. We have collected 4012 Bangla text comments related to cricket, drama, movie, and music from YouTube. We have applied some very prominent supervised machine learning techniques such as Support Vector Classifier (SVC), Random Forest (RF), and Linear Regression (LR). We have achieved more than 75% accuracy in classifying positive, negative, and neutral sentiments and 80% accuracy in extracting aspects from Bangla texts. Finally, we used publicly available datasets to test our proposed model's generalizability. Furthermore, we find that our proposed approach surpasses earlier related research.","PeriodicalId":255498,"journal":{"name":"2022 25th International Conference on Computer and Information Technology (ICCIT)","volume":"44 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Aspect-Based Sentiment Analysis of Bangla Comments on Entertainment Domain\",\"authors\":\"N. Sultana, R. Sultana, Risul Islam Rasel, M. M. Hoque\",\"doi\":\"10.1109/ICCIT57492.2022.10055705\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Low-resource natural language processing is getting more attention nowadays. Aspect-Based Sentiment Analysis (ABSA) from a high-resource language such as English becomes unchallenging because of sufficient datasets and experimentation tools. However, Aspect-Based Sentiment Analysis from low-resource languages such as Bangla is quite hard. So, many researchers are investing their time and knowledge in low-resource natural language processing. In this paper, we are proposing a Bangla Aspect-Based Sentiment Analysis model using Bangla natural language processing. We have collected 4012 Bangla text comments related to cricket, drama, movie, and music from YouTube. We have applied some very prominent supervised machine learning techniques such as Support Vector Classifier (SVC), Random Forest (RF), and Linear Regression (LR). We have achieved more than 75% accuracy in classifying positive, negative, and neutral sentiments and 80% accuracy in extracting aspects from Bangla texts. Finally, we used publicly available datasets to test our proposed model's generalizability. Furthermore, we find that our proposed approach surpasses earlier related research.\",\"PeriodicalId\":255498,\"journal\":{\"name\":\"2022 25th International Conference on Computer and Information Technology (ICCIT)\",\"volume\":\"44 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-12-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 25th International Conference on Computer and Information Technology (ICCIT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCIT57492.2022.10055705\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 25th International Conference on Computer and Information Technology (ICCIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCIT57492.2022.10055705","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

低资源的自然语言处理越来越受到人们的关注。由于有足够的数据集和实验工具,来自高资源语言(如英语)的基于方面的情感分析(ABSA)变得没有挑战性。然而,对于像孟加拉语这样的低资源语言,基于方面的情感分析是相当困难的。因此,许多研究人员将他们的时间和知识投入到低资源的自然语言处理中。在本文中,我们提出了一个使用孟加拉语自然语言处理的基于孟加拉语方面的情感分析模型。我们从YouTube上收集了4012条与板球、戏剧、电影和音乐相关的孟加拉文评论。我们应用了一些非常突出的监督机器学习技术,如支持向量分类器(SVC)、随机森林(RF)和线性回归(LR)。我们对正面、负面和中性情绪的分类准确率达到75%以上,从孟加拉语文本中提取方面的准确率达到80%以上。最后,我们使用公开可用的数据集来测试我们提出的模型的泛化性。此外,我们发现我们所提出的方法超越了先前的相关研究。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Aspect-Based Sentiment Analysis of Bangla Comments on Entertainment Domain
Low-resource natural language processing is getting more attention nowadays. Aspect-Based Sentiment Analysis (ABSA) from a high-resource language such as English becomes unchallenging because of sufficient datasets and experimentation tools. However, Aspect-Based Sentiment Analysis from low-resource languages such as Bangla is quite hard. So, many researchers are investing their time and knowledge in low-resource natural language processing. In this paper, we are proposing a Bangla Aspect-Based Sentiment Analysis model using Bangla natural language processing. We have collected 4012 Bangla text comments related to cricket, drama, movie, and music from YouTube. We have applied some very prominent supervised machine learning techniques such as Support Vector Classifier (SVC), Random Forest (RF), and Linear Regression (LR). We have achieved more than 75% accuracy in classifying positive, negative, and neutral sentiments and 80% accuracy in extracting aspects from Bangla texts. Finally, we used publicly available datasets to test our proposed model's generalizability. Furthermore, we find that our proposed approach surpasses earlier related research.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
SlotFinder: A Spatio-temporal based Car Parking System Land Cover and Land Use Detection using Semi-Supervised Learning Comparative Analysis of Process Scheduling Algorithm using AI models Throughput Optimization of IEEE 802.15.4e TSCH-Based Scheduling: A Deep Neural Network (DNN) Scheme Towards Developing a Voice-Over-Guided System for Visually Impaired People to Learn Writing the Alphabets
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1