哪种配置效果最好?监督阿拉伯语推特情感分析的实验研究

Talaat Khalil, Amal Halaby, Muhammad Hammad, S. El-Beltagy
{"title":"哪种配置效果最好?监督阿拉伯语推特情感分析的实验研究","authors":"Talaat Khalil, Amal Halaby, Muhammad Hammad, S. El-Beltagy","doi":"10.1109/ACLING.2015.19","DOIUrl":null,"url":null,"abstract":"Arabic Twitter Sentiment Analysis has been gaining a lot of attention lately with supervised approaches being exploited widely. However, to date, there has not been an experimental study that examines how different configurations of the Bag of Words model, text representation scheme, can affect various supervised machine learning methods. The goal of the presented work is to do exactly that. Specifically, this work examines which configurations work best for each of three machine learning approaches that have shown good results when applied on the task of sentiment analysis, namely: Support Vector Machines, Compliment Naïve Bayes, and Multinomial Naïve Bayes. Experimenting with different datasets has shown that each of these classifiers has a Bag of Words configuration in conjunction with which, it consistently performs best. It also showed that some features are dataset dependent.","PeriodicalId":404268,"journal":{"name":"2015 First International Conference on Arabic Computational Linguistics (ACLing)","volume":"117 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-04-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"17","resultStr":"{\"title\":\"Which Configuration Works Best? An Experimental Study on Supervised Arabic Twitter Sentiment Analysis\",\"authors\":\"Talaat Khalil, Amal Halaby, Muhammad Hammad, S. El-Beltagy\",\"doi\":\"10.1109/ACLING.2015.19\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Arabic Twitter Sentiment Analysis has been gaining a lot of attention lately with supervised approaches being exploited widely. However, to date, there has not been an experimental study that examines how different configurations of the Bag of Words model, text representation scheme, can affect various supervised machine learning methods. The goal of the presented work is to do exactly that. Specifically, this work examines which configurations work best for each of three machine learning approaches that have shown good results when applied on the task of sentiment analysis, namely: Support Vector Machines, Compliment Naïve Bayes, and Multinomial Naïve Bayes. Experimenting with different datasets has shown that each of these classifiers has a Bag of Words configuration in conjunction with which, it consistently performs best. It also showed that some features are dataset dependent.\",\"PeriodicalId\":404268,\"journal\":{\"name\":\"2015 First International Conference on Arabic Computational Linguistics (ACLing)\",\"volume\":\"117 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2015-04-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"17\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2015 First International Conference on Arabic Computational Linguistics (ACLing)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ACLING.2015.19\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 First International Conference on Arabic Computational Linguistics (ACLing)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ACLING.2015.19","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 17

摘要

阿拉伯语推特情感分析最近获得了很多关注,有监督的方法被广泛利用。然而,到目前为止,还没有一项实验研究来检验词袋模型(文本表示方案)的不同配置如何影响各种监督机器学习方法。本文的目标就是做到这一点。具体来说,这项工作检查了哪种配置最适合三种机器学习方法,这些方法在应用于情感分析任务时显示出良好的结果,即:支持向量机,恭维Naïve贝叶斯和多项式Naïve贝叶斯。对不同数据集的实验表明,这些分类器中的每一个都有一个词袋配置,与之相结合,它始终表现最好。它还表明,一些特征是数据集相关的。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Which Configuration Works Best? An Experimental Study on Supervised Arabic Twitter Sentiment Analysis
Arabic Twitter Sentiment Analysis has been gaining a lot of attention lately with supervised approaches being exploited widely. However, to date, there has not been an experimental study that examines how different configurations of the Bag of Words model, text representation scheme, can affect various supervised machine learning methods. The goal of the presented work is to do exactly that. Specifically, this work examines which configurations work best for each of three machine learning approaches that have shown good results when applied on the task of sentiment analysis, namely: Support Vector Machines, Compliment Naïve Bayes, and Multinomial Naïve Bayes. Experimenting with different datasets has shown that each of these classifiers has a Bag of Words configuration in conjunction with which, it consistently performs best. It also showed that some features are dataset dependent.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Which Configuration Works Best? An Experimental Study on Supervised Arabic Twitter Sentiment Analysis Increasing the Accuracy of Opinion Mining in Arabic Tunisian Arabic aeb Wordnet: Current State and Future Extensions A Named Entities Recognition System for Modern Standard Arabic using Rule-Based Approach Transducers Cascades for an Automatic Recognition of Arabic Named Entities in Order to Establish Links to Free Resources
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1