哪种配置效果最好?监督阿拉伯语推特情感分析的实验研究

2015 First International Conference on Arabic Computational Linguistics (ACLing) Pub Date : 2015-04-17 DOI:10.1109/ACLING.2015.19

Talaat Khalil, Amal Halaby, Muhammad Hammad, S. El-Beltagy

{"title":"哪种配置效果最好?监督阿拉伯语推特情感分析的实验研究","authors":"Talaat Khalil, Amal Halaby, Muhammad Hammad, S. El-Beltagy","doi":"10.1109/ACLING.2015.19","DOIUrl":null,"url":null,"abstract":"Arabic Twitter Sentiment Analysis has been gaining a lot of attention lately with supervised approaches being exploited widely. However, to date, there has not been an experimental study that examines how different configurations of the Bag of Words model, text representation scheme, can affect various supervised machine learning methods. The goal of the presented work is to do exactly that. Specifically, this work examines which configurations work best for each of three machine learning approaches that have shown good results when applied on the task of sentiment analysis, namely: Support Vector Machines, Compliment Naïve Bayes, and Multinomial Naïve Bayes. Experimenting with different datasets has shown that each of these classifiers has a Bag of Words configuration in conjunction with which, it consistently performs best. It also showed that some features are dataset dependent.","PeriodicalId":404268,"journal":{"name":"2015 First International Conference on Arabic Computational Linguistics (ACLing)","volume":"117 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-04-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"17","resultStr":"{\"title\":\"Which Configuration Works Best? An Experimental Study on Supervised Arabic Twitter Sentiment Analysis\",\"authors\":\"Talaat Khalil, Amal Halaby, Muhammad Hammad, S. El-Beltagy\",\"doi\":\"10.1109/ACLING.2015.19\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Arabic Twitter Sentiment Analysis has been gaining a lot of attention lately with supervised approaches being exploited widely. However, to date, there has not been an experimental study that examines how different configurations of the Bag of Words model, text representation scheme, can affect various supervised machine learning methods. The goal of the presented work is to do exactly that. Specifically, this work examines which configurations work best for each of three machine learning approaches that have shown good results when applied on the task of sentiment analysis, namely: Support Vector Machines, Compliment Naïve Bayes, and Multinomial Naïve Bayes. Experimenting with different datasets has shown that each of these classifiers has a Bag of Words configuration in conjunction with which, it consistently performs best. It also showed that some features are dataset dependent.\",\"PeriodicalId\":404268,\"journal\":{\"name\":\"2015 First International Conference on Arabic Computational Linguistics (ACLing)\",\"volume\":\"117 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2015-04-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"17\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2015 First International Conference on Arabic Computational Linguistics (ACLing)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ACLING.2015.19\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 First International Conference on Arabic Computational Linguistics (ACLing)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ACLING.2015.19","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 17

摘要

阿拉伯语推特情感分析最近获得了很多关注，有监督的方法被广泛利用。然而，到目前为止，还没有一项实验研究来检验词袋模型(文本表示方案)的不同配置如何影响各种监督机器学习方法。本文的目标就是做到这一点。具体来说，这项工作检查了哪种配置最适合三种机器学习方法，这些方法在应用于情感分析任务时显示出良好的结果，即:支持向量机，恭维Naïve贝叶斯和多项式Naïve贝叶斯。对不同数据集的实验表明，这些分类器中的每一个都有一个词袋配置，与之相结合，它始终表现最好。它还表明，一些特征是数据集相关的。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Which Configuration Works Best? An Experimental Study on Supervised Arabic Twitter Sentiment Analysis

Arabic Twitter Sentiment Analysis has been gaining a lot of attention lately with supervised approaches being exploited widely. However, to date, there has not been an experimental study that examines how different configurations of the Bag of Words model, text representation scheme, can affect various supervised machine learning methods. The goal of the presented work is to do exactly that. Specifically, this work examines which configurations work best for each of three machine learning approaches that have shown good results when applied on the task of sentiment analysis, namely: Support Vector Machines, Compliment Naïve Bayes, and Multinomial Naïve Bayes. Experimenting with different datasets has shown that each of these classifiers has a Bag of Words configuration in conjunction with which, it consistently performs best. It also showed that some features are dataset dependent.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2015 First International Conference on Arabic Computational Linguistics (ACLing)

自引率

0.00%

发文量