Sentiment Classification at Discourse Segment Level: Experiments on multi-domain Arabic corpus

J. Lang. Technol. Comput. Linguistics Pub Date : 2015-07-01 DOI:10.21248/jlcl.30.2015.193

Amine Bayoudhi, Hatem Ghorbel, Houssem Koubaa, Lamia Hadrich Belguith

{"title":"Sentiment Classification at Discourse Segment Level: Experiments on multi-domain Arabic corpus","authors":"Amine Bayoudhi, Hatem Ghorbel, Houssem Koubaa, Lamia Hadrich Belguith","doi":"10.21248/jlcl.30.2015.193","DOIUrl":null,"url":null,"abstract":"Sentiment classification aims to determine whether the semantic orientation of a text is positive, negative or neutral. It can be tackled at several levels of granularity: expression or phrase level, sentence level, and document level. In the scope of this research, we are interested in the sentence and sub-sentential level classification which can provide very useful trends for information retrieval and extraction applications, Question Answering systems and summarization tasks. In the context of our work, we address the problem of Arabic sentiment classification at sub-sentential level by (i) building a high coverage sentiment lexicon with semi-automatic approach; (ii) creating a large multi-domain annotated sentiment corpus segmented into discourse segments in order to evaluate our sentiment approach; and (iii) applying a lexicon-based approach with an aggregation model taking into account advanced linguistic phenomena such as negation and intensification. The results that we obtained are considered good and close to state of the art results in English language.","PeriodicalId":402489,"journal":{"name":"J. Lang. Technol. Comput. Linguistics","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"J. Lang. Technol. Comput. Linguistics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.21248/jlcl.30.2015.193","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 9

Abstract

Sentiment classification aims to determine whether the semantic orientation of a text is positive, negative or neutral. It can be tackled at several levels of granularity: expression or phrase level, sentence level, and document level. In the scope of this research, we are interested in the sentence and sub-sentential level classification which can provide very useful trends for information retrieval and extraction applications, Question Answering systems and summarization tasks. In the context of our work, we address the problem of Arabic sentiment classification at sub-sentential level by (i) building a high coverage sentiment lexicon with semi-automatic approach; (ii) creating a large multi-domain annotated sentiment corpus segmented into discourse segments in order to evaluate our sentiment approach; and (iii) applying a lexicon-based approach with an aggregation model taking into account advanced linguistic phenomena such as negation and intensification. The results that we obtained are considered good and close to state of the art results in English language.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

语段层面的情感分类:多领域阿拉伯语语料库实验

情感分类的目的是确定文本的语义取向是积极的、消极的还是中性的。它可以在几个粒度级别上进行处理:表达式或短语级别、句子级别和文档级别。在本研究的范围内，我们对句子和子句子级别的分类感兴趣，这可以为信息检索和提取应用、问答系统和摘要任务提供非常有用的趋势。在我们的工作背景下，我们通过以下方法解决了亚句子级别的阿拉伯语情感分类问题:(i)使用半自动方法构建高覆盖率的情感词典;(ii)创建一个大型的多领域标注情感语料库，将其分割成话语段，以评估我们的情感方法;(三)运用基于词典的方法和考虑到高级语言现象(如否定和强化)的聚合模型。我们获得的结果被认为是良好的，接近最先进的英语语言结果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

J. Lang. Technol. Comput. Linguistics

自引率

0.00%

发文量