{"title":"An Enhanced Twitter Sentiment Analysis Model using Negation Scope Identification Methods","authors":"Monir Yahya Ali Salmony, Arman Rasool Faridi","doi":"10.1109/INDIACom51348.2021.00155","DOIUrl":null,"url":null,"abstract":"Sentiment Analysis (SA), which is also known as Opinion Mining, is a hot-fastest growing research area, making it challenging to follow all its activities. It intends to study peoples' thoughts, feelings, and attitudes about topics, events, issues, entities, individuals, and attributes in social media (e.g., social networking sites, forums, blogs, etc.) expressed by either text comments or tweets. Twitter is one of the world's largest online microblogging platforms that allows its users to freely post texts called tweets. It offers a wealth of information, therefore utilizing SA to analyze this information into positive or negative will assist organizations' and customers' decision-making that will have a significant impact on daily life. SA draws the attention of scientific research in the Natural Language Processing community due to the text structure challenges that may contain negation. Negation is a widespread linguistic structure that changes the text meaning to the opposite and affects text polarity. Therefore, it needs to be considered in sentiment analysis systems. In this paper, supervised machine learning models have been used as a baseline to categorize the sentiment of a Twitter dataset using (Bag of Words) and (Term Frequency Inverse Document Frequency) feature representation methods. Then we applied negation scope identification methods to find negated tokens and investigate how embedding these tokens can raise SA classifiers' accuracy. The results of the sentiment classification task show an improvement once considering these tokens.","PeriodicalId":415594,"journal":{"name":"2021 8th International Conference on Computing for Sustainable Global Development (INDIACom)","volume":"17 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 8th International Conference on Computing for Sustainable Global Development (INDIACom)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/INDIACom51348.2021.00155","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
Sentiment Analysis (SA), which is also known as Opinion Mining, is a hot-fastest growing research area, making it challenging to follow all its activities. It intends to study peoples' thoughts, feelings, and attitudes about topics, events, issues, entities, individuals, and attributes in social media (e.g., social networking sites, forums, blogs, etc.) expressed by either text comments or tweets. Twitter is one of the world's largest online microblogging platforms that allows its users to freely post texts called tweets. It offers a wealth of information, therefore utilizing SA to analyze this information into positive or negative will assist organizations' and customers' decision-making that will have a significant impact on daily life. SA draws the attention of scientific research in the Natural Language Processing community due to the text structure challenges that may contain negation. Negation is a widespread linguistic structure that changes the text meaning to the opposite and affects text polarity. Therefore, it needs to be considered in sentiment analysis systems. In this paper, supervised machine learning models have been used as a baseline to categorize the sentiment of a Twitter dataset using (Bag of Words) and (Term Frequency Inverse Document Frequency) feature representation methods. Then we applied negation scope identification methods to find negated tokens and investigate how embedding these tokens can raise SA classifiers' accuracy. The results of the sentiment classification task show an improvement once considering these tokens.
情感分析(SA),也被称为意见挖掘,是一个发展最快的热门研究领域,这使得跟踪其所有活动具有挑战性。它旨在研究人们对社交媒体(如社交网站、论坛、博客等)中的话题、事件、问题、实体、个人和属性的想法、感受和态度,这些想法、感受和态度可以通过文本评论或tweet来表达。Twitter是世界上最大的在线微博平台之一,它允许用户自由发布被称为tweet的文本。它提供了丰富的信息,因此利用情景分析将这些信息分析为积极或消极将有助于组织和客户的决策,这将对日常生活产生重大影响。由于可能包含否定的文本结构挑战,SA引起了自然语言处理界的科学研究的关注。否定是一种广泛存在的语言结构,它使语篇意义向相反的方向转变,影响语篇极性。因此,在情感分析系统中需要考虑它。在本文中,使用监督机器学习模型作为基线,使用(Bag of Words)和(Term Frequency Inverse Document Frequency)特征表示方法对Twitter数据集的情感进行分类。然后,我们应用否定范围识别方法来寻找否定令牌,并研究如何嵌入这些令牌来提高SA分类器的准确率。考虑到这些标记,情感分类任务的结果显示出改进。