Sentiment Classification of Tweets with Explicit Word Negations and Emoji Using Deep Learning

Mdurvwa Usiju Ijairi, M. Abdullahi, Ibrahim Hayatu Hassan
{"title":"Sentiment Classification of Tweets with Explicit Word Negations and Emoji Using Deep Learning","authors":"Mdurvwa Usiju Ijairi, M. Abdullahi, Ibrahim Hayatu Hassan","doi":"10.15282/ijsecs.9.2.2023.3.0114","DOIUrl":null,"url":null,"abstract":"The widespread use of social media platforms such as Twitter, Instagram, Facebook, and LinkedIn have had a huge impact on daily human interactions and decision-making. Owing to Twitter's widespread acceptance, users can express their opinions/sentiments on nearly any issue, ranging from public opinion, a product/service, to even a specific group of people. Sharing these opinions/sentiments results in a massive production of user content known as tweets, which can be assessed to generate new knowledge. Corporate insights, government policy formation, decision-making, and brand identity monitoring all benefit from analyzing the opinions/sentiments expressed in these tweets. Even though several techniques have been created to analyze user sentiments from tweets, social media engagements include negation words and emoji elements that, if not properly pre-processed, would result in misclassification. The majority of available pre-processing techniques rely on clean data and machine learning algorithms to annotate sentiment in unlabeled texts. In this study, we propose a text pre-processing approach that takes into consideration negation words and emoji characteristics in text data by translating these features into single contextual words in tweets to minimize context loss. The proposed preprocessor was evaluated on benchmark Twitter datasets using four deep learning algorithms: Long Short-Term Memory (LSTM), Recurrent Neural Network (RNN), and Artificial Neural Network (ANN). The results showed that LSTM performed better than the approaches already discussed in the literature, with an accuracy of 96.36%, 88.41%, and 95.39%. The findings also suggest that pre-processing information like emoji and explicit word negations aids in the preservation of sentimental information. This appears to be the first study to classify sentiments in tweets while accounting for both explicit word negation conversion and emoji translation.","PeriodicalId":31240,"journal":{"name":"International Journal of Software Engineering and Computer Systems","volume":"131 4 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-07-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Software Engineering and Computer Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.15282/ijsecs.9.2.2023.3.0114","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

The widespread use of social media platforms such as Twitter, Instagram, Facebook, and LinkedIn have had a huge impact on daily human interactions and decision-making. Owing to Twitter's widespread acceptance, users can express their opinions/sentiments on nearly any issue, ranging from public opinion, a product/service, to even a specific group of people. Sharing these opinions/sentiments results in a massive production of user content known as tweets, which can be assessed to generate new knowledge. Corporate insights, government policy formation, decision-making, and brand identity monitoring all benefit from analyzing the opinions/sentiments expressed in these tweets. Even though several techniques have been created to analyze user sentiments from tweets, social media engagements include negation words and emoji elements that, if not properly pre-processed, would result in misclassification. The majority of available pre-processing techniques rely on clean data and machine learning algorithms to annotate sentiment in unlabeled texts. In this study, we propose a text pre-processing approach that takes into consideration negation words and emoji characteristics in text data by translating these features into single contextual words in tweets to minimize context loss. The proposed preprocessor was evaluated on benchmark Twitter datasets using four deep learning algorithms: Long Short-Term Memory (LSTM), Recurrent Neural Network (RNN), and Artificial Neural Network (ANN). The results showed that LSTM performed better than the approaches already discussed in the literature, with an accuracy of 96.36%, 88.41%, and 95.39%. The findings also suggest that pre-processing information like emoji and explicit word negations aids in the preservation of sentimental information. This appears to be the first study to classify sentiments in tweets while accounting for both explicit word negation conversion and emoji translation.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于深度学习的带有明确词否定和表情符号的推文情感分类
Twitter、Instagram、Facebook和LinkedIn等社交媒体平台的广泛使用,对日常人际互动和决策产生了巨大影响。由于Twitter的广泛接受,用户可以对几乎任何问题表达他们的意见/情绪,从公众意见,产品/服务,甚至是特定的人群。分享这些观点/情绪会产生大量的用户内容,这些内容被称为推文,可以通过评估来产生新的知识。企业洞察、政府政策形成、决策和品牌识别监控都受益于分析这些推文中表达的观点/情绪。尽管已经开发了几种技术来分析推文中的用户情绪,但社交媒体互动包括否定词和表情符号元素,如果不进行适当的预处理,就会导致错误分类。大多数可用的预处理技术依赖于干净的数据和机器学习算法来注释未标记文本中的情感。在本研究中,我们提出了一种文本预处理方法,该方法将文本数据中的否定词和表情符号特征转化为tweet中的单个上下文词,以最大限度地减少上下文丢失。使用长短期记忆(LSTM)、递归神经网络(RNN)和人工神经网络(ANN)四种深度学习算法在基准Twitter数据集上对所提出的预处理器进行了评估。结果表明,LSTM的准确率分别为96.36%、88.41%和95.39%,优于文献中讨论的方法。研究结果还表明,预处理表情符号和明确的单词否定等信息有助于保存情感信息。这似乎是第一个对推文中的情绪进行分类的研究,同时考虑到明确的单词否定转换和表情符号翻译。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
14
期刊最新文献
The Mobile Augmented Reality Application for Improving Learning of Electronic Component Module in TVET A Systematic Mapping on Android-based Platform for Smart Inventory System Sentiment Classification of Tweets with Explicit Word Negations and Emoji Using Deep Learning Protocol Efficiency Using Multiple Level Encoding in Quantum Secure Direct Communication Protocol SECURING IOT HEALTHCARE APPLICATIONS AND BLOCKCHAIN: ADDRESSING SECURITY ATTACKS
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1