A Deviation based Ensemble Algorithm for Sarcasm Detection in Online Comments

Anurita Bose, Deepanjali Pandit, Nidhi Prakash, Ashwini M. Joshi
{"title":"A Deviation based Ensemble Algorithm for Sarcasm Detection in Online Comments","authors":"Anurita Bose, Deepanjali Pandit, Nidhi Prakash, Ashwini M. Joshi","doi":"10.1109/ICECCT56650.2023.10179724","DOIUrl":null,"url":null,"abstract":"Sarcasm refers to the use of irony to mock or convey contempt and involves the use of words that mean the opposite of what someone truly intends to convey. Online forums which enable users to express sarcasm as a sentiment tend to induce misunderstandings between different parties and obscure the users' true intentions. This leads to ambiguity being one of the prime challenges in detecting sarcasm. Another challenge in sarcasm detection is the rapidly growing size of language vocabularies with the addition of new slang words every day. Additionally, usage of emojis in online text can greatly influence the polarity of a sentence by inducing a sarcastic tone. These setbacks make sarcasm a particularly demanding sentiment to determine. In this paper, the statistical significance of various deep learning models for the purpose of detecting sarcasm in online comments containing emojis is explored. For the task of binary classification, GRU achieves an accuracy score of 73.44% with an F1-score of 73.96%. The proposed ensemble-based approach yields an accuracy score of 74.41% for the combination of LSTM and GRU, which is comparable to the accuracy achieved with conventional ensemble techniques such as max-voting and averaging. Twenty-six different hybrid combinations of deep learning models were explored and the most optimal performing ones were identified. CNN and Global Average Pooling 1D are two other architectures that were explored.","PeriodicalId":180790,"journal":{"name":"2023 Fifth International Conference on Electrical, Computer and Communication Technologies (ICECCT)","volume":"34 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-02-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 Fifth International Conference on Electrical, Computer and Communication Technologies (ICECCT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICECCT56650.2023.10179724","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Sarcasm refers to the use of irony to mock or convey contempt and involves the use of words that mean the opposite of what someone truly intends to convey. Online forums which enable users to express sarcasm as a sentiment tend to induce misunderstandings between different parties and obscure the users' true intentions. This leads to ambiguity being one of the prime challenges in detecting sarcasm. Another challenge in sarcasm detection is the rapidly growing size of language vocabularies with the addition of new slang words every day. Additionally, usage of emojis in online text can greatly influence the polarity of a sentence by inducing a sarcastic tone. These setbacks make sarcasm a particularly demanding sentiment to determine. In this paper, the statistical significance of various deep learning models for the purpose of detecting sarcasm in online comments containing emojis is explored. For the task of binary classification, GRU achieves an accuracy score of 73.44% with an F1-score of 73.96%. The proposed ensemble-based approach yields an accuracy score of 74.41% for the combination of LSTM and GRU, which is comparable to the accuracy achieved with conventional ensemble techniques such as max-voting and averaging. Twenty-six different hybrid combinations of deep learning models were explored and the most optimal performing ones were identified. CNN and Global Average Pooling 1D are two other architectures that were explored.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于偏差的在线评论讽刺检测集成算法
讽刺指的是用讽刺的方式来嘲笑或表达蔑视,包括使用与某人真正想表达的意思相反的词语。允许用户将讽刺作为一种情感表达的网络论坛,容易引起各方之间的误解,模糊用户的真实意图。这导致歧义成为检测讽刺的主要挑战之一。讽刺检测的另一个挑战是语言词汇量的快速增长,每天都有新的俚语词汇增加。此外,在网络文本中使用表情符号可以通过诱导讽刺语气来极大地影响句子的极性。这些挫折使讽刺成为一种特别需要判断的情绪。本文探讨了各种深度学习模型用于检测包含表情符号的在线评论中的讽刺的统计意义。对于二值分类任务,GRU的准确率得分为73.44%,f1得分为73.96%。基于集成的LSTM和GRU组合方法的准确率为74.41%,与传统集成技术(如max-voting和average)的准确率相当。探索了26种不同的深度学习模型混合组合,并确定了性能最优的模型。CNN和Global Average Pooling 1D是我们探索的另外两种架构。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Model of Markovian Queue with Catastrophe, Restoration and Balking Nibble Based Two Bit Invert Coding Technique for Serial Network on Chip Links Hesitant Triangular Fuzzy Dombi Operators and Its Applications Fuel Cost Optimization of Coal-Fired Power Plants using Coal Blending Proportions An Efficient Classification for Light Motor Vehicles using CatBoost Algorithm
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1