Babita Sonare, J. Dewan, Sudeep D. Thepade, Vedang Dadape, Tejas Gadge, Aditya Gavali
{"title":"Detecting Sarcasm in Reddit Comments: A Comparative Analysis","authors":"Babita Sonare, J. Dewan, Sudeep D. Thepade, Vedang Dadape, Tejas Gadge, Aditya Gavali","doi":"10.1109/INCET57972.2023.10170613","DOIUrl":null,"url":null,"abstract":"Sarcasm is the use of sarcastic words to mock or mockingly show disdain for something. Several people frequently use it on social media sites like Reddit and Twitter. This study investigates the effectiveness of deep learning and machine learning algorithms in detecting sarcasm using SARC dataset consisting of 1.3 million Reddit comments with almost equal amounts of sarcastic and neutral comments. We compare several well-known machine learning classification methods, including Logistic Regression, Naïve Bayes, Decision Tree Classifier, and Convolutional Neural Networks (CNN). Our results, with an accuracy of 73.2%, demonstrate that the model designed using a fusion of CNN and Long Short-Term Memory Networks (LSTM) techniques performed better than alternative classification algorithms. Our findings show how machine learning techniques will be used in the future to identify sarcasm on social networking websites.","PeriodicalId":403008,"journal":{"name":"2023 4th International Conference for Emerging Technology (INCET)","volume":"11 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-05-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 4th International Conference for Emerging Technology (INCET)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/INCET57972.2023.10170613","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Sarcasm is the use of sarcastic words to mock or mockingly show disdain for something. Several people frequently use it on social media sites like Reddit and Twitter. This study investigates the effectiveness of deep learning and machine learning algorithms in detecting sarcasm using SARC dataset consisting of 1.3 million Reddit comments with almost equal amounts of sarcastic and neutral comments. We compare several well-known machine learning classification methods, including Logistic Regression, Naïve Bayes, Decision Tree Classifier, and Convolutional Neural Networks (CNN). Our results, with an accuracy of 73.2%, demonstrate that the model designed using a fusion of CNN and Long Short-Term Memory Networks (LSTM) techniques performed better than alternative classification algorithms. Our findings show how machine learning techniques will be used in the future to identify sarcasm on social networking websites.