Novitasari Arlim, Siti Kania Kushadiani, S. Riyanto, Rodiah Rodiah, Rini Arianty, Maukar Maukar, Shidiq Al Hakim, A. Siagian
{"title":"利用夸张特征检测印尼语推文中的讽刺语","authors":"Novitasari Arlim, Siti Kania Kushadiani, S. Riyanto, Rodiah Rodiah, Rini Arianty, Maukar Maukar, Shidiq Al Hakim, A. Siagian","doi":"10.1145/3575882.3575908","DOIUrl":null,"url":null,"abstract":"Since sarcasm has inverse meaning from what is said or written, it is very hard to detect sarcasm. Therefore, detecting sarcasm is an important task in Natural Language Processing (NLP) field. In this study, we use interjection, intensifier, capital letters, elongated words, and punctuation marks as hyperbole features to detect sarcasm in Indonesian tweets. Particularly, these hyperbole features are utilized by Support Vector Machine (SVM), Random Forest (RF), and RF+Bagging to classify Indonesian tweets in our testing data as sarcasm or not-sarcasm. English tweets obtained from Kaggle and SemEval are employed as our training data, while Indonesian tweets obtained from Drone Emprit are used as the testing data. Our experimental results show that our model with hyperbole features classifies more the tweets in the testing data as sarcasm than that without hyperbole ones. Our observation indicates that using hyperbole features could contribute well to detecting sarcasm.","PeriodicalId":367340,"journal":{"name":"Proceedings of the 2022 International Conference on Computer, Control, Informatics and Its Applications","volume":"18 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Sarcasm Detection in Indonesian Tweets Using Hyperbole Features\",\"authors\":\"Novitasari Arlim, Siti Kania Kushadiani, S. Riyanto, Rodiah Rodiah, Rini Arianty, Maukar Maukar, Shidiq Al Hakim, A. Siagian\",\"doi\":\"10.1145/3575882.3575908\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Since sarcasm has inverse meaning from what is said or written, it is very hard to detect sarcasm. Therefore, detecting sarcasm is an important task in Natural Language Processing (NLP) field. In this study, we use interjection, intensifier, capital letters, elongated words, and punctuation marks as hyperbole features to detect sarcasm in Indonesian tweets. Particularly, these hyperbole features are utilized by Support Vector Machine (SVM), Random Forest (RF), and RF+Bagging to classify Indonesian tweets in our testing data as sarcasm or not-sarcasm. English tweets obtained from Kaggle and SemEval are employed as our training data, while Indonesian tweets obtained from Drone Emprit are used as the testing data. Our experimental results show that our model with hyperbole features classifies more the tweets in the testing data as sarcasm than that without hyperbole ones. Our observation indicates that using hyperbole features could contribute well to detecting sarcasm.\",\"PeriodicalId\":367340,\"journal\":{\"name\":\"Proceedings of the 2022 International Conference on Computer, Control, Informatics and Its Applications\",\"volume\":\"18 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-11-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2022 International Conference on Computer, Control, Informatics and Its Applications\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3575882.3575908\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2022 International Conference on Computer, Control, Informatics and Its Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3575882.3575908","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Sarcasm Detection in Indonesian Tweets Using Hyperbole Features
Since sarcasm has inverse meaning from what is said or written, it is very hard to detect sarcasm. Therefore, detecting sarcasm is an important task in Natural Language Processing (NLP) field. In this study, we use interjection, intensifier, capital letters, elongated words, and punctuation marks as hyperbole features to detect sarcasm in Indonesian tweets. Particularly, these hyperbole features are utilized by Support Vector Machine (SVM), Random Forest (RF), and RF+Bagging to classify Indonesian tweets in our testing data as sarcasm or not-sarcasm. English tweets obtained from Kaggle and SemEval are employed as our training data, while Indonesian tweets obtained from Drone Emprit are used as the testing data. Our experimental results show that our model with hyperbole features classifies more the tweets in the testing data as sarcasm than that without hyperbole ones. Our observation indicates that using hyperbole features could contribute well to detecting sarcasm.