Abdul Gafar Manuel Meque, Fazlourrahman Balouchzahi, Alexander Gelbukh, Grigori Sidorov
{"title":"PoliGuilt: Two level guilt detection from social media texts","authors":"Abdul Gafar Manuel Meque, Fazlourrahman Balouchzahi, Alexander Gelbukh, Grigori Sidorov","doi":"10.1016/j.eswa.2025.127187","DOIUrl":null,"url":null,"abstract":"<div><div>Guilt, a multifaceted emotion stemming from the realization of causing harm, intertwines with various aspects of human psychology and social interaction. This paper delves into the nature of guilt by developing an annotated dataset of 3,304 posts. Guilt detection is approached as a two-level classification task: first, distinguishing between guilt and non-guilt, and then categorizing guilt into the types “Anticipatory”, “Reactive”, and “Existential” based on psychological frameworks. Exploratory analyses are conducted to examine the contributions of post titles, self-text, and their combination as inputs to guilt detection algorithms. Various learning approaches were employed, including traditional machine learning, deep learning models, and transformers, to ensure quality and efficacy. The findings indicate that while simple methods using only unigrams can distinguish between texts expressing guilt and those that do not, they struggle with fine-grained categorization of guilt types. Additionally, deep learning models and transformers, especially when utilizing contextual information from longer texts and a combination of titles and self-texts, show greater success in capturing the context of the text. Notably, the RoBERTa-base model achieved average F1 scores of 0.7599 for binary classification and 0.7394 for multiclass classification, outperforming all other experiments when combining the title and self-text.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"277 ","pages":"Article 127187"},"PeriodicalIF":7.5000,"publicationDate":"2025-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Expert Systems with Applications","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0957417425008097","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Guilt, a multifaceted emotion stemming from the realization of causing harm, intertwines with various aspects of human psychology and social interaction. This paper delves into the nature of guilt by developing an annotated dataset of 3,304 posts. Guilt detection is approached as a two-level classification task: first, distinguishing between guilt and non-guilt, and then categorizing guilt into the types “Anticipatory”, “Reactive”, and “Existential” based on psychological frameworks. Exploratory analyses are conducted to examine the contributions of post titles, self-text, and their combination as inputs to guilt detection algorithms. Various learning approaches were employed, including traditional machine learning, deep learning models, and transformers, to ensure quality and efficacy. The findings indicate that while simple methods using only unigrams can distinguish between texts expressing guilt and those that do not, they struggle with fine-grained categorization of guilt types. Additionally, deep learning models and transformers, especially when utilizing contextual information from longer texts and a combination of titles and self-texts, show greater success in capturing the context of the text. Notably, the RoBERTa-base model achieved average F1 scores of 0.7599 for binary classification and 0.7394 for multiclass classification, outperforming all other experiments when combining the title and self-text.
期刊介绍:
Expert Systems With Applications is an international journal dedicated to the exchange of information on expert and intelligent systems used globally in industry, government, and universities. The journal emphasizes original papers covering the design, development, testing, implementation, and management of these systems, offering practical guidelines. It spans various sectors such as finance, engineering, marketing, law, project management, information management, medicine, and more. The journal also welcomes papers on multi-agent systems, knowledge management, neural networks, knowledge discovery, data mining, and other related areas, excluding applications to military/defense systems.