CL-UZH at SemEval-2023 Task 10: Sexism Detection through Incremental Fine-Tuning and Multi-Task Learning with Label Descriptions

International Workshop on Semantic Evaluation Pub Date : 2023-06-06 DOI:10.48550/arXiv.2306.03907

Janis Goldzycher

引用次数: 0

Abstract

The widespread popularity of social media has led to an increase in hateful, abusive, and sexist language, motivating methods for the automatic detection of such phenomena. The goal of the SemEval shared task Towards Explainable Detection of Online Sexism (EDOS 2023) is to detect sexism in English social media posts (subtask A), and to categorize such posts into four coarse-grained sexism categories (subtask B), and eleven fine-grained subcategories (subtask C). In this paper, we present our submitted systems for all three subtasks, based on a multi-task model that has been fine-tuned on a range of related tasks and datasets before being fine-tuned on the specific EDOS subtasks. We implement multi-task learning by formulating each task as binary pairwise text classification, where the dataset and label descriptions are given along with the input text. The results show clear improvements over a fine-tuned DeBERTa-V3 serving as a baseline leading to F1-scores of 85.9% in subtask A (rank 13/84), 64.8% in subtask B (rank 19/69), and 44.9% in subtask C (26/63).

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于标签描述的增量微调和多任务学习的性别歧视检测

社交媒体的广泛普及导致了仇恨、辱骂和性别歧视语言的增加，这激发了自动检测此类现象的方法。SemEval共享任务“面向可解释的在线性别歧视检测”(EDOS 2023)的目标是检测英语社交媒体帖子中的性别歧视(子任务A)，并将这些帖子分为四个粗粒度的性别歧视类别(子任务B)和11个细粒度的子类别(子任务C)。在本文中，我们展示了我们为所有三个子任务提交的系统。基于一个多任务模型，该模型在对特定的EDOS子任务进行微调之前，已经对一系列相关的任务和数据集进行了微调。我们通过将每个任务表述为二元文本分类来实现多任务学习，其中数据集和标签描述与输入文本一起给出。结果显示，与经过微调的DeBERTa-V3作为基线相比，在子任务a(排名13/84)、子任务B(排名19/69)和子任务C(排名26/63)上的f1得分分别提高了85.9%、64.8%和44.9%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

International Workshop on Semantic Evaluation

自引率

0.00%

发文量