From Translation to Generative LLMs: Classification of Code-Mixed Affective Tasks

IF 9.8 2区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE IEEE Transactions on Affective Computing Pub Date : 2025-03-24 DOI:10.1109/TAFFC.2025.3553399

Anjali Yadav;Tanya Garg;Matej Klemen;Matej Ulčar;Basant Agarwal;M. Robnik-Šikonja

{"title":"From Translation to Generative LLMs: Classification of Code-Mixed Affective Tasks","authors":"Anjali Yadav;Tanya Garg;Matej Klemen;Matej Ulčar;Basant Agarwal;M. Robnik-Šikonja","doi":"10.1109/TAFFC.2025.3553399","DOIUrl":null,"url":null,"abstract":"Code-mixed (CM) discourse combines multiple languages in a single text. It is commonly used in informal discourse in countries with several official languages, but also in many other countries in combination with English or neighboring languages. With the recent rise of large transformer language models dominating NLP tasks, we explored their effectiveness in CM contexts. We developed four new bilingual pre-trained masked language models for Hinglish and English-Slovene languages, tailored to handle informal language. We then evaluated monolingual, bilingual, few-lingual, massively multilingual, and larger generative models across multiple languages using two affective tasks involving CM texts: sentiment analysis and offensive speech prediction in social media posts. We compared these models with two translation baselines, one obtained with a neural machine translation tool and the other produced by large generative models. The experiments conducted in five languages: French, Hindi, Russian, Slovene, and Tamil, reveal that fine-tuned bilingual models and multilingual models designed for social media texts outperform others, with massively multilingual and monolingual models following, while larger generative models lag. For the affective tasks studied, models generally performed better on CM data than on non-CM data. The monolingual models with translated datasets rarely compete with multilingual models trained on CM datasets.","PeriodicalId":13131,"journal":{"name":"IEEE Transactions on Affective Computing","volume":"16 3","pages":"2090-2101"},"PeriodicalIF":9.8000,"publicationDate":"2025-03-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Affective Computing","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10938193/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Code-mixed (CM) discourse combines multiple languages in a single text. It is commonly used in informal discourse in countries with several official languages, but also in many other countries in combination with English or neighboring languages. With the recent rise of large transformer language models dominating NLP tasks, we explored their effectiveness in CM contexts. We developed four new bilingual pre-trained masked language models for Hinglish and English-Slovene languages, tailored to handle informal language. We then evaluated monolingual, bilingual, few-lingual, massively multilingual, and larger generative models across multiple languages using two affective tasks involving CM texts: sentiment analysis and offensive speech prediction in social media posts. We compared these models with two translation baselines, one obtained with a neural machine translation tool and the other produced by large generative models. The experiments conducted in five languages: French, Hindi, Russian, Slovene, and Tamil, reveal that fine-tuned bilingual models and multilingual models designed for social media texts outperform others, with massively multilingual and monolingual models following, while larger generative models lag. For the affective tasks studied, models generally performed better on CM data than on non-CM data. The monolingual models with translated datasets rarely compete with multilingual models trained on CM datasets.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

从翻译到生成式llm：语码混合情感任务的分类

语码混合语篇将多种语言组合在一个文本中。在有几种官方语言的国家，它通常用于非正式话语中，但在许多其他国家，它也与英语或邻近语言结合使用。随着最近主导NLP任务的大型转换语言模型的兴起，我们探索了它们在CM上下文中的有效性。我们为印度英语和英语-斯洛文尼亚语开发了四种新的双语预训练掩码语言模型，专门用于处理非正式语言。然后，我们使用涉及CM文本的两个情感任务（情感分析和社交媒体帖子中的攻击性言论预测），评估了单语、双语、少语、大量多语和跨多种语言的更大生成模型。我们将这些模型与两个翻译基线进行比较，一个是由神经机器翻译工具获得的，另一个是由大型生成模型产生的。在法语、印地语、俄语、斯洛文尼亚语和泰米尔语五种语言中进行的实验表明，为社交媒体文本设计的微调双语模型和多语言模型优于其他模型，大量多语言和单语言模型紧随其后，而较大的生成模型滞后。对于所研究的情感任务，模型在CM数据上的表现通常优于非CM数据。使用翻译数据集的单语言模型很少与使用CM数据集训练的多语言模型竞争。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

IEEE Transactions on Affective Computing COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE-COMPUTER SCIENCE, CYBERNETICS

CiteScore

15.00

自引率

6.20%

发文量

174

期刊介绍： The IEEE Transactions on Affective Computing is an international and interdisciplinary journal. Its primary goal is to share research findings on the development of systems capable of recognizing, interpreting, and simulating human emotions and related affective phenomena. The journal publishes original research on the underlying principles and theories that explain how and why affective factors shape human-technology interactions. It also focuses on how techniques for sensing and simulating affect can enhance our understanding of human emotions and processes. Additionally, the journal explores the design, implementation, and evaluation of systems that prioritize the consideration of affect in their usability. We also welcome surveys of existing work that provide new perspectives on the historical and future directions of this field.