{"title":"Enhancing cross-lingual text classification through linguistic and interpretability-guided attack strategies","authors":"Abdelmounaim Kerkri , Mohamed Amine Madani , Aya Qeraouch , Kaoutar Zouin","doi":"10.1016/j.is.2025.102526","DOIUrl":null,"url":null,"abstract":"<div><div>While adversarial attacks on natural language processing systems have been extensively studied in English, their impact on morphologically complex languages remains poorly understood. We investigate how text classification systems respond to adversarial attacks across Arabic, English, and French — languages chosen for their distinct linguistic properties. Building on the DeepWordBug framework, we develop multilingual attack strategies that combine random perturbations with targeted modifications guided by model interpretability. We also introduce novel attack methods that exploit language-specific features like orthographic variations and syntactic patterns. Testing these approaches on a diverse dataset of news articles (9,030 Arabic, 14,501 English) and movie reviews (200,000 French), we find that interpretability-guided attacks are particularly effective, achieving misclassification rates of 58%–62% across languages. Language-specific perturbations also proved potent, degrading model performance to F1-scores between 0.38 and 0.63. However, incorporating adversarial examples during training markedly improved model robustness, with F1-scores recovering to above 0.82 across all test conditions. Beyond the immediate findings, this work reveals how adversarial vulnerability manifests differently across languages with varying morphological complexity, offering key insights for building more resilient multilingual NLP systems.</div></div>","PeriodicalId":50363,"journal":{"name":"Information Systems","volume":"131 ","pages":"Article 102526"},"PeriodicalIF":3.0000,"publicationDate":"2025-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Systems","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0306437925000110","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
While adversarial attacks on natural language processing systems have been extensively studied in English, their impact on morphologically complex languages remains poorly understood. We investigate how text classification systems respond to adversarial attacks across Arabic, English, and French — languages chosen for their distinct linguistic properties. Building on the DeepWordBug framework, we develop multilingual attack strategies that combine random perturbations with targeted modifications guided by model interpretability. We also introduce novel attack methods that exploit language-specific features like orthographic variations and syntactic patterns. Testing these approaches on a diverse dataset of news articles (9,030 Arabic, 14,501 English) and movie reviews (200,000 French), we find that interpretability-guided attacks are particularly effective, achieving misclassification rates of 58%–62% across languages. Language-specific perturbations also proved potent, degrading model performance to F1-scores between 0.38 and 0.63. However, incorporating adversarial examples during training markedly improved model robustness, with F1-scores recovering to above 0.82 across all test conditions. Beyond the immediate findings, this work reveals how adversarial vulnerability manifests differently across languages with varying morphological complexity, offering key insights for building more resilient multilingual NLP systems.
期刊介绍:
Information systems are the software and hardware systems that support data-intensive applications. The journal Information Systems publishes articles concerning the design and implementation of languages, data models, process models, algorithms, software and hardware for information systems.
Subject areas include data management issues as presented in the principal international database conferences (e.g., ACM SIGMOD/PODS, VLDB, ICDE and ICDT/EDBT) as well as data-related issues from the fields of data mining/machine learning, information retrieval coordinated with structured data, internet and cloud data management, business process management, web semantics, visual and audio information systems, scientific computing, and data science. Implementation papers having to do with massively parallel data management, fault tolerance in practice, and special purpose hardware for data-intensive systems are also welcome. Manuscripts from application domains, such as urban informatics, social and natural science, and Internet of Things, are also welcome. All papers should highlight innovative solutions to data management problems such as new data models, performance enhancements, and show how those innovations contribute to the goals of the application.