Expert Systems最新文献_第6页

A novel transformer attention-based approach for sarcasm detection 基于变压器注意力的讽刺检测新方法

IF 3 4区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Expert Systems

Pub Date : 2024-07-23 DOI: 10.1111/exsy.13686

Shumaila Khan, Iqbal Qasim, Wahab Khan, Khursheed Aurangzeb, Javed Ali Khan, Muhammad Shahid Anwar

Sarcasm detection is challenging in natural language processing (NLP) due to its implicit nature, particularly in low-resource languages. Despite limited linguistic resources, researchers have focused on detecting sarcasm on social media platforms, leading to the development of specialized algorithms and models tailored for Urdu text. Researchers have significantly improved sarcasm detection accuracy by analysing patterns and linguistic cues unique to the language, thereby advancing NLP capabilities in low-resource languages and facilitating better communication within diverse online communities. This work introduces UrduSarcasmNet, a novel architecture using cascaded group multi-head attention, which is an innovative deep-learning approach that employs cascaded group multi-head attention techniques to enhance effectiveness. By employing a series of attention heads in a cascading manner, our model captures both local and global contexts, facilitating a more comprehensive understanding of the text. Adding a group attention mechanism enables simultaneous consideration of various sub-topics within the content, thereby enriching the model's effectiveness. The proposed UrduSarcasmNet approach is validated with the Urdu-sarcastic-tweets-dataset (UST) dataset, which has been curated for this purpose. Our experimental results on the UST dataset show that the proposed UrduSarcasmNet framework outperforms the simple-attention mechanism and other state-of-the-art models. This research significantly enhances natural language processing (NLP) and provides valuable insights for improving sarcasm recognition tools in low-resource languages like Urdu.

在自然语言处理（NLP）中，由于讽刺的隐含性质，尤其是在低资源语言中，讽刺检测具有挑战性。尽管语言资源有限，但研究人员一直专注于检测社交媒体平台上的讽刺，从而开发出了专门针对乌尔都语文本的算法和模型。研究人员通过分析乌尔都语特有的模式和语言线索，大大提高了讽刺语言检测的准确性，从而推动了低资源语言的 NLP 能力，促进了不同网络社区内更好的交流。这项工作介绍了使用级联组多头注意力的新型架构 UrduSarcasmNet，这是一种创新的深度学习方法，采用了级联组多头注意力技术来提高效率。通过以级联方式使用一系列注意力头，我们的模型可以捕捉局部和全局上下文，从而促进对文本更全面的理解。通过添加群体注意机制，可以同时考虑内容中的各种子主题，从而丰富了模型的有效性。建议的 UrduSarcasmNet 方法已通过为此目的策划的 Urdu-sarcastic-tweets 数据集（UST）进行了验证。我们在 UST 数据集上的实验结果表明，所提出的 UrduSarcasmNet 框架优于简单关注机制和其他最先进的模型。这项研究大大提高了自然语言处理（NLP）能力，并为改进乌尔都语等低资源语言的讽刺语言识别工具提供了宝贵的见解。

{"title":"A novel transformer attention-based approach for sarcasm detection","authors":"Shumaila Khan, Iqbal Qasim, Wahab Khan, Khursheed Aurangzeb, Javed Ali Khan, Muhammad Shahid Anwar","doi":"10.1111/exsy.13686","DOIUrl":"10.1111/exsy.13686","url":null,"abstract":"Sarcasm detection is challenging in natural language processing (NLP) due to its implicit nature, particularly in low-resource languages. Despite limited linguistic resources, researchers have focused on detecting sarcasm on social media platforms, leading to the development of specialized algorithms and models tailored for Urdu text. Researchers have significantly improved sarcasm detection accuracy by analysing patterns and linguistic cues unique to the language, thereby advancing NLP capabilities in low-resource languages and facilitating better communication within diverse online communities. This work introduces UrduSarcasmNet, a novel architecture using cascaded group multi-head attention, which is an innovative deep-learning approach that employs cascaded group multi-head attention techniques to enhance effectiveness. By employing a series of attention heads in a cascading manner, our model captures both local and global contexts, facilitating a more comprehensive understanding of the text. Adding a group attention mechanism enables simultaneous consideration of various sub-topics within the content, thereby enriching the model's effectiveness. The proposed UrduSarcasmNet approach is validated with the Urdu-sarcastic-tweets-dataset (UST) dataset, which has been curated for this purpose. Our experimental results on the UST dataset show that the proposed UrduSarcasmNet framework outperforms the simple-attention mechanism and other state-of-the-art models. This research significantly enhances natural language processing (NLP) and provides valuable insights for improving sarcasm recognition tools in low-resource languages like Urdu.","PeriodicalId":51053,"journal":{"name":"Expert Systems","volume":"42 1","pages":""},"PeriodicalIF":3.0,"publicationDate":"2024-07-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141783666","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Optimizing fraud detection in financial transactions with machine learning and imbalance mitigation 利用机器学习和失衡缓解优化金融交易中的欺诈检测

IF 3.3 4区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Expert Systems

Pub Date : 2024-07-23 DOI: 10.1111/exsy.13682

Ezaz Mohammed Al‐dahasi, Rama Khaled Alsheikh, Fakhri Alam Khan, Gwanggil Jeon

The rapid advancement of the Internet and digital payments has transformed the landscape of financial transactions, leading to both technological progress and an alarming rise in cybercrime. This study addresses the critical issue of financial fraud detection in the era of digital payments, focusing on enhancing operational risk frameworks to mitigate the increasing threats. The objective is to improve the predictive performance of fraud detection systems using machine learning techniques. The methodology involves a comprehensive data preprocessing and model creation process, including one‐hot encoding, feature selection, sampling, standardization, and tokenization. Six machine learning models are employed for fraud detection, and their hyperparameters are optimized. Evaluation metrics such as accuracy, precision, recall, and F1‐score are used to assess model performance. Results reveal that XGBoost and Random Forest outperform other models, achieving a balance between false positives and false negatives. The study meets the requirements for fraud detection systems, ensuring accuracy, scalability, adaptability, and explainability. This paper provides valuable insights into the efficacy of machine learning models for financial fraud detection and emphasizes the importance of striking a balance between false positives and false negatives.

互联网和数字支付的快速发展改变了金融交易的格局，既带来了技术进步，也导致网络犯罪的惊人增长。本研究探讨了数字支付时代金融欺诈检测的关键问题，重点是加强操作风险框架，以减轻日益增长的威胁。目的是利用机器学习技术提高欺诈检测系统的预测性能。该方法涉及全面的数据预处理和模型创建过程，包括单次编码、特征选择、采样、标准化和标记化。欺诈检测采用了六个机器学习模型，并对其超参数进行了优化。准确率、精确度、召回率和 F1 分数等评价指标用于评估模型性能。结果显示，XGBoost 和随机森林的表现优于其他模型，在误报和误报之间取得了平衡。这项研究符合欺诈检测系统的要求，确保了准确性、可扩展性、适应性和可解释性。本文就机器学习模型在金融欺诈检测中的功效提供了宝贵的见解，并强调了在误报和误报之间取得平衡的重要性。

{"title":"Optimizing fraud detection in financial transactions with machine learning and imbalance mitigation","authors":"Ezaz Mohammed Al‐dahasi, Rama Khaled Alsheikh, Fakhri Alam Khan, Gwanggil Jeon","doi":"10.1111/exsy.13682","DOIUrl":"https://doi.org/10.1111/exsy.13682","url":null,"abstract":"The rapid advancement of the Internet and digital payments has transformed the landscape of financial transactions, leading to both technological progress and an alarming rise in cybercrime. This study addresses the critical issue of financial fraud detection in the era of digital payments, focusing on enhancing operational risk frameworks to mitigate the increasing threats. The objective is to improve the predictive performance of fraud detection systems using machine learning techniques. The methodology involves a comprehensive data preprocessing and model creation process, including one‐hot encoding, feature selection, sampling, standardization, and tokenization. Six machine learning models are employed for fraud detection, and their hyperparameters are optimized. Evaluation metrics such as accuracy, precision, recall, and F1‐score are used to assess model performance. Results reveal that XGBoost and Random Forest outperform other models, achieving a balance between false positives and false negatives. The study meets the requirements for fraud detection systems, ensuring accuracy, scalability, adaptability, and explainability. This paper provides valuable insights into the efficacy of machine learning models for financial fraud detection and emphasizes the importance of striking a balance between false positives and false negatives.","PeriodicalId":51053,"journal":{"name":"Expert Systems","volume":"71 1","pages":""},"PeriodicalIF":3.3,"publicationDate":"2024-07-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141783668","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

TMaD: Three‐tier malware detection using multi‐view feature for secure convergence ICT environments TMaD：利用多视角特征进行三层恶意软件检测，确保融合信息和通信技术环境的安全

IF 3.3 4区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Expert Systems

Pub Date : 2024-07-19 DOI: 10.1111/exsy.13684

Jueun Jeon, Byeonghui Jeong, Seungyeon Baek, Young‐Sik Jeong

As digital transformation accelerates, data generated in a convergence information and communication technology (ICT) environment must be secured. This data includes confidential information such as personal and financial information, so attackers spread malware in convergence ICT environments to steal this information. To protect convergence ICT environments from diverse cyber threats, deep learning models have been utilized for malware detection. However, accurately detecting rapidly generated variants and obfuscated malware is challenging. This study proposes a three‐tier malware detection (TMaD) scheme that utilizes a cloud‐fog‐edge collaborative architecture to analyse multi‐view features of executable files and detect malware. TMaD performs signature‐based malware detection at the edge device tier, then sends executables detected as unknown or benign to the fog tier. The fog tier conducts static analysis on non‐obfuscated executables and those transferred from the previous tier to detect variant malware. Subsequently, TMaD sends executables detected as benign in the fog tier to the cloud tier, where dynamic analysis is performed on obfuscated executables and those detected as benign to identify obfuscated malware. An evaluation of TMaD's detection performance resulted in an accuracy of 94.78%, a recall of 0.9794, a precision of 0.9535, and an f1‐score of 0.9663. This performance demonstrates that TMaD, by analysing executables across several tiers and minimizing false negatives, exhibits superior detection performance compared to existing malware detection models.

随着数字化转型的加速，在融合信息和通信技术（ICT）环境中生成的数据必须得到保护。这些数据包括个人和财务信息等机密信息，因此攻击者会在融合 ICT 环境中传播恶意软件，以窃取这些信息。为了保护融合 ICT 环境免受各种网络威胁，深度学习模型已被用于恶意软件检测。然而，准确检测快速生成的变种和混淆的恶意软件具有挑战性。本研究提出了一种三层恶意软件检测（TMaD）方案，利用云-雾-边协同架构分析可执行文件的多视图特征并检测恶意软件。TMaD 在边缘设备层执行基于签名的恶意软件检测，然后将检测到的未知或良性可执行文件发送到雾层。雾层对未经混淆处理的可执行文件和从上一层传输过来的可执行文件进行静态分析，以检测变种恶意软件。随后，TMaD 将在雾层中检测到的良性可执行文件发送到云层，在云层中对经过混淆处理的可执行文件和检测到的良性可执行文件进行动态分析，以识别经过混淆处理的恶意软件。对 TMaD 检测性能的评估结果是：准确率 94.78%，召回率 0.9794，精确度 0.9535，f1 分数 0.9663。这一性能表明，TMaD 通过分析多个层级的可执行文件并最大限度地减少误判，与现有的恶意软件检测模型相比，具有更出色的检测性能。

{"title":"TMaD: Three‐tier malware detection using multi‐view feature for secure convergence ICT environments","authors":"Jueun Jeon, Byeonghui Jeong, Seungyeon Baek, Young‐Sik Jeong","doi":"10.1111/exsy.13684","DOIUrl":"https://doi.org/10.1111/exsy.13684","url":null,"abstract":"As digital transformation accelerates, data generated in a convergence information and communication technology (ICT) environment must be secured. This data includes confidential information such as personal and financial information, so attackers spread malware in convergence ICT environments to steal this information. To protect convergence ICT environments from diverse cyber threats, deep learning models have been utilized for malware detection. However, accurately detecting rapidly generated variants and obfuscated malware is challenging. This study proposes a three‐tier malware detection (TMaD) scheme that utilizes a cloud‐fog‐edge collaborative architecture to analyse multi‐view features of executable files and detect malware. TMaD performs signature‐based malware detection at the edge device tier, then sends executables detected as unknown or benign to the fog tier. The fog tier conducts static analysis on non‐obfuscated executables and those transferred from the previous tier to detect variant malware. Subsequently, TMaD sends executables detected as benign in the fog tier to the cloud tier, where dynamic analysis is performed on obfuscated executables and those detected as benign to identify obfuscated malware. An evaluation of TMaD's detection performance resulted in an accuracy of 94.78%, a recall of 0.9794, a precision of 0.9535, and an f1‐score of 0.9663. This performance demonstrates that TMaD, by analysing executables across several tiers and minimizing false negatives, exhibits superior detection performance compared to existing malware detection models.","PeriodicalId":51053,"journal":{"name":"Expert Systems","volume":"12 1","pages":""},"PeriodicalIF":3.3,"publicationDate":"2024-07-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141742191","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

The study of engagement at work from the artificial intelligence perspective: A systematic review 从人工智能角度研究工作参与度：系统回顾

IF 3 4区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Expert Systems

Pub Date : 2024-07-16 DOI: 10.1111/exsy.13673

Claudia García-Navarro, Manuel Pulido-Martos, Cristina Pérez-Lozano

Engagement has been defined as an attitude toward work, as a positive, satisfying, work-related state of mind characterized by high levels of vigour, dedication, and absorption. Both its definition and its assessment have been controversial; however, new methods for its assessment, including artificial intelligence (AI), have been introduced in recent years. Therefore, this research aims to determine the state of the art of AI in the study of engagement. To this end, we conducted a systematic review in accordance with PRISMA to analyse the publications to date on the use of AI for the analysis of engagement. The search, carried out in six databases, was filtered, and 15 papers were finally analysed. The results show that AI has been used mainly to assess and predict engagement levels, as well as to understand the relationships between engagement and other variables. The most commonly used AI techniques are machine learning (ML) and natural language processing (NLP), and all publications use structured and unstructured data, mainly from self-report instruments, social networks, and datasets. The accuracy of the models varies from 22% to 87%, and its main benefit has been to help both managers and HR staff understand employee engagement, although it has also contributed to research. Most of the articles have been published since 2015, and the geography has been global, with publications predominantly in India and the US. In conclusion, this study highlights the state of the art in AI for the study of engagement and concludes that the number of publications is increasing, indicating that this is possibly a new field or area of research in which important advances can be made in the study of engagement through new and novel techniques.

敬业度被定义为一种工作态度，是一种积极的、令人满意的、与工作相关的精神状态，其特点是精力充沛、全心投入和全身心投入。对其定义和评估一直存在争议；不过，近年来，包括人工智能（AI）在内的新评估方法已经问世。因此，本研究旨在确定人工智能在参与度研究中的应用现状。为此，我们按照 PRISMA 标准进行了一次系统性回顾，分析了迄今为止有关使用人工智能分析参与度的出版物。在六个数据库中进行的搜索经过筛选，最终分析了 15 篇论文。结果显示，人工智能主要用于评估和预测参与度水平，以及了解参与度与其他变量之间的关系。最常用的人工智能技术是机器学习（ML）和自然语言处理（NLP），所有论文都使用了结构化和非结构化数据，主要来自自我报告工具、社交网络和数据集。模型的准确率从 22% 到 87% 不等，其主要益处是帮助管理人员和人力资源部门了解员工敬业度，但也有助于研究工作。大多数文章都是 2015 年以来发表的，发表地域遍及全球，主要集中在印度和美国。总之，本研究强调了人工智能在敬业度研究方面的技术水平，并得出结论认为，发表文章的数量在不断增加，这表明这可能是一个新的研究领域或领域，通过新颖的技术可以在敬业度研究方面取得重要进展。

{"title":"The study of engagement at work from the artificial intelligence perspective: A systematic review","authors":"Claudia García-Navarro, Manuel Pulido-Martos, Cristina Pérez-Lozano","doi":"10.1111/exsy.13673","DOIUrl":"10.1111/exsy.13673","url":null,"abstract":"Engagement has been defined as an attitude toward work, as a positive, satisfying, work-related state of mind characterized by high levels of vigour, dedication, and absorption. Both its definition and its assessment have been controversial; however, new methods for its assessment, including artificial intelligence (AI), have been introduced in recent years. Therefore, this research aims to determine the state of the art of AI in the study of engagement. To this end, we conducted a systematic review in accordance with PRISMA to analyse the publications to date on the use of AI for the analysis of engagement. The search, carried out in six databases, was filtered, and 15 papers were finally analysed. The results show that AI has been used mainly to assess and predict engagement levels, as well as to understand the relationships between engagement and other variables. The most commonly used AI techniques are machine learning (ML) and natural language processing (NLP), and all publications use structured and unstructured data, mainly from self-report instruments, social networks, and datasets. The accuracy of the models varies from 22% to 87%, and its main benefit has been to help both managers and HR staff understand employee engagement, although it has also contributed to research. Most of the articles have been published since 2015, and the geography has been global, with publications predominantly in India and the US. In conclusion, this study highlights the state of the art in AI for the study of engagement and concludes that the number of publications is increasing, indicating that this is possibly a new field or area of research in which important advances can be made in the study of engagement through new and novel techniques.","PeriodicalId":51053,"journal":{"name":"Expert Systems","volume":"41 11","pages":""},"PeriodicalIF":3.0,"publicationDate":"2024-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/exsy.13673","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141742192","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A generative adversarial network‐based client‐level handwriting forgery attack in federated learning scenario 联合学习场景中基于生成式对抗网络的客户端级笔迹伪造攻击

IF 3.3 4区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Expert Systems

Pub Date : 2024-07-12 DOI: 10.1111/exsy.13676

Lei Shi, Han Wu, Xu Ding, Hao Xu, Sinan Pan

Federated learning (FL), celebrated for its privacy‐preserving features, has been revealed by recent studies to harbour security vulnerabilities that jeopardize client privacy, particularly through data reconstruction attacks that enable adversaries to recover original client data. This study introduces a client‐level handwriting forgery attack method for FL based on generative adversarial networks (GANs), which reveals security vulnerabilities existing in FL systems. It should be stressed that this research is purely for academic purposes, aiming to raise concerns about privacy protection and data security, and does not encourage illegal activities. Our novel methodology assumes an adversarial scenario wherein adversaries intercept a fraction of parameter updates via victim clients’ wireless communication channels, then use this information to train GAN for data recovery. Finally, the purpose of handwriting imitation is achieved. To rigorously assess and validate our methodology, experiments were conducted using a bespoke Chinese digit dataset, facilitating in‐depth analysis and robust verification of results. Our experimental findings demonstrated enhanced data recovery effectiveness, a client‐level attack and greater versatility compared to prior art. Notably, our method maintained high attack performance even with a streamlined GAN design, yielding increased precision and significantly faster execution times compared to standard methods. Specifically, our experimental numerical results revealed a substantial boost in reconstruction accuracy by 16.7%, coupled with a 51.9% decrease in computational time compared to the latest similar techniques. Furthermore, tests on a simplified version of our GAN exhibited an average 10% enhancement in accuracy, alongside a remarkable 70% reduction in time consumption. By surmounting the limitations of previous work, this study fills crucial gaps and affirms the effectiveness of our approach in achieving high‐accuracy client‐level data reconstruction within the FL context, thereby stimulating further exploration into FL security measures.

联合学习（Federated Learning，FL）因其保护隐私的特性而备受赞誉，但最近的研究却揭示了它存在着危害客户端隐私的安全漏洞，特别是通过数据重构攻击，使对手能够恢复原始客户端数据。本研究介绍了一种基于生成式对抗网络（GANs）的 FL 客户端级笔迹伪造攻击方法，揭示了 FL 系统中存在的安全漏洞。需要强调的是，本研究纯粹出于学术目的，旨在引起人们对隐私保护和数据安全的关注，并不鼓励非法活动。我们的新方法假设了一种对抗场景，即对抗者通过受害者客户端的无线通信信道截获一部分参数更新，然后利用这些信息训练 GAN 进行数据恢复。最后，笔迹模仿的目的就达到了。为了严格评估和验证我们的方法，我们使用定制的中文数字数据集进行了实验，以便对结果进行深入分析和稳健验证。我们的实验结果表明，与现有技术相比，我们的方法提高了数据恢复的有效性、客户端级别的攻击和更大的通用性。值得注意的是，即使采用精简的 GAN 设计，我们的方法仍能保持较高的攻击性能，与标准方法相比，精度更高，执行时间更短。具体来说，我们的数值实验结果表明，与最新的类似技术相比，重建精度大幅提高了 16.7%，计算时间减少了 51.9%。此外，对我们的 GAN 简化版进行的测试表明，精度平均提高了 10%，同时耗时显著减少了 70%。通过克服以往工作的局限性，本研究填补了重要空白，并肯定了我们的方法在 FL 环境下实现高精度客户端级数据重建的有效性，从而激发了对 FL 安全措施的进一步探索。

{"title":"A generative adversarial network‐based client‐level handwriting forgery attack in federated learning scenario","authors":"Lei Shi, Han Wu, Xu Ding, Hao Xu, Sinan Pan","doi":"10.1111/exsy.13676","DOIUrl":"https://doi.org/10.1111/exsy.13676","url":null,"abstract":"Federated learning (FL), celebrated for its privacy‐preserving features, has been revealed by recent studies to harbour security vulnerabilities that jeopardize client privacy, particularly through data reconstruction attacks that enable adversaries to recover original client data. This study introduces a client‐level handwriting forgery attack method for FL based on generative adversarial networks (GANs), which reveals security vulnerabilities existing in FL systems. It should be stressed that this research is purely for academic purposes, aiming to raise concerns about privacy protection and data security, and does not encourage illegal activities. Our novel methodology assumes an adversarial scenario wherein adversaries intercept a fraction of parameter updates via victim clients’ wireless communication channels, then use this information to train GAN for data recovery. Finally, the purpose of handwriting imitation is achieved. To rigorously assess and validate our methodology, experiments were conducted using a bespoke Chinese digit dataset, facilitating in‐depth analysis and robust verification of results. Our experimental findings demonstrated enhanced data recovery effectiveness, a client‐level attack and greater versatility compared to prior art. Notably, our method maintained high attack performance even with a streamlined GAN design, yielding increased precision and significantly faster execution times compared to standard methods. Specifically, our experimental numerical results revealed a substantial boost in reconstruction accuracy by 16.7%, coupled with a 51.9% decrease in computational time compared to the latest similar techniques. Furthermore, tests on a simplified version of our GAN exhibited an average 10% enhancement in accuracy, alongside a remarkable 70% reduction in time consumption. By surmounting the limitations of previous work, this study fills crucial gaps and affirms the effectiveness of our approach in achieving high‐accuracy client‐level data reconstruction within the FL context, thereby stimulating further exploration into FL security measures.","PeriodicalId":51053,"journal":{"name":"Expert Systems","volume":"20 1","pages":""},"PeriodicalIF":3.3,"publicationDate":"2024-07-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141613269","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Multimodal dynamic fusion framework: Multilevel feature fusion guided by prompts 多模态动态融合框架：由提示引导的多级特征融合

IF 3 4区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Expert Systems

Pub Date : 2024-07-11 DOI: 10.1111/exsy.13668

Lei Pan, Huan-Qing Wu

With the progressive augmentation of parameters in multimodal models, to optimize computational efficiency, some studies have adopted the approach of fine-tuning the unimodal pre-training model to achieve multimodal fusion tasks. However, these methods tend to rely solely on simplistic or singular fusion strategies, thereby neglecting more flexible fusion approaches. Moreover, existing methods prioritize the integration of modality features containing highly semantic information, often overlooking the influence of fusing low-level features on the outcomes. Therefore, this study introduces an innovative approach named multilevel feature fusion guided by prompts (MFF-GP), a multimodal dynamic fusion framework. It guides the dynamic neural network by prompt vectors to dynamically select the suitable fusion network for each hierarchical feature of the unimodal pre-training model. This method improves the interactions between multiple modalities and promotes a more efficient fusion of features across them. Extensive experiments on the UPMC Food 101, SNLI-VE and MM-IMDB datasets demonstrate that with only a few trainable parameters, MFF-GP achieves significant accuracy improvements compared to a newly designed PMF based on fine-tuning—specifically, an accuracy improvement of 2.15% on the UPMC Food 101 dataset and 0.82% on the SNLI-VE dataset. Further study of the results reveals that increasing the diversity of interactions between distinct modalities is critical and delivers significant performance improvements. Furthermore, for certain multimodal tasks, focusing on the low-level features is beneficial for modality integration. Our implementation is available at: https://github.com/whq2024/MFF-GP.

随着多模态模型参数的逐步增加，为了优化计算效率，一些研究采用了微调单模态预训练模型的方法来实现多模态融合任务。然而，这些方法往往只依赖于简单或单一的融合策略，从而忽略了更灵活的融合方法。此外，现有方法优先考虑包含高语义信息的模态特征的融合，往往忽略了低层次特征融合对结果的影响。因此，本研究引入了一种名为 "提示引导的多级特征融合"（MFF-GP）的创新方法，这是一种多模态动态融合框架。它通过提示向量引导动态神经网络，为单模态预训练模型的每个层次特征动态选择合适的融合网络。这种方法改善了多种模态之间的交互，促进了跨模态特征的更有效融合。在 UPMC Food 101、SNLI-VE 和 MM-IMDB 数据集上进行的大量实验表明，与基于微调的新设计 PMF 相比，MFF-GP 只需几个可训练参数就能显著提高准确率，具体来说，在 UPMC Food 101 数据集上提高了 2.15%，在 SNLI-VE 数据集上提高了 0.82%。对结果的进一步研究表明，增加不同模态之间交互的多样性至关重要，并能显著提高性能。此外，对于某些多模态任务，关注低层次特征有利于模态整合。我们的实施方案可在以下网址获取：https://github.com/whq2024/MFF-GP。

{"title":"Multimodal dynamic fusion framework: Multilevel feature fusion guided by prompts","authors":"Lei Pan, Huan-Qing Wu","doi":"10.1111/exsy.13668","DOIUrl":"10.1111/exsy.13668","url":null,"abstract":"With the progressive augmentation of parameters in multimodal models, to optimize computational efficiency, some studies have adopted the approach of fine-tuning the unimodal pre-training model to achieve multimodal fusion tasks. However, these methods tend to rely solely on simplistic or singular fusion strategies, thereby neglecting more flexible fusion approaches. Moreover, existing methods prioritize the integration of modality features containing highly semantic information, often overlooking the influence of fusing low-level features on the outcomes. Therefore, this study introduces an innovative approach named multilevel feature fusion guided by prompts (MFF-GP), a multimodal dynamic fusion framework. It guides the dynamic neural network by prompt vectors to dynamically select the suitable fusion network for each hierarchical feature of the unimodal pre-training model. This method improves the interactions between multiple modalities and promotes a more efficient fusion of features across them. Extensive experiments on the UPMC Food 101, SNLI-VE and MM-IMDB datasets demonstrate that with only a few trainable parameters, MFF-GP achieves significant accuracy improvements compared to a newly designed PMF based on fine-tuning—specifically, an accuracy improvement of 2.15% on the UPMC Food 101 dataset and 0.82% on the SNLI-VE dataset. Further study of the results reveals that increasing the diversity of interactions between distinct modalities is critical and delivers significant performance improvements. Furthermore, for certain multimodal tasks, focusing on the low-level features is beneficial for modality integration. Our implementation is available at: https://github.com/whq2024/MFF-GP.","PeriodicalId":51053,"journal":{"name":"Expert Systems","volume":"41 11","pages":""},"PeriodicalIF":3.0,"publicationDate":"2024-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141613581","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

What distinguishes conspiracy from critical narratives? A computational analysis of oppositional discourse 阴谋论与批判性叙事有何区别？对反对派话语的计算分析

IF 3 4区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Expert Systems

Pub Date : 2024-07-09 DOI: 10.1111/exsy.13671

Damir Korenčić, Berta Chulvi, Xavier Bonet Casals, Alejandro Toselli, Mariona Taulé, Paolo Rosso

The current prevalence of conspiracy theories on the internet is a significant issue, tackled by many computational approaches. However, these approaches fail to recognize the relevance of distinguishing between texts which contain a conspiracy theory and texts which are simply critical and oppose mainstream narratives. Furthermore, little attention is usually paid to the role of inter-group conflict in oppositional narratives. We contribute by proposing a novel topic-agnostic annotation scheme that differentiates between conspiracies and critical texts, and that defines span-level categories of inter-group conflict. We also contribute with the multilingual XAI-DisInfodemics corpus (English and Spanish), which contains a high-quality annotation of Telegram messages related to COVID-19 (5000 messages per language). We also demonstrate the feasibility of an NLP-based automatization by performing a range of experiments that yield strong baseline solutions. Finally, we perform an analysis which demonstrates that the promotion of intergroup conflict and the presence of violence and anger are key aspects to distinguish between the two types of oppositional narratives, that is, conspiracy versus critical.

当前互联网上盛行的阴谋论是一个重大问题，许多计算方法都在解决这一问题。然而，这些方法没有认识到区分包含阴谋论的文本与单纯批判和反对主流叙事的文本的相关性。此外，人们通常很少关注群体间冲突在反对叙事中的作用。我们提出了一种新颖的主题区分注释方案，可区分阴谋论和批判性文本，并定义了跨度级别的群体间冲突类别。我们还提供了多语言 XAI-DisInfodemics 语料库（英语和西班牙语），其中包含与 COVID-19 相关的 Telegram 消息的高质量注释（每种语言 5000 条消息）。我们还通过一系列实验证明了基于 NLP 的自动化的可行性，这些实验产生了强大的基线解决方案。最后，我们进行了一项分析，证明促进群体间冲突以及暴力和愤怒的存在是区分阴谋与批判这两种对立叙事的关键因素。

{"title":"What distinguishes conspiracy from critical narratives? A computational analysis of oppositional discourse","authors":"Damir Korenčić, Berta Chulvi, Xavier Bonet Casals, Alejandro Toselli, Mariona Taulé, Paolo Rosso","doi":"10.1111/exsy.13671","DOIUrl":"10.1111/exsy.13671","url":null,"abstract":"The current prevalence of conspiracy theories on the internet is a significant issue, tackled by many computational approaches. However, these approaches fail to recognize the relevance of distinguishing between texts which contain a conspiracy theory and texts which are simply critical and oppose mainstream narratives. Furthermore, little attention is usually paid to the role of inter-group conflict in oppositional narratives. We contribute by proposing a novel topic-agnostic annotation scheme that differentiates between conspiracies and critical texts, and that defines span-level categories of inter-group conflict. We also contribute with the multilingual XAI-DisInfodemics corpus (English and Spanish), which contains a high-quality annotation of Telegram messages related to COVID-19 (5000 messages per language). We also demonstrate the feasibility of an NLP-based automatization by performing a range of experiments that yield strong baseline solutions. Finally, we perform an analysis which demonstrates that the promotion of intergroup conflict and the presence of violence and anger are key aspects to distinguish between the two types of oppositional narratives, that is, conspiracy versus critical.","PeriodicalId":51053,"journal":{"name":"Expert Systems","volume":"41 11","pages":""},"PeriodicalIF":3.0,"publicationDate":"2024-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141586682","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

An EEMD-LSTM, SVR, and BP decomposition ensemble model for steel future prices forecasting 用于钢铁未来价格预测的 EEMD-LSTM、SVR 和 BP 分解集合模型

IF 3 4区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Expert Systems

Pub Date : 2024-07-08 DOI: 10.1111/exsy.13672

Sen Wu, Wei Wang, Yanan Song, Shuaiqi Liu

The forecasting of steel futures prices is important for the steel futures market, even for the steel industry. We propose a decomposition ensemble model that incorporates the Ensemble Empirical Mode Decomposition (EEMD), Long Short-Term Memory (LSTM), Support Vector Regression (SVR), and Back Propagation (BP) neural network to forecast steel futures prices. The forecasting procedures are as follows: (1) The price data are initially decomposed into several relatively independent Intrinsic Mode Functions (IMFs) and a residue using EEMD. (2) The IMFs are then reconstructed as components representing short-term, medium-term, and long-term frequencies via fine-to-coarse. (3) LSTM, SVR, and BP neural network are utilized to forecast the short-term, medium-term, and long-term reconstructed components, respectively. (4) The prediction results for each component are simply added to the final prediction results. The accuracy of the proposed model is compared with several benchmark models by experiments and evaluated by some prediction evaluation indexes. The experimental results show that our model outperforms other models in terms of forecast accuracy, confirming its strong predictive capabilities. This study provides some suggestions for investment and decision making by participants in the steel futures market. It may promote the smooth operation of the steel futures market and shed some light on the operation of the steel industry.

钢材期货价格预测对于钢材期货市场乃至钢铁行业都非常重要。我们提出了一种分解集合模型，该模型融合了集合经验模式分解（EEMD）、长短期记忆（LSTM）、支持向量回归（SVR）和反向传播（BP）神经网络，用于预测钢材期货价格。预测程序如下(1) 首先使用 EEMD 将价格数据分解为几个相对独立的本征模式函数（IMF）和一个残差。(2) 然后通过从细到粗的方法将 IMF 重构为代表短期、中期和长期频率的成分。(3) 利用 LSTM、SVR 和 BP 神经网络分别预测重建的短期、中期和长期分量。(4) 将各分量的预测结果简单相加，得出最终预测结果。通过实验将所提出模型的准确性与几个基准模型进行比较，并通过一些预测评价指标进行评估。实验结果表明，我们的模型在预测准确率方面优于其他模型，证实了其强大的预测能力。本研究为钢铁期货市场参与者的投资和决策提供了一些建议。它可以促进钢材期货市场的平稳运行，并对钢铁行业的运行起到一定的启示作用。

{"title":"An EEMD-LSTM, SVR, and BP decomposition ensemble model for steel future prices forecasting","authors":"Sen Wu, Wei Wang, Yanan Song, Shuaiqi Liu","doi":"10.1111/exsy.13672","DOIUrl":"10.1111/exsy.13672","url":null,"abstract":"The forecasting of steel futures prices is important for the steel futures market, even for the steel industry. We propose a decomposition ensemble model that incorporates the Ensemble Empirical Mode Decomposition (EEMD), Long Short-Term Memory (LSTM), Support Vector Regression (SVR), and Back Propagation (BP) neural network to forecast steel futures prices. The forecasting procedures are as follows: (1) The price data are initially decomposed into several relatively independent Intrinsic Mode Functions (IMFs) and a residue using EEMD. (2) The IMFs are then reconstructed as components representing short-term, medium-term, and long-term frequencies via fine-to-coarse. (3) LSTM, SVR, and BP neural network are utilized to forecast the short-term, medium-term, and long-term reconstructed components, respectively. (4) The prediction results for each component are simply added to the final prediction results. The accuracy of the proposed model is compared with several benchmark models by experiments and evaluated by some prediction evaluation indexes. The experimental results show that our model outperforms other models in terms of forecast accuracy, confirming its strong predictive capabilities. This study provides some suggestions for investment and decision making by participants in the steel futures market. It may promote the smooth operation of the steel futures market and shed some light on the operation of the steel industry.","PeriodicalId":51053,"journal":{"name":"Expert Systems","volume":"41 11","pages":""},"PeriodicalIF":3.0,"publicationDate":"2024-07-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141570209","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Adaptive weighted feature fusion for multiscale atrous convolution-based 1DCNN with dilated LSTM-aided fake news detection using regional language text information 利用区域语言文本信息，为基于无差别卷积的 1DCNN 和扩张 LSTM 的多尺度自适应加权特征融合辅助假新闻检测

IF 3 4区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Expert Systems

Pub Date : 2024-07-04 DOI: 10.1111/exsy.13665

V Rathinapriya, J. Kalaivani

The people in the world rely on social media for gathering news, and it is mainly because of the development of technology. The approaches employed in natural language processing are still deficient in judgement factors, and these techniques frequently rely upon political or social circumstances. Numerous low-level communities in the area are curious after experiencing the negative effects caused by the spread of false information in different sectors. Low-resource languages are still distracted, because these techniques are extensively employed in the English language. This work aims to provide an analysis of regional language fake news and develop a referral system with advanced techniques to identify fake news in Hindi and Tamil. This proposed model includes (a) Regional Language Text Collection; (b) Text preprocessing; (c) Feature Extraction; (d) Weighted Stacked Feature Fusion; and (e) Fake News Detection. The text data is collected from the standard datasets. The collected text data is preprocessed and given into the feature extraction, which is done by using bidirectional encoder representations from transformers (BERT), transformer networks, and seq2seq network for extracting the three sets of language text features. These extracted feature sets are inserted into the weighted stacked feature fusion model, where the three sets of extracted features are integrated with the optimized weights that are acquired through the enhanced osprey optimization algorithm (EOOA). Finally, these resultant features are given to multi-scale atrous convolution-based one-dimensional convolutional neural network with dilated long short-term memory (MACNN-DLSTM) for detecting the fake news. Throughout the result analysis, the experimentation is conducted based on the standard Tamil and Hindi datasets. Moreover, the developed model shows 92% for Hindi datasets and 96% for Tamil datasets which shows effective performance regarding accuracy measures. The experimental analysis is carried out by comparing with the conventional algorithms and detection techniques to showcase the efficiency of the developed regional language-based fake news detection model.

全世界的人们都依赖社交媒体来收集新闻，这主要是因为技术的发展。自然语言处理所采用的方法在判断因素方面仍然存在缺陷，这些技术经常依赖于政治或社会环境。在经历了不同领域虚假信息传播所造成的负面影响后，该地区的众多低水平社区感到好奇。由于这些技术在英语中被广泛使用，低资源语言仍然被分散注意力。这项工作旨在提供对地区语言虚假新闻的分析，并利用先进技术开发一个转介系统，以识别印地语和泰米尔语的虚假新闻。该建议模型包括：（a）区域语言文本收集；（b）文本预处理；（c）特征提取；（d）加权堆叠特征融合；以及（e）假新闻检测。文本数据收集自标准数据集。收集到的文本数据经过预处理后进行特征提取，提取时使用变压器双向编码器表示法（BERT）、变压器网络和 seq2seq 网络提取三组语言文本特征。这些提取的特征集被插入加权堆叠特征融合模型，在该模型中，三组提取的特征与通过增强型鱼鹰优化算法（EOOA）获得的优化权重相融合。最后，这些结果特征被赋予基于多尺度阿特罗斯卷积的一维卷积神经网络（MACNN-DLSTM），用于检测假新闻。在整个结果分析过程中，实验是基于标准泰米尔语和印地语数据集进行的。此外，所开发的模型在印地语数据集上的准确率为 92%，在泰米尔语数据集上的准确率为 96%，显示了在准确度测量方面的有效性能。实验分析通过与传统算法和检测技术的比较来展示所开发的基于区域语言的假新闻检测模型的效率。

{"title":"Adaptive weighted feature fusion for multiscale atrous convolution-based 1DCNN with dilated LSTM-aided fake news detection using regional language text information","authors":"V Rathinapriya, J. Kalaivani","doi":"10.1111/exsy.13665","DOIUrl":"10.1111/exsy.13665","url":null,"abstract":"The people in the world rely on social media for gathering news, and it is mainly because of the development of technology. The approaches employed in natural language processing are still deficient in judgement factors, and these techniques frequently rely upon political or social circumstances. Numerous low-level communities in the area are curious after experiencing the negative effects caused by the spread of false information in different sectors. Low-resource languages are still distracted, because these techniques are extensively employed in the English language. This work aims to provide an analysis of regional language fake news and develop a referral system with advanced techniques to identify fake news in Hindi and Tamil. This proposed model includes (a) Regional Language Text Collection; (b) Text preprocessing; (c) Feature Extraction; (d) Weighted Stacked Feature Fusion; and (e) Fake News Detection. The text data is collected from the standard datasets. The collected text data is preprocessed and given into the feature extraction, which is done by using bidirectional encoder representations from transformers (BERT), transformer networks, and seq2seq network for extracting the three sets of language text features. These extracted feature sets are inserted into the weighted stacked feature fusion model, where the three sets of extracted features are integrated with the optimized weights that are acquired through the enhanced osprey optimization algorithm (EOOA). Finally, these resultant features are given to multi-scale atrous convolution-based one-dimensional convolutional neural network with dilated long short-term memory (MACNN-DLSTM) for detecting the fake news. Throughout the result analysis, the experimentation is conducted based on the standard Tamil and Hindi datasets. Moreover, the developed model shows 92% for Hindi datasets and 96% for Tamil datasets which shows effective performance regarding accuracy measures. The experimental analysis is carried out by comparing with the conventional algorithms and detection techniques to showcase the efficiency of the developed regional language-based fake news detection model.","PeriodicalId":51053,"journal":{"name":"Expert Systems","volume":"41 11","pages":""},"PeriodicalIF":3.0,"publicationDate":"2024-07-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141551442","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

KDBI special issue: Explainability feature selection framework application for LSTM multivariate time‐series forecast self optimization KDBI 特刊：可解释性特征选择框架在 LSTM 多变量时间序列预测自我优化中的应用

IF 3.3 4区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Expert Systems

Pub Date : 2024-07-04 DOI: 10.1111/exsy.13674

Eduardo M. Rodrigues, Yassine Baghoussi, João Mendes‐Moreira

Deep learning models are widely used in multivariate time series forecasting, yet, they have high computational costs. One way to reduce this cost is by reducing data dimensionality, which involves removing unimportant or low importance information with the proper method. This work presents a study on an explainability feature selection framework composed of four methods (IMV‐LSTM Tensor, LIME‐LSTM, Average SHAP‐LSTM, and Instance SHAP‐LSTM) aimed at using the LSTM black‐box model complexity to its favour, with the end goal of improving the error metrics and reducing the computational cost on a forecast task. To test the framework, three datasets with a total of 101 multivariate time series were used, with the explainability methods outperforming the baseline methods in most of the data, be it in error metrics or computation time for the LSTM model training.

深度学习模型被广泛应用于多元时间序列预测，但其计算成本较高。降低成本的方法之一是降低数据维度，这就需要用适当的方法去除不重要或低重要性的信息。本研究介绍了由四种方法（IMV-LSTM Tensor、LIME-LSTM、Average SHAP-LSTM、Instance SHAP-LSTM）组成的可解释性特征选择框架，该框架旨在利用 LSTM 黑盒模型的复杂性，以改善误差指标和降低预测任务的计算成本为最终目标。为了测试该框架，我们使用了三个数据集，共包含 101 个多元时间序列，在大多数数据中，可解释性方法都优于基线方法，无论是误差指标还是 LSTM 模型训练的计算时间。

引用次数: 0