首页 > 最新文献

International Journal of Intelligent Systems最新文献

英文 中文
Tasks-Embedded Reparameterization: A Novel Framework for Task-Specific Transfer Enhancement With Multitask Prompt Learning 任务内嵌再参数化:一种基于多任务提示学习的任务特定迁移增强新框架
IF 3.7 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-10-17 DOI: 10.1155/int/1688391
Jingjing Liu, Yishuai Song, Rui Jiang, Yi Feng, Mo Tao, Yinlin Li

Current​ fine-tuning techniques for large pretrained language models (LLMs) face significant challenges, particularly regarding the high computational costs associated with adapting billions of parameters and their limitations in effectively addressing diverse language understanding tasks. These methods often result in an inability to manage inter-task dependencies effectively, leading to underutilization of inter-task information. To address these issues, we propose tasks-embedded reparameterization (TER), a novel parameter-efficient fine-tuning framework that exploits multitask learning to enhance task-specific capabilities. The TER model integrates prompt tuning and multitask reparameterization, merging task-specific experts and hidden states of target tasks in a unified model framework. Furthermore, it employs a dynamic, task-oriented gating mechanism to optimize the prompts output by the model. This method dynamically adjusts the parameters according to the differing requirements of the task, ensuring that the model optimally adjusts the parameters according to the specific requirements of the task, so that the task can find a suitable balance between different tasks and improve knowledge sharing and task adaptability. Experimental evaluations using the SuperGLUE benchmark demonstrate that TER consistently outperforms existing parameter-efficient fine-tuning techniques in both performance and computational efficiency, offering a promising solution for task-specific language understanding in both research and industry.

目前针对大型预训练语言模型(llm)的微调技术面临着重大挑战,特别是与适应数十亿个参数相关的高计算成本以及有效解决各种语言理解任务的局限性。这些方法通常导致无法有效地管理任务间依赖关系,从而导致任务间信息的利用不足。为了解决这些问题,我们提出了任务嵌入再参数化(TER),这是一种新的参数高效微调框架,利用多任务学习来增强特定任务的能力。TER模型集成了即时调优和多任务重参数化,在统一的模型框架中合并了特定任务的专家和目标任务的隐藏状态。此外,它采用了一种动态的、面向任务的门控机制来优化模型的提示输出。该方法根据任务的不同要求动态调整参数,保证模型根据任务的具体要求对参数进行最优调整,使任务在不同任务之间找到合适的平衡点,提高知识共享和任务适应性。使用SuperGLUE基准的实验评估表明,TER在性能和计算效率方面始终优于现有的参数高效微调技术,为研究和工业中特定任务的语言理解提供了一个有前途的解决方案。
{"title":"Tasks-Embedded Reparameterization: A Novel Framework for Task-Specific Transfer Enhancement With Multitask Prompt Learning","authors":"Jingjing Liu,&nbsp;Yishuai Song,&nbsp;Rui Jiang,&nbsp;Yi Feng,&nbsp;Mo Tao,&nbsp;Yinlin Li","doi":"10.1155/int/1688391","DOIUrl":"https://doi.org/10.1155/int/1688391","url":null,"abstract":"<p>Current​ fine-tuning techniques for large pretrained language models (LLMs) face significant challenges, particularly regarding the high computational costs associated with adapting billions of parameters and their limitations in effectively addressing diverse language understanding tasks. These methods often result in an inability to manage inter-task dependencies effectively, leading to underutilization of inter-task information. To address these issues, we propose tasks-embedded reparameterization (TER), a novel parameter-efficient fine-tuning framework that exploits multitask learning to enhance task-specific capabilities. The TER model integrates prompt tuning and multitask reparameterization, merging task-specific experts and hidden states of target tasks in a unified model framework. Furthermore, it employs a dynamic, task-oriented gating mechanism to optimize the prompts output by the model. This method dynamically adjusts the parameters according to the differing requirements of the task, ensuring that the model optimally adjusts the parameters according to the specific requirements of the task, so that the task can find a suitable balance between different tasks and improve knowledge sharing and task adaptability. Experimental evaluations using the SuperGLUE benchmark demonstrate that TER consistently outperforms existing parameter-efficient fine-tuning techniques in both performance and computational efficiency, offering a promising solution for task-specific language understanding in both research and industry.</p>","PeriodicalId":14089,"journal":{"name":"International Journal of Intelligent Systems","volume":"2025 1","pages":""},"PeriodicalIF":3.7,"publicationDate":"2025-10-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1155/int/1688391","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145317591","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Enhanced Financial Fraud Detection via SISAE-METADES: A Supervised Deep Representation and Dynamic Ensemble Approach 基于siae - metades的增强金融欺诈检测:一种监督深度表示和动态集成方法
IF 3.7 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-10-16 DOI: 10.1155/int/8869784
Chang Wang, Sheng Fang, Fangsu Zhao, Zongmei Mu

Detecting financial reporting fraud is vital for preserving market integrity and protecting investors from substantial losses. Yet, the challenges of high dimensionality and noisy financial data often undermine the effectiveness of existing financial fraud detection systems. To address these issues, this study proposes SISAE-METADES, a novel framework that integrates a supervised input-enhanced stacked autoencoder (SISAE) with a meta-learning–based dynamic ensemble selection (METADES) strategy. Unlike conventional stacked autoencoders, SISAE concatenates the original input at each encoding stage and incorporates label supervision, thereby learning task-relevant and class-discriminative representations. These enriched deep features improve both the diversity and competence of base classifiers and enable METADES to achieve more reliable local competence estimation. We validate the proposed framework using financial statement data from Chinese A-share listed companies (2005–2023), covering 71 indicators. Experimental results show that SISAE-METADES significantly outperforms standalone SISAE, traditional METADES, and several state-of-the-art baselines. In particular, it achieves substantial improvements in accuracy, recall, and F1-score, underscoring the robustness and effectiveness of combining supervised deep representation learning with dynamic ensemble selection for financial fraud detection. These findings highlight the framework’s practical significance in reducing investor losses, strengthening market confidence, and promoting the stability of the financial system.

检测财务报告欺诈对于维护市场诚信和保护投资者免受重大损失至关重要。然而,高维和嘈杂的金融数据的挑战往往会破坏现有金融欺诈检测系统的有效性。为了解决这些问题,本研究提出了SISAE-METADES,这是一个将监督输入增强堆叠自编码器(SISAE)与基于元学习的动态集成选择(METADES)策略集成在一起的新框架。与传统的堆叠式自编码器不同,SISAE在每个编码阶段将原始输入连接起来,并结合标签监督,从而学习任务相关和类别区分表示。这些丰富的深度特征提高了基分类器的多样性和能力,使METADES能够实现更可靠的局部能力估计。我们利用中国a股上市公司2005-2023年的财务报表数据,涵盖71个指标,对提出的框架进行了验证。实验结果表明,SISAE-METADES显著优于独立的SISAE、传统的METADES和几种最先进的基线。特别是,它在准确性、召回率和f1分数方面取得了实质性的改进,强调了将监督深度表示学习与动态集成选择相结合用于金融欺诈检测的鲁棒性和有效性。这些发现突出了该框架在减少投资者损失、增强市场信心和促进金融体系稳定方面的现实意义。
{"title":"Enhanced Financial Fraud Detection via SISAE-METADES: A Supervised Deep Representation and Dynamic Ensemble Approach","authors":"Chang Wang,&nbsp;Sheng Fang,&nbsp;Fangsu Zhao,&nbsp;Zongmei Mu","doi":"10.1155/int/8869784","DOIUrl":"https://doi.org/10.1155/int/8869784","url":null,"abstract":"<p>Detecting financial reporting fraud is vital for preserving market integrity and protecting investors from substantial losses. Yet, the challenges of high dimensionality and noisy financial data often undermine the effectiveness of existing financial fraud detection systems. To address these issues, this study proposes SISAE-METADES, a novel framework that integrates a supervised input-enhanced stacked autoencoder (SISAE) with a meta-learning–based dynamic ensemble selection (METADES) strategy. Unlike conventional stacked autoencoders, SISAE concatenates the original input at each encoding stage and incorporates label supervision, thereby learning task-relevant and class-discriminative representations. These enriched deep features improve both the diversity and competence of base classifiers and enable METADES to achieve more reliable local competence estimation. We validate the proposed framework using financial statement data from Chinese A-share listed companies (2005–2023), covering 71 indicators. Experimental results show that SISAE-METADES significantly outperforms standalone SISAE, traditional METADES, and several state-of-the-art baselines. In particular, it achieves substantial improvements in accuracy, recall, and F1-score, underscoring the robustness and effectiveness of combining supervised deep representation learning with dynamic ensemble selection for financial fraud detection. These findings highlight the framework’s practical significance in reducing investor losses, strengthening market confidence, and promoting the stability of the financial system.</p>","PeriodicalId":14089,"journal":{"name":"International Journal of Intelligent Systems","volume":"2025 1","pages":""},"PeriodicalIF":3.7,"publicationDate":"2025-10-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1155/int/8869784","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145317273","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Hybrid Model for Visual Sentiment Classification Using Content-Based Image Retrieval and Multi-Input Convolutional Neural Network 基于内容图像检索和多输入卷积神经网络的视觉情感分类混合模型
IF 3.7 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-10-16 DOI: 10.1155/int/5581601
Israa K. Salman Al-Tameemi, Mohammad-Reza Feizi-Derakhshi, Zari Farhadi, Amir-Reza Feizi-Derakhshi

With the exponential growth of multimedia content, visual sentiment classification has emerged as a significant research area. However, it poses unique challenges due to the complexity and subjective nature of the visual information. This can be attributed to the significant presence of semantically ambiguous images within the current benchmark datasets, which enhances the performance of sentiment analysis but ignores the differences between various annotators. Moreover, most current methods concentrate on improving local emotional representations that focus on object extraction procedures rather than utilizing robust features that can effectively indicate the relevance of objects within an image through color information. Motivated by these observations, this paper addresses the need for efficient algorithms for labeling and classifying sentiment from visual images by introducing a novel hybrid model, which combines content-based image retrieval (CBIR) and a multi-input convolutional neural network (CNN). The CBIR model extracts color features from all dataset images, creating a numerical representation for each. It compares a query image to dataset images’ features to find similar features. This process continues until the images are grouped according to color similarity, which allows accurate sentimental categories based on similar features and feelings. Then, a multi-input CNN model is utilized to extract and efficiently incorporate high-level contextual visual information. This model comprises 70 layers, with six branches, each containing 11 layers. It seeks to facilitate the fusion of complementary information by incorporating multiple input categories that differ according to the color features extracted by the CBIR technique. This feature enables the model to understand the target and generate more precise predictions fully. The proposed model demonstrates significant improvements over existing algorithms, as evidenced by evaluations of six benchmark datasets of varying sizes. Also, it outperforms the state of the art in sentiment classification accuracy, getting 87.88%, 84.62%, 84.1%, 83.7%, 80.7%, and 91.2% accuracy for the EmotionROI, ArtPhoto, Twitter I, Twitter II, Abstract, and FI datasets, respectively. Furthermore, the model is evaluated on two newly collected large datasets, which confirm its scalability and robustness in handling large-scale sentiment classification tasks, and thus achieves a significant accuracy of 85.21% and 83.72% with the BGETTY and Twitter datasets, respectively. This paper contributes to the advancement of visual sentiment classification by offering a comprehensive solution for analyzing sentiment from images and laying the foundation for further research.

随着多媒体内容呈指数级增长,视觉情感分类已成为一个重要的研究领域。然而,由于视觉信息的复杂性和主观性,它提出了独特的挑战。这可以归因于当前基准数据集中存在语义模糊的图像,这增强了情感分析的性能,但忽略了不同注释器之间的差异。此外,目前的大多数方法都集中在改进局部情感表征上,这些表征关注的是对象提取过程,而不是利用鲁棒特征,通过颜色信息有效地指示图像中对象的相关性。基于这些观察结果,本文通过引入一种结合基于内容的图像检索(CBIR)和多输入卷积神经网络(CNN)的新型混合模型,解决了对视觉图像情感标记和分类的高效算法的需求。CBIR模型从所有数据集图像中提取颜色特征,为每个图像创建一个数字表示。它将查询图像与数据集图像的特征进行比较,以找到相似的特征。这个过程一直持续到图像根据颜色相似度分组,这样就可以根据相似的特征和感觉进行准确的情感分类。然后,利用多输入CNN模型提取并高效融合高级上下文视觉信息。该模型包括70层,6个分支,每个分支包含11层。它寻求通过合并根据CBIR技术提取的颜色特征不同的多个输入类别来促进互补信息的融合。这一特性使模型能够充分理解目标并生成更精确的预测。通过对六个不同大小的基准数据集的评估,证明了所提出的模型比现有算法有了显著的改进。此外,它在情感分类准确率方面也优于目前的技术水平,在EmotionROI、ArtPhoto、Twitter I、Twitter II、Abstract和FI数据集上分别获得87.88%、84.62%、84.1%、83.7%、80.7%和91.2%的准确率。此外,在两个新收集的大型数据集上对该模型进行了评估,验证了该模型在处理大规模情感分类任务方面的可扩展性和鲁棒性,在BGETTY和Twitter数据集上分别达到了85.21%和83.72%的显著准确率。本文为图像情感分析提供了一个全面的解决方案,为进一步的研究奠定了基础,有助于视觉情感分类的发展。
{"title":"Hybrid Model for Visual Sentiment Classification Using Content-Based Image Retrieval and Multi-Input Convolutional Neural Network","authors":"Israa K. Salman Al-Tameemi,&nbsp;Mohammad-Reza Feizi-Derakhshi,&nbsp;Zari Farhadi,&nbsp;Amir-Reza Feizi-Derakhshi","doi":"10.1155/int/5581601","DOIUrl":"https://doi.org/10.1155/int/5581601","url":null,"abstract":"<p>With the exponential growth of multimedia content, visual sentiment classification has emerged as a significant research area. However, it poses unique challenges due to the complexity and subjective nature of the visual information. This can be attributed to the significant presence of semantically ambiguous images within the current benchmark datasets, which enhances the performance of sentiment analysis but ignores the differences between various annotators. Moreover, most current methods concentrate on improving local emotional representations that focus on object extraction procedures rather than utilizing robust features that can effectively indicate the relevance of objects within an image through color information. Motivated by these observations, this paper addresses the need for efficient algorithms for labeling and classifying sentiment from visual images by introducing a novel hybrid model, which combines content-based image retrieval (CBIR) and a multi-input convolutional neural network (CNN). The CBIR model extracts color features from all dataset images, creating a numerical representation for each. It compares a query image to dataset images’ features to find similar features. This process continues until the images are grouped according to color similarity, which allows accurate sentimental categories based on similar features and feelings. Then, a multi-input CNN model is utilized to extract and efficiently incorporate high-level contextual visual information. This model comprises 70 layers, with six branches, each containing 11 layers. It seeks to facilitate the fusion of complementary information by incorporating multiple input categories that differ according to the color features extracted by the CBIR technique. This feature enables the model to understand the target and generate more precise predictions fully. The proposed model demonstrates significant improvements over existing algorithms, as evidenced by evaluations of six benchmark datasets of varying sizes. Also, it outperforms the state of the art in sentiment classification accuracy, getting 87.88%, 84.62%, 84.1%, 83.7%, 80.7%, and 91.2% accuracy for the EmotionROI, ArtPhoto, Twitter I, Twitter II, Abstract, and FI datasets, respectively. Furthermore, the model is evaluated on two newly collected large datasets, which confirm its scalability and robustness in handling large-scale sentiment classification tasks, and thus achieves a significant accuracy of 85.21% and 83.72% with the BGETTY and Twitter datasets, respectively. This paper contributes to the advancement of visual sentiment classification by offering a comprehensive solution for analyzing sentiment from images and laying the foundation for further research.</p>","PeriodicalId":14089,"journal":{"name":"International Journal of Intelligent Systems","volume":"2025 1","pages":""},"PeriodicalIF":3.7,"publicationDate":"2025-10-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1155/int/5581601","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145317290","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Novel Clustering-Forecast Method With Nonlinear Logo Information Filtering Networks 一种新的非线性标志信息过滤网络聚类预测方法
IF 3.7 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-10-16 DOI: 10.1155/int/6410414
Qingyang Liu, Ramin Yahyapour

In this paper, we introduced a novel methodology to build a classification-forecast model used for financial risk forewarning. For the first step, we utilize the K–S test, Mann–Whitney U test, and Pearson’s correlation to select variables. Then, we employ CRITIC and fuzzy comprehensive evaluation (FCE) methods to score the risk of listed companies. Following this, self-organizing maps (SOM) clustering is utilized to segment the samples into five distinct risk levels. For the second step, we utilized triangulated maximally filtered graph (TMFG) and maximally filtered clique forest (MFCF) to minimize the number of indicators based on the dependent relationships between variables. These are then combined with Gaussian Markov random field (GMRF) and Copula algorithms to address nonlinear situations, forming what we refer to as the LoGo model. To further enhance the accuracy of LoGo models, we utilize the square Mahalanobis distance to compute the log-likelihoods as part matrix. The results reveal that the enhanced LoGo model with part matrix improves average accuracy by 7% compared with the original models without part matrix, albeit with a tenfold increase in execution time. MFCF demonstrates superior performance over TMFG in linear situations, achieving a 40% higher accuracy. However, under nonlinear circumstances, TMFG only requires half the execution time of MFCF, yet achieves a slightly higher average accuracy. Furthermore, compared with the widely used CNN models, the enhanced LoGo models show superior performance as they achieved closed accuracy in a shorter time.

本文介绍了一种新的方法来建立用于金融风险预警的分类预测模型。第一步,我们利用K-S检验、Mann-Whitney U检验和Pearson相关性来选择变量。然后,采用critical和模糊综合评价(FCE)方法对上市公司风险进行评分。在此之后,利用自组织图(SOM)聚类将样本划分为五个不同的风险水平。第二步,我们利用三角化最大过滤图(TMFG)和最大过滤团森林(MFCF)基于变量之间的依赖关系来最小化指标的数量。然后将它们与高斯马尔可夫随机场(GMRF)和Copula算法结合起来处理非线性情况,形成我们所说的LoGo模型。为了进一步提高LoGo模型的准确性,我们利用马氏距离的平方来计算对数似然作为部分矩阵。结果表明,与没有部分矩阵的原始模型相比,带有部分矩阵的增强LoGo模型的平均准确率提高了7%,尽管执行时间增加了10倍。在线性情况下,MFCF表现出优于TMFG的性能,实现了40%以上的精度。然而,在非线性情况下,TMFG只需要MFCF一半的执行时间,但平均精度略高。此外,与广泛使用的CNN模型相比,增强的LoGo模型在更短的时间内实现了封闭精度,表现出更优越的性能。
{"title":"A Novel Clustering-Forecast Method With Nonlinear Logo Information Filtering Networks","authors":"Qingyang Liu,&nbsp;Ramin Yahyapour","doi":"10.1155/int/6410414","DOIUrl":"https://doi.org/10.1155/int/6410414","url":null,"abstract":"<p>In this paper, we introduced a novel methodology to build a classification-forecast model used for financial risk forewarning. For the first step, we utilize the K–S test, Mann–Whitney <i>U</i> test, and Pearson’s correlation to select variables. Then, we employ CRITIC and fuzzy comprehensive evaluation (FCE) methods to score the risk of listed companies. Following this, self-organizing maps (SOM) clustering is utilized to segment the samples into five distinct risk levels. For the second step, we utilized triangulated maximally filtered graph (TMFG) and maximally filtered clique forest (MFCF) to minimize the number of indicators based on the dependent relationships between variables. These are then combined with Gaussian Markov random field (GMRF) and Copula algorithms to address nonlinear situations, forming what we refer to as the LoGo model. To further enhance the accuracy of LoGo models, we utilize the square Mahalanobis distance to compute the log-likelihoods as part matrix. The results reveal that the enhanced LoGo model with part matrix improves average accuracy by 7% compared with the original models without part matrix, albeit with a tenfold increase in execution time. MFCF demonstrates superior performance over TMFG in linear situations, achieving a 40% higher accuracy. However, under nonlinear circumstances, TMFG only requires half the execution time of MFCF, yet achieves a slightly higher average accuracy. Furthermore, compared with the widely used CNN models, the enhanced LoGo models show superior performance as they achieved closed accuracy in a shorter time.</p>","PeriodicalId":14089,"journal":{"name":"International Journal of Intelligent Systems","volume":"2025 1","pages":""},"PeriodicalIF":3.7,"publicationDate":"2025-10-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1155/int/6410414","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145317291","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Reasoning-Guided LLM Translation Optimization: A Framework Using Multidimensional Postediting Feedback 推理引导的法学硕士翻译优化:一个使用多维后编辑反馈的框架
IF 3.7 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-10-15 DOI: 10.1155/int/9971702
Yan Huang, Xiaogang Zang, Chenyang Ji, Zhuo Chen

While Large Language Models (LLMs) demonstrate strong translation capabilities, optimizing their output towards human-level refinement necessitates reasoning-guided approaches that move beyond simple generation. This paper introduces Multidimensional Feedback and Postedit Thought (MFPE), a novel framework specifically designed for reasoning-guided LLM translation optimization. MFPE operationalizes this guidance by leveraging multidimensional postediting feedback, which acts as explicit reasoning signals to the LLM. This feedback mechanism simulates the human postediting process, where errors are systematically identified and corrected. Generated by a dedicated optimization model trained on a synthetic dataset (using GLM-4 and inspired by multidimensional quality metrics (MQM), this feedback provides fine-grained error details including spans, categories, and quantities from initial LLM translations. We conduct experiments across four language pairs: Chinese-English, German-English, English-Chinese, and English-German. The results show that fine-tuning with structured, reasoning-like feedback significantly enhances translation quality and outperforms standard bilingual fine-tuning approaches. Our findings highlight the effectiveness of simulating postediting reasoning through structured feedback, offering a promising direction for harnessing and improving the inferential capabilities of LLMs for complex tasks like high-quality machine translation.

虽然大型语言模型(llm)展示了强大的翻译能力,但将其输出优化到人类水平的细化需要推理引导的方法,而不仅仅是简单的生成。本文介绍了一种专门为推理引导的法学硕士翻译优化设计的新框架——多维反馈和post - dit思想(MFPE)。MFPE通过利用多维后期编辑反馈来实现该指导,该反馈作为向LLM发出的明确推理信号。这种反馈机制模拟了人类的后期编辑过程,在这个过程中,错误被系统地识别和纠正。该反馈由在合成数据集(使用GLM-4)上训练的专用优化模型生成,并受到多维质量度量(MQM)的启发,提供细粒度的错误细节,包括初始LLM翻译的范围、类别和数量。我们进行了四种语言对的实验:汉语-英语、德语-英语、英语-汉语和英语-德语。结果表明,使用结构化的、类似推理的反馈进行微调可以显著提高翻译质量,并且优于标准的双语微调方法。我们的研究结果强调了通过结构化反馈模拟编辑后推理的有效性,为利用和提高法学硕士在高质量机器翻译等复杂任务中的推理能力提供了一个有希望的方向。
{"title":"Reasoning-Guided LLM Translation Optimization: A Framework Using Multidimensional Postediting Feedback","authors":"Yan Huang,&nbsp;Xiaogang Zang,&nbsp;Chenyang Ji,&nbsp;Zhuo Chen","doi":"10.1155/int/9971702","DOIUrl":"https://doi.org/10.1155/int/9971702","url":null,"abstract":"<p>While Large Language Models (LLMs) demonstrate strong translation capabilities, optimizing their output towards human-level refinement necessitates reasoning-guided approaches that move beyond simple generation. This paper introduces Multidimensional Feedback and Postedit Thought (MFPE), a novel framework specifically designed for reasoning-guided LLM translation optimization. MFPE operationalizes this guidance by leveraging multidimensional postediting feedback, which acts as explicit reasoning signals to the LLM. This feedback mechanism simulates the human postediting process, where errors are systematically identified and corrected. Generated by a dedicated optimization model trained on a synthetic dataset (using GLM-4 and inspired by multidimensional quality metrics (MQM), this feedback provides fine-grained error details including spans, categories, and quantities from initial LLM translations. We conduct experiments across four language pairs: Chinese-English, German-English, English-Chinese, and English-German. The results show that fine-tuning with structured, reasoning-like feedback significantly enhances translation quality and outperforms standard bilingual fine-tuning approaches. Our findings highlight the effectiveness of simulating postediting reasoning through structured feedback, offering a promising direction for harnessing and improving the inferential capabilities of LLMs for complex tasks like high-quality machine translation.</p>","PeriodicalId":14089,"journal":{"name":"International Journal of Intelligent Systems","volume":"2025 1","pages":""},"PeriodicalIF":3.7,"publicationDate":"2025-10-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1155/int/9971702","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145317037","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Neural Lyapunov Control for Caputo Fractional-Order Systems 分数阶Caputo系统的神经Lyapunov控制
IF 3.7 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-10-15 DOI: 10.1155/int/3639257
Xiaoya Gao, Guoqing Jiang, Ran Huang, Cong Wu

This article presents a novel neural network–based approach for designing effective control policies for Caputo-type nonlinear fractional-order systems. The proposed approach iteratively refines the neural network to generate a control policy that stabilizes the system within a predefined neighborhood around the zero equilibrium. Stability of the controlled system is guaranteed by rigorously formulated theorems and empirically verified using a neural Lyapunov function. The effectiveness of the proposed methodology is demonstrated through simulations on two classical Caputo fractional-order systems, showcasing its capability to ensure stability and its potential applicability to a broader range of fractional-order nonlinear systems.

本文提出了一种新的基于神经网络的方法来设计卡普托型非线性分数阶系统的有效控制策略。该方法对神经网络进行迭代优化,生成控制策略,使系统稳定在零平衡点附近的预定义邻域内。被控系统的稳定性由严格表述的定理保证,并使用神经李雅普诺夫函数进行经验验证。通过对两个经典Caputo分数阶系统的仿真,证明了该方法的有效性,证明了其确保稳定性的能力以及对更广泛分数阶非线性系统的潜在适用性。
{"title":"Neural Lyapunov Control for Caputo Fractional-Order Systems","authors":"Xiaoya Gao,&nbsp;Guoqing Jiang,&nbsp;Ran Huang,&nbsp;Cong Wu","doi":"10.1155/int/3639257","DOIUrl":"https://doi.org/10.1155/int/3639257","url":null,"abstract":"<p>This article presents a novel neural network–based approach for designing effective control policies for Caputo-type nonlinear fractional-order systems. The proposed approach iteratively refines the neural network to generate a control policy that stabilizes the system within a predefined neighborhood around the zero equilibrium. Stability of the controlled system is guaranteed by rigorously formulated theorems and empirically verified using a neural Lyapunov function. The effectiveness of the proposed methodology is demonstrated through simulations on two classical Caputo fractional-order systems, showcasing its capability to ensure stability and its potential applicability to a broader range of fractional-order nonlinear systems.</p>","PeriodicalId":14089,"journal":{"name":"International Journal of Intelligent Systems","volume":"2025 1","pages":""},"PeriodicalIF":3.7,"publicationDate":"2025-10-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1155/int/3639257","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145317038","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An Augmented Slime Mold Algorithm Based on Spiral Sensing Search Mechanism and Its Engineering Application for Photovoltaic Cell Parameter Identification Problem 基于螺旋传感搜索机制的增强型黏菌算法及其在光伏电池参数识别问题中的工程应用
IF 3.7 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-10-13 DOI: 10.1155/int/9642959
Qian Qian, Hongyu Li, Anbo Wang, Jiawen Pan, Miao Song, Yong Feng, Yingna Li

The slime mold algorithm (SMA) is a metaheuristic optimization algorithm that simulates the foraging behavior of slime molds. Compared to other optimization algorithms, SMA has fewer parameters, faster convergence speed, and stronger optimization capabilities. However, the standard SMA uses two randomly selected individuals to guide the search direction of the population, which results in excessive randomness during the search process. This can lead to the loss of valuable information and waste computational resources. To overcome these limitations, this study proposes an enhanced slime mold algorithm (S2SMA) based on a spiral sensing search mechanism. The main contributions of this study are as follows: Firstly, a fitness–distance balanced oscillation search mechanism is introduced to solve the issue of lack of guidance in the individual oscillatory search phase in the original SMA, thus enhancing the global exploration ability of the algorithm. Secondly, the spiral sensing search mechanism is introduced, reshaping the random redistribution behavior in SMA. This aims to fully utilize the effective information in the existing population, improve search efficiency, and enhance population diversity. Finally, the computational logic of SMA is restructured based on the existing parameters, improving the algorithm’s performance while avoiding additional computational overhead. To validate the effectiveness of the proposed S2SMA, experiments were conducted on 71 test instances from the IEEE CEC2017 and IEEE CEC2021 benchmark sets, as well as three engineering problems. The algorithm was compared with classical algorithms, high-performance algorithms, and advanced SMA variants. Experimental results show that S2SMA outperforms the classical algorithms, high-performance algorithms, and other SMA variants in terms of both performance and robustness, demonstrating its potential application in engineering optimization.

黏菌算法是一种模拟黏菌觅食行为的元启发式优化算法。与其他优化算法相比,SMA具有参数少、收敛速度快、优化能力强等优点。然而,标准SMA使用随机选择的两个个体来指导总体的搜索方向,导致搜索过程中的随机性过大。这可能导致宝贵信息的丢失和计算资源的浪费。为了克服这些局限性,本研究提出了一种基于螺旋传感搜索机制的增强型黏菌算法(S2SMA)。本研究的主要贡献如下:首先,引入了适应度-距离平衡振荡搜索机制,解决了原始SMA在单个振荡搜索阶段缺乏引导的问题,增强了算法的全局搜索能力;其次,引入螺旋传感搜索机制,重塑SMA中的随机再分配行为;这样做的目的是充分利用现有种群中的有效信息,提高搜索效率,增强种群多样性。最后,基于现有参数重构SMA的计算逻辑,在避免额外计算开销的同时提高了算法的性能。为了验证所提出的S2SMA的有效性,在IEEE CEC2017和IEEE CEC2021基准集的71个测试实例以及3个工程问题上进行了实验。将该算法与经典算法、高性能算法和先进SMA变体进行了比较。实验结果表明,S2SMA在性能和鲁棒性方面均优于经典算法、高性能算法和其他SMA变体,显示了其在工程优化中的潜在应用。
{"title":"An Augmented Slime Mold Algorithm Based on Spiral Sensing Search Mechanism and Its Engineering Application for Photovoltaic Cell Parameter Identification Problem","authors":"Qian Qian,&nbsp;Hongyu Li,&nbsp;Anbo Wang,&nbsp;Jiawen Pan,&nbsp;Miao Song,&nbsp;Yong Feng,&nbsp;Yingna Li","doi":"10.1155/int/9642959","DOIUrl":"https://doi.org/10.1155/int/9642959","url":null,"abstract":"<p>The slime mold algorithm (SMA) is a metaheuristic optimization algorithm that simulates the foraging behavior of slime molds. Compared to other optimization algorithms, SMA has fewer parameters, faster convergence speed, and stronger optimization capabilities. However, the standard SMA uses two randomly selected individuals to guide the search direction of the population, which results in excessive randomness during the search process. This can lead to the loss of valuable information and waste computational resources. To overcome these limitations, this study proposes an enhanced slime mold algorithm (S2SMA) based on a spiral sensing search mechanism. The main contributions of this study are as follows: Firstly, a fitness–distance balanced oscillation search mechanism is introduced to solve the issue of lack of guidance in the individual oscillatory search phase in the original SMA, thus enhancing the global exploration ability of the algorithm. Secondly, the spiral sensing search mechanism is introduced, reshaping the random redistribution behavior in SMA. This aims to fully utilize the effective information in the existing population, improve search efficiency, and enhance population diversity. Finally, the computational logic of SMA is restructured based on the existing parameters, improving the algorithm’s performance while avoiding additional computational overhead. To validate the effectiveness of the proposed S2SMA, experiments were conducted on 71 test instances from the IEEE CEC2017 and IEEE CEC2021 benchmark sets, as well as three engineering problems. The algorithm was compared with classical algorithms, high-performance algorithms, and advanced SMA variants. Experimental results show that S2SMA outperforms the classical algorithms, high-performance algorithms, and other SMA variants in terms of both performance and robustness, demonstrating its potential application in engineering optimization.</p>","PeriodicalId":14089,"journal":{"name":"International Journal of Intelligent Systems","volume":"2025 1","pages":""},"PeriodicalIF":3.7,"publicationDate":"2025-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1155/int/9642959","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145316970","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
VQ-Rice: Integrating Variational Quantum Models for Intelligent Rice Disease Classification VQ-Rice:整合变分量子模型的水稻病害智能分类
IF 3.7 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-10-08 DOI: 10.1155/int/9911441
Daya Shankar Verma, Jitendra K. Mishra, Ankit Kumar, Abdul Khader Jilani Saudagar, Shambhu Mahato

This study presents a novel hybrid quantum-classical framework for rice disease diagnosis, leveraging variational quantum circuits (VQCs) to address the limitations of traditional and deep learning models in precision agriculture. The proposed Quantum Variational Rice Disease Network (QVRDN) integrates quantum feature encoding, variational quantum processing, and adaptive optimization to achieve superior classification accuracy, efficiency, and robustness. Using a curated dataset of 3000 annotated rice leaf images spanning major disease categories, the QVRDN framework applies dimensionality reduction and quantum angle encoding to transform the image features into quantum states, which are then processed by parameterized quantum circuits for disease classification. Experimental results demonstrate that QVRDN outperforms classical models, including SVM, random forest, CNN, and ResNet50-achieving, the highest accuracy of 97.8%, faster inference times, and greater resilience to noise and limited data. The compact design of the framework enables edge deployment without GPU dependency, making it suitable for resource-constrained agricultural environments. By demonstrating the feasibility and advantages of quantum machine learning in crop health monitoring, this study establishes a foundation for quantum-enhanced, data-efficient agricultural diagnostics and paves the way for future advances in intelligent, field-ready quantum geoinformatics systems.

本研究提出了一种用于水稻病害诊断的新型混合量子-经典框架,利用变分量子电路(vqc)来解决传统和深度学习模型在精准农业中的局限性。提出的量子变分水稻病害网络(QVRDN)集成了量子特征编码、变分量子处理和自适应优化,实现了较好的分类精度、效率和鲁棒性。QVRDN框架利用3000张标注水稻叶片图像的数据集,跨越主要疾病类别,采用降维和量子角编码将图像特征转换为量子态,然后通过参数化量子电路进行处理,用于疾病分类。实验结果表明,QVRDN优于SVM、随机森林、CNN和resnet50等经典模型,准确率高达97.8%,推理速度更快,对噪声和有限数据的适应能力更强。框架的紧凑设计使边缘部署不依赖GPU,使其适合资源受限的农业环境。通过展示量子机器学习在作物健康监测中的可行性和优势,本研究为量子增强、数据高效的农业诊断奠定了基础,并为智能、现场就绪的量子地理信息系统的未来发展铺平了道路。
{"title":"VQ-Rice: Integrating Variational Quantum Models for Intelligent Rice Disease Classification","authors":"Daya Shankar Verma,&nbsp;Jitendra K. Mishra,&nbsp;Ankit Kumar,&nbsp;Abdul Khader Jilani Saudagar,&nbsp;Shambhu Mahato","doi":"10.1155/int/9911441","DOIUrl":"https://doi.org/10.1155/int/9911441","url":null,"abstract":"<p>This study presents a novel hybrid quantum-classical framework for rice disease diagnosis, leveraging variational quantum circuits (VQCs) to address the limitations of traditional and deep learning models in precision agriculture. The proposed Quantum Variational Rice Disease Network (QVRDN) integrates quantum feature encoding, variational quantum processing, and adaptive optimization to achieve superior classification accuracy, efficiency, and robustness. Using a curated dataset of 3000 annotated rice leaf images spanning major disease categories, the QVRDN framework applies dimensionality reduction and quantum angle encoding to transform the image features into quantum states, which are then processed by parameterized quantum circuits for disease classification. Experimental results demonstrate that QVRDN outperforms classical models, including SVM, random forest, CNN, and ResNet50-achieving, the highest accuracy of 97.8%, faster inference times, and greater resilience to noise and limited data. The compact design of the framework enables edge deployment without GPU dependency, making it suitable for resource-constrained agricultural environments. By demonstrating the feasibility and advantages of quantum machine learning in crop health monitoring, this study establishes a foundation for quantum-enhanced, data-efficient agricultural diagnostics and paves the way for future advances in intelligent, field-ready quantum geoinformatics systems.</p>","PeriodicalId":14089,"journal":{"name":"International Journal of Intelligent Systems","volume":"2025 1","pages":""},"PeriodicalIF":3.7,"publicationDate":"2025-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1155/int/9911441","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145271953","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Airflow Field Prediction for Quadrotor UAVs Based on Spatiotemporal Prediction Network 基于时空预测网络的四旋翼无人机气流场预测
IF 3.7 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-10-07 DOI: 10.1155/int/3828807
Qiwei Guo, Zhijian Fan, Yu Tang, Mingwei Fang, Jiajun Zhuang, Xiaobing Chen, Chaojun Hou, Yong He

To address the limitations of traditional computational fluid dynamics (CFD) simulations, such as high computational cost, long processing times, and limited scalability, this study identifies the inefficiencies of existing data-driven prediction methods, which often lack spatial–temporal coordination mechanisms and fail to capture fine-grained dynamic features of UAV airflow fields. We propose a novel deep learning model, VAN-ConvLSTM, for rapid and accurate prediction of UAV downwash airflow. Unlike conventional ConvLSTM-based frameworks, which struggle with modeling long-range dependencies and detailed spatial variations, our model introduces a visual attention unit (VAN) to enhance spatiotemporal sensitivity. The model architecture combines a convolutional encoder for spatial feature extraction, a VAN module for attention-guided temporal modeling, and a ConvLSTM decoder for sequence generation. This synergistic design improves both the accuracy and interpretability of airflow prediction. Experimental results show that the VAN-ConvLSTM model achieves an SSIM score of 0.96, demonstrating high consistency with CFD simulations. Compared to baseline methods, our model reduces error while improving stability and spatial fidelity. Ablation studies further validate the individual contributions of VAN and ConvLSTM modules. The results, verified through three representative case studies, confirm that VAN-ConvLSTM outperforms state-of-the-art approaches across multiple evaluation metrics, while offering significantly enhanced computational efficiency. This demonstrates its strong potential as a reliable and scalable alternative to traditional CFD methods in rotor airflow prediction scenarios.

针对传统计算流体动力学(CFD)模拟计算成本高、处理时间长、可扩展性有限等局限性,本研究发现现有数据驱动预测方法效率低下,往往缺乏时空协调机制,无法捕捉无人机气流场的细粒度动态特征。为了快速准确地预测无人机下洗气流,我们提出了一种新的深度学习模型VAN-ConvLSTM。与传统的基于convlstm的框架不同,该模型引入了视觉注意单元(VAN)来增强时空敏感性。该模型架构结合了用于空间特征提取的卷积编码器、用于注意力引导时间建模的VAN模块和用于序列生成的ConvLSTM解码器。这种协同设计提高了气流预测的准确性和可解释性。实验结果表明,VAN-ConvLSTM模型的SSIM得分为0.96,与CFD模拟结果具有较高的一致性。与基线方法相比,我们的模型减少了误差,同时提高了稳定性和空间保真度。消融研究进一步验证了VAN和ConvLSTM模块各自的贡献。通过三个代表性案例研究验证的结果证实,VAN-ConvLSTM在多个评估指标上优于最先进的方法,同时显著提高了计算效率。这表明了它作为传统CFD方法在转子气流预测场景中可靠和可扩展的替代方案的强大潜力。
{"title":"Airflow Field Prediction for Quadrotor UAVs Based on Spatiotemporal Prediction Network","authors":"Qiwei Guo,&nbsp;Zhijian Fan,&nbsp;Yu Tang,&nbsp;Mingwei Fang,&nbsp;Jiajun Zhuang,&nbsp;Xiaobing Chen,&nbsp;Chaojun Hou,&nbsp;Yong He","doi":"10.1155/int/3828807","DOIUrl":"https://doi.org/10.1155/int/3828807","url":null,"abstract":"<p>To address the limitations of traditional computational fluid dynamics (CFD) simulations, such as high computational cost, long processing times, and limited scalability, this study identifies the inefficiencies of existing data-driven prediction methods, which often lack spatial–temporal coordination mechanisms and fail to capture fine-grained dynamic features of UAV airflow fields. We propose a novel deep learning model, VAN-ConvLSTM, for rapid and accurate prediction of UAV downwash airflow. Unlike conventional ConvLSTM-based frameworks, which struggle with modeling long-range dependencies and detailed spatial variations, our model introduces a visual attention unit (VAN) to enhance spatiotemporal sensitivity. The model architecture combines a convolutional encoder for spatial feature extraction, a VAN module for attention-guided temporal modeling, and a ConvLSTM decoder for sequence generation. This synergistic design improves both the accuracy and interpretability of airflow prediction. Experimental results show that the VAN-ConvLSTM model achieves an SSIM score of 0.96, demonstrating high consistency with CFD simulations. Compared to baseline methods, our model reduces error while improving stability and spatial fidelity. Ablation studies further validate the individual contributions of VAN and ConvLSTM modules. The results, verified through three representative case studies, confirm that VAN-ConvLSTM outperforms state-of-the-art approaches across multiple evaluation metrics, while offering significantly enhanced computational efficiency. This demonstrates its strong potential as a reliable and scalable alternative to traditional CFD methods in rotor airflow prediction scenarios.</p>","PeriodicalId":14089,"journal":{"name":"International Journal of Intelligent Systems","volume":"2025 1","pages":""},"PeriodicalIF":3.7,"publicationDate":"2025-10-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1155/int/3828807","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145272084","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Hybrid Deep Learning Model for E-Commerce Recommendations: Sentiment Analysis With Autoencoders and Generative Adversarial Networks 电子商务推荐的混合深度学习模型:使用自动编码器和生成对抗网络的情感分析
IF 3.7 2区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-10-02 DOI: 10.1155/int/3852068
Mohammad Yarjanli, Neda Mahdinasab

In e-commerce, customer reviews wield significant influence over business strategies. This study proposes a robust sentiment analysis (SA) model tailored to e-commerce recommendations. It aims to address the key limitations of existing methods, including challenges in generalizability, feature extraction, class imbalance, and hyperparameter tuning. Our process uses an autoencoder (AE) to extract key features from the input sentence. We employ DistilBERT for word embedding, which performs faster than the standard BERT model (bidirectional encoder representations from transformers). The proposed architecture integrates an AE with a transductive long short-term memory (TLSTM) unit, which is trained with a modified generative adversarial network (GAN). TLSTM leverages transductive learning to emphasize training samples that closely resemble those in the test distribution, enhancing the flexibility and predictive accuracy of the model. Within the GAN, the generator is designed to exclude gradients from dominant batch instances, encouraging greater output diversity and generalization. Once the AE is trained, its compressed feature representations are fed into a multilayer perceptron (MLP) classifier. To tackle class imbalance issues during classification, we implement a reinforcement learning (RL) mechanism. This strategy prioritizes the minority class by applying a reward mechanism to balance the classification outcomes. Moreover, we use the Bayesian optimization hyperband (BOHB) algorithm to fine-tune the hyperparameters of the model. Experimental results on the AIV, AA, and Yelp datasets demonstrate superior performance, with F-measure scores of 91.603%, 89.504%, and 90.397%, respectively. These outcomes validate the robustness of the model and its potential to significantly enhance recommendation quality in dynamic e-commerce environments.

在电子商务中,顾客评价对商业策略有着重要的影响。本研究提出了一个针对电子商务推荐的稳健情感分析(SA)模型。它旨在解决现有方法的主要局限性,包括泛化性、特征提取、类不平衡和超参数调优方面的挑战。我们的过程使用自动编码器(AE)从输入句子中提取关键特征。我们使用蒸馏器进行词嵌入,它比标准的BERT模型(来自转换器的双向编码器表示)执行得更快。该架构集成了AE和转导长短期记忆(TLSTM)单元,该单元使用改进的生成对抗网络(GAN)进行训练。TLSTM利用转换学习来强调训练样本与测试分布中的样本非常相似,从而增强了模型的灵活性和预测准确性。在GAN中,生成器被设计为从主导批处理实例中排除梯度,从而鼓励更大的输出多样性和泛化。一旦AE被训练,它的压缩特征表示被输入到多层感知器(MLP)分类器中。为了解决分类过程中的类不平衡问题,我们实现了一种强化学习(RL)机制。该策略通过应用奖励机制来平衡分类结果,从而优先考虑少数类。此外,我们使用贝叶斯优化超带(BOHB)算法对模型的超参数进行微调。在AIV、AA和Yelp数据集上的实验结果显示了优异的性能,F-measure得分分别为91.603%、89.504%和90.397%。这些结果验证了模型的鲁棒性及其在动态电子商务环境中显著提高推荐质量的潜力。
{"title":"A Hybrid Deep Learning Model for E-Commerce Recommendations: Sentiment Analysis With Autoencoders and Generative Adversarial Networks","authors":"Mohammad Yarjanli,&nbsp;Neda Mahdinasab","doi":"10.1155/int/3852068","DOIUrl":"https://doi.org/10.1155/int/3852068","url":null,"abstract":"<p>In e-commerce, customer reviews wield significant influence over business strategies. This study proposes a robust sentiment analysis (SA) model tailored to e-commerce recommendations. It aims to address the key limitations of existing methods, including challenges in generalizability, feature extraction, class imbalance, and hyperparameter tuning. Our process uses an autoencoder (AE) to extract key features from the input sentence. We employ DistilBERT for word embedding, which performs faster than the standard BERT model (bidirectional encoder representations from transformers). The proposed architecture integrates an AE with a transductive long short-term memory (TLSTM) unit, which is trained with a modified generative adversarial network (GAN). TLSTM leverages transductive learning to emphasize training samples that closely resemble those in the test distribution, enhancing the flexibility and predictive accuracy of the model. Within the GAN, the generator is designed to exclude gradients from dominant batch instances, encouraging greater output diversity and generalization. Once the AE is trained, its compressed feature representations are fed into a multilayer perceptron (MLP) classifier. To tackle class imbalance issues during classification, we implement a reinforcement learning (RL) mechanism. This strategy prioritizes the minority class by applying a reward mechanism to balance the classification outcomes. Moreover, we use the Bayesian optimization hyperband (BOHB) algorithm to fine-tune the hyperparameters of the model. Experimental results on the AIV, AA, and Yelp datasets demonstrate superior performance, with F-measure scores of 91.603%, 89.504%, and 90.397%, respectively. These outcomes validate the robustness of the model and its potential to significantly enhance recommendation quality in dynamic e-commerce environments.</p>","PeriodicalId":14089,"journal":{"name":"International Journal of Intelligent Systems","volume":"2025 1","pages":""},"PeriodicalIF":3.7,"publicationDate":"2025-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1155/int/3852068","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145223880","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
International Journal of Intelligent Systems
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1