首页 > 最新文献

Expert Systems with Applications最新文献

英文 中文
EduYOLO: A classroom behavior recognition framework based on high-resolution feature attention fusion 基于高分辨率特征注意力融合的课堂行为识别框架
IF 7.5 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-28 DOI: 10.1016/j.eswa.2026.131370
Jun Yu , Shengzhao Li , Huijie Liu , Qi Liu , Chang Tan , Zhiyuan Cheng , Jinze Wu
Process-oriented evaluation of classroom instruction is vital for assessing student learning quality and teacher instructional effectiveness. In recent years, object detection-based methods have been widely applied to classroom behavior recognition, yet they struggle with the unique challenges of real-world classrooms: small student objects due to distant cameras, frequent occlusions, and subtle, fine-grained behaviors like “Gaze” and “Turn”. To address these issues, this paper proposes EduYOLO, a novel classroom Behavior Recognition Framework Based on High-Resolution Feature Attention Fusion (HRFAF) module, which is architected around three dedicated components: a Key Region Perception Backbone that enhances the representation of crucial action regions, a Fine-Grained Action Modeling Neck that captures intricate behavioral patterns, and a High-Resolution Prediction Head that significantly improves small object detection. This holistic design synergistically strengthens the capability of model to perceive local details and complex postures. Furthermore, we design the FM-IoU loss function for bounding box regression, integrating focal weighting and multi-point distance constraints to enhance localization stability. Extensive experiments conducted on the self-constructed CSCB-Dataset and SCB-Data3 demonstrate that the proposed EduYOLO achieves superior detection accuracy and generalization performance compared with existing methods, confirming its effectiveness and robustness for real-world classroom behavior recognition tasks. To support reproducible research, our code is available at: https://github.com/datadance/EduYolo.
以过程为导向的课堂教学评价是评价学生学习质量和教师教学效果的重要手段。近年来,基于对象检测的方法已被广泛应用于课堂行为识别,但它们面临着现实世界课堂的独特挑战:由于远距离摄像机,学生对象很小,频繁遮挡,以及“凝视”和“转向”等微妙的细粒度行为。为了解决这些问题,本文提出了EduYOLO,一个基于高分辨率特征注意融合(HRFAF)模块的新型课堂行为识别框架,该框架围绕三个专用组件构建:增强关键动作区域表示的关键区域感知骨干,捕获复杂行为模式的细粒度动作建模颈,以及显著提高小目标检测的高分辨率预测头。这种整体设计协同增强了模型感知局部细节和复杂姿态的能力。此外,我们设计了FM-IoU损失函数用于边界盒回归,结合焦点加权和多点距离约束来增强定位稳定性。在自建的CSCB-Dataset和SCB-Data3上进行的大量实验表明,与现有方法相比,本文提出的EduYOLO具有更好的检测精度和泛化性能,验证了其在现实课堂行为识别任务中的有效性和鲁棒性。为了支持可重复的研究,我们的代码可在:https://github.com/datadance/EduYolo。
{"title":"EduYOLO: A classroom behavior recognition framework based on high-resolution feature attention fusion","authors":"Jun Yu ,&nbsp;Shengzhao Li ,&nbsp;Huijie Liu ,&nbsp;Qi Liu ,&nbsp;Chang Tan ,&nbsp;Zhiyuan Cheng ,&nbsp;Jinze Wu","doi":"10.1016/j.eswa.2026.131370","DOIUrl":"10.1016/j.eswa.2026.131370","url":null,"abstract":"<div><div>Process-oriented evaluation of classroom instruction is vital for assessing student learning quality and teacher instructional effectiveness. In recent years, object detection-based methods have been widely applied to classroom behavior recognition, yet they struggle with the unique challenges of real-world classrooms: small student objects due to distant cameras, frequent occlusions, and subtle, fine-grained behaviors like “Gaze” and “Turn”. To address these issues, this paper proposes EduYOLO, a novel classroom Behavior Recognition Framework Based on High-Resolution Feature Attention Fusion (HRFAF) module, which is architected around three dedicated components: a Key Region Perception Backbone that enhances the representation of crucial action regions, a Fine-Grained Action Modeling Neck that captures intricate behavioral patterns, and a High-Resolution Prediction Head that significantly improves small object detection. This holistic design synergistically strengthens the capability of model to perceive local details and complex postures. Furthermore, we design the FM-IoU loss function for bounding box regression, integrating focal weighting and multi-point distance constraints to enhance localization stability. Extensive experiments conducted on the self-constructed CSCB-Dataset and SCB-Data3 demonstrate that the proposed EduYOLO achieves superior detection accuracy and generalization performance compared with existing methods, confirming its effectiveness and robustness for real-world classroom behavior recognition tasks. To support reproducible research, our code is available at: <span><span>https://github.com/datadance/EduYolo</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"311 ","pages":"Article 131370"},"PeriodicalIF":7.5,"publicationDate":"2026-01-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146081702","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Adaptive compressed domain video encryption 自适应压缩域视频加密
IF 7.5 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-28 DOI: 10.1016/j.eswa.2026.131360
mohammad ghasempour , yuan yuan , hadi amirpour , hongjie he , christian timmerer
With the ever-increasing amount of digital video content, efficient encryption is crucial to protect visual content across diverse platforms. Existing methods often incur excessive bitrate overhead due to content variability. Furthermore, since most videos are already compressed, encryption in the compressed domain is essential to avoid processing overhead and re-compression quality loss. However, achieving both format compliance and compression efficiency while ensuring that the decoded content remains unrecognizable is challenging in the compressed domain, since only limited information is available without full decoding. This paper proposes an adaptive compressed domain video encryption (ACDC) method that dynamically adjusts the encryption strategy according to content characteristics. Two tunable parameters derived from the bitstream information enable adaptation to various application requirements. An adaptive syntax integrity method is employed to produce format-compliant bitstreams without full decoding. Experimental results show that ACDC reduces bitrate overhead by 48.2% and achieves a 31-fold speedup in encryption time compared to the latest state of the art, while producing visually unrecognizable outputs.
随着数字视频内容的不断增加,有效的加密对于保护不同平台上的视频内容至关重要。由于内容的可变性,现有的方法经常导致过多的比特率开销。此外,由于大多数视频已经被压缩,因此在压缩域中进行加密对于避免处理开销和重新压缩质量损失至关重要。然而,在确保解码后的内容无法识别的同时,实现格式遵从性和压缩效率在压缩领域是具有挑战性的,因为没有完全解码,只有有限的信息可用。提出了一种根据内容特征动态调整加密策略的自适应压缩域视频加密方法。从比特流信息派生的两个可调参数能够适应各种应用需求。采用自适应语法完整性方法产生符合格式的比特流,无需完全解码。实验结果表明,与最新技术相比,ACDC减少了48.2%的比特率开销,并在加密时间上实现了31倍的加速,同时产生视觉上无法识别的输出。
{"title":"Adaptive compressed domain video encryption","authors":"mohammad ghasempour ,&nbsp;yuan yuan ,&nbsp;hadi amirpour ,&nbsp;hongjie he ,&nbsp;christian timmerer","doi":"10.1016/j.eswa.2026.131360","DOIUrl":"10.1016/j.eswa.2026.131360","url":null,"abstract":"<div><div>With the ever-increasing amount of digital video content, efficient encryption is crucial to protect visual content across diverse platforms. Existing methods often incur excessive bitrate overhead due to content variability. Furthermore, since most videos are already compressed, encryption in the compressed domain is essential to avoid processing overhead and re-compression quality loss. However, achieving both format compliance and compression efficiency while ensuring that the decoded content remains unrecognizable is challenging in the compressed domain, since only limited information is available without full decoding. This paper proposes an adaptive compressed domain video encryption (ACDC) method that dynamically adjusts the encryption strategy according to content characteristics. Two tunable parameters derived from the bitstream information enable adaptation to various application requirements. An adaptive syntax integrity method is employed to produce format-compliant bitstreams without full decoding. Experimental results show that ACDC reduces bitrate overhead by 48.2% and achieves a 31-fold speedup in encryption time compared to the latest state of the art, while producing visually unrecognizable outputs.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"311 ","pages":"Article 131360"},"PeriodicalIF":7.5,"publicationDate":"2026-01-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146081694","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Dual-space intervention for mitigating bias in robust visual question answering 双空间干预在稳健视觉问答中的缓解偏差
IF 7.5 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-27 DOI: 10.1016/j.eswa.2026.131346
Runmin Wang , Xingdong Song , Zukun Wan , Han Xu , Congzhen Yu , Tianming Ma , Yajun Ding , Shengyou Qian
Visual Question Answering (VQA) evaluates the visual-textual reasoning capabilities of intelligent agents. However, existing methods are often susceptible to various biases. In particular, language bias leads models to rely on spurious question-answer correlations as shortcut solutions, while distribution bias caused by dataset imbalance encourages models to overfit head classes and overlook tail classes. To address these long-standing challenges, we propose a Dual-Space Intervention (DSI) approach that tackles these two biases from a unified yet complementary perspective. Two key innovations are included in our work: (1) In the input space, we adopt an adaptive question shuffling strategy to alleviate language bias by adjusting perturbation strength according to question bias, ensuring models develop a deeper understanding of the problem context, rather than relying on spurious word-answer correlations; (2) In the output space, we propose a novel label rebalancing mechanism that moderates head-class dominance based on long-tailed statistics, improving robustness to distribution bias. This approach reduces the disproportionately high variance in head logits relative to tail logits, improving tail class recognition accuracy. Extensive experiments on four benchmarks (VQA-CP v1, VQA-CP v2, VQA-CE, and SLAKE-CP) demonstrate our method’s superiority, with VQA-CP v1 and SLAKE-CP achieving state-of-the-art performance at 63.14% and 37.61% respectively. The code will be released at https://github.com/songxdr3/DSI.
视觉问答(VQA)评估智能代理的视觉文本推理能力。然而,现有的方法往往容易受到各种偏差的影响。特别是,语言偏差导致模型依赖虚假的问答相关性作为捷径解决方案,而由数据集不平衡引起的分布偏差则导致模型过度拟合头部类而忽略尾部类。为了解决这些长期存在的挑战,我们提出了一种双空间干预(DSI)方法,从统一但互补的角度解决这两种偏见。我们的工作包括两个关键创新:(1)在输入空间中,我们采用自适应问题洗牌策略,通过根据问题偏差调整扰动强度来减轻语言偏差,确保模型对问题上下文有更深入的理解,而不是依赖虚假的词-答案相关性;(2)在输出空间中,我们提出了一种新的标签再平衡机制,该机制调节了基于长尾统计的头类优势,提高了对分布偏差的鲁棒性。该方法减少了头部logits相对于尾部logits的不成比例的高方差,提高了尾部分类识别的准确性。在四个基准测试(VQA-CP v1、VQA-CP v2、VQA-CE和slack - cp)上进行的大量实验证明了我们的方法的优越性,VQA-CP v1和slack - cp的性能分别达到了63.14%和37.61%。代码将在https://github.com/songxdr3/DSI上发布。
{"title":"Dual-space intervention for mitigating bias in robust visual question answering","authors":"Runmin Wang ,&nbsp;Xingdong Song ,&nbsp;Zukun Wan ,&nbsp;Han Xu ,&nbsp;Congzhen Yu ,&nbsp;Tianming Ma ,&nbsp;Yajun Ding ,&nbsp;Shengyou Qian","doi":"10.1016/j.eswa.2026.131346","DOIUrl":"10.1016/j.eswa.2026.131346","url":null,"abstract":"<div><div>Visual Question Answering (VQA) evaluates the visual-textual reasoning capabilities of intelligent agents. However, existing methods are often susceptible to various biases. In particular, language bias leads models to rely on spurious question-answer correlations as shortcut solutions, while distribution bias caused by dataset imbalance encourages models to overfit head classes and overlook tail classes. To address these long-standing challenges, we propose a Dual-Space Intervention (DSI) approach that tackles these two biases from a unified yet complementary perspective. Two key innovations are included in our work: (1) In the input space, we adopt an adaptive question shuffling strategy to alleviate language bias by adjusting perturbation strength according to question bias, ensuring models develop a deeper understanding of the problem context, rather than relying on spurious word-answer correlations; (2) In the output space, we propose a novel label rebalancing mechanism that moderates head-class dominance based on long-tailed statistics, improving robustness to distribution bias. This approach reduces the disproportionately high variance in head logits relative to tail logits, improving tail class recognition accuracy. Extensive experiments on four benchmarks (VQA-CP v1, VQA-CP v2, VQA-CE, and SLAKE-CP) demonstrate our method’s superiority, with VQA-CP v1 and SLAKE-CP achieving state-of-the-art performance at 63.14% and 37.61% respectively. The code will be released at <span><span>https://github.com/songxdr3/DSI</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"311 ","pages":"Article 131346"},"PeriodicalIF":7.5,"publicationDate":"2026-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146081699","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
PromptMed: Prompt-driven semi-supervised medical image classification with class-balanced consistency and contrastive learning PromptMed:基于类平衡一致性和对比学习的提示驱动的半监督医学图像分类
IF 7.5 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-27 DOI: 10.1016/j.eswa.2026.131345
Shuai Wang , Ruina Mao
Existing pre-trained foundation models have demonstrated strong generalization and transfer capabilities across diverse domains. However, directly fine-tuning all parameters of pre-trained models for medical image classification requires massive labeled data, making it inefficient and resource-intensive. To address this, we aim to leverage semi-supervised learning (SSL) techniques to reduce the need for massive annotations for efficient fine-tuning. In this context, we propose PromptMed, a parameter-efficient framework for semi-supervised medical image classification, which consists of three key components: Prompt Noise Injection (PNI), Class-Balanced Prompt Adaptation (CBPA), and Contrastive Feature Consistency (CFC). Specifically, we introduce PNI to enhance the robustness of prompt representations and enable effective prompt-based consistency training. PNI applies Gaussian noise of varying strengths to prompt tokens, serving as a form of representation-level augmentation. To mitigate class imbalance, we design a CBPA mechanism that dynamically assigns higher noise to minority classes based on recent class distributions, encouraging better representation learning for hard categories. Additionally, to promote feature consistency, especially for minority and visually similar classes, we incorporate a CFC on the vision branch features. These three components work synergistically to enable PromptMed to achieve robust, balanced, and highly discriminative medical image classification with significantly reduced trainable parameters. Extensive experiments on multiple medical image datasets demonstrate that our approach achieves state-of-the-art performance while significantly reducing the number of trainable parameters.
现有的预训练基础模型已经证明了在不同领域的强大泛化和迁移能力。然而,直接微调预训练模型的所有参数进行医学图像分类需要大量的标记数据,效率低下且资源密集。为了解决这个问题,我们的目标是利用半监督学习(SSL)技术来减少对大量注释的需求,从而实现高效的微调。在此背景下,我们提出了一种参数高效的半监督医学图像分类框架PromptMed,它由三个关键部分组成:提示噪声注入(PNI)、类别平衡提示适应(CBPA)和对比特征一致性(CFC)。具体来说,我们引入PNI来增强提示表示的鲁棒性,并实现有效的基于提示的一致性训练。PNI将不同强度的高斯噪声应用于提示符号,作为表示级增强的一种形式。为了缓解类不平衡,我们设计了一个CBPA机制,该机制基于最近的类分布动态地为少数类分配更高的噪声,鼓励对硬类别进行更好的表征学习。此外,为了促进特征一致性,特别是对于少数类和视觉上相似的类,我们在视觉分支特征上合并了CFC。这三个组件协同工作,使PromptMed能够实现鲁棒、平衡和高度判别的医学图像分类,显著减少可训练参数。在多个医学图像数据集上进行的大量实验表明,我们的方法在显著减少可训练参数数量的同时实现了最先进的性能。
{"title":"PromptMed: Prompt-driven semi-supervised medical image classification with class-balanced consistency and contrastive learning","authors":"Shuai Wang ,&nbsp;Ruina Mao","doi":"10.1016/j.eswa.2026.131345","DOIUrl":"10.1016/j.eswa.2026.131345","url":null,"abstract":"<div><div>Existing pre-trained foundation models have demonstrated strong generalization and transfer capabilities across diverse domains. However, directly fine-tuning all parameters of pre-trained models for medical image classification requires massive labeled data, making it inefficient and resource-intensive. To address this, we aim to leverage semi-supervised learning (SSL) techniques to reduce the need for massive annotations for efficient fine-tuning. In this context, we propose PromptMed, a parameter-efficient framework for semi-supervised medical image classification, which consists of three key components: Prompt Noise Injection (PNI), Class-Balanced Prompt Adaptation (CBPA), and Contrastive Feature Consistency (CFC). Specifically, we introduce PNI to enhance the robustness of prompt representations and enable effective prompt-based consistency training. PNI applies Gaussian noise of varying strengths to prompt tokens, serving as a form of representation-level augmentation. To mitigate class imbalance, we design a CBPA mechanism that dynamically assigns higher noise to minority classes based on recent class distributions, encouraging better representation learning for hard categories. Additionally, to promote feature consistency, especially for minority and visually similar classes, we incorporate a CFC on the vision branch features. These three components work synergistically to enable PromptMed to achieve robust, balanced, and highly discriminative medical image classification with significantly reduced trainable parameters. Extensive experiments on multiple medical image datasets demonstrate that our approach achieves state-of-the-art performance while significantly reducing the number of trainable parameters.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"310 ","pages":"Article 131345"},"PeriodicalIF":7.5,"publicationDate":"2026-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146080879","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Reinforcement learning-driven service allocation via potential game modeling in aerial edge computing 航空边缘计算中基于潜在博弈建模的强化学习驱动服务分配
IF 7.5 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-27 DOI: 10.1016/j.eswa.2026.131339
Xi Liu , Jun Liu
Aerial Edge Computing has recently received significant research attention due to its remarkable potential for dynamically deploying computing power. We address the problem of service scheduling in aerial edge computing, in which uncrewed aerial vehicles (UAVs) are deployed to mission areas to provide sensor data collection and analysis services. Two types of sensing tasks are considered: single-zone service and multiple-zone service. The first category refers to UAVs that remain in a single zone. The second category refers to a UAV traversing several areas to collect sensing data to meet user requirements. The objective is to maximize the overall utility of the UAVs. The service scheduling problem is formulated as an ordinal potential game to achieve a stable system state. A distributed algorithm based on reinforcement learning is proposed. An improved search-state formulation is introduced to accelerate convergence and enhance search efficiency. The proposed scheduling algorithm is demonstrated to achieve a Nash equilibrium in where no UAV can improve its utility by unilaterally deviating. Additionally, the approximation performance of the proposed scheduling algorithm and the game’s price of anarchy are presented. The results indicate that the proposed algorithm provides higher utility to UAVs and adapts effectively to diverse distribution environments.
空中边缘计算由于其动态部署计算能力的巨大潜力,最近受到了重大的研究关注。我们解决了空中边缘计算中的服务调度问题,其中无人驾驶飞行器(uav)部署到任务区域,提供传感器数据收集和分析服务。考虑了两种类型的传感任务:单区域服务和多区域服务。第一类是指停留在单一区域的无人机。第二类是指无人机穿越多个区域收集传感数据以满足用户需求。目标是使无人机的整体效用最大化。将服务调度问题表述为一个有序的潜在博弈,以达到系统状态的稳定。提出了一种基于强化学习的分布式算法。为了加快收敛速度和提高搜索效率,引入了一种改进的搜索状态公式。结果表明,所提出的调度算法能够达到纳什均衡状态,在这种状态下,任何无人机都不能通过单方面偏离来提高其效用。此外,给出了所提调度算法的近似性能和无政府状态下的博弈代价。结果表明,该算法对无人机具有较高的实用性,能有效适应不同的分布环境。
{"title":"Reinforcement learning-driven service allocation via potential game modeling in aerial edge computing","authors":"Xi Liu ,&nbsp;Jun Liu","doi":"10.1016/j.eswa.2026.131339","DOIUrl":"10.1016/j.eswa.2026.131339","url":null,"abstract":"<div><div>Aerial Edge Computing has recently received significant research attention due to its remarkable potential for dynamically deploying computing power. We address the problem of service scheduling in aerial edge computing, in which uncrewed aerial vehicles (UAVs) are deployed to mission areas to provide sensor data collection and analysis services. Two types of sensing tasks are considered: single-zone service and multiple-zone service. The first category refers to UAVs that remain in a single zone. The second category refers to a UAV traversing several areas to collect sensing data to meet user requirements. The objective is to maximize the overall utility of the UAVs. The service scheduling problem is formulated as an ordinal potential game to achieve a stable system state. A distributed algorithm based on reinforcement learning is proposed. An improved search-state formulation is introduced to accelerate convergence and enhance search efficiency. The proposed scheduling algorithm is demonstrated to achieve a Nash equilibrium in where no UAV can improve its utility by unilaterally deviating. Additionally, the approximation performance of the proposed scheduling algorithm and the game’s price of anarchy are presented. The results indicate that the proposed algorithm provides higher utility to UAVs and adapts effectively to diverse distribution environments.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"310 ","pages":"Article 131339"},"PeriodicalIF":7.5,"publicationDate":"2026-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146080971","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Regression test optimization for software of the cellular network base stations: A language-based approach 蜂窝网络基站软件回归测试优化:一种基于语言的方法
IF 7.5 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-27 DOI: 10.1016/j.eswa.2026.131225
Sebastian Zarębski , Krzysztof Rusek , Piotr Chołda
This paper introduces Linear Model of Latent Dirichlet Allocation (LMLDA), a novel methodology for software test optimization that directly addresses the gap between computationally-prohibitive large language models (LLMs) and semantically-shallow heuristics. Our primary contribution is a lightweight, interpretable, and cost-efficient model specifically designed for high-stakes industrial Continuous Integration and Continuous Development (CI/CD) environments where security and traceability are essential. The novelty of LMLDA lies in its integration of Latent Dirichlet Allocation (LDA) for the semantic analysis of code modifications and test content, with a classifier based on logistic regression concepts for the training phase, yet offering prediction capabilities that align with the computational simplicity of linear regression. This approach uniquely predicts the probability of test failure based on semantic interactions, enabling precise, bug-centric prioritization rather than relying on indirect diversity proxies. A large-scale industrial case study at NOKIA demonstrates LMLDA’s practical effectiveness, achieving an average 64% reduction in test suite size while maintaining 88% precision in bug detection and accelerating critical bug discovery by an average of 8 h per cycle.
本文介绍了潜在狄利克雷分配线性模型(LMLDA),这是一种用于软件测试优化的新方法,直接解决了计算禁止的大型语言模型(llm)和语义浅层启发式之间的差距。我们的主要贡献是一个轻量级的、可解释的、具有成本效益的模型,专门为高风险的工业持续集成和持续开发(CI/CD)环境设计,其中安全性和可追溯性是必不可少的。LMLDA的新颖之处在于它将用于代码修改和测试内容的语义分析的潜在狄利克雷分配(LDA)与训练阶段基于逻辑回归概念的分类器集成在一起,同时提供与线性回归的计算简单性相一致的预测能力。这种方法基于语义交互唯一地预测了测试失败的概率,实现了精确的、以bug为中心的优先级,而不是依赖于间接的多样性代理。诺基亚的一项大规模工业案例研究证明了LMLDA的实际有效性,在测试套件大小平均减少64%的同时,在bug检测方面保持88%的精度,并将关键bug发现速度平均提高8小时。
{"title":"Regression test optimization for software of the cellular network base stations: A language-based approach","authors":"Sebastian Zarębski ,&nbsp;Krzysztof Rusek ,&nbsp;Piotr Chołda","doi":"10.1016/j.eswa.2026.131225","DOIUrl":"10.1016/j.eswa.2026.131225","url":null,"abstract":"<div><div>This paper introduces Linear Model of Latent Dirichlet Allocation (LMLDA), a novel methodology for software test optimization that directly addresses the gap between computationally-prohibitive large language models (LLMs) and semantically-shallow heuristics. Our primary contribution is a lightweight, interpretable, and cost-efficient model specifically designed for high-stakes industrial Continuous Integration and Continuous Development (CI/CD) environments where security and traceability are essential. The novelty of LMLDA lies in its integration of Latent Dirichlet Allocation (LDA) for the semantic analysis of code modifications and test content, with a classifier based on logistic regression concepts for the training phase, yet offering prediction capabilities that align with the computational simplicity of linear regression. This approach uniquely predicts the probability of test failure based on semantic interactions, enabling precise, bug-centric prioritization rather than relying on indirect diversity proxies. A large-scale industrial case study at NOKIA demonstrates LMLDA’s practical effectiveness, achieving an average 64% reduction in test suite size while maintaining 88% precision in bug detection and accelerating critical bug discovery by an average of 8 h per cycle.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"311 ","pages":"Article 131225"},"PeriodicalIF":7.5,"publicationDate":"2026-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146081703","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Quantum modeling of the dynamic ride-sharing problem: Development of quantum benders decomposition methods 动态拼车问题的量子建模:量子弯曲子分解方法的发展
IF 7.5 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-27 DOI: 10.1016/j.eswa.2026.131320
Erfan Amani Bani, Kourosh Eshghi
Mathematical modeling and the subsequent development of optimization algorithms for problems have been the core focus of operations research scientists. However, the challenges of solving complex models promptly have always sparked numerous innovations in this field. Quantum computing has been proposed as an alternative to binary computing for several decades. In recent years, operations researchers have paid special attention to applying and integrating this logic with optimization. Specifically, many quantum-based optimization algorithms have been developed; however, little attention has been given to modeling optimization problems using quantum variables. In this paper, a practical problem, the dynamic ride-sharing problem, is redefined and then modeled with the help of quantum variables. Based on quantum variables, the resulting model is fully compatible with quantum algorithms. Subsequently, quantum algorithms based on Benders’ decomposition have been developed. Despite the limitations of access to quantum computing hardware, from a theoretical perspective in terms of computational complexity and solving a simple example, the performance of the algorithms has been demonstrated.
数学建模和问题优化算法的后续发展一直是运筹学科学家关注的核心问题。然而,快速解决复杂模型的挑战一直激发了该领域的许多创新。几十年来,量子计算一直被提议作为二进制计算的替代方案。近年来,运筹学研究人员特别关注将这一逻辑与最优化相结合并加以应用。具体来说,已经开发了许多基于量子的优化算法;然而,很少有人关注使用量子变量的建模优化问题。本文对一个实际问题——动态拼车问题进行了重新定义,并用量子变量对其进行了建模。基于量子变量,得到的模型完全兼容量子算法。随后,基于Benders分解的量子算法得到了发展。尽管获得量子计算硬件的限制,但从计算复杂性和解决一个简单示例的理论角度来看,算法的性能已经得到了证明。
{"title":"Quantum modeling of the dynamic ride-sharing problem: Development of quantum benders decomposition methods","authors":"Erfan Amani Bani,&nbsp;Kourosh Eshghi","doi":"10.1016/j.eswa.2026.131320","DOIUrl":"10.1016/j.eswa.2026.131320","url":null,"abstract":"<div><div>Mathematical modeling and the subsequent development of optimization algorithms for problems have been the core focus of operations research scientists. However, the challenges of solving complex models promptly have always sparked numerous innovations in this field. Quantum computing has been proposed as an alternative to binary computing for several decades. In recent years, operations researchers have paid special attention to applying and integrating this logic with optimization. Specifically, many quantum-based optimization algorithms have been developed; however, little attention has been given to modeling optimization problems using quantum variables. In this paper, a practical problem, the dynamic ride-sharing problem, is redefined and then modeled with the help of quantum variables. Based on quantum variables, the resulting model is fully compatible with quantum algorithms. Subsequently, quantum algorithms based on Benders’ decomposition have been developed. Despite the limitations of access to quantum computing hardware, from a theoretical perspective in terms of computational complexity and solving a simple example, the performance of the algorithms has been demonstrated.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"310 ","pages":"Article 131320"},"PeriodicalIF":7.5,"publicationDate":"2026-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146080874","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
DP-HM2F: Data-driven LoRA with dual-projection representation for heterogeneous multimodal federated fine-tuning DP-HM2F:具有双投影表示的数据驱动LoRA,用于异构多模态联邦微调
IF 7.5 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-27 DOI: 10.1016/j.eswa.2026.131287
Yu Yang , Suxia Zhu , Guanglu Sun , Zian He , Xinyu Liu , Kai Zhou , Xiaojuan Cui
Federated learning (FL) enables privacy-preserving fine-tuning of multimodal large language models (MLLMs) on edge devices; however, the limited computational resources of edge clients, coupled with inherent modality and data heterogeneity across clients, pose major challenges for federated multimodal fine-tuning and lead to performance degradation. To tackle these issues, we propose DP-HM2F, a data-driven LoRA framework with a dual-projection representation mechanism for heterogeneous multimodal federated fine-tuning. Specifically, DP-HM2F establishes a dual-projection architecture that exploits a global feature pool and client-specific local feature pools, where the global pool encodes privacy-agnostic shared representations and each edge client dynamically maintains a local pool to refine heterogeneous multimodal representations. The architecture enables projection-based retrieval between the global and local pools to improve representation alignment, while introducing additional computational overhead on resource-constrained devices. To mitigate this limitation, DP-HM2F integrates a data-driven LoRA module that adaptively scales the number of trainable parameters based on local data, thereby alleviating computational constraints across heterogeneous clients. Furthermore, to address semantic conflicts induced by high-dimensional representation spaces during federated aggregation, we introduce a positive-vector collaborative optimization strategy to alleviate conflicting client updates. Extensive experimental results demonstrate that DP-HM2F, with only 7.05% of trainable parameters (a 0.3% reduction compared with conventional LoRA-based methods), achieves a performance improvement of 4.1 points under heterogeneous multimodal settings.
联邦学习(FL)能够在边缘设备上对多模态大型语言模型(mllm)进行隐私保护微调;然而,边缘客户端的计算资源有限,再加上客户端的固有模态和数据异构性,给联邦多模态微调带来了重大挑战,并导致性能下降。为了解决这些问题,我们提出了DP-HM2F,这是一个数据驱动的LoRA框架,具有用于异构多模态联邦微调的双投影表示机制。具体来说,DP-HM2F建立了一个双投影架构,利用全局特征池和特定于客户端的本地特征池,其中全局特征池编码与隐私无关的共享表示,每个边缘客户端动态维护一个本地池来优化异构多模态表示。该体系结构支持全局池和本地池之间基于投影的检索,以改进表示对齐,同时在资源受限的设备上引入额外的计算开销。为了缓解这一限制,DP-HM2F集成了一个数据驱动的LoRA模块,该模块可以根据本地数据自适应地扩展可训练参数的数量,从而减轻跨异构客户端的计算限制。此外,为了解决联邦聚合过程中由高维表示空间引起的语义冲突,我们引入了一种正向量协同优化策略来缓解客户机更新冲突。大量的实验结果表明,DP-HM2F仅使用7.05%的可训练参数(与传统基于lora的方法相比减少了0.3%),在异构多模式设置下实现了4.1点的性能提升。
{"title":"DP-HM2F: Data-driven LoRA with dual-projection representation for heterogeneous multimodal federated fine-tuning","authors":"Yu Yang ,&nbsp;Suxia Zhu ,&nbsp;Guanglu Sun ,&nbsp;Zian He ,&nbsp;Xinyu Liu ,&nbsp;Kai Zhou ,&nbsp;Xiaojuan Cui","doi":"10.1016/j.eswa.2026.131287","DOIUrl":"10.1016/j.eswa.2026.131287","url":null,"abstract":"<div><div>Federated learning (FL) enables privacy-preserving fine-tuning of multimodal large language models (MLLMs) on edge devices; however, the limited computational resources of edge clients, coupled with inherent modality and data heterogeneity across clients, pose major challenges for federated multimodal fine-tuning and lead to performance degradation. To tackle these issues, we propose DP-HM2F, a data-driven LoRA framework with a dual-projection representation mechanism for heterogeneous multimodal federated fine-tuning. Specifically, DP-HM2F establishes a dual-projection architecture that exploits a global feature pool and client-specific local feature pools, where the global pool encodes privacy-agnostic shared representations and each edge client dynamically maintains a local pool to refine heterogeneous multimodal representations. The architecture enables projection-based retrieval between the global and local pools to improve representation alignment, while introducing additional computational overhead on resource-constrained devices. To mitigate this limitation, DP-HM2F integrates a data-driven LoRA module that adaptively scales the number of trainable parameters based on local data, thereby alleviating computational constraints across heterogeneous clients. Furthermore, to address semantic conflicts induced by high-dimensional representation spaces during federated aggregation, we introduce a positive-vector collaborative optimization strategy to alleviate conflicting client updates. Extensive experimental results demonstrate that DP-HM2F, with only 7.05% of trainable parameters (a 0.3% reduction compared with conventional LoRA-based methods), achieves a performance improvement of 4.1 points under heterogeneous multimodal settings.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"311 ","pages":"Article 131287"},"PeriodicalIF":7.5,"publicationDate":"2026-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146081695","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
LLM-augmented causal-knowledge heterogeneous graph framework for interpretable reasoning and collaborative knowledge fusion in automotive chip production 基于llm的汽车芯片生产中可解释推理与协同知识融合的因果知识异构图框架
IF 7.5 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-27 DOI: 10.1016/j.eswa.2026.131343
Shuangxue Liu , Hongbin Xie , Yuzhen Lei , Jiaxing Zhao , Xuan Song
Automotive chip production involves complex interdependencies across design, manufacturing, and supply-chain processes, posing significant challenges for interpretable and consistent reasoning. To address these challenges, this paper proposes a Causal-Knowledge Heterogeneous Graph (C-KHG) framework that integrates a domain knowledge graph with a text-grounded causal event graph, capturing linguistically asserted cause-and-effect relations extracted from expert-authored technical documents. Unlike statistical causal discovery or interventional causal modeling, the proposed causal event graph focuses on causally informed semantic reasoning, emphasizing directional consistency and interpretability aligned with domain expert knowledge. Built upon the unified heterogeneous graph, we design a three-stage reasoning pipeline consisting of intent classification, graph-based adaptive retrieval, and large language model (LLM) answer generation. To evaluate its effectiveness, experiments were conducted on three representative tasks: hybrid knowledge-causal reasoning, value-chain question answering, and pure causal reasoning. Specifically, we address three types of tasks: (1) hybrid knowledge-causal reasoning, which jointly involves entity-level knowledge retrieval and cause-and-effect analysis; (2) value-chain question answering, which focuses on structured domain knowledge across the automotive chip lifecycle; and (3) pure causal reasoning, which concentrates exclusively on cause-and-effect relations without requiring explicit entity attributes. Instead of relying on direct prompt-based inference, we construct the causal knowledge graph as an explicit intermediate structured layer, efficiently bootstrapped by LLMs, which serves as a persistent and updatable domain memory. This design improves reasoning stability and directional consistency while facilitating knowledge maintenance and iterative updates without model retraining. Experimental results on automotive chip value-chain question answering tasks demonstrate that the proposed framework consistently improves reasoning accuracy, causal directionality, and interpretability compared with vanilla LLMs and conventional knowledge-graph-based retrieval methods. In particular, for the causal-knowledge fusion task, the cosine similarity of GLM4-9B improved from 9.63 to 21.75. These findings highlight the effectiveness of structured graph-based reasoning scaffolds as intermediate representations for enhancing LLM-based reasoning in complex industrial domains. Code and data are made available on https://github.com/shuangxueliu/C-KHG.
汽车芯片生产涉及设计、制造和供应链流程之间复杂的相互依赖关系,对可解释和一致的推理提出了重大挑战。为了解决这些挑战,本文提出了一个因果知识异构图(C-KHG)框架,该框架将领域知识图与基于文本的因果事件图集成在一起,捕获从专家撰写的技术文档中提取的语言断言的因果关系。与统计因果发现或介入因果建模不同,所提出的因果事件图侧重于因果知情的语义推理,强调与领域专家知识一致的方向一致性和可解释性。在统一异构图的基础上,设计了意图分类、基于图的自适应检索和大语言模型(LLM)答案生成三阶段推理管道。为了评估其有效性,在三个代表性任务上进行了实验:混合知识-因果推理、价值链问答和纯因果推理。具体来说,我们解决了三种类型的任务:(1)混合知识-因果推理,它共同涉及实体级知识检索和因果分析;(2)价值链问答,重点关注汽车芯片生命周期的结构化领域知识;(3)纯粹因果推理,它只关注因果关系,不要求明确的实体属性。我们不再依赖直接的基于提示的推理,而是将因果知识图构建为一个明确的中间结构化层,由llm有效地引导,作为一个持久和可更新的领域存储器。这种设计提高了推理的稳定性和方向一致性,同时便于知识维护和迭代更新,无需模型再训练。在汽车芯片价值链问答任务上的实验结果表明,与传统的基于知识图的检索方法和传统的llm方法相比,所提出的框架在推理精度、因果方向性和可解释性方面都有显著提高。特别是在因果知识融合任务中,GLM4-9B的余弦相似度从9.63提高到21.75。这些发现强调了基于结构化图的推理脚手架作为增强复杂工业领域中基于法学硕士的推理的中间表示的有效性。代码和数据可在https://github.com/shuangxueliu/C-KHG上获得。
{"title":"LLM-augmented causal-knowledge heterogeneous graph framework for interpretable reasoning and collaborative knowledge fusion in automotive chip production","authors":"Shuangxue Liu ,&nbsp;Hongbin Xie ,&nbsp;Yuzhen Lei ,&nbsp;Jiaxing Zhao ,&nbsp;Xuan Song","doi":"10.1016/j.eswa.2026.131343","DOIUrl":"10.1016/j.eswa.2026.131343","url":null,"abstract":"<div><div>Automotive chip production involves complex interdependencies across design, manufacturing, and supply-chain processes, posing significant challenges for interpretable and consistent reasoning. To address these challenges, this paper proposes a Causal-Knowledge Heterogeneous Graph (C-KHG) framework that integrates a domain knowledge graph with a text-grounded causal event graph, capturing linguistically asserted cause-and-effect relations extracted from expert-authored technical documents. Unlike statistical causal discovery or interventional causal modeling, the proposed causal event graph focuses on causally informed semantic reasoning, emphasizing directional consistency and interpretability aligned with domain expert knowledge. Built upon the unified heterogeneous graph, we design a three-stage reasoning pipeline consisting of intent classification, graph-based adaptive retrieval, and large language model (LLM) answer generation. To evaluate its effectiveness, experiments were conducted on three representative tasks: hybrid knowledge-causal reasoning, value-chain question answering, and pure causal reasoning. Specifically, we address three types of tasks: (1) hybrid knowledge-causal reasoning, which jointly involves entity-level knowledge retrieval and cause-and-effect analysis; (2) value-chain question answering, which focuses on structured domain knowledge across the automotive chip lifecycle; and (3) pure causal reasoning, which concentrates exclusively on cause-and-effect relations without requiring explicit entity attributes. Instead of relying on direct prompt-based inference, we construct the causal knowledge graph as an explicit intermediate structured layer, efficiently bootstrapped by LLMs, which serves as a persistent and updatable domain memory. This design improves reasoning stability and directional consistency while facilitating knowledge maintenance and iterative updates without model retraining. Experimental results on automotive chip value-chain question answering tasks demonstrate that the proposed framework consistently improves reasoning accuracy, causal directionality, and interpretability compared with vanilla LLMs and conventional knowledge-graph-based retrieval methods. In particular, for the causal-knowledge fusion task, the cosine similarity of GLM4-9B improved from 9.63 to 21.75. These findings highlight the effectiveness of structured graph-based reasoning scaffolds as intermediate representations for enhancing LLM-based reasoning in complex industrial domains. Code and data are made available on <span><span>https://github.com/shuangxueliu/C-KHG</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"311 ","pages":"Article 131343"},"PeriodicalIF":7.5,"publicationDate":"2026-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146070881","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A quality prediction method for injection molding products based on multi-stage feature decoupling and fusion 一种基于多阶段特征解耦融合的注塑产品质量预测方法
IF 7.5 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-27 DOI: 10.1016/j.eswa.2026.131372
Xianhao Zhang, Hongfei Zhan
Injection molding is an efficient method for the mass production of plastic products, but product quality is susceptible to variations in process conditions and parameters. To improve the real-time performance and accuracy of quality control, deep learning-based data-driven prediction methods have become a research focus. Nevertheless, existing injection molding quality prediction methods tend to prematurely couple variable channels and still face limitations in information fusion and model efficiency. Therefore, this paper combines multi-source data and proposes a quality prediction method based on Multi-Stage Feature Decoupling and Fusion (MSFDF). To address the issue of premature coupling of multivariate features in injection molding, a Temporal and Channel Decoupling Based Multi-Scale Feature Extraction Module (TC-DMFE) is designed to extract multi-scale features while maintaining feature independence. In addition, to address the issue of inadequate integration of multi-scale information during the injection molding process, a Channel-wise Multi-scale Feature Fusion Module (CMFF) is proposed, which fully integrates multi-scale features through a channel by channel fusion strategy and enhances the model’s comprehensive understanding of injection molding process variables under multi-scale variation patterns. On this basis, a Deep Feature Guided Channel Attention Recoupling Module (DCAR) is further constructed to learn inter-channel dependencies and apply channel weighting to achieve more effective variable recoupling. The model proposed in this paper effectively reduces training time while maintaining prediction accuracy and possesses the ability to quickly adapt to injection molding production scenarios.
注射成型是塑料制品大批量生产的一种有效方法,但产品质量容易受到工艺条件和参数变化的影响。为了提高质量控制的实时性和准确性,基于深度学习的数据驱动预测方法已成为研究热点。然而,现有的注塑质量预测方法往往过早地耦合可变通道,在信息融合和模型效率方面仍然存在局限性。为此,本文结合多源数据,提出了一种基于多阶段特征解耦与融合(MSFDF)的质量预测方法。为了解决注射成型过程中多尺度特征过早耦合的问题,设计了一种基于时间和通道解耦的多尺度特征提取模块(TC-DMFE),在提取多尺度特征的同时保持特征独立性。此外,针对注射成型过程中多尺度信息集成不足的问题,提出了一种基于通道的多尺度特征融合模块(CMFF),该模块通过通道对通道的融合策略充分集成了多尺度特征,增强了模型对多尺度变化模式下注射成型过程变量的综合理解。在此基础上,进一步构建深度特征引导的信道注意重耦模块(DCAR),学习信道间依赖关系,并应用信道加权实现更有效的变量重耦。本文提出的模型在保持预测精度的同时有效地减少了训练时间,并具有快速适应注塑生产场景的能力。
{"title":"A quality prediction method for injection molding products based on multi-stage feature decoupling and fusion","authors":"Xianhao Zhang,&nbsp;Hongfei Zhan","doi":"10.1016/j.eswa.2026.131372","DOIUrl":"10.1016/j.eswa.2026.131372","url":null,"abstract":"<div><div>Injection molding is an efficient method for the mass production of plastic products, but product quality is susceptible to variations in process conditions and parameters. To improve the real-time performance and accuracy of quality control, deep learning-based data-driven prediction methods have become a research focus. Nevertheless, existing injection molding quality prediction methods tend to prematurely couple variable channels and still face limitations in information fusion and model efficiency. Therefore, this paper combines multi-source data and proposes a quality prediction method based on Multi-Stage Feature Decoupling and Fusion (MSFDF). To address the issue of premature coupling of multivariate features in injection molding, a Temporal and Channel Decoupling Based Multi-Scale Feature Extraction Module (TC-DMFE) is designed to extract multi-scale features while maintaining feature independence. In addition, to address the issue of inadequate integration of multi-scale information during the injection molding process, a Channel-wise Multi-scale Feature Fusion Module (CMFF) is proposed, which fully integrates multi-scale features through a channel by channel fusion strategy and enhances the model’s comprehensive understanding of injection molding process variables under multi-scale variation patterns. On this basis, a Deep Feature Guided Channel Attention Recoupling Module (DCAR) is further constructed to learn inter-channel dependencies and apply channel weighting to achieve more effective variable recoupling. The model proposed in this paper effectively reduces training time while maintaining prediction accuracy and possesses the ability to quickly adapt to injection molding production scenarios.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"310 ","pages":"Article 131372"},"PeriodicalIF":7.5,"publicationDate":"2026-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146080788","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Expert Systems with Applications
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1