首页 > 最新文献

Expert Systems with Applications最新文献

英文 中文
Quantum modeling of the dynamic ride-sharing problem: Development of quantum benders decomposition methods 动态拼车问题的量子建模:量子弯曲子分解方法的发展
IF 7.5 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-27 DOI: 10.1016/j.eswa.2026.131320
Erfan Amani Bani, Kourosh Eshghi
Mathematical modeling and the subsequent development of optimization algorithms for problems have been the core focus of operations research scientists. However, the challenges of solving complex models promptly have always sparked numerous innovations in this field. Quantum computing has been proposed as an alternative to binary computing for several decades. In recent years, operations researchers have paid special attention to applying and integrating this logic with optimization. Specifically, many quantum-based optimization algorithms have been developed; however, little attention has been given to modeling optimization problems using quantum variables. In this paper, a practical problem, the dynamic ride-sharing problem, is redefined and then modeled with the help of quantum variables. Based on quantum variables, the resulting model is fully compatible with quantum algorithms. Subsequently, quantum algorithms based on Benders’ decomposition have been developed. Despite the limitations of access to quantum computing hardware, from a theoretical perspective in terms of computational complexity and solving a simple example, the performance of the algorithms has been demonstrated.
数学建模和问题优化算法的后续发展一直是运筹学科学家关注的核心问题。然而,快速解决复杂模型的挑战一直激发了该领域的许多创新。几十年来,量子计算一直被提议作为二进制计算的替代方案。近年来,运筹学研究人员特别关注将这一逻辑与最优化相结合并加以应用。具体来说,已经开发了许多基于量子的优化算法;然而,很少有人关注使用量子变量的建模优化问题。本文对一个实际问题——动态拼车问题进行了重新定义,并用量子变量对其进行了建模。基于量子变量,得到的模型完全兼容量子算法。随后,基于Benders分解的量子算法得到了发展。尽管获得量子计算硬件的限制,但从计算复杂性和解决一个简单示例的理论角度来看,算法的性能已经得到了证明。
{"title":"Quantum modeling of the dynamic ride-sharing problem: Development of quantum benders decomposition methods","authors":"Erfan Amani Bani,&nbsp;Kourosh Eshghi","doi":"10.1016/j.eswa.2026.131320","DOIUrl":"10.1016/j.eswa.2026.131320","url":null,"abstract":"<div><div>Mathematical modeling and the subsequent development of optimization algorithms for problems have been the core focus of operations research scientists. However, the challenges of solving complex models promptly have always sparked numerous innovations in this field. Quantum computing has been proposed as an alternative to binary computing for several decades. In recent years, operations researchers have paid special attention to applying and integrating this logic with optimization. Specifically, many quantum-based optimization algorithms have been developed; however, little attention has been given to modeling optimization problems using quantum variables. In this paper, a practical problem, the dynamic ride-sharing problem, is redefined and then modeled with the help of quantum variables. Based on quantum variables, the resulting model is fully compatible with quantum algorithms. Subsequently, quantum algorithms based on Benders’ decomposition have been developed. Despite the limitations of access to quantum computing hardware, from a theoretical perspective in terms of computational complexity and solving a simple example, the performance of the algorithms has been demonstrated.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"310 ","pages":"Article 131320"},"PeriodicalIF":7.5,"publicationDate":"2026-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146080874","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
DP-HM2F: Data-driven LoRA with dual-projection representation for heterogeneous multimodal federated fine-tuning DP-HM2F:具有双投影表示的数据驱动LoRA,用于异构多模态联邦微调
IF 7.5 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-27 DOI: 10.1016/j.eswa.2026.131287
Yu Yang , Suxia Zhu , Guanglu Sun , Zian He , Xinyu Liu , Kai Zhou , Xiaojuan Cui
Federated learning (FL) enables privacy-preserving fine-tuning of multimodal large language models (MLLMs) on edge devices; however, the limited computational resources of edge clients, coupled with inherent modality and data heterogeneity across clients, pose major challenges for federated multimodal fine-tuning and lead to performance degradation. To tackle these issues, we propose DP-HM2F, a data-driven LoRA framework with a dual-projection representation mechanism for heterogeneous multimodal federated fine-tuning. Specifically, DP-HM2F establishes a dual-projection architecture that exploits a global feature pool and client-specific local feature pools, where the global pool encodes privacy-agnostic shared representations and each edge client dynamically maintains a local pool to refine heterogeneous multimodal representations. The architecture enables projection-based retrieval between the global and local pools to improve representation alignment, while introducing additional computational overhead on resource-constrained devices. To mitigate this limitation, DP-HM2F integrates a data-driven LoRA module that adaptively scales the number of trainable parameters based on local data, thereby alleviating computational constraints across heterogeneous clients. Furthermore, to address semantic conflicts induced by high-dimensional representation spaces during federated aggregation, we introduce a positive-vector collaborative optimization strategy to alleviate conflicting client updates. Extensive experimental results demonstrate that DP-HM2F, with only 7.05% of trainable parameters (a 0.3% reduction compared with conventional LoRA-based methods), achieves a performance improvement of 4.1 points under heterogeneous multimodal settings.
联邦学习(FL)能够在边缘设备上对多模态大型语言模型(mllm)进行隐私保护微调;然而,边缘客户端的计算资源有限,再加上客户端的固有模态和数据异构性,给联邦多模态微调带来了重大挑战,并导致性能下降。为了解决这些问题,我们提出了DP-HM2F,这是一个数据驱动的LoRA框架,具有用于异构多模态联邦微调的双投影表示机制。具体来说,DP-HM2F建立了一个双投影架构,利用全局特征池和特定于客户端的本地特征池,其中全局特征池编码与隐私无关的共享表示,每个边缘客户端动态维护一个本地池来优化异构多模态表示。该体系结构支持全局池和本地池之间基于投影的检索,以改进表示对齐,同时在资源受限的设备上引入额外的计算开销。为了缓解这一限制,DP-HM2F集成了一个数据驱动的LoRA模块,该模块可以根据本地数据自适应地扩展可训练参数的数量,从而减轻跨异构客户端的计算限制。此外,为了解决联邦聚合过程中由高维表示空间引起的语义冲突,我们引入了一种正向量协同优化策略来缓解客户机更新冲突。大量的实验结果表明,DP-HM2F仅使用7.05%的可训练参数(与传统基于lora的方法相比减少了0.3%),在异构多模式设置下实现了4.1点的性能提升。
{"title":"DP-HM2F: Data-driven LoRA with dual-projection representation for heterogeneous multimodal federated fine-tuning","authors":"Yu Yang ,&nbsp;Suxia Zhu ,&nbsp;Guanglu Sun ,&nbsp;Zian He ,&nbsp;Xinyu Liu ,&nbsp;Kai Zhou ,&nbsp;Xiaojuan Cui","doi":"10.1016/j.eswa.2026.131287","DOIUrl":"10.1016/j.eswa.2026.131287","url":null,"abstract":"<div><div>Federated learning (FL) enables privacy-preserving fine-tuning of multimodal large language models (MLLMs) on edge devices; however, the limited computational resources of edge clients, coupled with inherent modality and data heterogeneity across clients, pose major challenges for federated multimodal fine-tuning and lead to performance degradation. To tackle these issues, we propose DP-HM2F, a data-driven LoRA framework with a dual-projection representation mechanism for heterogeneous multimodal federated fine-tuning. Specifically, DP-HM2F establishes a dual-projection architecture that exploits a global feature pool and client-specific local feature pools, where the global pool encodes privacy-agnostic shared representations and each edge client dynamically maintains a local pool to refine heterogeneous multimodal representations. The architecture enables projection-based retrieval between the global and local pools to improve representation alignment, while introducing additional computational overhead on resource-constrained devices. To mitigate this limitation, DP-HM2F integrates a data-driven LoRA module that adaptively scales the number of trainable parameters based on local data, thereby alleviating computational constraints across heterogeneous clients. Furthermore, to address semantic conflicts induced by high-dimensional representation spaces during federated aggregation, we introduce a positive-vector collaborative optimization strategy to alleviate conflicting client updates. Extensive experimental results demonstrate that DP-HM2F, with only 7.05% of trainable parameters (a 0.3% reduction compared with conventional LoRA-based methods), achieves a performance improvement of 4.1 points under heterogeneous multimodal settings.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"311 ","pages":"Article 131287"},"PeriodicalIF":7.5,"publicationDate":"2026-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146081695","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
LLM-augmented causal-knowledge heterogeneous graph framework for interpretable reasoning and collaborative knowledge fusion in automotive chip production 基于llm的汽车芯片生产中可解释推理与协同知识融合的因果知识异构图框架
IF 7.5 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-27 DOI: 10.1016/j.eswa.2026.131343
Shuangxue Liu , Hongbin Xie , Yuzhen Lei , Jiaxing Zhao , Xuan Song
Automotive chip production involves complex interdependencies across design, manufacturing, and supply-chain processes, posing significant challenges for interpretable and consistent reasoning. To address these challenges, this paper proposes a Causal-Knowledge Heterogeneous Graph (C-KHG) framework that integrates a domain knowledge graph with a text-grounded causal event graph, capturing linguistically asserted cause-and-effect relations extracted from expert-authored technical documents. Unlike statistical causal discovery or interventional causal modeling, the proposed causal event graph focuses on causally informed semantic reasoning, emphasizing directional consistency and interpretability aligned with domain expert knowledge. Built upon the unified heterogeneous graph, we design a three-stage reasoning pipeline consisting of intent classification, graph-based adaptive retrieval, and large language model (LLM) answer generation. To evaluate its effectiveness, experiments were conducted on three representative tasks: hybrid knowledge-causal reasoning, value-chain question answering, and pure causal reasoning. Specifically, we address three types of tasks: (1) hybrid knowledge-causal reasoning, which jointly involves entity-level knowledge retrieval and cause-and-effect analysis; (2) value-chain question answering, which focuses on structured domain knowledge across the automotive chip lifecycle; and (3) pure causal reasoning, which concentrates exclusively on cause-and-effect relations without requiring explicit entity attributes. Instead of relying on direct prompt-based inference, we construct the causal knowledge graph as an explicit intermediate structured layer, efficiently bootstrapped by LLMs, which serves as a persistent and updatable domain memory. This design improves reasoning stability and directional consistency while facilitating knowledge maintenance and iterative updates without model retraining. Experimental results on automotive chip value-chain question answering tasks demonstrate that the proposed framework consistently improves reasoning accuracy, causal directionality, and interpretability compared with vanilla LLMs and conventional knowledge-graph-based retrieval methods. In particular, for the causal-knowledge fusion task, the cosine similarity of GLM4-9B improved from 9.63 to 21.75. These findings highlight the effectiveness of structured graph-based reasoning scaffolds as intermediate representations for enhancing LLM-based reasoning in complex industrial domains. Code and data are made available on https://github.com/shuangxueliu/C-KHG.
汽车芯片生产涉及设计、制造和供应链流程之间复杂的相互依赖关系,对可解释和一致的推理提出了重大挑战。为了解决这些挑战,本文提出了一个因果知识异构图(C-KHG)框架,该框架将领域知识图与基于文本的因果事件图集成在一起,捕获从专家撰写的技术文档中提取的语言断言的因果关系。与统计因果发现或介入因果建模不同,所提出的因果事件图侧重于因果知情的语义推理,强调与领域专家知识一致的方向一致性和可解释性。在统一异构图的基础上,设计了意图分类、基于图的自适应检索和大语言模型(LLM)答案生成三阶段推理管道。为了评估其有效性,在三个代表性任务上进行了实验:混合知识-因果推理、价值链问答和纯因果推理。具体来说,我们解决了三种类型的任务:(1)混合知识-因果推理,它共同涉及实体级知识检索和因果分析;(2)价值链问答,重点关注汽车芯片生命周期的结构化领域知识;(3)纯粹因果推理,它只关注因果关系,不要求明确的实体属性。我们不再依赖直接的基于提示的推理,而是将因果知识图构建为一个明确的中间结构化层,由llm有效地引导,作为一个持久和可更新的领域存储器。这种设计提高了推理的稳定性和方向一致性,同时便于知识维护和迭代更新,无需模型再训练。在汽车芯片价值链问答任务上的实验结果表明,与传统的基于知识图的检索方法和传统的llm方法相比,所提出的框架在推理精度、因果方向性和可解释性方面都有显著提高。特别是在因果知识融合任务中,GLM4-9B的余弦相似度从9.63提高到21.75。这些发现强调了基于结构化图的推理脚手架作为增强复杂工业领域中基于法学硕士的推理的中间表示的有效性。代码和数据可在https://github.com/shuangxueliu/C-KHG上获得。
{"title":"LLM-augmented causal-knowledge heterogeneous graph framework for interpretable reasoning and collaborative knowledge fusion in automotive chip production","authors":"Shuangxue Liu ,&nbsp;Hongbin Xie ,&nbsp;Yuzhen Lei ,&nbsp;Jiaxing Zhao ,&nbsp;Xuan Song","doi":"10.1016/j.eswa.2026.131343","DOIUrl":"10.1016/j.eswa.2026.131343","url":null,"abstract":"<div><div>Automotive chip production involves complex interdependencies across design, manufacturing, and supply-chain processes, posing significant challenges for interpretable and consistent reasoning. To address these challenges, this paper proposes a Causal-Knowledge Heterogeneous Graph (C-KHG) framework that integrates a domain knowledge graph with a text-grounded causal event graph, capturing linguistically asserted cause-and-effect relations extracted from expert-authored technical documents. Unlike statistical causal discovery or interventional causal modeling, the proposed causal event graph focuses on causally informed semantic reasoning, emphasizing directional consistency and interpretability aligned with domain expert knowledge. Built upon the unified heterogeneous graph, we design a three-stage reasoning pipeline consisting of intent classification, graph-based adaptive retrieval, and large language model (LLM) answer generation. To evaluate its effectiveness, experiments were conducted on three representative tasks: hybrid knowledge-causal reasoning, value-chain question answering, and pure causal reasoning. Specifically, we address three types of tasks: (1) hybrid knowledge-causal reasoning, which jointly involves entity-level knowledge retrieval and cause-and-effect analysis; (2) value-chain question answering, which focuses on structured domain knowledge across the automotive chip lifecycle; and (3) pure causal reasoning, which concentrates exclusively on cause-and-effect relations without requiring explicit entity attributes. Instead of relying on direct prompt-based inference, we construct the causal knowledge graph as an explicit intermediate structured layer, efficiently bootstrapped by LLMs, which serves as a persistent and updatable domain memory. This design improves reasoning stability and directional consistency while facilitating knowledge maintenance and iterative updates without model retraining. Experimental results on automotive chip value-chain question answering tasks demonstrate that the proposed framework consistently improves reasoning accuracy, causal directionality, and interpretability compared with vanilla LLMs and conventional knowledge-graph-based retrieval methods. In particular, for the causal-knowledge fusion task, the cosine similarity of GLM4-9B improved from 9.63 to 21.75. These findings highlight the effectiveness of structured graph-based reasoning scaffolds as intermediate representations for enhancing LLM-based reasoning in complex industrial domains. Code and data are made available on <span><span>https://github.com/shuangxueliu/C-KHG</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"311 ","pages":"Article 131343"},"PeriodicalIF":7.5,"publicationDate":"2026-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146070881","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A quality prediction method for injection molding products based on multi-stage feature decoupling and fusion 一种基于多阶段特征解耦融合的注塑产品质量预测方法
IF 7.5 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-27 DOI: 10.1016/j.eswa.2026.131372
Xianhao Zhang, Hongfei Zhan
Injection molding is an efficient method for the mass production of plastic products, but product quality is susceptible to variations in process conditions and parameters. To improve the real-time performance and accuracy of quality control, deep learning-based data-driven prediction methods have become a research focus. Nevertheless, existing injection molding quality prediction methods tend to prematurely couple variable channels and still face limitations in information fusion and model efficiency. Therefore, this paper combines multi-source data and proposes a quality prediction method based on Multi-Stage Feature Decoupling and Fusion (MSFDF). To address the issue of premature coupling of multivariate features in injection molding, a Temporal and Channel Decoupling Based Multi-Scale Feature Extraction Module (TC-DMFE) is designed to extract multi-scale features while maintaining feature independence. In addition, to address the issue of inadequate integration of multi-scale information during the injection molding process, a Channel-wise Multi-scale Feature Fusion Module (CMFF) is proposed, which fully integrates multi-scale features through a channel by channel fusion strategy and enhances the model’s comprehensive understanding of injection molding process variables under multi-scale variation patterns. On this basis, a Deep Feature Guided Channel Attention Recoupling Module (DCAR) is further constructed to learn inter-channel dependencies and apply channel weighting to achieve more effective variable recoupling. The model proposed in this paper effectively reduces training time while maintaining prediction accuracy and possesses the ability to quickly adapt to injection molding production scenarios.
注射成型是塑料制品大批量生产的一种有效方法,但产品质量容易受到工艺条件和参数变化的影响。为了提高质量控制的实时性和准确性,基于深度学习的数据驱动预测方法已成为研究热点。然而,现有的注塑质量预测方法往往过早地耦合可变通道,在信息融合和模型效率方面仍然存在局限性。为此,本文结合多源数据,提出了一种基于多阶段特征解耦与融合(MSFDF)的质量预测方法。为了解决注射成型过程中多尺度特征过早耦合的问题,设计了一种基于时间和通道解耦的多尺度特征提取模块(TC-DMFE),在提取多尺度特征的同时保持特征独立性。此外,针对注射成型过程中多尺度信息集成不足的问题,提出了一种基于通道的多尺度特征融合模块(CMFF),该模块通过通道对通道的融合策略充分集成了多尺度特征,增强了模型对多尺度变化模式下注射成型过程变量的综合理解。在此基础上,进一步构建深度特征引导的信道注意重耦模块(DCAR),学习信道间依赖关系,并应用信道加权实现更有效的变量重耦。本文提出的模型在保持预测精度的同时有效地减少了训练时间,并具有快速适应注塑生产场景的能力。
{"title":"A quality prediction method for injection molding products based on multi-stage feature decoupling and fusion","authors":"Xianhao Zhang,&nbsp;Hongfei Zhan","doi":"10.1016/j.eswa.2026.131372","DOIUrl":"10.1016/j.eswa.2026.131372","url":null,"abstract":"<div><div>Injection molding is an efficient method for the mass production of plastic products, but product quality is susceptible to variations in process conditions and parameters. To improve the real-time performance and accuracy of quality control, deep learning-based data-driven prediction methods have become a research focus. Nevertheless, existing injection molding quality prediction methods tend to prematurely couple variable channels and still face limitations in information fusion and model efficiency. Therefore, this paper combines multi-source data and proposes a quality prediction method based on Multi-Stage Feature Decoupling and Fusion (MSFDF). To address the issue of premature coupling of multivariate features in injection molding, a Temporal and Channel Decoupling Based Multi-Scale Feature Extraction Module (TC-DMFE) is designed to extract multi-scale features while maintaining feature independence. In addition, to address the issue of inadequate integration of multi-scale information during the injection molding process, a Channel-wise Multi-scale Feature Fusion Module (CMFF) is proposed, which fully integrates multi-scale features through a channel by channel fusion strategy and enhances the model’s comprehensive understanding of injection molding process variables under multi-scale variation patterns. On this basis, a Deep Feature Guided Channel Attention Recoupling Module (DCAR) is further constructed to learn inter-channel dependencies and apply channel weighting to achieve more effective variable recoupling. The model proposed in this paper effectively reduces training time while maintaining prediction accuracy and possesses the ability to quickly adapt to injection molding production scenarios.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"310 ","pages":"Article 131372"},"PeriodicalIF":7.5,"publicationDate":"2026-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146080788","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MSCPSO: A multi-strategy cooperative particle swarm optimization algorithm for UAV path planning 无人机路径规划的多策略协同粒子群优化算法
IF 7.5 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-27 DOI: 10.1016/j.eswa.2025.131034
Jun Guan , Shuanghui Ye , Wenjun Yi
Particle swarm optimization (PSO) is widely applied to various practical problems due to its strong optimization capability and flexibility. However, when tackling complex optimization tasks, it suffers from shortcomings such as premature convergence and an imbalance between global exploration and local exploitation. To address these issues, this study proposes a multi-strategy cooperative particle swarm optimization algorithm (MSCPSO). MSCPSO divides the population into leaders and followers based on fitness and integrates diverse learning strategies to enhance performance. First, a nonlinear adaptive inertia weight is proposed to dynamically adjust inertia according to particle roles, effectively balancing exploration and exploitation. Second, a weighted learning strategy is introduced, which assigns weights based on leader fitness values to guide particles more efficiently toward promising solution regions. Third, a fitness-distance balance mechanism is designed to maintain population diversity in the early stage, accelerate convergence in the later stage, and reduce the probability of falling into local optima. Finally, in the later iterations of the algorithm, a terminal replacement mechanism is designed to replace the worst global particle, reducing population diversity to accelerate convergence. Comparative experiments on CEC2014, CEC2017, and CEC2022 test suites against seven heuristic algorithms, eleven PSO variants, and eight state-of-the-art algorithms show that multi-strategy cooperation significantly enhances PSO performance. MSCPSO outperforms most compared algorithms. Finally, MSCPSO is applied to 3D UAV path planning in complex environments. Across 12 scenarios of varying complexity, MSCPSO demonstrates the ability to generate more feasible and efficient paths in most cases.
粒子群算法以其强大的优化能力和灵活性被广泛应用于各种实际问题。然而,在处理复杂的优化任务时,它存在过早收敛和全局勘探与局部开采不平衡等缺点。针对这些问题,本研究提出了一种多策略协同粒子群优化算法(MSCPSO)。MSCPSO基于适应度将群体划分为领导者和追随者,并整合多种学习策略以提高绩效。首先,提出一种非线性自适应惯性权值,根据粒子的作用动态调整惯性,有效平衡勘探和开采;其次,引入了一种加权学习策略,该策略根据领导者适应度值分配权重,以更有效地引导粒子走向有希望的解区域;第三,设计适应度-距离平衡机制,保持种群早期多样性,加快后期收敛,降低陷入局部最优的概率。最后,在算法的后期迭代中,设计了一种终端替换机制,替换最差的全局粒子,减少种群多样性,加速收敛。在CEC2014、CEC2017和CEC2022测试套件上对7种启发式算法、11种PSO变体和8种最先进算法的对比实验表明,多策略协作显著提高了PSO的性能。MSCPSO优于大多数比较算法。最后,将MSCPSO应用于复杂环境下的无人机路径规划。在12个不同复杂性的场景中,MSCPSO展示了在大多数情况下生成更可行和有效的路径的能力。
{"title":"MSCPSO: A multi-strategy cooperative particle swarm optimization algorithm for UAV path planning","authors":"Jun Guan ,&nbsp;Shuanghui Ye ,&nbsp;Wenjun Yi","doi":"10.1016/j.eswa.2025.131034","DOIUrl":"10.1016/j.eswa.2025.131034","url":null,"abstract":"<div><div>Particle swarm optimization (PSO) is widely applied to various practical problems due to its strong optimization capability and flexibility. However, when tackling complex optimization tasks, it suffers from shortcomings such as premature convergence and an imbalance between global exploration and local exploitation. To address these issues, this study proposes a multi-strategy cooperative particle swarm optimization algorithm (MSCPSO). MSCPSO divides the population into leaders and followers based on fitness and integrates diverse learning strategies to enhance performance. First, a nonlinear adaptive inertia weight is proposed to dynamically adjust inertia according to particle roles, effectively balancing exploration and exploitation. Second, a weighted learning strategy is introduced, which assigns weights based on leader fitness values to guide particles more efficiently toward promising solution regions. Third, a fitness-distance balance mechanism is designed to maintain population diversity in the early stage, accelerate convergence in the later stage, and reduce the probability of falling into local optima. Finally, in the later iterations of the algorithm, a terminal replacement mechanism is designed to replace the worst global particle, reducing population diversity to accelerate convergence. Comparative experiments on CEC2014, CEC2017, and CEC2022 test suites against seven heuristic algorithms, eleven PSO variants, and eight state-of-the-art algorithms show that multi-strategy cooperation significantly enhances PSO performance. MSCPSO outperforms most compared algorithms. Finally, MSCPSO is applied to 3D UAV path planning in complex environments. Across 12 scenarios of varying complexity, MSCPSO demonstrates the ability to generate more feasible and efficient paths in most cases.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"310 ","pages":"Article 131034"},"PeriodicalIF":7.5,"publicationDate":"2026-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146080873","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Dual-observer-based event-triggered state synchronization for discrete-time output-coupled neural networks under Round-Robin protocol 轮询协议下离散时间输出耦合神经网络的双观测器事件触发状态同步
IF 7.5 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-27 DOI: 10.1016/j.eswa.2026.131155
Zhihong Liang , Huaguang Zhang , Juan Zhang , Qiongwen Zhang
Aiming at the state master-slave synchronization problem of discrete-time output-coupled neural networks (OCNNs), this study proposes a novel control framework. Deviating from the existing studies, this study aims at the constraint that the output state cannot obtain all neuron state information, innovatively constructs full-dimensional observers in the master/slave system respectively to realize state reconstruction, which solves the problem of missing reference trajectory caused by incomplete state information of the master system, and realizes state synchronization for the first time under the output-coupled framework. A round-robin (RR) protocol is introduced to dynamically schedule the output-coupled communication among neurons to address the transmission efficiency bottleneck under limited bandwidth resources. To allocate communication resources more intelligently, a novel event-triggered (ET) mechanism is designed. Under this mechanism, the triggering threshold is constructed based on the last triggering instant and is updated according to the most current triggering instant. It significantly reduces overall resource consumption while ensuring control performance. Then, the joint Lyapunov function is constructed based on the designed observer-controller-protocol interaction dynamic model. It derived a quantitative relationship between the triggering frequency and the error decay boundary, thereby providing a quantifiable basis for precision tuning in network control. Finally, the effectiveness of the proposed method is verified by circuit simulation.
针对离散时间输出耦合神经网络(ocnn)的状态主从同步问题,提出了一种新的控制框架。与已有研究不同,本研究针对输出状态无法获取全部神经元状态信息的约束,创新地在主从系统中分别构建全维观测器实现状态重构,解决了主从系统状态信息不完整导致的参考轨迹缺失问题,首次实现了输出耦合框架下的状态同步。引入轮循(RR)协议来动态调度神经元之间的输出耦合通信,以解决有限带宽资源下的传输效率瓶颈。为了更智能地分配通信资源,设计了一种新的事件触发机制。在该机制下,触发阈值根据最后一个触发瞬间构造,并根据最新的触发瞬间更新。它在确保控制性能的同时显著降低了整体资源消耗。然后,基于所设计的观察者-控制器-协议交互动态模型,构造了联合Lyapunov函数。导出了触发频率与误差衰减边界之间的定量关系,从而为网络控制中的精确调谐提供了可量化的依据。最后,通过电路仿真验证了所提方法的有效性。
{"title":"Dual-observer-based event-triggered state synchronization for discrete-time output-coupled neural networks under Round-Robin protocol","authors":"Zhihong Liang ,&nbsp;Huaguang Zhang ,&nbsp;Juan Zhang ,&nbsp;Qiongwen Zhang","doi":"10.1016/j.eswa.2026.131155","DOIUrl":"10.1016/j.eswa.2026.131155","url":null,"abstract":"<div><div>Aiming at the state master-slave synchronization problem of discrete-time output-coupled neural networks (OCNNs), this study proposes a novel control framework. Deviating from the existing studies, this study aims at the constraint that the output state cannot obtain all neuron state information, innovatively constructs full-dimensional observers in the master/slave system respectively to realize state reconstruction, which solves the problem of missing reference trajectory caused by incomplete state information of the master system, and realizes state synchronization for the first time under the output-coupled framework. A round-robin (RR) protocol is introduced to dynamically schedule the output-coupled communication among neurons to address the transmission efficiency bottleneck under limited bandwidth resources. To allocate communication resources more intelligently, a novel event-triggered (ET) mechanism is designed. Under this mechanism, the triggering threshold is constructed based on the last triggering instant and is updated according to the most current triggering instant. It significantly reduces overall resource consumption while ensuring control performance. Then, the joint Lyapunov function is constructed based on the designed observer-controller-protocol interaction dynamic model. It derived a quantitative relationship between the triggering frequency and the error decay boundary, thereby providing a quantifiable basis for precision tuning in network control. Finally, the effectiveness of the proposed method is verified by circuit simulation.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"310 ","pages":"Article 131155"},"PeriodicalIF":7.5,"publicationDate":"2026-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146080881","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A perception-enhanced multi-agent deep reinforcement learning method for multi-UAV cooperative pursuit 一种基于感知增强的多智能体深度强化学习的多无人机协同追踪方法
IF 7.5 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-26 DOI: 10.1016/j.eswa.2026.131334
Xiong Liqin , Chen Xiliang , Luo Xijian , Cao Lei
Multi-UAV cooperative pursuit is an important branch in the field of multi-robot collaboration, widely applied in critical tasks such as cooperative reconnaissance and emergency rescue. Existing researches primarily focus on the constant-speed cooperative pursuit problem in fully observable environments, while paying less attention to the issue of pursuing fast-evading targets in partially observable settings. Therefore, this paper proposes a perception-enhanced multi-agent deep reinforcement learning method to enable pursuing UAVs to learn cooperation through local observations. Firstly, a pursuit judgment model based on Apollonius circle is constructed for the bounded multi-UAV cooperative pursuit problem, and a rigorous theoretical proof is provided for the boundary condition of successful pursuit. Subsequently, a dynamic multi-agent interaction graph is established based on the real-time connectivity among the pursuing UAVs, and then information features are extracted from neighbors using a two-layer graph attention network to enhance their perceptual capability. Finally, a joint reward function incorporating multiple types of rewards is designed to reflect task requirements, and the framework of centralized training with decentralized execution is utilized to train the policies of pursuing UAVs, promoting them to learn autonomous cooperation. To verify the effectiveness of our method, extensive comparative experiments are conducted in various scenarios with different evasion strategies. Experimental results show that in almost all scenarios, our method outperforms other methods in terms of success rate, stability, and time consumption. Notably, it improves the success rate by up to several times (over ten times in some cases) and reduces the average pursuit steps by a maximum of 71.42%.
多无人机协同追踪是多机器人协同领域的一个重要分支,广泛应用于协同侦察、应急救援等关键任务。现有的研究主要集中在完全可观察环境下的恒速协同追捕问题,而对部分可观察环境下快速躲避目标的追捕问题关注较少。因此,本文提出了一种感知增强的多智能体深度强化学习方法,使追击无人机能够通过局部观察学习合作。首先,针对有界多无人机协同追捕问题,建立了基于阿波罗尼乌斯圆的追捕判断模型,并对成功追捕的边界条件提供了严格的理论证明;随后,基于跟踪无人机之间的实时连通性,建立了动态多智能体交互图,然后利用两层图关注网络从邻居中提取信息特征,增强其感知能力。最后,设计了一个包含多种奖励类型的联合奖励函数来反映任务需求,并利用集中训练分散执行的框架来训练无人机的追击策略,促进无人机学习自主合作。为了验证我们的方法的有效性,我们在不同的逃避策略下进行了大量的对比实验。实验结果表明,在几乎所有场景下,我们的方法在成功率、稳定性和耗时方面都优于其他方法。值得注意的是,它将成功率提高了几倍(在某些情况下超过10倍),并将平均追踪步骤减少了71.42%。
{"title":"A perception-enhanced multi-agent deep reinforcement learning method for multi-UAV cooperative pursuit","authors":"Xiong Liqin ,&nbsp;Chen Xiliang ,&nbsp;Luo Xijian ,&nbsp;Cao Lei","doi":"10.1016/j.eswa.2026.131334","DOIUrl":"10.1016/j.eswa.2026.131334","url":null,"abstract":"<div><div>Multi-UAV cooperative pursuit is an important branch in the field of multi-robot collaboration, widely applied in critical tasks such as cooperative reconnaissance and emergency rescue. Existing researches primarily focus on the constant-speed cooperative pursuit problem in fully observable environments, while paying less attention to the issue of pursuing fast-evading targets in partially observable settings. Therefore, this paper proposes a perception-enhanced multi-agent deep reinforcement learning method to enable pursuing UAVs to learn cooperation through local observations. Firstly, a pursuit judgment model based on Apollonius circle is constructed for the bounded multi-UAV cooperative pursuit problem, and a rigorous theoretical proof is provided for the boundary condition of successful pursuit. Subsequently, a dynamic multi-agent interaction graph is established based on the real-time connectivity among the pursuing UAVs, and then information features are extracted from neighbors using a two-layer graph attention network to enhance their perceptual capability. Finally, a joint reward function incorporating multiple types of rewards is designed to reflect task requirements, and the framework of centralized training with decentralized execution is utilized to train the policies of pursuing UAVs, promoting them to learn autonomous cooperation. To verify the effectiveness of our method, extensive comparative experiments are conducted in various scenarios with different evasion strategies. Experimental results show that in almost all scenarios, our method outperforms other methods in terms of success rate, stability, and time consumption. Notably, it improves the success rate by up to several times (over ten times in some cases) and reduces the average pursuit steps by a maximum of 71.42%.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"311 ","pages":"Article 131334"},"PeriodicalIF":7.5,"publicationDate":"2026-01-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146070732","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A novel process dynamic guided fusion loss for soft sensor modeling in complex industrial processes 一种用于复杂工业过程软传感器建模的过程动态引导融合损失新方法
IF 7.5 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-26 DOI: 10.1016/j.eswa.2026.131340
Yulong Wang, Jiayi Zhou, Fanlei Lu, Xu Tang, Xiaoli Wang, Chunhua Yang
Soft sensors are indispensable tools for inferring hard-to-measure quality variables in modern complex industrial processes. However, the training objectives of most deep learning-based soft sensor models typically focus on minimizing point-wise errors, a strategy that often fails to capture crucial process dynamics. This limitation directly compromises the reliability of soft sensor models in practical process control. This paper introduces the Process-Dynamic Guided Fusion Loss (PD-GFL), a model-agnostic, composite loss function designed to address this gap. PD-GFL guides model optimization from three dimensions: (i) aligning slow-varying trends to capture long-term system states, (ii) matching local statistical features to preserve distributional properties, and (iii) synchronizing differential dynamic patterns to ensure the process’s intrinsic inertia and smoothness. These objectives are integrated via an adaptive uncertainty-based weighting scheme, which enables the seamless integration of PD-GFL with diverse deep learning architectures. Extensive experiments on public benchmarks and a real-world industrial dataset demonstrate the superiority of PD-GFL, yielding improvements of up to 30% in MSE and 49% in MAPE over standard training. Notably, PD-GFL empowers a simple MLP backbone to improve its MSE by approximately 6%, achieving accuracy competitive with advanced Transformer models. Our findings show that improving the training objective is an effective alternative to simply making models more complex, offering a practical way to build more reliable and efficient soft sensors for industrial applications.
在现代复杂的工业过程中,软传感器是推断难以测量的质量变量不可或缺的工具。然而,大多数基于深度学习的软传感器模型的训练目标通常集中在最小化逐点误差上,这种策略通常无法捕捉关键的过程动态。这种限制直接影响了软测量模型在实际过程控制中的可靠性。本文介绍了过程动态引导融合损失(PD-GFL),一种与模型无关的复合损失函数,旨在解决这一差距。PD-GFL从三个维度指导模型优化:(i)调整缓慢变化的趋势以捕获长期系统状态,(ii)匹配局部统计特征以保持分布特性,以及(iii)同步差分动态模式以确保过程的固有惯性和平滑性。这些目标通过基于不确定性的自适应加权方案进行集成,从而实现PD-GFL与各种深度学习架构的无缝集成。在公共基准测试和现实世界的工业数据集上进行的大量实验证明了PD-GFL的优越性,与标准训练相比,MSE和MAPE分别提高了30%和49%。值得注意的是,PD-GFL使简单的MLP骨干能够将其MSE提高约6%,实现与先进Transformer模型竞争的精度。我们的研究结果表明,改进训练目标是简单地使模型更复杂的有效替代方案,为工业应用构建更可靠和高效的软传感器提供了一种实用的方法。
{"title":"A novel process dynamic guided fusion loss for soft sensor modeling in complex industrial processes","authors":"Yulong Wang,&nbsp;Jiayi Zhou,&nbsp;Fanlei Lu,&nbsp;Xu Tang,&nbsp;Xiaoli Wang,&nbsp;Chunhua Yang","doi":"10.1016/j.eswa.2026.131340","DOIUrl":"10.1016/j.eswa.2026.131340","url":null,"abstract":"<div><div>Soft sensors are indispensable tools for inferring hard-to-measure quality variables in modern complex industrial processes. However, the training objectives of most deep learning-based soft sensor models typically focus on minimizing point-wise errors, a strategy that often fails to capture crucial process dynamics. This limitation directly compromises the reliability of soft sensor models in practical process control. This paper introduces the Process-Dynamic Guided Fusion Loss (PD-GFL), a model-agnostic, composite loss function designed to address this gap. PD-GFL guides model optimization from three dimensions: (i) aligning slow-varying trends to capture long-term system states, (ii) matching local statistical features to preserve distributional properties, and (iii) synchronizing differential dynamic patterns to ensure the process’s intrinsic inertia and smoothness. These objectives are integrated via an adaptive uncertainty-based weighting scheme, which enables the seamless integration of PD-GFL with diverse deep learning architectures. Extensive experiments on public benchmarks and a real-world industrial dataset demonstrate the superiority of PD-GFL, yielding improvements of up to 30% in MSE and 49% in MAPE over standard training. Notably, PD-GFL empowers a simple MLP backbone to improve its MSE by approximately 6%, achieving accuracy competitive with advanced Transformer models. Our findings show that improving the training objective is an effective alternative to simply making models more complex, offering a practical way to build more reliable and efficient soft sensors for industrial applications.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"310 ","pages":"Article 131340"},"PeriodicalIF":7.5,"publicationDate":"2026-01-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146081049","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
JSDU-Net: Joint sensitivity-learning driven deep unfolding network for accelerated radial MRI reconstruction JSDU-Net:关节敏感性学习驱动的径向MRI加速重建深度展开网络
IF 7.5 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-26 DOI: 10.1016/j.eswa.2026.131351
Biao Qu , Huajun She , Qingxia Wu , Pingping Jie , Qi Yao , Liulu Zhang , Yuting Fu , Yamei Luo , Taishan Kang , Gaofeng Zheng
Radial Magnetic Resonance Imaging (MRI) enables high acceleration imaging. However, reconstructing high-quality images from highly undersampled k-space data remains a challenge due to the difficulty in accurately estimating coil sensitivity maps from limited autocalibration signal. The autocalibration signal is extracted from the central k-space in existing compressed sensing and deep learning methods. This extraction makes the low-resolution sensitivity maps lack high-frequency details, leading to suboptimal reconstructions. To address this problem, we propose a Joint Sensitivity-learning driven Deep Unfolding Network (JSDU-Net) for accelerated radial MRI. Sensitivity maps are firstly estimated from all available k-space data, including the low and high-frequency parts and then updated in the reconstruction. JSDU-Net unfolds the iterative reconstruction process into a deep neural network and introduces a novel sensitivity learning strategy that alternately updates sensitivity maps and image estimates in each iteration. This joint optimization facilitates accurate sensitivity estimation by capturing high-frequency information. Extensive experiments demonstrate that JSDU-Net achieves superior performance in detail preservation, artifact suppression, and reconstruction efficiency. Blinded evaluations by clinical radiologists show that the reconstructed images exhibit excellent diagnostic value, suggesting the potential of JSDU-Net for clinical applications of radial MRI.
径向磁共振成像(MRI)可以实现高加速度成像。然而,由于难以从有限的自动校准信号中准确估计线圈灵敏度图,因此从高度欠采样的k空间数据中重建高质量图像仍然是一个挑战。在现有的压缩感知和深度学习方法中,自动校准信号是从中心k空间中提取的。这种提取使得低分辨率灵敏度图缺乏高频细节,导致次优重建。为了解决这个问题,我们提出了一个联合灵敏度学习驱动的深度展开网络(JSDU-Net),用于加速径向MRI。首先从所有可用的k空间数据(包括低频和高频部分)估计灵敏度图,然后在重建中更新。JSDU-Net将迭代重建过程展开为一个深度神经网络,并引入了一种新的灵敏度学习策略,在每次迭代中交替更新灵敏度图和图像估计。这种联合优化通过捕获高频信息促进准确的灵敏度估计。大量的实验表明,JSDU-Net在细节保存、伪迹抑制和重建效率等方面都具有优异的性能。临床放射科医师的盲法评价表明,重建图像具有良好的诊断价值,提示JSDU-Net在放射MRI临床应用的潜力。
{"title":"JSDU-Net: Joint sensitivity-learning driven deep unfolding network for accelerated radial MRI reconstruction","authors":"Biao Qu ,&nbsp;Huajun She ,&nbsp;Qingxia Wu ,&nbsp;Pingping Jie ,&nbsp;Qi Yao ,&nbsp;Liulu Zhang ,&nbsp;Yuting Fu ,&nbsp;Yamei Luo ,&nbsp;Taishan Kang ,&nbsp;Gaofeng Zheng","doi":"10.1016/j.eswa.2026.131351","DOIUrl":"10.1016/j.eswa.2026.131351","url":null,"abstract":"<div><div>Radial Magnetic Resonance Imaging (MRI) enables high acceleration imaging. However, reconstructing high-quality images from highly undersampled k-space data remains a challenge due to the difficulty in accurately estimating coil sensitivity maps from limited autocalibration signal. The autocalibration signal is extracted from the central k-space in existing compressed sensing and deep learning methods. This extraction makes the low-resolution sensitivity maps lack high-frequency details, leading to suboptimal reconstructions. To address this problem, we propose a Joint Sensitivity-learning driven Deep Unfolding Network (JSDU-Net) for accelerated radial MRI. Sensitivity maps are firstly estimated from all available k-space data, including the low and high-frequency parts and then updated in the reconstruction. JSDU-Net unfolds the iterative reconstruction process into a deep neural network and introduces a novel sensitivity learning strategy that alternately updates sensitivity maps and image estimates in each iteration. This joint optimization facilitates accurate sensitivity estimation by capturing high-frequency information. Extensive experiments demonstrate that JSDU-Net achieves superior performance in detail preservation, artifact suppression, and reconstruction efficiency. Blinded evaluations by clinical radiologists show that the reconstructed images exhibit excellent diagnostic value, suggesting the potential of JSDU-Net for clinical applications of radial MRI.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"310 ","pages":"Article 131351"},"PeriodicalIF":7.5,"publicationDate":"2026-01-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146080973","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
SCID-Net: Few-shot deep-hole defect instance segmentation via multi-grained feature coupling and instance-aware inference decoupling SCID-Net:基于多粒度特征耦合和实例感知推理解耦的少弹深孔缺陷实例分割
IF 7.5 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-25 DOI: 10.1016/j.eswa.2026.131341
Zongyang Zhao , Jiehu Kang , Yichen Xu , Jian Liang , Luyuan Feng , Yuqi Ren , Ting Xue , Bin Wu
Accurate instance segmentation of deep-hole inner surface defects is critical for ensuring the structural integrity and functional reliability of high-precision industrial components. However, this task remains highly challenging due to the extreme scarcity of annotated data, along with the irregular morphology, weak texture, and dense, random spatial distribution of defects. Existing instance segmentation methods typically rely on large-scale supervision, which is prohibitively expensive and often infeasible in real-world manufacturing. While few-shot learning offers a promising alternative, current models primarily focus on semantic segmentation and fail to delineate individual defect instances with accurate boundaries and counts. Moreover, they lack adaptive mechanisms to model fine-grained morphological variations of defect regions and are susceptible to foreground–background ambiguity induced by incomplete annotations, resulting in classification bias during inspection. To address these limitations, we propose SCID-Net, a novel few-shot defect instance segmentation framework based on multi-granularity feature coupling and instance-aware inference decoupling. Specifically, we introduce a Multi-Grained Coupling Module (GCM) to facilitate hierarchical bi-directional interaction between support and query features, enriching both class-level prototypes and instance-specific representations. Built upon this, the Instance-Aware Inference Decoupling Module (IAM) decouples dense inference into specialized pathways, and further integrates adaptive spatial modulation and prototype-driven semantic alignment to suppress noise from incomplete annotations. Extensive experiments on a proprietary industrial deep-hole defect dataset demonstrate that SCID-Net achieves state-of-the-art performance under few-shot settings. Moreover, evaluations on NEU-Seg and MS COCO further validate the exceptional generalization capability of SCID-Net, highlighting its versatility in both challenging industrial environments and diverse real-world scenarios.
深孔内表面缺陷的准确实例分割是保证高精度工业零部件结构完整性和功能可靠性的关键。然而,由于标注数据的极度稀缺,以及缺陷的不规则形态、弱纹理和密集随机的空间分布,这项任务仍然具有很高的挑战性。现有的实例分割方法通常依赖于大规模的监督,这是非常昂贵的,而且在现实世界的制造中往往是不可行的。虽然few-shot学习提供了一个很有前途的选择,但当前的模型主要关注语义分割,并且无法用准确的边界和计数来描绘单个缺陷实例。此外,它们缺乏自适应机制来模拟缺陷区域的细粒度形态变化,并且容易受到不完整注释引起的前景和背景模糊的影响,从而导致检查过程中的分类偏差。为了解决这些限制,我们提出了一种基于多粒度特征耦合和实例感知推理解耦的新型少镜头缺陷实例分割框架SCID-Net。具体来说,我们引入了一个多粒度耦合模块(GCM)来促进支持和查询特性之间的分层双向交互,丰富类级原型和实例特定表示。在此基础上,实例感知推理解耦模块(IAM)将密集推理解耦到专门的路径中,并进一步集成自适应空间调制和原型驱动的语义对齐,以抑制来自不完整注释的噪声。在一个专有的工业深孔缺陷数据集上进行的大量实验表明,SCID-Net在很少的射击设置下就能达到最先进的性能。此外,对NEU-Seg和MS COCO的评估进一步验证了SCID-Net卓越的泛化能力,突出了其在具有挑战性的工业环境和多种现实场景中的通用性。
{"title":"SCID-Net: Few-shot deep-hole defect instance segmentation via multi-grained feature coupling and instance-aware inference decoupling","authors":"Zongyang Zhao ,&nbsp;Jiehu Kang ,&nbsp;Yichen Xu ,&nbsp;Jian Liang ,&nbsp;Luyuan Feng ,&nbsp;Yuqi Ren ,&nbsp;Ting Xue ,&nbsp;Bin Wu","doi":"10.1016/j.eswa.2026.131341","DOIUrl":"10.1016/j.eswa.2026.131341","url":null,"abstract":"<div><div>Accurate instance segmentation of deep-hole inner surface defects is critical for ensuring the structural integrity and functional reliability of high-precision industrial components. However, this task remains highly challenging due to the extreme scarcity of annotated data, along with the irregular morphology, weak texture, and dense, random spatial distribution of defects. Existing instance segmentation methods typically rely on large-scale supervision, which is prohibitively expensive and often infeasible in real-world manufacturing. While few-shot learning offers a promising alternative, current models primarily focus on semantic segmentation and fail to delineate individual defect instances with accurate boundaries and counts. Moreover, they lack adaptive mechanisms to model fine-grained morphological variations of defect regions and are susceptible to foreground–background ambiguity induced by incomplete annotations, resulting in classification bias during inspection. To address these limitations, we propose SCID-Net, a novel few-shot defect instance segmentation framework based on multi-granularity feature coupling and instance-aware inference decoupling. Specifically, we introduce a Multi-Grained Coupling Module (GCM) to facilitate hierarchical bi-directional interaction between support and query features, enriching both class-level prototypes and instance-specific representations. Built upon this, the Instance-Aware Inference Decoupling Module (IAM) decouples dense inference into specialized pathways, and further integrates adaptive spatial modulation and prototype-driven semantic alignment to suppress noise from incomplete annotations. Extensive experiments on a proprietary industrial deep-hole defect dataset demonstrate that SCID-Net achieves state-of-the-art performance under few-shot settings. Moreover, evaluations on NEU-Seg and MS COCO further validate the exceptional generalization capability of SCID-Net, highlighting its versatility in both challenging industrial environments and diverse real-world scenarios.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"310 ","pages":"Article 131341"},"PeriodicalIF":7.5,"publicationDate":"2026-01-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146080964","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Expert Systems with Applications
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1