首页 > 最新文献

Expert Systems with Applications最新文献

英文 中文
Amplitude -phase decomposition-based latent diffusion model for underwater image enhancement 基于幅相分解的水下图像潜扩散增强模型
IF 7.5 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-05-25 Epub Date: 2026-01-29 DOI: 10.1016/j.eswa.2026.131296
Hansen Zhang , Miao Yang , Can Pan , Leyuan Wang , Jiaju Tao
Underwater images commonly suffer from blurring, low contrast, and color distortion due to light scattering effects and wavelength-dependent absorption. From a frequency-domain perspective, these degradations significantly impair the spectral representation: they not only attenuate high-frequency amplitudes (causing detail loss) but also weaken specific color channel energies (inducing color deviation) while reducing overall spectral energy (leading to contrast deterioration). To address these challenges, this paper proposes an innovative two-stage underwater image enhancement framework, termed APD-LDM. In the first stage, we design an Amplitude-Phase Decomposition Network (APDNet) that performs end-to-end learning on paired underwater image data to preliminarily recover amplitude information degraded by absorption and scattering effects. The second stage employs a conditional diffusion model for refined reconstruction, where latent representations of degraded images serve as conditional constraints to guide the diffusion process toward more realistic underwater image features. Additionally, we introduce a self-constrained consistency loss function to further optimize network training. Extensive experiments demonstrate that the proposed method achieves superior effectiveness and robustness in both subjective visual quality and objective metrics. The code is available at https://github.com/JOU-UIP/APD-LDM.
由于光散射效应和波长依赖的吸收,水下图像通常会出现模糊、低对比度和颜色失真。从频域的角度来看,这些退化严重损害了光谱表示:它们不仅衰减了高频幅度(导致细节损失),而且减弱了特定颜色通道能量(诱导颜色偏差),同时降低了总体光谱能量(导致对比度下降)。为了解决这些挑战,本文提出了一种创新的两阶段水下图像增强框架,称为APD-LDM。在第一阶段,我们设计了一个幅相分解网络(APDNet),对成对的水下图像数据进行端到端学习,初步恢复被吸收和散射效应退化的幅值信息。第二阶段采用条件扩散模型进行精细重建,其中退化图像的潜在表示作为条件约束,引导扩散过程向更真实的水下图像特征扩散。此外,我们引入了自约束一致性损失函数来进一步优化网络训练。大量的实验表明,该方法在主观视觉质量和客观度量方面都具有良好的有效性和鲁棒性。代码可在https://github.com/JOU-UIP/APD-LDM上获得。
{"title":"Amplitude -phase decomposition-based latent diffusion model for underwater image enhancement","authors":"Hansen Zhang ,&nbsp;Miao Yang ,&nbsp;Can Pan ,&nbsp;Leyuan Wang ,&nbsp;Jiaju Tao","doi":"10.1016/j.eswa.2026.131296","DOIUrl":"10.1016/j.eswa.2026.131296","url":null,"abstract":"<div><div>Underwater images commonly suffer from blurring, low contrast, and color distortion due to light scattering effects and wavelength-dependent absorption. From a frequency-domain perspective, these degradations significantly impair the spectral representation: they not only attenuate high-frequency amplitudes (causing detail loss) but also weaken specific color channel energies (inducing color deviation) while reducing overall spectral energy (leading to contrast deterioration). To address these challenges, this paper proposes an innovative two-stage underwater image enhancement framework, termed APD-LDM. In the first stage, we design an Amplitude-Phase Decomposition Network (APDNet) that performs end-to-end learning on paired underwater image data to preliminarily recover amplitude information degraded by absorption and scattering effects. The second stage employs a conditional diffusion model for refined reconstruction, where latent representations of degraded images serve as conditional constraints to guide the diffusion process toward more realistic underwater image features. Additionally, we introduce a self-constrained consistency loss function to further optimize network training. Extensive experiments demonstrate that the proposed method achieves superior effectiveness and robustness in both subjective visual quality and objective metrics. The code is available at <span><span>https://github.com/JOU-UIP/APD-LDM</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"312 ","pages":"Article 131296"},"PeriodicalIF":7.5,"publicationDate":"2026-05-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146192779","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MatNet : Multi-scale adaptive time series forecasting network with bidirectional collaborative pathways MatNet:具有双向协同路径的多尺度自适应时间序列预测网络
IF 7.5 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-05-25 Epub Date: 2026-02-03 DOI: 10.1016/j.eswa.2026.131493
Guangming Zi, Yujun Zhu, Xin He, Yong Xu, Qun Fang
In long-term time series forecasting (LSTF), a fundamental challenge lies in simultaneously capturing fine-grained local dynamics and long-range global dependencies within inherently complex and non-stationary temporal series. However, most existing forecasting architectures rely on single-structure paradigms, each exhibiting inherent representational biases–for example, CNNs are constrained by limited receptive fields, while Transformers often overlook fine-grained local patterns. More critically, these architectures typically operate in isolation, lacking collaborative mechanisms to effectively integrate their complementary modeling capabilities. To address these limitations, we propose MatNet, a Multi-scale Adaptive Forecasting Network with a novel bidirectional collaborative architecture designed to establish bidirectional collaborative pathways between CNN and Transformer branches. Within this architecture, local representations extracted by CNNs are leveraged to refine and enrich the global context modeled by Transformers, thereby improving the model’s sensitivity to fine-grained temporal structures. Conversely, global dependencies captured by Transformer provide high-level semantic guidance to CNNs, enabling them to focus on contextually salient local regions and enhance representation coherence. Additionally, we introduce a Dynamic Temporal-Aware Router that adaptively extracts and fuses temporal features across multiple scales, enabling adaptive multi-scale modeling. Extensive experiments on nine public datasets demonstrate that MatNet consistently outperforms existing state-of-the-art methods in forecasting accuracy.
在长期时间序列预测(LSTF)中,一个基本的挑战在于同时捕获固有的复杂和非平稳时间序列中的细粒度局部动态和长期全局依赖关系。然而,大多数现有的预测架构依赖于单结构范式,每个都表现出固有的表征偏差——例如,cnn受到有限的接受域的约束,而变形金刚经常忽略细粒度的局部模式。更关键的是,这些体系结构通常是孤立运行的,缺乏协作机制来有效地集成它们互补的建模功能。为了解决这些限制,我们提出了MatNet,这是一个多尺度自适应预测网络,具有新颖的双向协作架构,旨在建立CNN和Transformer分支之间的双向协作路径。在该体系结构中,利用cnn提取的局部表示来细化和丰富变形金刚建模的全局上下文,从而提高模型对细粒度时间结构的敏感性。相反,Transformer捕获的全局依赖关系为cnn提供了高级语义指导,使它们能够专注于上下文显著的局部区域并增强表示一致性。此外,我们还引入了一个动态时间感知路由器,该路由器可以自适应地提取和融合多尺度的时间特征,从而实现自适应多尺度建模。在9个公共数据集上进行的大量实验表明,MatNet在预测准确性方面始终优于现有的最先进的方法。
{"title":"MatNet : Multi-scale adaptive time series forecasting network with bidirectional collaborative pathways","authors":"Guangming Zi,&nbsp;Yujun Zhu,&nbsp;Xin He,&nbsp;Yong Xu,&nbsp;Qun Fang","doi":"10.1016/j.eswa.2026.131493","DOIUrl":"10.1016/j.eswa.2026.131493","url":null,"abstract":"<div><div>In long-term time series forecasting (LSTF), a fundamental challenge lies in simultaneously capturing fine-grained local dynamics and long-range global dependencies within inherently complex and non-stationary temporal series. However, most existing forecasting architectures rely on single-structure paradigms, each exhibiting inherent representational biases–for example, CNNs are constrained by limited receptive fields, while Transformers often overlook fine-grained local patterns. More critically, these architectures typically operate in isolation, lacking collaborative mechanisms to effectively integrate their complementary modeling capabilities. To address these limitations, we propose MatNet, a Multi-scale Adaptive Forecasting Network with a novel bidirectional collaborative architecture designed to establish bidirectional collaborative pathways between CNN and Transformer branches. Within this architecture, local representations extracted by CNNs are leveraged to refine and enrich the global context modeled by Transformers, thereby improving the model’s sensitivity to fine-grained temporal structures. Conversely, global dependencies captured by Transformer provide high-level semantic guidance to CNNs, enabling them to focus on contextually salient local regions and enhance representation coherence. Additionally, we introduce a Dynamic Temporal-Aware Router that adaptively extracts and fuses temporal features across multiple scales, enabling adaptive multi-scale modeling. Extensive experiments on nine public datasets demonstrate that MatNet consistently outperforms existing state-of-the-art methods in forecasting accuracy.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"312 ","pages":"Article 131493"},"PeriodicalIF":7.5,"publicationDate":"2026-05-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146192696","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A robust model on the location of temporary medical centers considering secondary disasters 考虑二次灾害的临时医疗中心位置的鲁棒模型
IF 7.5 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-05-25 Epub Date: 2026-02-05 DOI: 10.1016/j.eswa.2026.131532
Hongmei Li , Dongxia Qu , Taibo Luo
Natural disasters occur frequently in the real world, and the secondary disasters they trigger can also result in significant losses. Well-organized disaster response is essential for providing timely, effective medical services in such emergencies. This paper addresses a post-disaster emergency response problem involving the determination of the location and number of temporary medical centers (TMCs) and the planning of casualty transportation, considering both primary and secondary disasters under uncertainty. A two-stage robust optimization model based on the minimax regret criterion is proposed. In the first stage, the locations and allocations for primary disaster casualties are determined prior to the occurrence of secondary disasters. In the second stage, casualties resulting from secondary disasters are transported to the established TMCs. To handle uncertainty in the number and spatial distribution of secondary disaster casualties, a minimax regret approach is employed across a set of predefined scenarios. To enhance computational efficiency, a scenario relaxation algorithm based on row generation is developed. Case studies based on the Lushan Earthquake are conducted to validate the feasibility and effectiveness of the model. Results demonstrate that incorporating secondary disasters significantly improves the efficiency of casualty treatment compared to models considering only primary disasters. Under the uncertainty of secondary disasters, constructing a limited number of critical scenarios is sufficient, and large-capacity TMCs are more recommended.
自然灾害在现实世界中频繁发生,其引发的次生灾害也会造成重大损失。组织良好的灾害应对对于在此类紧急情况下提供及时有效的医疗服务至关重要。本文研究了在不确定性条件下,考虑初级灾害和次级灾害的灾后应急响应问题,涉及临时医疗中心(tmc)的位置和数量的确定以及伤员运输的规划。提出了一种基于极大极小后悔准则的两阶段鲁棒优化模型。在第一阶段,在次生灾害发生前确定一次灾害伤亡的地点和分配。在第二阶段,由次生灾害造成的伤亡人员被运送到已建立的灾害管理中心。为了处理次生灾害伤亡人数和空间分布的不确定性,在一组预定义的场景中采用了最小最大遗憾方法。为了提高计算效率,提出了一种基于行生成的场景松弛算法。以芦山地震为例,验证了模型的可行性和有效性。结果表明,与只考虑初级灾害的模型相比,纳入次生灾害的模型显著提高了伤亡处理的效率。在次生灾害的不确定性下,构建有限数量的关键场景就足够了,更推荐大容量的tmc。
{"title":"A robust model on the location of temporary medical centers considering secondary disasters","authors":"Hongmei Li ,&nbsp;Dongxia Qu ,&nbsp;Taibo Luo","doi":"10.1016/j.eswa.2026.131532","DOIUrl":"10.1016/j.eswa.2026.131532","url":null,"abstract":"<div><div>Natural disasters occur frequently in the real world, and the secondary disasters they trigger can also result in significant losses. Well-organized disaster response is essential for providing timely, effective medical services in such emergencies. This paper addresses a post-disaster emergency response problem involving the determination of the location and number of temporary medical centers (TMCs) and the planning of casualty transportation, considering both primary and secondary disasters under uncertainty. A two-stage robust optimization model based on the minimax regret criterion is proposed. In the first stage, the locations and allocations for primary disaster casualties are determined prior to the occurrence of secondary disasters. In the second stage, casualties resulting from secondary disasters are transported to the established TMCs. To handle uncertainty in the number and spatial distribution of secondary disaster casualties, a minimax regret approach is employed across a set of predefined scenarios. To enhance computational efficiency, a scenario relaxation algorithm based on row generation is developed. Case studies based on the Lushan Earthquake are conducted to validate the feasibility and effectiveness of the model. Results demonstrate that incorporating secondary disasters significantly improves the efficiency of casualty treatment compared to models considering only primary disasters. Under the uncertainty of secondary disasters, constructing a limited number of critical scenarios is sufficient, and large-capacity TMCs are more recommended.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"312 ","pages":"Article 131532"},"PeriodicalIF":7.5,"publicationDate":"2026-05-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146192699","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Budget-constrained workflow scheduling using task prediction in hybrid environments 混合环境下使用任务预测的预算约束工作流调度
IF 7.5 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-05-25 Epub Date: 2026-01-31 DOI: 10.1016/j.eswa.2026.131436
Changhong Tai , Huiying Jin , Qi Wang , Hai Dong , Pengcheng Zhang
This paper addresses the challenges of workflow scheduling in hybrid cloud environments, where unknown task execution times and resource demand deviations often lead to prolonged makespan and scheduling failures. We propose FBiLSTM-DPBWS, a novel budget-constrained workflow scheduling algorithm. The core contributions of this algorithm are reflected at two levels: Firstly, a novel FBiLSTM regression prediction model is proposed. By integrating the Flash Attention mechanism with bidirectional long short-term memory networks, it can accurately and synchronously predict the instruction counts of all subtasks based solely on the directed acyclic graph structure and prior information about the workflow before task execution, thereby estimating their execution times on heterogeneous resources. This fundamentally addresses the challenge of unknown execution times. Secondly, a dynamic critical-path-priority-based budget-constrained workflow scheduling algorithm, DPBWS, is designed. Instead of simply combining virtual machine and serverless function resources, this algorithm dynamically adjusts the priority of critical-path tasks. It adaptively selects the optimal resource type based on predicted instruction counts, real-time budget consumption, and the task’s compute or data-intensive characteristics. The algorithm explicitly accounts for the fundamental billing differences between these resource types (whole-unit rental per billing period vs. fine-grained billing based on actual execution time), thereby minimizing makespan and maximizing scheduling success rates under limited budgets. Experiments conducted on six real-world and five large-scale synthetic datasets demonstrate that the FBiLSTM model achieves prediction accuracies ranging from 97.80% to 99.62%. Under the same budget, DPBWS significantly reduces makespan compared to the best-performing baseline and achieves scheduling success rates from 98.62% to 100% across all datasets. These results confirm the superiority, robustness, and practical applicability of the proposed method in hybrid cloud environments.
本文讨论了混合云环境中工作流调度的挑战,在混合云环境中,未知的任务执行时间和资源需求偏差通常会导致长时间的完工时间和调度失败。提出了一种新的预算约束工作流调度算法FBiLSTM-DPBWS。该算法的核心贡献体现在两个层面:首先,提出了一种新的FBiLSTM回归预测模型;该算法将Flash注意机制与双向长短期记忆网络相结合,仅根据任务执行前的有向无环图结构和工作流程的先验信息,就能准确、同步地预测子任务的指令数,从而估计子任务在异构资源上的执行时间。这从根本上解决了未知执行时间的挑战。其次,设计了基于动态关键路径优先级的预算约束工作流调度算法DPBWS。该算法不是简单地结合虚拟机和无服务器功能资源,而是动态调整关键路径任务的优先级。它根据预测的指令计数、实时预算消耗以及任务的计算或数据密集型特征,自适应地选择最优资源类型。该算法显式地考虑了这些资源类型之间的基本计费差异(每个计费周期的全单元租金与基于实际执行时间的细粒度计费),从而在有限的预算下最小化完工时间并最大化调度成功率。在6个真实数据集和5个大规模合成数据集上进行的实验表明,FBiLSTM模型的预测准确率在97.80% ~ 99.62%之间。在相同的预算下,与性能最好的基线相比,DPBWS显著减少了完工时间,并在所有数据集上实现了从98.62%到100%的调度成功率。这些结果证实了该方法在混合云环境下的优越性、鲁棒性和实用性。
{"title":"Budget-constrained workflow scheduling using task prediction in hybrid environments","authors":"Changhong Tai ,&nbsp;Huiying Jin ,&nbsp;Qi Wang ,&nbsp;Hai Dong ,&nbsp;Pengcheng Zhang","doi":"10.1016/j.eswa.2026.131436","DOIUrl":"10.1016/j.eswa.2026.131436","url":null,"abstract":"<div><div>This paper addresses the challenges of workflow scheduling in hybrid cloud environments, where unknown task execution times and resource demand deviations often lead to prolonged makespan and scheduling failures. We propose FBiLSTM-DPBWS, a novel budget-constrained workflow scheduling algorithm. The core contributions of this algorithm are reflected at two levels: Firstly, a novel FBiLSTM regression prediction model is proposed. By integrating the Flash Attention mechanism with bidirectional long short-term memory networks, it can accurately and synchronously predict the instruction counts of all subtasks based solely on the directed acyclic graph structure and prior information about the workflow before task execution, thereby estimating their execution times on heterogeneous resources. This fundamentally addresses the challenge of unknown execution times. Secondly, a dynamic critical-path-priority-based budget-constrained workflow scheduling algorithm, DPBWS, is designed. Instead of simply combining virtual machine and serverless function resources, this algorithm dynamically adjusts the priority of critical-path tasks. It adaptively selects the optimal resource type based on predicted instruction counts, real-time budget consumption, and the task’s compute or data-intensive characteristics. The algorithm explicitly accounts for the fundamental billing differences between these resource types (whole-unit rental per billing period vs. fine-grained billing based on actual execution time), thereby minimizing makespan and maximizing scheduling success rates under limited budgets. Experiments conducted on six real-world and five large-scale synthetic datasets demonstrate that the FBiLSTM model achieves prediction accuracies ranging from 97.80% to 99.62%. Under the same budget, DPBWS significantly reduces makespan compared to the best-performing baseline and achieves scheduling success rates from 98.62% to 100% across all datasets. These results confirm the superiority, robustness, and practical applicability of the proposed method in hybrid cloud environments.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"312 ","pages":"Article 131436"},"PeriodicalIF":7.5,"publicationDate":"2026-05-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146192852","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Local sharpness aware minimization in decentralized federated learning with privacy protection 具有隐私保护的分散联邦学习中的局部锐度感知最小化
IF 7.5 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-05-25 Epub Date: 2026-02-02 DOI: 10.1016/j.eswa.2026.131510
Jifei Hu , Yanli Li , Huayong Xie , Lijun Xu , Hang Zhang , Xinqiang Zhou
Federated learning (FL) enables distributed model training while preserving data privacy, but it still faces challenges from data heterogeneity and privacy constraints. Existing optimization methods aiming to flatten loss landscapes often fail to resolve inconsistencies between local and global flatness due to hyperparameter dependencies and centralized aggregation. Moreover, privacy-preserving techniques such as differential privacy (DP) can distort gradients, leading to sharper loss landscapes and hindered convergence. To tackle these issues, we propose DFedLSAM (Decentralized Federated Local Sharpness-Aware Minimization), a novel framework that eliminates the central server and uses the Sharpness-Aware Minimization (SAM) optimizer on the client side to maintain locally flattened loss landscapes. DFedLSAM adopts a dual-model architecture, where each client trains a sharing model for cross-client knowledge exchange and a private model updated via knowledge distillation (KD) from the sharing model’s soft logits, thereby reducing data heterogeneity and mitigating DP-induced sharpness. Building on this design, we introduce a perturbation-based SAM variant, integrated into the framework as DFedLSAM-Pert, which allocates perturbations according to layer-wise sensitivity and improves generalization without sacrificing privacy. Extensive experiments on benchmark image datasets and real-world medical datasets show that DFedLSAM and its perturbation-based extension DFedLSAM-Pert outperform existing baselines, especially in non-IID settings and under strict privacy budgets. These results indicate that DFedLSAM and DFedLSAM-Pert provide practical solutions for privacy-preserving FL in healthcare and other sensitive application domains.
联邦学习(FL)在保护数据隐私的同时支持分布式模型训练,但它仍然面临数据异构和隐私约束的挑战。现有的旨在平坦化损失景观的优化方法往往由于超参数依赖和集中式聚集而无法解决局部平坦度与全局平坦度之间的不一致性。此外,差分隐私(DP)等隐私保护技术会扭曲梯度,导致更清晰的损失景观并阻碍收敛。为了解决这些问题,我们提出了DFedLSAM(去中心化联邦本地锐度感知最小化),这是一个新的框架,它消除了中央服务器,并在客户端使用锐度感知最小化(SAM)优化器来维护本地平坦的损失景观。DFedLSAM采用双模型架构,每个客户端训练一个用于跨客户端知识交换的共享模型和一个通过共享模型软逻辑的知识蒸馏(KD)更新的私有模型,从而降低了数据异构性并减轻了dp引起的锐度。在此设计的基础上,我们引入了一个基于扰动的SAM变体,作为DFedLSAM-Pert集成到框架中,它根据分层灵敏度分配扰动,并在不牺牲隐私的情况下提高泛化。在基准图像数据集和真实医疗数据集上的大量实验表明,DFedLSAM及其基于扰动的扩展DFedLSAM- pert优于现有基线,特别是在非iid设置和严格的隐私预算下。这些结果表明,DFedLSAM和DFedLSAM- pert为医疗保健和其他敏感应用领域的隐私保护FL提供了实用的解决方案。
{"title":"Local sharpness aware minimization in decentralized federated learning with privacy protection","authors":"Jifei Hu ,&nbsp;Yanli Li ,&nbsp;Huayong Xie ,&nbsp;Lijun Xu ,&nbsp;Hang Zhang ,&nbsp;Xinqiang Zhou","doi":"10.1016/j.eswa.2026.131510","DOIUrl":"10.1016/j.eswa.2026.131510","url":null,"abstract":"<div><div>Federated learning (FL) enables distributed model training while preserving data privacy, but it still faces challenges from data heterogeneity and privacy constraints. Existing optimization methods aiming to flatten loss landscapes often fail to resolve inconsistencies between local and global flatness due to hyperparameter dependencies and centralized aggregation. Moreover, privacy-preserving techniques such as differential privacy (DP) can distort gradients, leading to sharper loss landscapes and hindered convergence. To tackle these issues, we propose DFedLSAM (Decentralized Federated Local Sharpness-Aware Minimization), a novel framework that eliminates the central server and uses the Sharpness-Aware Minimization (SAM) optimizer on the client side to maintain locally flattened loss landscapes. DFedLSAM adopts a dual-model architecture, where each client trains a sharing model for cross-client knowledge exchange and a private model updated via knowledge distillation (KD) from the sharing model’s soft logits, thereby reducing data heterogeneity and mitigating DP-induced sharpness. Building on this design, we introduce a perturbation-based SAM variant, integrated into the framework as DFedLSAM-Pert, which allocates perturbations according to layer-wise sensitivity and improves generalization without sacrificing privacy. Extensive experiments on benchmark image datasets and real-world medical datasets show that DFedLSAM and its perturbation-based extension DFedLSAM-Pert outperform existing baselines, especially in non-IID settings and under strict privacy budgets. These results indicate that DFedLSAM and DFedLSAM-Pert provide practical solutions for privacy-preserving FL in healthcare and other sensitive application domains.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"312 ","pages":"Article 131510"},"PeriodicalIF":7.5,"publicationDate":"2026-05-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146122658","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Discovering anomalous sequences in attributed graphs: A parameter-light approach 发现属性图中的异常序列:一种轻参数的方法
IF 7.5 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-05-25 Epub Date: 2026-02-02 DOI: 10.1016/j.eswa.2026.131467
Cheng He , Xinyang Chen , Amaël Broustet , Guoting Chen
Graphs have been widely used across scientific disciplines, from sociology to biology, particularly when modeling temporal evolution. Although many algorithms have been developed to discover patterns in graphs, they face three main limitations. First, most algorithms assume that each node or edge is associated with a single attribute, whereas real-world applications often involve multiple attributes to capture events more comprehensively. Second, existing methods typically require tuning several hyperparameters, which can vary significantly across different datasets. Third, most approaches focus on identifying frequent patterns, often overlooking rare but meaningful ones. To address these limitations, this paper proposes a framework for discovering anomalous sequences in attributed graphs. Instead of relying on frequency-based measures, the framework adopts an entropy-based method for pattern mining, thereby requiring at most one hyperparameter. Experimental results on real-world datasets demonstrate the effectiveness of the proposed approach in detecting anomalous sequences. Moreover, we extend the framework to applications in optics, where it is used to evaluate phase differences.
从社会学到生物学,图形在科学学科中被广泛使用,特别是在建模时间进化时。尽管已经开发了许多算法来发现图中的模式,但它们面临三个主要限制。首先,大多数算法假设每个节点或边缘与单个属性相关联,而实际应用程序通常涉及多个属性以更全面地捕获事件。其次,现有的方法通常需要调优几个超参数,这在不同的数据集上可能会有很大的不同。第三,大多数方法专注于识别频繁的模式,往往忽略了罕见但有意义的模式。为了解决这些限制,本文提出了一个发现属性图中异常序列的框架。该框架不依赖基于频率的度量,而是采用基于熵的方法进行模式挖掘,因此最多只需要一个超参数。在实际数据集上的实验结果证明了该方法在检测异常序列方面的有效性。此外,我们将该框架扩展到光学应用中,用于评估相位差。
{"title":"Discovering anomalous sequences in attributed graphs: A parameter-light approach","authors":"Cheng He ,&nbsp;Xinyang Chen ,&nbsp;Amaël Broustet ,&nbsp;Guoting Chen","doi":"10.1016/j.eswa.2026.131467","DOIUrl":"10.1016/j.eswa.2026.131467","url":null,"abstract":"<div><div>Graphs have been widely used across scientific disciplines, from sociology to biology, particularly when modeling temporal evolution. Although many algorithms have been developed to discover patterns in graphs, they face three main limitations. First, most algorithms assume that each node or edge is associated with a single attribute, whereas real-world applications often involve multiple attributes to capture events more comprehensively. Second, existing methods typically require tuning several hyperparameters, which can vary significantly across different datasets. Third, most approaches focus on identifying frequent patterns, often overlooking rare but meaningful ones. To address these limitations, this paper proposes a framework for discovering anomalous sequences in attributed graphs. Instead of relying on frequency-based measures, the framework adopts an entropy-based method for pattern mining, thereby requiring at most one hyperparameter. Experimental results on real-world datasets demonstrate the effectiveness of the proposed approach in detecting anomalous sequences. Moreover, we extend the framework to applications in optics, where it is used to evaluate phase differences.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"312 ","pages":"Article 131467"},"PeriodicalIF":7.5,"publicationDate":"2026-05-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146122664","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Safety-assured decision support for ASV navigation via hybrid graph planning and timed automata verification 通过混合图形规划和定时自动机验证的ASV导航安全决策支持
IF 7.5 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-05-25 Epub Date: 2026-02-05 DOI: 10.1016/j.eswa.2026.131367
Huilin Ge , Meng Li , Guanghui Wen , Yu Lu
Autonomous surface vehicles (ASVs) require reliable safety assurance to operate in complex and dynamic marine environments. This paper develops an integrated decision-support framework that couples hybrid graph-based path planning with formal verification to guarantee safe and reachable navigation. A composite roadmap is generated using the proposed HVV-E planner, which combines Voronoi-based global planning and visibility-graph refinement to produce collision-free and energy-aware trajectories. To ensure the trustworthiness of candidate routes, the navigation process is abstracted into a Linearly Priced Timed Automata (LPTA) model and formally verified using UPPAAL. Both qualitative properties, including obstacle avoidance and deadlock-freeness, and quantitative mission constraints, such as bounded travel time and energy consumption, are examined. The verification results provide explicit explanations of why a route is safe or unsafe, enabling early identification of infeasible or risky mission configurations. Experiments conducted in realistic Singapore Strait scenarios demonstrate that the proposed framework delivers transparent, safety-assured, and energy-aware navigation support for real-world ASV missions. The results highlight the value of integrating formal reasoning with intelligent planning to advance explainable and trustworthy autonomous maritime systems.
自动水面车辆(asv)需要可靠的安全保证才能在复杂和动态的海洋环境中运行。本文开发了一种集成的决策支持框架,将基于混合图的路径规划与形式验证相结合,以保证安全可达的导航。利用提出的HVV-E规划器生成复合路线图,该规划器结合了基于voronoi的全局规划和可见性图的改进,以生成无碰撞和能量感知的轨迹。为了保证候选路径的可信度,将导航过程抽象为线性定价时间自动机(LPTA)模型,并使用UPPAAL进行形式化验证。定性性质,包括避障和无死锁,定量任务约束,如有限的旅行时间和能量消耗,进行了检查。验证结果明确解释了路线安全或不安全的原因,从而能够及早识别不可行或有风险的任务配置。在现实的新加坡海峡场景中进行的实验表明,所提出的框架为现实世界的ASV任务提供了透明、安全、节能的导航支持。结果强调了将形式推理与智能规划相结合以推进可解释和可信赖的自主海事系统的价值。
{"title":"Safety-assured decision support for ASV navigation via hybrid graph planning and timed automata verification","authors":"Huilin Ge ,&nbsp;Meng Li ,&nbsp;Guanghui Wen ,&nbsp;Yu Lu","doi":"10.1016/j.eswa.2026.131367","DOIUrl":"10.1016/j.eswa.2026.131367","url":null,"abstract":"<div><div>Autonomous surface vehicles (ASVs) require reliable safety assurance to operate in complex and dynamic marine environments. This paper develops an integrated decision-support framework that couples hybrid graph-based path planning with formal verification to guarantee safe and reachable navigation. A composite roadmap is generated using the proposed HVV-E planner, which combines Voronoi-based global planning and visibility-graph refinement to produce collision-free and energy-aware trajectories. To ensure the trustworthiness of candidate routes, the navigation process is abstracted into a Linearly Priced Timed Automata (LPTA) model and formally verified using UPPAAL. Both qualitative properties, including obstacle avoidance and deadlock-freeness, and quantitative mission constraints, such as bounded travel time and energy consumption, are examined. The verification results provide explicit explanations of why a route is safe or unsafe, enabling early identification of infeasible or risky mission configurations. Experiments conducted in realistic Singapore Strait scenarios demonstrate that the proposed framework delivers transparent, safety-assured, and energy-aware navigation support for real-world ASV missions. The results highlight the value of integrating formal reasoning with intelligent planning to advance explainable and trustworthy autonomous maritime systems.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"312 ","pages":"Article 131367"},"PeriodicalIF":7.5,"publicationDate":"2026-05-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146191894","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Temporal three-way decision-making for emergency admission integrating multigranulation neighborhood rough set with Gaussian mixture-hidden Markov model 基于高斯混合-隐马尔可夫模型的多粒邻域粗糙集急诊入院时间三向决策
IF 7.5 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-05-25 Epub Date: 2026-02-05 DOI: 10.1016/j.eswa.2026.131458
Meng Zhang, Jianghua Zhang, Dongchen Gao, Weibo Liu
Accurate and timely admission prediction in emergency departments is essential for improving resource allocation, enhancing patient outcomes, and mitigating overcrowding. However, the progression of emergency patients often exhibits strong temporal dynamics, and clinical decisions typically involve not only admission and non-admission but also an intermediate state of wait-and-see. To address this challenge, this study proposes a novel temporal three-way decision-making method that integrates Temporal Feature-based Multigranulation Neighborhood Rough Set (TMNRS) with Gaussian Mixture-Hidden Markov Model (GMM-HMM). Specifically, TMNRS is utilized to quantify and characterize the initial distribution of patient states from both theoretical and data-driven perspectives, thereby providing parameter support for subsequent modeling. Building on this foundation, GMM-HMM is employed to capture the dynamic evolution of patients’ conditions across three states over time. This integration facilitates interpretable state representation of the model. Finally, comprehensive experiments conducted on real-world clinical data, including comparisons with multiple benchmark models, demonstrate competitive and rob ust performance of the proposed approach in supporting temporal three-way admission decision-making for emergency patients.
在急诊科准确和及时的入院预测是必不可少的,以改善资源分配,提高患者的结果,并缓解过度拥挤。然而,急诊患者的进展往往表现出强烈的时间动态,临床决策通常不仅涉及入院和不入院,还包括观望的中间状态。为了解决这一挑战,本研究提出了一种新的时间三向决策方法,该方法将基于时间特征的多粒邻域粗糙集(TMNRS)与高斯混合-隐马尔可夫模型(GMM-HMM)相结合。具体而言,利用TMNRS从理论和数据驱动的角度对患者状态的初始分布进行量化和表征,从而为后续建模提供参数支持。在此基础上,GMM-HMM被用来捕捉三个州的患者病情随时间的动态演变。这种集成促进了模型的可解释状态表示。最后,在现实世界的临床数据上进行了全面的实验,包括与多个基准模型的比较,证明了所提出的方法在支持急诊患者的时间三方入院决策方面的竞争性和公平性。
{"title":"Temporal three-way decision-making for emergency admission integrating multigranulation neighborhood rough set with Gaussian mixture-hidden Markov model","authors":"Meng Zhang,&nbsp;Jianghua Zhang,&nbsp;Dongchen Gao,&nbsp;Weibo Liu","doi":"10.1016/j.eswa.2026.131458","DOIUrl":"10.1016/j.eswa.2026.131458","url":null,"abstract":"<div><div>Accurate and timely admission prediction in emergency departments is essential for improving resource allocation, enhancing patient outcomes, and mitigating overcrowding. However, the progression of emergency patients often exhibits strong temporal dynamics, and clinical decisions typically involve not only admission and non-admission but also an intermediate state of wait-and-see. To address this challenge, this study proposes a novel temporal three-way decision-making method that integrates Temporal Feature-based Multigranulation Neighborhood Rough Set (TMNRS) with Gaussian Mixture-Hidden Markov Model (GMM-HMM). Specifically, TMNRS is utilized to quantify and characterize the initial distribution of patient states from both theoretical and data-driven perspectives, thereby providing parameter support for subsequent modeling. Building on this foundation, GMM-HMM is employed to capture the dynamic evolution of patients’ conditions across three states over time. This integration facilitates interpretable state representation of the model. Finally, comprehensive experiments conducted on real-world clinical data, including comparisons with multiple benchmark models, demonstrate competitive and rob ust performance of the proposed approach in supporting temporal three-way admission decision-making for emergency patients.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"312 ","pages":"Article 131458"},"PeriodicalIF":7.5,"publicationDate":"2026-05-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146192590","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MSIF-SSTR: A “Quick smuggler” smuggling speedboat trajectory recognition method based on multi-source information fusion MSIF-SSTR:基于多源信息融合的“快速走私者”走私快艇轨迹识别方法
IF 7.5 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-05-25 Epub Date: 2026-02-04 DOI: 10.1016/j.eswa.2026.131525
Zhuhua Hu , Yifeng Sun , Yaochi Zhao , Wei Wu , Lingkai Kong , Keli Chen
Cooperating with maritime administrative departments to identify smuggling activities and enhance the control ability of nearshore vessels holds significant practical significance. However, existing research mostly relies on basic AIS data and simple features, making it difficult to deal with complex vessel behaviors. Especially when identifying covert and flexible smuggling activities, it is prone to misjudgment and has limited effectiveness. In real-world enforcement, distinguishing truly suspicious “Quick Smuggler” smuggling from benign high-speed transit requires modeling subtle, deep-level spatio-temporal cues that couple motion dynamics with external conditions (e.g., wind, wave, visibility) and context. Simple linear mappings and shallow temporal encoders often overfit speed bursts or local detours, causing elevated false alarms. By contrast, dilated-convolutional receptive fields in TCNs capture multi-scale temporal dependencies efficiently, while KAN layers provide adaptive nonlinear function bases to fit complex, locally varying trajectory patterns. This synergy is particularly suited to covert nighttime operations under shifting sea states, where genuine smuggling exhibits trajectory micro-structures and weather-conditioned behaviors that are hard to emulate by normal craft. To address these challenges, this study proposes a Multi-Source Information Fusion-based “Quick Smuggler” Smuggling Speedboat Trajectory Recognition method (MSIF-SSTR). First, we construct the HN_BF dataset, comprising real-world nighttime radar trajectories from the Qiongzhou Strait and corresponding meteorological data. Next, parallel TCN networks are employed to separately extract motion features, and meteorological features, enabling the model to better capture global temporal dependencies during feature extraction. Finally, the fused features are fed into an LSTM for classification, while a Kolmogorov-arnold networks (KAN) module replaces traditional fully connected layers to improve the representation of complex trajectory patterns. Experimental results demonstrate that MSIF-SSTR achieves F1-scores exceeding 94.2% on the HN_BF dataset, outperforming state-of-the-art methods with higher computational efficiency. Field applications confirm the model’s robustness.
与海事管理部门合作,查清走私活动,提高近岸船舶的管制能力,具有重要的现实意义。然而,现有的研究大多依赖于基本的AIS数据和简单的特征,难以处理复杂的船舶行为。特别是在识别隐蔽和灵活的走私活动时,容易出现误判,效果有限。在现实世界的执法中,区分真正可疑的“快速走私者”走私和良性的高速运输需要建模微妙的、深层次的时空线索,这些线索将运动动力学与外部条件(例如,风、波浪、能见度)和环境相结合。简单的线性映射和浅时间编码器经常过拟合速度突发或局部弯路,导致误报警升高。相比之下,tcnn中的扩展卷积接受场有效地捕获多尺度时间依赖性,而KAN层提供自适应非线性函数基来拟合复杂的局部变化轨迹模式。这种协同作用特别适合在变化的海况下进行夜间秘密行动,因为真正的走私显示出常规船只难以模仿的轨迹微观结构和受天气影响的行为。针对这些挑战,本研究提出了一种基于多源信息融合的“快速走私者”走私快艇轨迹识别方法(MSIF-SSTR)。首先,我们构建了HN_BF数据集,该数据集包含琼州海峡真实的夜间雷达轨迹和相应的气象数据。其次,采用并行TCN网络分别提取运动特征和气象特征,使模型在特征提取过程中更好地捕获全局时间依赖关系。最后,将融合的特征输入到LSTM中进行分类,而Kolmogorov-arnold网络(KAN)模块取代传统的全连接层,以改善复杂轨迹模式的表示。实验结果表明,MSIF-SSTR在HN_BF数据集上的f1得分超过94.2%,计算效率高于现有方法。现场应用验证了模型的鲁棒性。
{"title":"MSIF-SSTR: A “Quick smuggler” smuggling speedboat trajectory recognition method based on multi-source information fusion","authors":"Zhuhua Hu ,&nbsp;Yifeng Sun ,&nbsp;Yaochi Zhao ,&nbsp;Wei Wu ,&nbsp;Lingkai Kong ,&nbsp;Keli Chen","doi":"10.1016/j.eswa.2026.131525","DOIUrl":"10.1016/j.eswa.2026.131525","url":null,"abstract":"<div><div>Cooperating with maritime administrative departments to identify smuggling activities and enhance the control ability of nearshore vessels holds significant practical significance. However, existing research mostly relies on basic AIS data and simple features, making it difficult to deal with complex vessel behaviors. Especially when identifying covert and flexible smuggling activities, it is prone to misjudgment and has limited effectiveness. In real-world enforcement, distinguishing truly suspicious “Quick Smuggler” smuggling from benign high-speed transit requires modeling subtle, deep-level spatio-temporal cues that couple motion dynamics with external conditions (e.g., wind, wave, visibility) and context. Simple linear mappings and shallow temporal encoders often overfit speed bursts or local detours, causing elevated false alarms. By contrast, dilated-convolutional receptive fields in TCNs capture multi-scale temporal dependencies efficiently, while KAN layers provide adaptive nonlinear function bases to fit complex, locally varying trajectory patterns. This synergy is particularly suited to covert nighttime operations under shifting sea states, where genuine smuggling exhibits trajectory micro-structures and weather-conditioned behaviors that are hard to emulate by normal craft. To address these challenges, this study proposes a Multi-Source Information Fusion-based “Quick Smuggler” Smuggling Speedboat Trajectory Recognition method (MSIF-SSTR). First, we construct the HN_BF dataset, comprising real-world nighttime radar trajectories from the Qiongzhou Strait and corresponding meteorological data. Next, parallel TCN networks are employed to separately extract motion features, and meteorological features, enabling the model to better capture global temporal dependencies during feature extraction. Finally, the fused features are fed into an LSTM for classification, while a Kolmogorov-arnold networks (KAN) module replaces traditional fully connected layers to improve the representation of complex trajectory patterns. Experimental results demonstrate that MSIF-SSTR achieves F1-scores exceeding 94.2% on the HN_BF dataset, outperforming state-of-the-art methods with higher computational efficiency. Field applications confirm the model’s robustness.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"312 ","pages":"Article 131525"},"PeriodicalIF":7.5,"publicationDate":"2026-05-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146192599","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Neural clothing tryer: Customized virtual try-on via semantic enhancement and controlling diffusion model 神经试衣机:通过语义增强和控制扩散模型定制虚拟试衣机
IF 7.5 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-05-25 Epub Date: 2026-02-02 DOI: 10.1016/j.eswa.2026.131390
Zhijing Yang , Weiwei Zhang , Mingliang Yang , Siyuan Peng , Yukai Shi , Junpeng Tan , Tianshui Chen , Liruo Zhong
This work aims to address a novel Customized Virtual Try-ON (Cu-VTON) task, enabling the superimposition of a specified garment onto a model that can be customized in terms of appearance, posture, and additional attributes. Compared with traditional VTON task, it enables users to tailor digital avatars to their individual preferences, thereby enhancing the virtual fitting experience with greater flexibility and engagement. To address this task, we introduce a Neural Clothing Tryer (NCT) framework, which exploits the advanced diffusion models equipped with semantic enhancement and controlling modules to better preserve semantic characterization and textural details of the garment and meanwhile facilitating the flexible editing of the model’s postures and appearances. Specifically, NCT introduces a semantic-enhanced module to take semantic descriptions of garments and utilizes a visual-language encoder to learn aligned features across modalities. The aligned features are served as condition input to the diffusion model to enhance the preservation of the garment’s semantics. Then, a semantic controlling module is designed to take the garment image, tailored posture image, and semantic description as input to maintain garment details while simultaneously editing model postures, expressions, and various attributes. Extensive experiments on the open available benchmark demonstrate the superior performance of the proposed NCT framework.
这项工作旨在解决一种新颖的定制虚拟试穿(Cu-VTON)任务,将指定的服装叠加到可以根据外观,姿势和其他属性定制的模型上。与传统的VTON任务相比,它使用户能够根据个人喜好定制数字化身,从而提高虚拟试衣体验的灵活性和参与度。为了解决这个问题,我们引入了一个Neural Clothing Tryer (NCT)框架,该框架利用配备语义增强和控制模块的先进扩散模型来更好地保留服装的语义特征和纹理细节,同时促进模型的姿态和外观的灵活编辑。具体来说,NCT引入了一个语义增强模块来获取服装的语义描述,并利用视觉语言编码器来学习跨模态的对齐特征。将对齐的特征作为扩散模型的条件输入,以增强服装语义的保存。然后,设计语义控制模块,以服装图像、剪裁姿态图像、语义描述为输入,维护服装细节,同时编辑模型姿态、表情和各种属性。在开放可用基准上的大量实验证明了所提出的NCT框架的优越性能。
{"title":"Neural clothing tryer: Customized virtual try-on via semantic enhancement and controlling diffusion model","authors":"Zhijing Yang ,&nbsp;Weiwei Zhang ,&nbsp;Mingliang Yang ,&nbsp;Siyuan Peng ,&nbsp;Yukai Shi ,&nbsp;Junpeng Tan ,&nbsp;Tianshui Chen ,&nbsp;Liruo Zhong","doi":"10.1016/j.eswa.2026.131390","DOIUrl":"10.1016/j.eswa.2026.131390","url":null,"abstract":"<div><div>This work aims to address a novel Customized Virtual Try-ON (Cu-VTON) task, enabling the superimposition of a specified garment onto a model that can be customized in terms of appearance, posture, and additional attributes. Compared with traditional VTON task, it enables users to tailor digital avatars to their individual preferences, thereby enhancing the virtual fitting experience with greater flexibility and engagement. To address this task, we introduce a Neural Clothing Tryer (NCT) framework, which exploits the advanced diffusion models equipped with semantic enhancement and controlling modules to better preserve semantic characterization and textural details of the garment and meanwhile facilitating the flexible editing of the model’s postures and appearances. Specifically, NCT introduces a semantic-enhanced module to take semantic descriptions of garments and utilizes a visual-language encoder to learn aligned features across modalities. The aligned features are served as condition input to the diffusion model to enhance the preservation of the garment’s semantics. Then, a semantic controlling module is designed to take the garment image, tailored posture image, and semantic description as input to maintain garment details while simultaneously editing model postures, expressions, and various attributes. Extensive experiments on the open available benchmark demonstrate the superior performance of the proposed NCT framework.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"312 ","pages":"Article 131390"},"PeriodicalIF":7.5,"publicationDate":"2026-05-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146192843","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Expert Systems with Applications
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1