首页 > 最新文献

Expert Systems with Applications最新文献

英文 中文
Efficient workflow offloading in private clouds using serverless computing 使用无服务器计算在私有云中高效地卸载工作流
IF 7.5 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-31 DOI: 10.1016/j.eswa.2026.131460
Shukun Yu , Quanwang Wu , Kun Cai , Taolin Guo , Zhuo Jiang , Tianhao Sun
Although private clouds provide workflow execution environments with enhanced data security, greater administrative control, and predictable resource provisioning, they often suffer from limited elasticity and struggle to adapt to dynamic and bursty workload patterns. Serverless computing, characterized by elastic scaling, pay-per-use pricing model, and event-driven execution, presents a promising paradigm to address these limitations. Hence, this paper explores integrating private clouds with serverless computing to enable efficient workflow offloading. A systematic workflow offloading framework is established for minimizing monetary costs under workflow deadlines. A Deadline-Aware Task Offloading (DATO) heuristic algorithm is proposed, which strategically offloads workflow tasks between serverless platforms and private cloud resources. It employs latest-start-time prioritization to sort tasks from different workflows, and dynamically determines optimal offloading decisions by balancing task characteristics, resource availability, and cost considerations. Evaluation experiments have been carried out with realistic workflows and platform settings. The results demonstrate the excellent performance of DATO in reducing costs under deadline constraints compared with traditional approaches.
尽管私有云为工作流执行环境提供了增强的数据安全性、更好的管理控制和可预测的资源供应,但它们的弹性往往有限,难以适应动态和突发的工作负载模式。无服务器计算的特点是弹性伸缩、按使用付费的定价模型和事件驱动的执行,为解决这些限制提供了一个很有前途的范例。因此,本文探讨了将私有云与无服务器计算集成在一起,以实现高效的工作流卸载。建立了一个系统的工作流卸载框架,以最大限度地减少工作流截止日期下的货币成本。提出了一种基于截止日期感知的任务卸载(DATO)启发式算法,在无服务器平台和私有云资源之间策略性地进行工作流任务的卸载。它采用最新开始时间优先级来对来自不同工作流的任务进行排序,并通过平衡任务特征、资源可用性和成本考虑来动态确定最佳卸载决策。在实际工作流程和平台设置下进行了评估实验。结果表明,与传统方法相比,DATO在期限约束下降低成本方面具有优异的性能。
{"title":"Efficient workflow offloading in private clouds using serverless computing","authors":"Shukun Yu ,&nbsp;Quanwang Wu ,&nbsp;Kun Cai ,&nbsp;Taolin Guo ,&nbsp;Zhuo Jiang ,&nbsp;Tianhao Sun","doi":"10.1016/j.eswa.2026.131460","DOIUrl":"10.1016/j.eswa.2026.131460","url":null,"abstract":"<div><div>Although private clouds provide workflow execution environments with enhanced data security, greater administrative control, and predictable resource provisioning, they often suffer from limited elasticity and struggle to adapt to dynamic and bursty workload patterns. Serverless computing, characterized by elastic scaling, pay-per-use pricing model, and event-driven execution, presents a promising paradigm to address these limitations. Hence, this paper explores integrating private clouds with serverless computing to enable efficient workflow offloading. A systematic workflow offloading framework is established for minimizing monetary costs under workflow deadlines. A Deadline-Aware Task Offloading (DATO) heuristic algorithm is proposed, which strategically offloads workflow tasks between serverless platforms and private cloud resources. It employs latest-start-time prioritization to sort tasks from different workflows, and dynamically determines optimal offloading decisions by balancing task characteristics, resource availability, and cost considerations. Evaluation experiments have been carried out with realistic workflows and platform settings. The results demonstrate the excellent performance of DATO in reducing costs under deadline constraints compared with traditional approaches.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"312 ","pages":"Article 131460"},"PeriodicalIF":7.5,"publicationDate":"2026-01-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146122665","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A depthwise convolutional variational autoencoder for anomaly detection in complex traffic scenarios from UAV views 一种基于深度卷积变分自编码器的无人机视角复杂交通场景异常检测
IF 7.5 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-30 DOI: 10.1016/j.eswa.2026.131425
Arslan Saleem, Cem Direkoglu
Traffic anomaly detection (AD) is essential for improving public safety, reducing risks, and enabling quick responses in intelligent surveillance systems. Aerial traffic monitoring, particularly using Unmanned Aerial Vehicles (UAV), has gained attention due to its potential to address challenges like dynamic urban environments, yet it remains underexplored. Detecting anomalies in drone-captured video involves unique obstacles: rare events, small and overlapping objects, multi-scale targets, and complex backgrounds. To address these challenges, we propose the Depthwise Convolutional Variational Autoencoder (DwCVAE), a novel model designed to enhance AD in drone-based traffic surveillance. DwCVAE leverages depthwise convolutions, which allow efficient and detailed feature extraction, improving model sensitivity to subtle and multi-scale anomalies. The proposed DwCVAE adopts an encoder-latent-decoder VAE architecture, in which stacked depthwise convolutional layers in the encoder emphasize spatially localized feature learning while maintaining channel-wise efficiency, and a compact variational latent space captures the distribution of normal traffic dynamics. Built on variational autoencoder (VAE) architecture, DwCVAE creates compact latent representations that capture normal traffic patterns, enabling reliable detection of deviations. Anomalies are identified through reconstruction-based scoring, where events that deviate from the learned normal representations yield higher reconstruction errors. This depthwise approach marks a key innovation, optimizing both computational efficiency and detection accuracy. We design four additional models: Convolutional Variational Autoencoder (CVAE), Dilated Convolutional VAE (DCVAE), Separable Convolutional VAE (SCVAE), and Convolutional LSTM VAE (CLSTMVAE) to systematically assess the effectiveness of DwCVAE. Additionally, we evaluate DwCVAE against state-of-the-art weakly supervised and unsupervised models on two benchmark datasets, Drone-Anomaly and UIT-Adrone. DwCVAE achieves an AUC of 74.95 with an EER of 0.30 on Drone-Anomaly, and an AUC of 79.77 with an EER of 0.27 on UIT-ADrone, demonstrating its superior performance in complex aerial surveillance tasks.
在智能监控系统中,交通异常检测(Traffic anomaly detection, AD)对于提高公共安全、降低风险和实现快速响应至关重要。空中交通监控,特别是使用无人机(UAV),由于其应对动态城市环境等挑战的潜力而受到关注,但仍未得到充分开发。检测无人机捕获的视频异常涉及独特的障碍:罕见事件,小而重叠的物体,多尺度目标和复杂的背景。为了解决这些挑战,我们提出了深度卷积变分自编码器(DwCVAE),这是一种新的模型,旨在增强基于无人机的交通监控中的AD。DwCVAE利用深度卷积,允许高效和详细的特征提取,提高模型对微妙和多尺度异常的敏感性。本文提出的DwCVAE采用编码器-潜伏-解码器的VAE架构,在保持信道效率的同时,编码器中堆叠的深度卷积层强调空间局部特征学习,紧凑的变分潜伏空间捕获正常流量动态分布。DwCVAE建立在变分自编码器(VAE)架构上,创建了紧凑的潜在表示,可以捕获正常的流量模式,从而能够可靠地检测偏差。通过基于重建的评分来识别异常,其中偏离学习到的正常表示的事件会产生更高的重建误差。这种深入的方法标志着一项关键的创新,优化了计算效率和检测精度。我们设计了四个额外的模型:卷积变分自编码器(CVAE)、扩张卷积VAE (DCVAE)、可分离卷积VAE (SCVAE)和卷积LSTMVAE (CLSTMVAE)来系统地评估DwCVAE的有效性。此外,我们在两个基准数据集(Drone-Anomaly和unit - drone)上针对最先进的弱监督和无监督模型评估DwCVAE。DwCVAE在Drone-Anomaly上AUC为74.95,EER为0.30;在unit - drone上AUC为79.77,EER为0.27,在复杂的空中监视任务中表现出优越的性能。
{"title":"A depthwise convolutional variational autoencoder for anomaly detection in complex traffic scenarios from UAV views","authors":"Arslan Saleem,&nbsp;Cem Direkoglu","doi":"10.1016/j.eswa.2026.131425","DOIUrl":"10.1016/j.eswa.2026.131425","url":null,"abstract":"<div><div>Traffic anomaly detection (AD) is essential for improving public safety, reducing risks, and enabling quick responses in intelligent surveillance systems. Aerial traffic monitoring, particularly using Unmanned Aerial Vehicles (UAV), has gained attention due to its potential to address challenges like dynamic urban environments, yet it remains underexplored. Detecting anomalies in drone-captured video involves unique obstacles: rare events, small and overlapping objects, multi-scale targets, and complex backgrounds. To address these challenges, we propose the Depthwise Convolutional Variational Autoencoder (DwCVAE), a novel model designed to enhance AD in drone-based traffic surveillance. DwCVAE leverages depthwise convolutions, which allow efficient and detailed feature extraction, improving model sensitivity to subtle and multi-scale anomalies. The proposed DwCVAE adopts an encoder-latent-decoder VAE architecture, in which stacked depthwise convolutional layers in the encoder emphasize spatially localized feature learning while maintaining channel-wise efficiency, and a compact variational latent space captures the distribution of normal traffic dynamics. Built on variational autoencoder (VAE) architecture, DwCVAE creates compact latent representations that capture normal traffic patterns, enabling reliable detection of deviations. Anomalies are identified through reconstruction-based scoring, where events that deviate from the learned normal representations yield higher reconstruction errors. This depthwise approach marks a key innovation, optimizing both computational efficiency and detection accuracy. We design four additional models: Convolutional Variational Autoencoder (CVAE), Dilated Convolutional VAE (DCVAE), Separable Convolutional VAE (SCVAE), and Convolutional LSTM VAE (CLSTMVAE) to systematically assess the effectiveness of DwCVAE. Additionally, we evaluate DwCVAE against state-of-the-art weakly supervised and unsupervised models on two benchmark datasets, Drone-Anomaly and UIT-Adrone. DwCVAE achieves an AUC of 74.95 with an EER of 0.30 on Drone-Anomaly, and an AUC of 79.77 with an EER of 0.27 on UIT-ADrone, demonstrating its superior performance in complex aerial surveillance tasks.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"312 ","pages":"Article 131425"},"PeriodicalIF":7.5,"publicationDate":"2026-01-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146122667","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Dynamic 3D Gaussian SLAM via motion suppression and incremental optimization 动态三维高斯SLAM通过运动抑制和增量优化
IF 7.5 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-30 DOI: 10.1016/j.eswa.2026.131329
Longxin Zhang , Shuping Ye , Benlian Xu , Shuting Le , Xu Zhou , Mingli Lu , Jinliang Cong , Jian Shi
Recent advancements in 3D Gaussian Splatting (3DGS) have demonstrated significant potential for high-fidelity scene reconstruction and real-time rendering. However, applying 3DGS-based Simultaneous Localization and Mapping (SLAM) in dynamic environments remains challenging due to disturbance from moving objects, which corrupt the Gaussian initialization and optimization processes. To address these issues, we propose a dynamic 3D Gaussian SLAM framework that integrates a dynamic pollution suppression mechanism and an incremental Gaussian optimization strategy. Our approach leverages semantic segmentation and depth fusion to actively mask dynamic regions during initialization, ensuring the integrity of the foundational Gaussian model. Furthermore, we introduce an image inpainting network to restore static content in masked areas, enabling high-quality densification of under-rendered regions. Experimental results on the TUM RGB-D dataset and real-world dynamic scenes demonstrate that our method significantly outperforms state-of-the-art approaches in both localization accuracy and rendering quality, achieving robust performance in highly dynamic environments.
3D高斯飞溅(3DGS)的最新进展已经证明了高保真场景重建和实时渲染的巨大潜力。然而,在动态环境中应用基于3dgs的同步定位和映射(SLAM)仍然具有挑战性,因为运动物体的干扰会破坏高斯初始化和优化过程。为了解决这些问题,我们提出了一个动态三维高斯SLAM框架,该框架集成了动态污染抑制机制和增量高斯优化策略。我们的方法利用语义分割和深度融合在初始化过程中主动掩盖动态区域,确保基础高斯模型的完整性。此外,我们还引入了一个图像绘制网络来恢复遮罩区域的静态内容,从而实现渲染不足区域的高质量密度化。在TUM RGB-D数据集和现实世界动态场景上的实验结果表明,我们的方法在定位精度和渲染质量方面都明显优于最先进的方法,在高动态环境中实现了稳健的性能。
{"title":"Dynamic 3D Gaussian SLAM via motion suppression and incremental optimization","authors":"Longxin Zhang ,&nbsp;Shuping Ye ,&nbsp;Benlian Xu ,&nbsp;Shuting Le ,&nbsp;Xu Zhou ,&nbsp;Mingli Lu ,&nbsp;Jinliang Cong ,&nbsp;Jian Shi","doi":"10.1016/j.eswa.2026.131329","DOIUrl":"10.1016/j.eswa.2026.131329","url":null,"abstract":"<div><div>Recent advancements in 3D Gaussian Splatting (3DGS) have demonstrated significant potential for high-fidelity scene reconstruction and real-time rendering. However, applying 3DGS-based Simultaneous Localization and Mapping (SLAM) in dynamic environments remains challenging due to disturbance from moving objects, which corrupt the Gaussian initialization and optimization processes. To address these issues, we propose a dynamic 3D Gaussian SLAM framework that integrates a dynamic pollution suppression mechanism and an incremental Gaussian optimization strategy. Our approach leverages semantic segmentation and depth fusion to actively mask dynamic regions during initialization, ensuring the integrity of the foundational Gaussian model. Furthermore, we introduce an image inpainting network to restore static content in masked areas, enabling high-quality densification of under-rendered regions. Experimental results on the TUM RGB-D dataset and real-world dynamic scenes demonstrate that our method significantly outperforms state-of-the-art approaches in both localization accuracy and rendering quality, achieving robust performance in highly dynamic environments.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"311 ","pages":"Article 131329"},"PeriodicalIF":7.5,"publicationDate":"2026-01-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146081700","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
DCCL: Question-guided dual-channel contrastive learning framework for emotion-cause pair extraction 基于问题导向的双通道情感原因对提取对比学习框架
IF 7.5 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-29 DOI: 10.1016/j.eswa.2026.131357
Hongyang Wang, Yajun Du, Jia Liu, Xianyong Li, Xiaoliang Chen, Yanli Lee, Qing Qi, Wanjie Zhang
The emotion-cause pair extraction (ECPE) task aims to identify emotion clauses and their corresponding cause clauses from document-level text. It has important applications in a wide range of scenarios, including public opinion monitoring and user feedback analysis. Although research has made initial progress on this task, existing methods still face challenges in identifying implicit emotions. Firstly, the lack of explicit semantic guidance leads to insufficient discriminative power, especially when dealing with ambiguous emotional expressions. Secondly, existing methods primarily focus on modeling intra-sentence relationships, which limits their ability to jointly capture cross-sentence temporal dependencies and global semantic information. To address the challenges of emotion-cause pair extraction, we propose a question-guided dual-channel contrastive learning framework, DCCL. Firstly, the DCCL employs a question formulation based on machine reading comprehension (MRC) to guide the model in capturing the emotion-cause relationship between clauses. Furthermore, task-specific queries are explicitly injected into the input, making the model more aware of the task objective. Secondly, in DCCL, we design a dual-channel network combining query-aware clause-level Transformer and BiLSTM to enhance the model’s ability to capture temporal and global contextual dependencies, which enables DCCL to capture the temporal and global contextual relationships between clauses more fully. Thirdly, the DCCL incorporates supervised contrastive learning. We leverage positive and negative samples to incorporate contrastive learning into each channel, which optimizes the representation space and enhances the model’s ability to recognize ambiguous emotions and boundary conditions. We conducted experiments on three mainstream tasks, namely emotion cause pair extraction, emotion extraction, and cause extraction, on the ECPE benchmark dataset. The results show that DCCL improves the F1 scores of the best baseline models such as CD-MRC, SEG, ect by 1.53%, 4.41%, respectively in the emotion-cause pair extraction task, 0.81%, 4.37%, respectively in the emotion extraction task, and 0.62%, 1.27%, respectively in the cause extraction task. Moreover, compared with the large language model baseline LLM-MTLN, DCCL further improves F1 by 2.48%, 4.50%, and 0.63% on these three tasks, respectively.
情感-原因对抽取(ECPE)任务旨在从文档级文本中识别情感子句及其对应的原因子句。它在广泛的场景中具有重要的应用,包括舆情监测和用户反馈分析。虽然研究在这项任务上取得了初步进展,但现有的方法在识别内隐情绪方面仍然面临挑战。首先,缺乏明确的语义引导导致辨别能力不足,尤其是在处理模棱两可的情绪表达时。其次,现有的方法主要关注句子内关系的建模,这限制了它们联合捕获跨句子时间依赖关系和全局语义信息的能力。为了解决情感-原因对提取的挑战,我们提出了一个问题引导的双通道对比学习框架,DCCL。首先,DCCL采用基于机器阅读理解(MRC)的问题提法来指导模型捕捉子句之间的情感-原因关系。此外,特定于任务的查询被显式地注入到输入中,使模型更加了解任务目标。其次,在DCCL中,我们设计了一个结合查询感知子句级Transformer和BiLSTM的双通道网络,以增强模型捕获时态和全局上下文依赖关系的能力,使DCCL能够更充分地捕获子句之间的时态和全局上下文关系。第三,DCCL结合了监督对比学习。我们利用正样本和负样本将对比学习纳入每个通道,这优化了表示空间,增强了模型识别模糊情绪和边界条件的能力。我们在ECPE基准数据集上对情感原因对提取、情感提取和原因提取三个主流任务进行了实验。结果表明,DCCL在情绪-原因对提取任务中分别提高了CD-MRC、SEG等最佳基线模型的F1得分1.53%、4.41%,在情绪提取任务中分别提高了0.81%、4.37%,在原因提取任务中分别提高了0.62%、1.27%。此外,与大型语言模型基线LLM-MTLN相比,DCCL在这三个任务上的F1分别提高了2.48%、4.50%和0.63%。
{"title":"DCCL: Question-guided dual-channel contrastive learning framework for emotion-cause pair extraction","authors":"Hongyang Wang,&nbsp;Yajun Du,&nbsp;Jia Liu,&nbsp;Xianyong Li,&nbsp;Xiaoliang Chen,&nbsp;Yanli Lee,&nbsp;Qing Qi,&nbsp;Wanjie Zhang","doi":"10.1016/j.eswa.2026.131357","DOIUrl":"10.1016/j.eswa.2026.131357","url":null,"abstract":"<div><div>The emotion-cause pair extraction (ECPE) task aims to identify emotion clauses and their corresponding cause clauses from document-level text. It has important applications in a wide range of scenarios, including public opinion monitoring and user feedback analysis. Although research has made initial progress on this task, existing methods still face challenges in identifying implicit emotions. Firstly, the lack of explicit semantic guidance leads to insufficient discriminative power, especially when dealing with ambiguous emotional expressions. Secondly, existing methods primarily focus on modeling intra-sentence relationships, which limits their ability to jointly capture cross-sentence temporal dependencies and global semantic information. To address the challenges of emotion-cause pair extraction, we propose a question-guided dual-channel contrastive learning framework, DCCL. Firstly, the DCCL employs a question formulation based on machine reading comprehension (MRC) to guide the model in capturing the emotion-cause relationship between clauses. Furthermore, task-specific queries are explicitly injected into the input, making the model more aware of the task objective. Secondly, in DCCL, we design a dual-channel network combining query-aware clause-level Transformer and BiLSTM to enhance the model’s ability to capture temporal and global contextual dependencies, which enables DCCL to capture the temporal and global contextual relationships between clauses more fully. Thirdly, the DCCL incorporates supervised contrastive learning. We leverage positive and negative samples to incorporate contrastive learning into each channel, which optimizes the representation space and enhances the model’s ability to recognize ambiguous emotions and boundary conditions. We conducted experiments on three mainstream tasks, namely emotion cause pair extraction, emotion extraction, and cause extraction, on the ECPE benchmark dataset. The results show that DCCL improves the F1 scores of the best baseline models such as CD-MRC, SEG, ect by 1.53%, 4.41%, respectively in the emotion-cause pair extraction task, 0.81%, 4.37%, respectively in the emotion extraction task, and 0.62%, 1.27%, respectively in the cause extraction task. Moreover, compared with the large language model baseline LLM-MTLN, DCCL further improves F1 by 2.48%, 4.50%, and 0.63% on these three tasks, respectively.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"311 ","pages":"Article 131357"},"PeriodicalIF":7.5,"publicationDate":"2026-01-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146070882","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Preference learning based on maximizing membership degree with heterogeneous information for landslide early warning 基于异构信息最大隶属度的偏好学习滑坡预警
IF 7.5 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-29 DOI: 10.1016/j.eswa.2026.131381
Jiajia Jiang , Min Zhan , Gaocan Gong , Lin Wang , Quanbo Zha
Preference learning involves developing a model that reflects a decision-maker's preference based on the provided information. Existing preference learning models for multi-criteria classification overlook exploring the membership degree of an alternative to a predefined class, especially in the face of heterogeneous types of criteria information. To address this issue, this paper proposes a preference learning model based on membership degree maximization with heterogeneous information. First, based on the additive utility function, we construct a preference model that integrates numerical and linguistic criteria information to express the utilities of alternatives. Within this model, four types of utility functions are considered to describe the variation characteristics of criteria. Next, triangular fuzzy numbers are employed to capture the membership degree of each alternative within the predefined classes, and a learning model is developed by maximizing the membership degrees of alternatives to their corresponding predefined classes. Finally, the proposed model is applied to landslide early warning to verify its feasibility.
偏好学习涉及开发一个模型,该模型可以根据提供的信息反映决策者的偏好。现有的多准则分类偏好学习模型忽略了对预定义类的替代选择的隶属度的探索,特别是在面对异构类型的准则信息时。针对这一问题,本文提出了一种基于隶属度最大化的异构信息偏好学习模型。首先,基于可加性效用函数,构建了一个综合数值和语言标准信息的偏好模型来表达备选方案的效用。在该模型中,考虑了四种类型的效用函数来描述标准的变化特征。其次,利用三角模糊数捕获预定义类中每个备选方案的隶属度,并通过最大化备选方案与其对应的预定义类的隶属度来建立学习模型。最后,将该模型应用于滑坡预警,验证了该模型的可行性。
{"title":"Preference learning based on maximizing membership degree with heterogeneous information for landslide early warning","authors":"Jiajia Jiang ,&nbsp;Min Zhan ,&nbsp;Gaocan Gong ,&nbsp;Lin Wang ,&nbsp;Quanbo Zha","doi":"10.1016/j.eswa.2026.131381","DOIUrl":"10.1016/j.eswa.2026.131381","url":null,"abstract":"<div><div>Preference learning involves developing a model that reflects a decision-maker's preference based on the provided information. Existing preference learning models for multi-criteria classification overlook exploring the membership degree of an alternative to a predefined class, especially in the face of heterogeneous types of criteria information. To address this issue, this paper proposes a preference learning model based on membership degree maximization with heterogeneous information. First, based on the additive utility function, we construct a preference model that integrates numerical and linguistic criteria information to express the utilities of alternatives. Within this model, four types of utility functions are considered to describe the variation characteristics of criteria. Next, triangular fuzzy numbers are employed to capture the membership degree of each alternative within the predefined classes, and a learning model is developed by maximizing the membership degrees of alternatives to their corresponding predefined classes. Finally, the proposed model is applied to landslide early warning to verify its feasibility.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"312 ","pages":"Article 131381"},"PeriodicalIF":7.5,"publicationDate":"2026-01-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146122666","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MAGF: Multi-scale attention and gated fusion for multi-modal glaucoma grading MAGF:多模式青光眼分级的多尺度关注和门控融合
IF 7.5 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-29 DOI: 10.1016/j.eswa.2026.131388
Haixi Cheng , Chaoqun Hong , Bo Zhang , Huihui Fang , Yanwu Xu , Si Yong Yeo
Glaucoma is one of the leading causes of irreversible blindness worldwide. Color fundus photography (CFP) and optical coherence tomography (OCT) are two primary imaging modalities for glaucoma diagnosis. Recently, multi-modal approaches that combine CFP and OCT have demonstrated higher diagnostic accuracy compared to single-modal methods. However, the high similarity among medical image poses presents a challenge for extracting effective features. Additionally, low-quality features can degrade fusion performance, potentially leading to inaccurate grading results. To address these challenges, we propose a Multi-scale Attention and Gated Fusion (MAGF) framework, which incorporates a dual-branch feature extraction architecture with targeted attention, a Multi-scale Attention Fusion Module (MAFM) for enhancing OCT features, and a Gated Fusion Module (GFM) for adaptive integration of CFP and OCT modalities. Extensive experiments demonstrate that our method achieves state-of-the-art (SOTA) performance in glaucoma grading.
青光眼是世界范围内导致不可逆失明的主要原因之一。彩色眼底摄影(CFP)和光学相干断层扫描(OCT)是青光眼诊断的两种主要成像方式。最近,与单模态方法相比,结合CFP和OCT的多模态方法显示出更高的诊断准确性。然而,医学图像之间的高度相似性给有效特征的提取带来了挑战。此外,低质量的特征会降低融合性能,可能导致不准确的分级结果。为了应对这些挑战,我们提出了一个多尺度注意力和门控融合(MAGF)框架,该框架包括一个具有目标注意力的双分支特征提取架构,一个用于增强OCT特征的多尺度注意力融合模块(MAFM),以及一个用于自适应集成CFP和OCT模式的门控融合模块(GFM)。大量的实验表明,我们的方法在青光眼分级中达到了最先进的(SOTA)性能。
{"title":"MAGF: Multi-scale attention and gated fusion for multi-modal glaucoma grading","authors":"Haixi Cheng ,&nbsp;Chaoqun Hong ,&nbsp;Bo Zhang ,&nbsp;Huihui Fang ,&nbsp;Yanwu Xu ,&nbsp;Si Yong Yeo","doi":"10.1016/j.eswa.2026.131388","DOIUrl":"10.1016/j.eswa.2026.131388","url":null,"abstract":"<div><div>Glaucoma is one of the leading causes of irreversible blindness worldwide. Color fundus photography (CFP) and optical coherence tomography (OCT) are two primary imaging modalities for glaucoma diagnosis. Recently, multi-modal approaches that combine CFP and OCT have demonstrated higher diagnostic accuracy compared to single-modal methods. However, the high similarity among medical image poses presents a challenge for extracting effective features. Additionally, low-quality features can degrade fusion performance, potentially leading to inaccurate grading results. To address these challenges, we propose a Multi-scale Attention and Gated Fusion (MAGF) framework, which incorporates a dual-branch feature extraction architecture with targeted attention, a Multi-scale Attention Fusion Module (MAFM) for enhancing OCT features, and a Gated Fusion Module (GFM) for adaptive integration of CFP and OCT modalities. Extensive experiments demonstrate that our method achieves state-of-the-art (SOTA) performance in glaucoma grading.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"312 ","pages":"Article 131388"},"PeriodicalIF":7.5,"publicationDate":"2026-01-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146122714","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An integrated data-driven-simulation-optimization model: insights into controlling invasive plants in China 数据驱动的综合模拟优化模型:中国入侵植物控制的启示
IF 7.5 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-29 DOI: 10.1016/j.eswa.2026.131261
Shuhua Zhang , Ming Liu , Dong Li
The rapid spread of invasive plants such as Spartina alterniflora has emerged as a major ecological and economic threats to coastal wetlands, while existing management strategies often fail to adapt to dynamic invasion processes and limited financial resources. To address this challenge, this study develops a novel data-driven-simulation–optimization (DDSO) framework that enables dynamic and spatially explicit management of biological invasions. The core innovation lies in coupling data-driven ecological parameterization based on multi-source observations with a simulation model that captures life-cycle transitions and spatial dispersal, and a mixed-integer optimization module that allocates control budgets and intervention intensities across space and time. By integrating heterogeneous environmental, biological, and management data, the framework constructs time-varying ecological parameters that reflect evolving invasion conditions and underlying ecological processes. The optimization component then generates cost-effective intervention schedules under fixed budget constraints. Comparative evaluation against system dynamics (SD) and simulation–optimization (SO) models shows that DDSO outperforms conventional approaches not only in budget efficiency, but also by revealing counterintuitive management logics: management effectiveness hinges more on the presence of a coordinated optimization framework than on investment scale, and economically efficient strategies inherently favor highly uneven spatial resource allocation. These mechanism-level insights underscore the importance of early intervention and cross-regional coordination, establishing DDSO as a policy-relevant framework for adaptive invasive species management.
互花米草等入侵植物的迅速蔓延已成为滨海湿地的主要生态和经济威胁,而现有的管理策略往往不能适应动态的入侵过程和有限的财政资源。为了应对这一挑战,本研究开发了一种新的数据驱动模拟优化(DDSO)框架,可以对生物入侵进行动态和空间明确的管理。核心创新在于将基于多源观测的数据驱动的生态参数化与捕获生命周期过渡和空间分散的模拟模型以及跨空间和时间分配控制预算和干预强度的混合整数优化模块相结合。通过整合异质的环境、生物和管理数据,该框架构建了反映不断演变的入侵条件和潜在生态过程的时变生态参数。然后,优化组件在固定预算约束下生成具有成本效益的干预计划。与系统动力学模型(SD)和模拟优化模型(SO)的比较表明,DDSO不仅在预算效率方面优于传统方法,而且揭示了反直觉的管理逻辑:管理有效性更多地取决于协调优化框架的存在,而不是投资规模,经济高效的策略本质上倾向于高度不均衡的空间资源配置。这些机制层面的见解强调了早期干预和跨区域协调的重要性,将DDSO建立为适应性入侵物种管理的政策相关框架。
{"title":"An integrated data-driven-simulation-optimization model: insights into controlling invasive plants in China","authors":"Shuhua Zhang ,&nbsp;Ming Liu ,&nbsp;Dong Li","doi":"10.1016/j.eswa.2026.131261","DOIUrl":"10.1016/j.eswa.2026.131261","url":null,"abstract":"<div><div>The rapid spread of invasive plants such as Spartina alterniflora has emerged as a major ecological and economic threats to coastal wetlands, while existing management strategies often fail to adapt to dynamic invasion processes and limited financial resources. To address this challenge, this study develops a novel data-driven-simulation–optimization (DDSO) framework that enables dynamic and spatially explicit management of biological invasions. The core innovation lies in coupling data-driven ecological parameterization based on multi-source observations with a simulation model that captures life-cycle transitions and spatial dispersal, and a mixed-integer optimization module that allocates control budgets and intervention intensities across space and time. By integrating heterogeneous environmental, biological, and management data, the framework constructs time-varying ecological parameters that reflect evolving invasion conditions and underlying ecological processes. The optimization component then generates cost-effective intervention schedules under fixed budget constraints. Comparative evaluation against system dynamics (SD) and simulation–optimization (SO) models shows that DDSO outperforms conventional approaches not only in budget efficiency, but also by revealing counterintuitive management logics: management effectiveness hinges more on the presence of a coordinated optimization framework than on investment scale, and economically efficient strategies inherently favor highly uneven spatial resource allocation. These mechanism-level insights underscore the importance of early intervention and cross-regional coordination, establishing DDSO as a policy-relevant framework for adaptive invasive species management.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"310 ","pages":"Article 131261"},"PeriodicalIF":7.5,"publicationDate":"2026-01-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146080880","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A2-CMA-ES: a hybrid CMA-ES with adaptive archive for complex engineering design A2-CMA-ES:具有自适应存档的复杂工程设计混合CMA-ES
IF 7.5 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-28 DOI: 10.1016/j.eswa.2026.131236
Mao Xi , Longyan Xu , Zaihan He , Jiali Chen , Chenyang Bai , Ren Gao
Aiming at the problems of premature convergence, insufficient population diversity and decreased convergence rate in high-dimensional optimization of metaheuristic algorithms, this paper proposes an adaptive archiving hybrid CMA-ES (A2-CMA-ES) for complex engineering design. This method improves population diversity and inhibits premature convergence by introducing external archive fusion mean update with cost information. Secondly, a step size adaptive adjustment strategy is designed, which combines nonlinear suppression and reward amplification mechanism to alleviate the abnormal fluctuation of step size and enhance the global search ability. Furthermore, a multiplicative growth strategy of population size based on statistical stagnation criterion is established, and the search distribution is dynamically expanded through soft restart mechanism to reduce estimation noise. In order to ensure the high-dimensional numerical stability, a monitoring mechanism of covariance matrix condition number and convex combination regularization correction mechanism are constructed to ensure the robustness of anisotropic sampling basis. In this stable framework, a directional mining module based on probabilistic local model is embedded, and anisotropic fine sampling is carried out in the neighborhood of historical optimal solution based on the learned local geometry. Finally, combined with the dynamic learning rate regulation mechanism, the covariance matrix update parameters are adaptively adjusted to balance the global exploration and local development. The results of ablation experiments show that the coordination mechanism of the above components makes the algorithm effectively deal with complex high-dimensional optimization problems. In the CEC2017 benchmark test, A2-CMA-ES was compared with seven advanced algorithms in multi-dimensional scenarios, and further applied to engineering problems such as pressure vessel design, cantilever beam design, three-bar truss optimization, and UAV path planning. The results show that the algorithm is competitive in terms of convergence speed, accuracy and robustness.
针对元启发式算法在高维优化中存在过早收敛、种群多样性不足和收敛速度下降等问题,提出了一种用于复杂工程设计的自适应归档混合CMA-ES (A2-CMA-ES)算法。该方法通过引入带有成本信息的外部档案融合均值更新,提高了种群多样性,抑制了过早收敛。其次,设计了一种步长自适应调整策略,将非线性抑制和奖励放大机制相结合,缓解了步长异常波动,增强了全局搜索能力;建立了基于统计停滞准则的种群规模乘法增长策略,并通过软重启机制动态扩展搜索分布,降低估计噪声。为了保证高维数值稳定性,构建了协方差矩阵条件数监测机制和凸组合正则化校正机制,保证了各向异性采样基的鲁棒性。在此稳定框架中,嵌入基于概率局部模型的定向挖掘模块,并基于学习到的局部几何在历史最优解的邻域进行各向异性精细采样。最后,结合动态学习率调节机制,自适应调整协方差矩阵更新参数,平衡全局勘探和局部开发。烧蚀实验结果表明,上述各部分的协调机制使得该算法能够有效地处理复杂的高维优化问题。在CEC2017基准测试中,将A2-CMA-ES与7种先进算法在多维场景下进行对比,并进一步应用于压力容器设计、悬臂梁设计、三杆桁架优化、无人机路径规划等工程问题。结果表明,该算法在收敛速度、精度和鲁棒性方面具有一定的竞争力。
{"title":"A2-CMA-ES: a hybrid CMA-ES with adaptive archive for complex engineering design","authors":"Mao Xi ,&nbsp;Longyan Xu ,&nbsp;Zaihan He ,&nbsp;Jiali Chen ,&nbsp;Chenyang Bai ,&nbsp;Ren Gao","doi":"10.1016/j.eswa.2026.131236","DOIUrl":"10.1016/j.eswa.2026.131236","url":null,"abstract":"<div><div>Aiming at the problems of premature convergence, insufficient population diversity and decreased convergence rate in high-dimensional optimization of metaheuristic algorithms, this paper proposes an adaptive archiving hybrid CMA-ES (A<sup>2</sup>-CMA-ES) for complex engineering design. This method improves population diversity and inhibits premature convergence by introducing external archive fusion mean update with cost information. Secondly, a step size adaptive adjustment strategy is designed, which combines nonlinear suppression and reward amplification mechanism to alleviate the abnormal fluctuation of step size and enhance the global search ability. Furthermore, a multiplicative growth strategy of population size based on statistical stagnation criterion is established, and the search distribution is dynamically expanded through soft restart mechanism to reduce estimation noise. In order to ensure the high-dimensional numerical stability, a monitoring mechanism of covariance matrix condition number and convex combination regularization correction mechanism are constructed to ensure the robustness of anisotropic sampling basis. In this stable framework, a directional mining module based on probabilistic local model is embedded, and anisotropic fine sampling is carried out in the neighborhood of historical optimal solution based on the learned local geometry. Finally, combined with the dynamic learning rate regulation mechanism, the covariance matrix update parameters are adaptively adjusted to balance the global exploration and local development. The results of ablation experiments show that the coordination mechanism of the above components makes the algorithm effectively deal with complex high-dimensional optimization problems. In the CEC2017 benchmark test, A<sup>2</sup>-CMA-ES was compared with seven advanced algorithms in multi-dimensional scenarios, and further applied to engineering problems such as pressure vessel design, cantilever beam design, three-bar truss optimization, and UAV path planning. The results show that the algorithm is competitive in terms of convergence speed, accuracy and robustness.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"310 ","pages":"Article 131236"},"PeriodicalIF":7.5,"publicationDate":"2026-01-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146081045","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Integrated multi-factory production scheduling and vehicle routing with factory eligibility 集成多工厂生产调度和车辆路线与工厂合格
IF 7.5 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-28 DOI: 10.1016/j.eswa.2026.131291
Kai Wang , Yun Huang
To enhance the operational performance of supply chains under the trends of globalization and customization, integrated multi-factory production and distribution has recently attracted increasing attention. This paper presents a novel integrated multi-factory production scheduling and vehicle routing problem. In this problem, a set of customer orders is first assigned to several distributed factories for production, each of which is arranged as a hybrid flow shop (HFS). Owing to the technical or physical aspects, factory eligibility is considered in the production stage, where some orders can only be processed in a subset of factories. The finished products are then delivered by capacitated vehicles, subject to customer time windows. As a combination of the distributed HFS scheduling problem and the vehicle routing problem, three types of decisions have to be made, namely factory allocation, job scheduling, and vehicle assignment and routing. Considering the NP-hardness of the studied problem, a hybrid algorithm that integrates a distribution estimation algorithm (EDA) with an adaptive large neighborhood search (ALNS) is developed to generate solutions. To improve the local search capability of this algorithm, Q-Learning is employed to dynamically determine the destroy-and-repair operators of ALNS. Computational results on both small-sized and large-sized test problems indicate the superiority of the proposed algorithm.
在全球化和定制化的趋势下,为了提高供应链的运营绩效,多工厂生产与配送一体化日益受到人们的关注。提出了一种新的多工厂生产调度与车辆路径集成问题。在该问题中,首先将一组客户订单分配给几个分布式工厂进行生产,每个工厂被安排为一个混合流车间(HFS)。由于技术或物理方面的原因,工厂合格性是在生产阶段考虑的,其中一些订单只能在工厂的子集中处理。然后,根据客户的时间窗口,由有能力的车辆交付成品。作为分布式HFS调度问题和车辆路径问题的结合,需要做出三种决策,即工厂分配、作业调度和车辆分配与路径。考虑到所研究问题的np -硬度,提出了一种将分布估计算法(EDA)与自适应大邻域搜索(ALNS)相结合的混合算法来生成解。为了提高算法的局部搜索能力,采用Q-Learning动态确定ALNS的销毁和修复算子。小型和大型测试问题的计算结果都表明了该算法的优越性。
{"title":"Integrated multi-factory production scheduling and vehicle routing with factory eligibility","authors":"Kai Wang ,&nbsp;Yun Huang","doi":"10.1016/j.eswa.2026.131291","DOIUrl":"10.1016/j.eswa.2026.131291","url":null,"abstract":"<div><div>To enhance the operational performance of supply chains under the trends of globalization and customization, integrated multi-factory production and distribution has recently attracted increasing attention. This paper presents a novel integrated multi-factory production scheduling and vehicle routing problem. In this problem, a set of customer orders is first assigned to several distributed factories for production, each of which is arranged as a hybrid flow shop (HFS). Owing to the technical or physical aspects, factory eligibility is considered in the production stage, where some orders can only be processed in a subset of factories. The finished products are then delivered by capacitated vehicles, subject to customer time windows. As a combination of the distributed HFS scheduling problem and the vehicle routing problem, three types of decisions have to be made, namely factory allocation, job scheduling, and vehicle assignment and routing. Considering the NP-hardness of the studied problem, a hybrid algorithm that integrates a distribution estimation algorithm (EDA) with an adaptive large neighborhood search (ALNS) is developed to generate solutions. To improve the local search capability of this algorithm, Q-Learning is employed to dynamically determine the destroy-and-repair operators of ALNS. Computational results on both small-sized and large-sized test problems indicate the superiority of the proposed algorithm.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"310 ","pages":"Article 131291"},"PeriodicalIF":7.5,"publicationDate":"2026-01-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146080791","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
DBANet: A dual-branch dynamic convolutional temporal attention network for few-shot wind turbine bearing fault diagnosis DBANet:基于双支路动态卷积时间关注网络的风电轴承故障诊断
IF 7.5 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-01-28 DOI: 10.1016/j.eswa.2026.131361
Yazhou Du , Bokang Sun , Boyang Ma , Chao Shuai , Jianhao Yang , Yuanchao Lv , Dongchen Wang , Xuewei Wang , Mingyang Liu
Addressing the challenge where high-value fault samples of wind turbines under real operating conditions are extremely scarce-leading to ineffective feature extraction and poor generalization in existing deep learning methods due to insufficient training data-this paper proposes a Dual-Branch Dynamic Convolutional Temporal Attention Network (DBANet) for few-shot fault diagnosis. First, a time-frequency dual-branch architecture is constructed incorporating a dynamic convolution module. By utilizing an adaptive weighting mechanism of multi-expert kernels, the feature extraction strategy is dynamically adjusted to capture subtle fault signatures under few-shot conditions. Second, a Bidirectional Long Short-Term Memory (BiLSTM) network is integrated with a Multi-Head Attention (MHA) mechanism. This combination deeply mines temporal dependencies while precisely focusing on critical fault frequency bands, thereby enhancing the saliency of feature representation. Extensive validation on multiple public bearing datasets and real-world operational data from the Liaoning Datang Hongshan Wind Farm demonstrates that DBANet performs exceptionally well across diverse datasets and sample-limited environments. Specifically, in the validation using real wind farm data, the proposed method achieved an accuracy of 88.33% even in extremely data-scarce scenarios with only 5 training samples per class. The average accuracy across different training sample sizes reached 93.54%, representing a performance improvement of over 16% compared to state-of-the-art methods. These results fully demonstrate the superiority of the proposed method and its significant value for engineering applications.
针对风电机组实际运行状态下的高值故障样本极其稀少,现有深度学习方法由于训练数据不足导致特征提取效果不佳、泛化效果差的问题,本文提出了一种双分支动态卷积时间注意网络(DBANet)进行少采样故障诊断。首先,构建了包含动态卷积模块的时频双分支结构。利用多专家核的自适应加权机制,动态调整特征提取策略,以捕获少量条件下的细微故障特征。第二,双向长短期记忆(BiLSTM)网络与多头注意(MHA)机制相结合。这种组合深度挖掘了时间依赖性,同时精确地关注关键故障频带,从而增强了特征表示的显著性。对多个公共轴承数据集和辽宁大唐洪山风电场的实际运行数据的广泛验证表明,DBANet在不同数据集和样本有限的环境中表现得非常好。具体而言,在使用真实风电场数据的验证中,即使在数据极其稀缺的场景下,每类只有5个训练样本,本文方法的准确率也达到了88.33%。不同训练样本大小的平均准确率达到93.54%,与最先进的方法相比,性能提高了16%以上。这些结果充分证明了该方法的优越性,具有重要的工程应用价值。
{"title":"DBANet: A dual-branch dynamic convolutional temporal attention network for few-shot wind turbine bearing fault diagnosis","authors":"Yazhou Du ,&nbsp;Bokang Sun ,&nbsp;Boyang Ma ,&nbsp;Chao Shuai ,&nbsp;Jianhao Yang ,&nbsp;Yuanchao Lv ,&nbsp;Dongchen Wang ,&nbsp;Xuewei Wang ,&nbsp;Mingyang Liu","doi":"10.1016/j.eswa.2026.131361","DOIUrl":"10.1016/j.eswa.2026.131361","url":null,"abstract":"<div><div>Addressing the challenge where high-value fault samples of wind turbines under real operating conditions are extremely scarce-leading to ineffective feature extraction and poor generalization in existing deep learning methods due to insufficient training data-this paper proposes a Dual-Branch Dynamic Convolutional Temporal Attention Network (DBANet) for few-shot fault diagnosis. First, a time-frequency dual-branch architecture is constructed incorporating a dynamic convolution module. By utilizing an adaptive weighting mechanism of multi-expert kernels, the feature extraction strategy is dynamically adjusted to capture subtle fault signatures under few-shot conditions. Second, a Bidirectional Long Short-Term Memory (BiLSTM) network is integrated with a Multi-Head Attention (MHA) mechanism. This combination deeply mines temporal dependencies while precisely focusing on critical fault frequency bands, thereby enhancing the saliency of feature representation. Extensive validation on multiple public bearing datasets and real-world operational data from the Liaoning Datang Hongshan Wind Farm demonstrates that DBANet performs exceptionally well across diverse datasets and sample-limited environments. Specifically, in the validation using real wind farm data, the proposed method achieved an accuracy of 88.33% even in extremely data-scarce scenarios with only 5 training samples per class. The average accuracy across different training sample sizes reached 93.54%, representing a performance improvement of over 16% compared to state-of-the-art methods. These results fully demonstrate the superiority of the proposed method and its significant value for engineering applications.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"311 ","pages":"Article 131361"},"PeriodicalIF":7.5,"publicationDate":"2026-01-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146081697","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Expert Systems with Applications
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1