首页 > 最新文献

Expert Systems with Applications最新文献

英文 中文
DORF-EASNet: physics-driven real-time seafloor classification via entropy‑regularized acoustic features and adaptive model activation DORF-EASNet:物理驱动的实时海底分类,通过熵正则化声学特征和自适应模型激活
IF 7.5 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-05-25 Epub Date: 2026-02-04 DOI: 10.1016/j.eswa.2026.131461
Xi Zhao, Qiangqiang Yuan, Quanyin Zhang, Jiadan Xu
Real-time seabed sediment classification (SSC) is crucial for underwater navigation, operations, and habitat assessment. Conventional methods relying on post-mission multibeam-echosounder (MBES) data processing impede in situ decision-making. We propose a novel, real-time SSC method deployable on both shipborne and Autonomous Underwater Vehicle (AUV) platforms, integrating three core components. Primarily, an efficient preprocessing pipeline comprising georeferencing, radiometric normalization, noise suppression, and incidence-angle correction enables rapid conversion of raw MBES backscatter into geometry-consistent tiles, supporting real-time operation with sub-second responsiveness. Afterwards, the system extracts multi-modal descriptors by combining entropy-regularised angular-response fitting for acoustic backscatter, object-level texture analysis using adaptive graph segmentation, and curvature-aware terrain metrics derived from quadratic surface fitting under entropy constraints by considering the physical responses and spatial distribution of MBES images and point clouds. Finally, a Dynamic Optimal Random Forest with Entropy-Adaptive Subnetwork Selection (DORF-EASNet) dynamically selects between a global classifier and lightweight domain-specific sub-models to match local acoustic complexity, achieving a balance between inference efficiency and physical interpretability. Field experiments conducted in Jiaozhou Bay and the South China Sea demonstrate the proposed framework’s robustness across platforms and sensing configurations, achieving macro-F1 scores of 0.881 and 0.913, respectively, while maintaining real-time processing capability exceeding that of conventional offline methods.
实时海底沉积物分类(SSC)对水下导航、作业和栖息地评估至关重要。依靠任务后多波束测深(MBES)数据处理的传统方法阻碍了现场决策。我们提出了一种新型的实时SSC方法,可部署在舰载和自主水下航行器(AUV)平台上,集成了三个核心组件。首先,高效的预处理管道包括地理参考、辐射归一化、噪声抑制和入射角校正,可以将原始MBES背散射快速转换为几何一致的瓷砖,支持亚秒级响应的实时操作。然后,考虑MBES图像和点云的物理响应和空间分布,结合声学后向散射的熵正则化角响应拟合、自适应图分割的目标级纹理分析以及熵约束下二次曲面拟合的曲率感知地形度量,提取多模态描述符。最后,基于熵自适应子网络选择的动态最优随机森林(DORF-EASNet)在全局分类器和轻量化领域特定子模型之间动态选择以匹配局部声学复杂性,实现了推理效率和物理可解释性之间的平衡。在胶州湾和南海进行的现场实验表明,该框架具有跨平台、跨感知配置的鲁棒性,宏观f1得分分别达到0.881和0.913,同时保持了超过传统离线方法的实时处理能力。
{"title":"DORF-EASNet: physics-driven real-time seafloor classification via entropy‑regularized acoustic features and adaptive model activation","authors":"Xi Zhao,&nbsp;Qiangqiang Yuan,&nbsp;Quanyin Zhang,&nbsp;Jiadan Xu","doi":"10.1016/j.eswa.2026.131461","DOIUrl":"10.1016/j.eswa.2026.131461","url":null,"abstract":"<div><div>Real-time seabed sediment classification (SSC) is crucial for underwater navigation, operations, and habitat assessment. Conventional methods relying on post-mission multibeam-echosounder (MBES) data processing impede in situ decision-making. We propose a novel, real-time SSC method deployable on both shipborne and Autonomous Underwater Vehicle (AUV) platforms, integrating three core components. Primarily, an efficient preprocessing pipeline comprising georeferencing, radiometric normalization, noise suppression, and incidence-angle correction enables rapid conversion of raw MBES backscatter into geometry-consistent tiles, supporting real-time operation with sub-second responsiveness. Afterwards, the system extracts multi-modal descriptors by combining entropy-regularised angular-response fitting for acoustic backscatter, object-level texture analysis using adaptive graph segmentation, and curvature-aware terrain metrics derived from quadratic surface fitting under entropy constraints by considering the physical responses and spatial distribution of MBES images and point clouds. Finally, a Dynamic Optimal Random Forest with Entropy-Adaptive Subnetwork Selection (DORF-EASNet) dynamically selects between a global classifier and lightweight domain-specific sub-models to match local acoustic complexity, achieving a balance between inference efficiency and physical interpretability. Field experiments conducted in Jiaozhou Bay and the South China Sea demonstrate the proposed framework’s robustness across platforms and sensing configurations, achieving macro-F1 scores of 0.881 and 0.913, respectively, while maintaining real-time processing capability exceeding that of conventional offline methods.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"312 ","pages":"Article 131461"},"PeriodicalIF":7.5,"publicationDate":"2026-05-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146122660","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Preference learning based on maximizing membership degree with heterogeneous information for landslide early warning 基于异构信息最大隶属度的偏好学习滑坡预警
IF 7.5 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-05-25 Epub Date: 2026-01-29 DOI: 10.1016/j.eswa.2026.131381
Jiajia Jiang , Min Zhan , Gaocan Gong , Lin Wang , Quanbo Zha
Preference learning involves developing a model that reflects a decision-maker's preference based on the provided information. Existing preference learning models for multi-criteria classification overlook exploring the membership degree of an alternative to a predefined class, especially in the face of heterogeneous types of criteria information. To address this issue, this paper proposes a preference learning model based on membership degree maximization with heterogeneous information. First, based on the additive utility function, we construct a preference model that integrates numerical and linguistic criteria information to express the utilities of alternatives. Within this model, four types of utility functions are considered to describe the variation characteristics of criteria. Next, triangular fuzzy numbers are employed to capture the membership degree of each alternative within the predefined classes, and a learning model is developed by maximizing the membership degrees of alternatives to their corresponding predefined classes. Finally, the proposed model is applied to landslide early warning to verify its feasibility.
偏好学习涉及开发一个模型,该模型可以根据提供的信息反映决策者的偏好。现有的多准则分类偏好学习模型忽略了对预定义类的替代选择的隶属度的探索,特别是在面对异构类型的准则信息时。针对这一问题,本文提出了一种基于隶属度最大化的异构信息偏好学习模型。首先,基于可加性效用函数,构建了一个综合数值和语言标准信息的偏好模型来表达备选方案的效用。在该模型中,考虑了四种类型的效用函数来描述标准的变化特征。其次,利用三角模糊数捕获预定义类中每个备选方案的隶属度,并通过最大化备选方案与其对应的预定义类的隶属度来建立学习模型。最后,将该模型应用于滑坡预警,验证了该模型的可行性。
{"title":"Preference learning based on maximizing membership degree with heterogeneous information for landslide early warning","authors":"Jiajia Jiang ,&nbsp;Min Zhan ,&nbsp;Gaocan Gong ,&nbsp;Lin Wang ,&nbsp;Quanbo Zha","doi":"10.1016/j.eswa.2026.131381","DOIUrl":"10.1016/j.eswa.2026.131381","url":null,"abstract":"<div><div>Preference learning involves developing a model that reflects a decision-maker's preference based on the provided information. Existing preference learning models for multi-criteria classification overlook exploring the membership degree of an alternative to a predefined class, especially in the face of heterogeneous types of criteria information. To address this issue, this paper proposes a preference learning model based on membership degree maximization with heterogeneous information. First, based on the additive utility function, we construct a preference model that integrates numerical and linguistic criteria information to express the utilities of alternatives. Within this model, four types of utility functions are considered to describe the variation characteristics of criteria. Next, triangular fuzzy numbers are employed to capture the membership degree of each alternative within the predefined classes, and a learning model is developed by maximizing the membership degrees of alternatives to their corresponding predefined classes. Finally, the proposed model is applied to landslide early warning to verify its feasibility.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"312 ","pages":"Article 131381"},"PeriodicalIF":7.5,"publicationDate":"2026-05-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146122666","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A depthwise convolutional variational autoencoder for anomaly detection in complex traffic scenarios from UAV views 一种基于深度卷积变分自编码器的无人机视角复杂交通场景异常检测
IF 7.5 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-05-25 Epub Date: 2026-01-30 DOI: 10.1016/j.eswa.2026.131425
Arslan Saleem, Cem Direkoglu
Traffic anomaly detection (AD) is essential for improving public safety, reducing risks, and enabling quick responses in intelligent surveillance systems. Aerial traffic monitoring, particularly using Unmanned Aerial Vehicles (UAV), has gained attention due to its potential to address challenges like dynamic urban environments, yet it remains underexplored. Detecting anomalies in drone-captured video involves unique obstacles: rare events, small and overlapping objects, multi-scale targets, and complex backgrounds. To address these challenges, we propose the Depthwise Convolutional Variational Autoencoder (DwCVAE), a novel model designed to enhance AD in drone-based traffic surveillance. DwCVAE leverages depthwise convolutions, which allow efficient and detailed feature extraction, improving model sensitivity to subtle and multi-scale anomalies. The proposed DwCVAE adopts an encoder-latent-decoder VAE architecture, in which stacked depthwise convolutional layers in the encoder emphasize spatially localized feature learning while maintaining channel-wise efficiency, and a compact variational latent space captures the distribution of normal traffic dynamics. Built on variational autoencoder (VAE) architecture, DwCVAE creates compact latent representations that capture normal traffic patterns, enabling reliable detection of deviations. Anomalies are identified through reconstruction-based scoring, where events that deviate from the learned normal representations yield higher reconstruction errors. This depthwise approach marks a key innovation, optimizing both computational efficiency and detection accuracy. We design four additional models: Convolutional Variational Autoencoder (CVAE), Dilated Convolutional VAE (DCVAE), Separable Convolutional VAE (SCVAE), and Convolutional LSTM VAE (CLSTMVAE) to systematically assess the effectiveness of DwCVAE. Additionally, we evaluate DwCVAE against state-of-the-art weakly supervised and unsupervised models on two benchmark datasets, Drone-Anomaly and UIT-Adrone. DwCVAE achieves an AUC of 74.95 with an EER of 0.30 on Drone-Anomaly, and an AUC of 79.77 with an EER of 0.27 on UIT-ADrone, demonstrating its superior performance in complex aerial surveillance tasks.
在智能监控系统中,交通异常检测(Traffic anomaly detection, AD)对于提高公共安全、降低风险和实现快速响应至关重要。空中交通监控,特别是使用无人机(UAV),由于其应对动态城市环境等挑战的潜力而受到关注,但仍未得到充分开发。检测无人机捕获的视频异常涉及独特的障碍:罕见事件,小而重叠的物体,多尺度目标和复杂的背景。为了解决这些挑战,我们提出了深度卷积变分自编码器(DwCVAE),这是一种新的模型,旨在增强基于无人机的交通监控中的AD。DwCVAE利用深度卷积,允许高效和详细的特征提取,提高模型对微妙和多尺度异常的敏感性。本文提出的DwCVAE采用编码器-潜伏-解码器的VAE架构,在保持信道效率的同时,编码器中堆叠的深度卷积层强调空间局部特征学习,紧凑的变分潜伏空间捕获正常流量动态分布。DwCVAE建立在变分自编码器(VAE)架构上,创建了紧凑的潜在表示,可以捕获正常的流量模式,从而能够可靠地检测偏差。通过基于重建的评分来识别异常,其中偏离学习到的正常表示的事件会产生更高的重建误差。这种深入的方法标志着一项关键的创新,优化了计算效率和检测精度。我们设计了四个额外的模型:卷积变分自编码器(CVAE)、扩张卷积VAE (DCVAE)、可分离卷积VAE (SCVAE)和卷积LSTMVAE (CLSTMVAE)来系统地评估DwCVAE的有效性。此外,我们在两个基准数据集(Drone-Anomaly和unit - drone)上针对最先进的弱监督和无监督模型评估DwCVAE。DwCVAE在Drone-Anomaly上AUC为74.95,EER为0.30;在unit - drone上AUC为79.77,EER为0.27,在复杂的空中监视任务中表现出优越的性能。
{"title":"A depthwise convolutional variational autoencoder for anomaly detection in complex traffic scenarios from UAV views","authors":"Arslan Saleem,&nbsp;Cem Direkoglu","doi":"10.1016/j.eswa.2026.131425","DOIUrl":"10.1016/j.eswa.2026.131425","url":null,"abstract":"<div><div>Traffic anomaly detection (AD) is essential for improving public safety, reducing risks, and enabling quick responses in intelligent surveillance systems. Aerial traffic monitoring, particularly using Unmanned Aerial Vehicles (UAV), has gained attention due to its potential to address challenges like dynamic urban environments, yet it remains underexplored. Detecting anomalies in drone-captured video involves unique obstacles: rare events, small and overlapping objects, multi-scale targets, and complex backgrounds. To address these challenges, we propose the Depthwise Convolutional Variational Autoencoder (DwCVAE), a novel model designed to enhance AD in drone-based traffic surveillance. DwCVAE leverages depthwise convolutions, which allow efficient and detailed feature extraction, improving model sensitivity to subtle and multi-scale anomalies. The proposed DwCVAE adopts an encoder-latent-decoder VAE architecture, in which stacked depthwise convolutional layers in the encoder emphasize spatially localized feature learning while maintaining channel-wise efficiency, and a compact variational latent space captures the distribution of normal traffic dynamics. Built on variational autoencoder (VAE) architecture, DwCVAE creates compact latent representations that capture normal traffic patterns, enabling reliable detection of deviations. Anomalies are identified through reconstruction-based scoring, where events that deviate from the learned normal representations yield higher reconstruction errors. This depthwise approach marks a key innovation, optimizing both computational efficiency and detection accuracy. We design four additional models: Convolutional Variational Autoencoder (CVAE), Dilated Convolutional VAE (DCVAE), Separable Convolutional VAE (SCVAE), and Convolutional LSTM VAE (CLSTMVAE) to systematically assess the effectiveness of DwCVAE. Additionally, we evaluate DwCVAE against state-of-the-art weakly supervised and unsupervised models on two benchmark datasets, Drone-Anomaly and UIT-Adrone. DwCVAE achieves an AUC of 74.95 with an EER of 0.30 on Drone-Anomaly, and an AUC of 79.77 with an EER of 0.27 on UIT-ADrone, demonstrating its superior performance in complex aerial surveillance tasks.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"312 ","pages":"Article 131425"},"PeriodicalIF":7.5,"publicationDate":"2026-05-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146122667","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MAGF: Multi-scale attention and gated fusion for multi-modal glaucoma grading MAGF:多模式青光眼分级的多尺度关注和门控融合
IF 7.5 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-05-25 Epub Date: 2026-01-29 DOI: 10.1016/j.eswa.2026.131388
Haixi Cheng , Chaoqun Hong , Bo Zhang , Huihui Fang , Yanwu Xu , Si Yong Yeo
Glaucoma is one of the leading causes of irreversible blindness worldwide. Color fundus photography (CFP) and optical coherence tomography (OCT) are two primary imaging modalities for glaucoma diagnosis. Recently, multi-modal approaches that combine CFP and OCT have demonstrated higher diagnostic accuracy compared to single-modal methods. However, the high similarity among medical image poses presents a challenge for extracting effective features. Additionally, low-quality features can degrade fusion performance, potentially leading to inaccurate grading results. To address these challenges, we propose a Multi-scale Attention and Gated Fusion (MAGF) framework, which incorporates a dual-branch feature extraction architecture with targeted attention, a Multi-scale Attention Fusion Module (MAFM) for enhancing OCT features, and a Gated Fusion Module (GFM) for adaptive integration of CFP and OCT modalities. Extensive experiments demonstrate that our method achieves state-of-the-art (SOTA) performance in glaucoma grading.
青光眼是世界范围内导致不可逆失明的主要原因之一。彩色眼底摄影(CFP)和光学相干断层扫描(OCT)是青光眼诊断的两种主要成像方式。最近,与单模态方法相比,结合CFP和OCT的多模态方法显示出更高的诊断准确性。然而,医学图像之间的高度相似性给有效特征的提取带来了挑战。此外,低质量的特征会降低融合性能,可能导致不准确的分级结果。为了应对这些挑战,我们提出了一个多尺度注意力和门控融合(MAGF)框架,该框架包括一个具有目标注意力的双分支特征提取架构,一个用于增强OCT特征的多尺度注意力融合模块(MAFM),以及一个用于自适应集成CFP和OCT模式的门控融合模块(GFM)。大量的实验表明,我们的方法在青光眼分级中达到了最先进的(SOTA)性能。
{"title":"MAGF: Multi-scale attention and gated fusion for multi-modal glaucoma grading","authors":"Haixi Cheng ,&nbsp;Chaoqun Hong ,&nbsp;Bo Zhang ,&nbsp;Huihui Fang ,&nbsp;Yanwu Xu ,&nbsp;Si Yong Yeo","doi":"10.1016/j.eswa.2026.131388","DOIUrl":"10.1016/j.eswa.2026.131388","url":null,"abstract":"<div><div>Glaucoma is one of the leading causes of irreversible blindness worldwide. Color fundus photography (CFP) and optical coherence tomography (OCT) are two primary imaging modalities for glaucoma diagnosis. Recently, multi-modal approaches that combine CFP and OCT have demonstrated higher diagnostic accuracy compared to single-modal methods. However, the high similarity among medical image poses presents a challenge for extracting effective features. Additionally, low-quality features can degrade fusion performance, potentially leading to inaccurate grading results. To address these challenges, we propose a Multi-scale Attention and Gated Fusion (MAGF) framework, which incorporates a dual-branch feature extraction architecture with targeted attention, a Multi-scale Attention Fusion Module (MAFM) for enhancing OCT features, and a Gated Fusion Module (GFM) for adaptive integration of CFP and OCT modalities. Extensive experiments demonstrate that our method achieves state-of-the-art (SOTA) performance in glaucoma grading.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"312 ","pages":"Article 131388"},"PeriodicalIF":7.5,"publicationDate":"2026-05-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146122714","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A heterogeneous Hopfield neural network with discrete memristor: modeling, dynamics, and application in medical image encryption 具有离散忆阻器的异构Hopfield神经网络:建模、动态和在医学图像加密中的应用
IF 7.5 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-05-25 Epub Date: 2026-02-01 DOI: 10.1016/j.eswa.2026.131457
Huiqun Zou , Yang Lu , Wenjiao Li , Wenhui Li , Xiuli Chai
Memristors and activation functions critically shape the nonlinear dynamics of Hopfield neural networks. While previous studies explored memristor modeling and heterogeneous activation separately, their combination to form heterogeneous memristive networks remains insufficiently explored. This paper bridges this gap by proposing a novel Heterogeneous Hopfield Neural Network with Discrete Memristor (HHNN-DM), coupling a discrete memristor with heterogeneous activations to mimic neural diversity. By analyzing dissipation, equilibrium stability, bifurcation diagrams, and Lyapunov exponents, we demonstrate that heterogeneous activation mechanisms significantly enhance network complexity and unpredictability under memristive interactions. This synergy gives rise to rich chaotic behaviors, such as periodic orbits, bifurcations, transient chaos, and chaotic bursting. As these biologically inspired chaotic dynamics, the resulting high-quality chaotic sequences are well suited for cryptographic applications. Furthermore, a Heterogeneous Hopfield Neural Network–based Medical Image Encryption Algorithm (HHNN-MIEA) is developed to enhance security in remote medical image transmission, integrating an X-fractal curve sorting matrix for permutation with multi-logical diffusion driven by chaotic sequences. Experimental results verify that the HHNN-MIEA achieves high security in aspects such as key sensitivity, information entropy without compromising efficiency, highlighting its effectiveness, robustness and reliable solution for secure medical image transmission.
记忆电阻器和激活函数是Hopfield神经网络非线性动力学的关键。虽然以前的研究分别探讨了忆阻器建模和异质激活,但它们的组合形成异质忆阻网络的探索仍然不够充分。本文通过提出一种具有离散忆阻器的新型异质Hopfield神经网络(HHNN-DM)来弥补这一差距,将离散忆阻器与异质激活耦合起来以模拟神经多样性。通过对耗散、平衡稳定性、分岔图和Lyapunov指数的分析,我们证明了在记忆相互作用下,异质激活机制显著提高了网络的复杂性和不可预测性。这种协同作用产生了丰富的混沌行为,如周期轨道、分岔、瞬态混沌和混沌爆发。由于这些受生物学启发的混沌动力学,由此产生的高质量混沌序列非常适合密码学应用。在此基础上,提出了一种基于异构Hopfield神经网络的医学图像加密算法(HHNN-MIEA),将x分形曲线排序矩阵与混沌序列驱动的多逻辑扩散相结合,提高了医学图像远程传输的安全性。实验结果表明,在不影响效率的前提下,HHNN-MIEA在密钥灵敏度、信息熵等方面实现了较高的安全性,突出了其有效性、鲁棒性和可靠的医学图像安全传输解决方案。
{"title":"A heterogeneous Hopfield neural network with discrete memristor: modeling, dynamics, and application in medical image encryption","authors":"Huiqun Zou ,&nbsp;Yang Lu ,&nbsp;Wenjiao Li ,&nbsp;Wenhui Li ,&nbsp;Xiuli Chai","doi":"10.1016/j.eswa.2026.131457","DOIUrl":"10.1016/j.eswa.2026.131457","url":null,"abstract":"<div><div>Memristors and activation functions critically shape the nonlinear dynamics of Hopfield neural networks. While previous studies explored memristor modeling and heterogeneous activation separately, their combination to form heterogeneous memristive networks remains insufficiently explored. This paper bridges this gap by proposing a novel Heterogeneous Hopfield Neural Network with Discrete Memristor (HHNN-DM), coupling a discrete memristor with heterogeneous activations to mimic neural diversity. By analyzing dissipation, equilibrium stability, bifurcation diagrams, and Lyapunov exponents, we demonstrate that heterogeneous activation mechanisms significantly enhance network complexity and unpredictability under memristive interactions. This synergy gives rise to rich chaotic behaviors, such as periodic orbits, bifurcations, transient chaos, and chaotic bursting. As these biologically inspired chaotic dynamics, the resulting high-quality chaotic sequences are well suited for cryptographic applications. Furthermore, a Heterogeneous Hopfield Neural Network–based Medical Image Encryption Algorithm (HHNN-MIEA) is developed to enhance security in remote medical image transmission, integrating an X-fractal curve sorting matrix for permutation with multi-logical diffusion driven by chaotic sequences. Experimental results verify that the HHNN-MIEA achieves high security in aspects such as key sensitivity, information entropy without compromising efficiency, highlighting its effectiveness, robustness and reliable solution for secure medical image transmission.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"312 ","pages":"Article 131457"},"PeriodicalIF":7.5,"publicationDate":"2026-05-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146122715","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Local attention alignment fusion network for domain adaptive water body segmentation 区域自适应水体分割的局部注意力对齐融合网络
IF 7.5 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-05-25 Epub Date: 2026-02-02 DOI: 10.1016/j.eswa.2026.131382
Hao Liu , Xiaobin Zhu , Xu Qizhi , Chun Yang , Hongyang Zhou , Yongjie Xia , Xucheng Yin
Water body segmentation is crucial for various tasks, e.g., disaster early warning and ecological management. Existing deep learning-based methods mainly focus on specific scenarios, often encountering significant performance drops on multi-source satellite data with large domain differences. In this paper, we propose a novel Local Attention Alignment Fusion Network for domain adaptive water body segmentation (dubbed LAAFNet). Our LAAFNet explores spatial relationships between pseudo-RGB and pseudo-NIR images, and then extracts invariant features via local attention to improve the representative capability of cross-domain features. Moreover, we design a novel Difficult Sample Point Loss (DSPLoss) to address the presence of potential positive samples within negative regions across domains through a pixel-level contrastive learning strategy. DSPLoss leverages a Cauchy-Schwarz-based constraint to regulate the upper bound of feature similarity in the pixel-level inner product space. This constraint enhances the separation between water bodies and background in hard samples, allowing the model to learn a clearer decision boundary and thereby improving its generalization capability. Notably, we construct a large-scale Water Generation Testing Dataset (WGTDataset) to evaluate water body segmentation in real-world applications. Experimental results demonstrate that the LAAFNet outperforms the state-of-the-art (SOTA) methods. The codes and dataset are available on: https://github.com/LH325/LAAFNet.
水体分割对于灾害预警、生态管理等工作至关重要。现有的基于深度学习的方法主要针对特定场景,在多源卫星数据域差异较大的情况下,往往会出现显著的性能下降。在本文中,我们提出了一种新的区域自适应水体分割的局部注意力对齐融合网络(LAAFNet)。我们的LAAFNet探索伪rgb和伪nir图像之间的空间关系,然后通过局部关注提取不变性特征,以提高跨域特征的代表能力。此外,我们设计了一种新的困难样本点损失(DSPLoss),通过像素级对比学习策略来解决跨域负区域内潜在正样本的存在。DSPLoss利用基于cauchy - schwarz的约束来调节像素级内积空间中特征相似度的上界。该约束增强了硬样本中水体与背景的分离,使模型能够学习到更清晰的决策边界,从而提高模型的泛化能力。值得注意的是,我们构建了一个大规模的水生成测试数据集(WGTDataset)来评估实际应用中的水体分割。实验结果表明,LAAFNet优于最先进的SOTA方法。代码和数据集可在https://github.com/LH325/LAAFNet上获得。
{"title":"Local attention alignment fusion network for domain adaptive water body segmentation","authors":"Hao Liu ,&nbsp;Xiaobin Zhu ,&nbsp;Xu Qizhi ,&nbsp;Chun Yang ,&nbsp;Hongyang Zhou ,&nbsp;Yongjie Xia ,&nbsp;Xucheng Yin","doi":"10.1016/j.eswa.2026.131382","DOIUrl":"10.1016/j.eswa.2026.131382","url":null,"abstract":"<div><div>Water body segmentation is crucial for various tasks, e.g., disaster early warning and ecological management. Existing deep learning-based methods mainly focus on specific scenarios, often encountering significant performance drops on multi-source satellite data with large domain differences. In this paper, we propose a novel Local Attention Alignment Fusion Network for domain adaptive water body segmentation (dubbed LAAFNet). Our LAAFNet explores spatial relationships between pseudo-RGB and pseudo-NIR images, and then extracts invariant features via local attention to improve the representative capability of cross-domain features. Moreover, we design a novel Difficult Sample Point Loss (DSPLoss) to address the presence of potential positive samples within negative regions across domains through a pixel-level contrastive learning strategy. DSPLoss leverages a Cauchy-Schwarz-based constraint to regulate the upper bound of feature similarity in the pixel-level inner product space. This constraint enhances the separation between water bodies and background in hard samples, allowing the model to learn a clearer decision boundary and thereby improving its generalization capability. Notably, we construct a large-scale Water Generation Testing Dataset (WGTDataset) to evaluate water body segmentation in real-world applications. Experimental results demonstrate that the LAAFNet outperforms the state-of-the-art (SOTA) methods. The codes and dataset are available on: <span><span>https://github.com/LH325/LAAFNet</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"312 ","pages":"Article 131382"},"PeriodicalIF":7.5,"publicationDate":"2026-05-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146122716","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
AI-driven zero trust and blockchain framework for secure electric vehicle infrastructure 人工智能驱动的零信任和安全电动汽车基础设施区块链框架
IF 7.5 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-05-25 Epub Date: 2026-02-08 DOI: 10.1016/j.eswa.2026.131577
Clement Daah , Ysabel Fallot , Amna Qureshi , Irfan Awan , Savas Konur
Electric vehicle (EV) charging infrastructures are increasingly exposed to sophisticated cyber threats, including replay, spoofing, privilege escalation, and geolocation-based attacks. While standards such as ISO 15118 and OCPP 2.0.1 provide interoperability and cryptographic guarantees, they rely on static policies or isolated detection mechanisms, leaving gaps against adaptive adversaries. This paper presents an AI-driven Zero Trust Blockchain (AI-ZTB) framework whose novelty lies in the system-level integration of identity and access management, AI-based risk assessment, and blockchain-backed decentralized auditability with IPFS-based evidence storage, while operational governance remains centrally managed by the service provider. Unlike prior AI-only or blockchain-only frameworks, AI-ZTB introduces a fully integrated and enforceable Zero Trust control loop in which AI-generated risk scores are operationally bound to access enforcement decisions through smart contracts, enabling adaptive, auditable, and context-aware security governance in real time. The framework was implemented in Python with Solidity smart contracts and evaluated through a large-scale network simulation involving batches of 10,000 EV-charging sessions, trained on a dataset of 50,000 legitimate and adversarial behaviours using Random Forest, Autoencoder, and Isolation Forest models. Results demonstrate that AI-ZTB achieves access-decision accuracy above 95%, reducing false acceptance and rejection rates to approximately 3%. A comparative analysis evaluates AI-ZTB against industry standards (ISO 15,118 and OCPP 2.0.1) as secure communication baselines, and against prior integrated frameworks from the literature, highlighting differences in architectural scope, policy enforceability, and auditability rather than protocol-level performance. Despite modest inference and logging overheads, performance remained within real-time operational tolerances. The framework establishes a robust foundation for securing EV infrastructures, with extensibility to smart grids and other cyber-physical environments.
电动汽车(EV)充电基础设施越来越容易受到复杂的网络威胁,包括重播、欺骗、特权升级和基于地理位置的攻击。虽然ISO 15118和OCPP 2.0.1等标准提供了互操作性和加密保证,但它们依赖于静态策略或孤立的检测机制,这给自适应对手留下了空白。本文提出了一个人工智能驱动的零信任区块链(AI-ZTB)框架,其新颖之处在于身份和访问管理的系统级集成,基于人工智能的风险评估,以及基于ipfs的证据存储的区块链支持的去中心化审计,而运营治理仍由服务提供商集中管理。与之前的纯人工智能或纯区块链框架不同,AI-ZTB引入了一个完全集成和可执行的零信任控制循环,其中人工智能生成的风险评分在操作上绑定到通过智能合约访问执行决策,从而实现自适应、可审计和上下文感知的实时安全治理。该框架是在Python中使用Solidity智能合约实现的,并通过大规模网络模拟进行评估,该模拟涉及10,000个电动汽车充电会话的批次,并使用随机森林,自动编码器和隔离森林模型在50,000个合法和对抗行为的数据集上进行训练。结果表明,AI-ZTB的访问决策准确率在95%以上,将误接受率和拒绝率降低到约3%。比较分析将AI-ZTB与行业标准(ISO 15,118和OCPP 2.0.1)作为安全通信基线进行比较,并与文献中的先前集成框架进行比较,突出了体系结构范围、策略可执行性和可审计性方面的差异,而不是协议级性能。尽管有适度的推断和日志开销,但性能仍保持在实时操作容许范围内。该框架为确保电动汽车基础设施的安全奠定了坚实的基础,并可扩展到智能电网和其他网络物理环境。
{"title":"AI-driven zero trust and blockchain framework for secure electric vehicle infrastructure","authors":"Clement Daah ,&nbsp;Ysabel Fallot ,&nbsp;Amna Qureshi ,&nbsp;Irfan Awan ,&nbsp;Savas Konur","doi":"10.1016/j.eswa.2026.131577","DOIUrl":"10.1016/j.eswa.2026.131577","url":null,"abstract":"<div><div>Electric vehicle (EV) charging infrastructures are increasingly exposed to sophisticated cyber threats, including replay, spoofing, privilege escalation, and geolocation-based attacks. While standards such as ISO 15118 and OCPP 2.0.1 provide interoperability and cryptographic guarantees, they rely on static policies or isolated detection mechanisms, leaving gaps against adaptive adversaries. This paper presents an AI-driven Zero Trust Blockchain (AI-ZTB) framework whose novelty lies in the system-level integration of identity and access management, AI-based risk assessment, and blockchain-backed decentralized auditability with IPFS-based evidence storage, while operational governance remains centrally managed by the service provider. Unlike prior AI-only or blockchain-only frameworks, AI-ZTB introduces a fully integrated and enforceable Zero Trust control loop in which AI-generated risk scores are operationally bound to access enforcement decisions through smart contracts, enabling adaptive, auditable, and context-aware security governance in real time. The framework was implemented in Python with Solidity smart contracts and evaluated through a large-scale network simulation involving batches of 10,000 EV-charging sessions, trained on a dataset of 50,000 legitimate and adversarial behaviours using Random Forest, Autoencoder, and Isolation Forest models. Results demonstrate that AI-ZTB achieves access-decision accuracy above 95%, reducing false acceptance and rejection rates to approximately 3%. A comparative analysis evaluates AI-ZTB against industry standards (ISO 15,118 and OCPP 2.0.1) as secure communication baselines, and against prior integrated frameworks from the literature, highlighting differences in architectural scope, policy enforceability, and auditability rather than protocol-level performance. Despite modest inference and logging overheads, performance remained within real-time operational tolerances. The framework establishes a robust foundation for securing EV infrastructures, with extensibility to smart grids and other cyber-physical environments.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"312 ","pages":"Article 131577"},"PeriodicalIF":7.5,"publicationDate":"2026-05-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146192588","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Federated learning with dynamics-aware loss for label noise 具有标签噪声动态感知损失的联邦学习
IF 7.5 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-05-25 Epub Date: 2026-02-05 DOI: 10.1016/j.eswa.2026.131523
Chengtian Ouyang , Jihong Mao , Zhiquan Liu , Donglin Zhu , Changjun Zhou , Gangqiang Hu , Taiyong Li
In the domain of Internet of Things, federated learning is gradually becoming a key technology for achieving safe and efficient implementation of artificial intelligence. Through distributed collaboration mechanisms, it enables edge intelligence while protecting data privacy and reducing communication costs. In the real federated learning system, clients usually exhibit variable levels of label noise, and local training tends to overfit the label noise resulting in decreased generalization performance of the model. Despite the existence of many research findings on the problem of data heterogeneity, these methods are not effective in dealing with label noise. Thus, tackling label noise problem is one of the keys to facilitating the development of federal learning. In the research, an adaptive framework FedDAL is proposed to combat federated learning with label noise. In the pre-training stage, the server identifies noisy clients by the unreliability score. The module named distance-sensitive truncation is designed to improve identification accuracy. In the federated learning stage, noisy clients train local models by dynamics-aware loss to mitigate the adverse effects of label noise. Finally, the server carries out loss normalization and weight adjustment aggregation taking into account the data volume and the aggregate class mean loss. Experimental results on multiple datasets demonstrate that FedDAL effectively addresses label noise overfitting, improves model generalization performance and outperforms state-of-the-art methods across multiple distributions of label noise. Our code is available at https://github.com/Donglin0730/FedDAL.
在物联网领域,联邦学习正逐渐成为实现人工智能安全高效实施的关键技术。通过分布式协作机制,它在保护数据隐私和降低通信成本的同时实现边缘智能。在真实的联邦学习系统中,客户端通常表现出不同程度的标签噪声,而局部训练往往会过度拟合标签噪声,导致模型泛化性能下降。尽管存在许多关于数据异质性问题的研究成果,但这些方法在处理标签噪声方面并不有效。因此,解决标签噪音问题是促进联邦学习发展的关键之一。在研究中,提出了一种自适应框架FedDAL来对抗带有标签噪声的联邦学习。在预训练阶段,服务器通过不可靠性评分来识别有噪声的客户端。为了提高识别精度,设计了距离敏感截断模块。在联邦学习阶段,噪声客户端通过动态感知损失来训练局部模型,以减轻标签噪声的不利影响。最后,服务器根据数据量和汇总类平均损失进行损失归一化和权重调整聚合。在多个数据集上的实验结果表明,FedDAL有效地解决了标签噪声过拟合问题,提高了模型泛化性能,并且在标签噪声的多个分布中优于最先进的方法。我们的代码可在https://github.com/Donglin0730/FedDAL上获得。
{"title":"Federated learning with dynamics-aware loss for label noise","authors":"Chengtian Ouyang ,&nbsp;Jihong Mao ,&nbsp;Zhiquan Liu ,&nbsp;Donglin Zhu ,&nbsp;Changjun Zhou ,&nbsp;Gangqiang Hu ,&nbsp;Taiyong Li","doi":"10.1016/j.eswa.2026.131523","DOIUrl":"10.1016/j.eswa.2026.131523","url":null,"abstract":"<div><div>In the domain of Internet of Things, federated learning is gradually becoming a key technology for achieving safe and efficient implementation of artificial intelligence. Through distributed collaboration mechanisms, it enables edge intelligence while protecting data privacy and reducing communication costs. In the real federated learning system, clients usually exhibit variable levels of label noise, and local training tends to overfit the label noise resulting in decreased generalization performance of the model. Despite the existence of many research findings on the problem of data heterogeneity, these methods are not effective in dealing with label noise. Thus, tackling label noise problem is one of the keys to facilitating the development of federal learning. In the research, an adaptive framework FedDAL is proposed to combat federated learning with label noise. In the pre-training stage, the server identifies noisy clients by the unreliability score. The module named distance-sensitive truncation is designed to improve identification accuracy. In the federated learning stage, noisy clients train local models by dynamics-aware loss to mitigate the adverse effects of label noise. Finally, the server carries out loss normalization and weight adjustment aggregation taking into account the data volume and the aggregate class mean loss. Experimental results on multiple datasets demonstrate that FedDAL effectively addresses label noise overfitting, improves model generalization performance and outperforms state-of-the-art methods across multiple distributions of label noise. Our code is available at <span><span>https://github.com/Donglin0730/FedDAL</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"312 ","pages":"Article 131523"},"PeriodicalIF":7.5,"publicationDate":"2026-05-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146192585","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A cumulative, pipeline-oriented framework for extreme imbalanced classification: Large-scale benchmarking of 4,100 rare-event models 极端不平衡分类的累积、面向管道的框架:4,100个罕见事件模型的大规模基准测试
IF 7.5 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-05-25 Epub Date: 2026-01-30 DOI: 10.1016/j.eswa.2026.131366
Jaime Villanueva-García , Ignacio Moral-Arce , Luis Javier García Villalba
Extreme class imbalance remains a long-standing challenge in supervised learning, particularly when the minority class corresponds to rare but high-impact events such as fraud, equipment failure, or retirement transitions. Although a wide range of imbalance-handling techniques has been proposed-including penalization, resampling, loss shaping, threshold calibration, optimization, and ensemble strategies-the field largely lacks an operational understanding of how these components should be combined, calibrated, and consolidated within end-to-end pipelines.
This paper introduces a cumulative, pipeline-oriented framework for imbalanced classification under extreme skew. Rather than categorizing learning algorithms or paradigms, the framework reorganizes the design space of imbalanced learning around where and how imbalance-aware interventions act within the modeling pipeline. By progressively integrating data-level adjustments, objective and decision-rule shaping, and higher-level consolidation mechanisms, the framework exposes the structural logic underlying effective rare-event classifiers.
We validate this framework through a large-scale benchmark comprising 4109 pipeline configurations applied to administrative microdata from the Spanish Social Security system, where monthly retirement detection exhibits a positive-class prevalence of 2.34%. The results reveal clear and interpretable performance regimes as pipeline complexity increases. Explicit pipeline-level ensemble architectures achieve the strongest recall and balance-sensitive metrics, while well-calibrated and optimized intermediate pipelines frequently attain comparable or superior performance in selected metrics-such as F1-score and Matthews correlation coefficient-relative to ensemble baselines, at substantially lower computational cost and with improved interpretability. Distributional and robustness analyses further show that consolidation mechanisms act primarily by stabilizing the upper tail of the performance distribution rather than by improving isolated best-case outcomes.
Beyond this application, the proposed framework provides a reproducible benchmarking protocol and a unifying operational lens for understanding when additional methodological sophistication yields robust, stable, and practically meaningful performance gains under extreme class imbalance.
极端的班级失衡仍然是监督学习的一个长期挑战,特别是当少数班级对应的是罕见但影响很大的事件,如欺诈、设备故障或退休过渡。尽管已经提出了广泛的不平衡处理技术,包括惩罚、重采样、损失整形、阈值校准、优化和集成策略,但该领域在很大程度上缺乏对如何将这些组件组合、校准和整合到端到端管道中的操作理解。本文介绍了一种用于极端偏斜下不平衡分类的累积式、面向管道的框架。该框架不是对学习算法或范式进行分类,而是围绕建模管道中感知不平衡的干预的位置和方式重新组织不平衡学习的设计空间。通过逐步集成数据级调整、目标和决策规则形成以及更高级别的整合机制,该框架揭示了有效罕见事件分类器的结构逻辑。我们通过一个大规模的基准来验证这个框架,该基准包括4109个管道配置,应用于西班牙社会保障系统的行政微观数据,其中每月退休检测显示出2.34%的正类患病率。随着管道复杂性的增加,结果揭示了清晰和可解释的性能机制。显式管道级集成架构实现了最强的召回和平衡敏感指标,而经过良好校准和优化的中间管道通常在选择的指标(如f1分数和马修斯相关系数)中获得与集成基线相当或更好的性能,计算成本大大降低,可解释性得到改善。分布和稳健性分析进一步表明,整合机制的作用主要是稳定性能分布的上尾,而不是改善孤立的最佳情况结果。除了这个应用程序之外,建议的框架还提供了一个可重复的基准测试协议和一个统一的操作透镜,用于理解在极端类不平衡的情况下,额外的方法复杂性何时产生健壮、稳定和实际有意义的性能增益。
{"title":"A cumulative, pipeline-oriented framework for extreme imbalanced classification: Large-scale benchmarking of 4,100 rare-event models","authors":"Jaime Villanueva-García ,&nbsp;Ignacio Moral-Arce ,&nbsp;Luis Javier García Villalba","doi":"10.1016/j.eswa.2026.131366","DOIUrl":"10.1016/j.eswa.2026.131366","url":null,"abstract":"<div><div>Extreme class imbalance remains a long-standing challenge in supervised learning, particularly when the minority class corresponds to rare but high-impact events such as fraud, equipment failure, or retirement transitions. Although a wide range of imbalance-handling techniques has been proposed-including penalization, resampling, loss shaping, threshold calibration, optimization, and ensemble strategies-the field largely lacks an operational understanding of how these components should be combined, calibrated, and consolidated within end-to-end pipelines.</div><div>This paper introduces a cumulative, pipeline-oriented framework for imbalanced classification under extreme skew. Rather than categorizing learning algorithms or paradigms, the framework reorganizes the design space of imbalanced learning around <em>where</em> and <em>how</em> imbalance-aware interventions act within the modeling pipeline. By progressively integrating data-level adjustments, objective and decision-rule shaping, and higher-level consolidation mechanisms, the framework exposes the structural logic underlying effective rare-event classifiers.</div><div>We validate this framework through a large-scale benchmark comprising 4109 pipeline configurations applied to administrative microdata from the Spanish Social Security system, where monthly retirement detection exhibits a positive-class prevalence of 2.34%. The results reveal clear and interpretable performance regimes as pipeline complexity increases. Explicit pipeline-level ensemble architectures achieve the strongest recall and balance-sensitive metrics, while well-calibrated and optimized intermediate pipelines frequently attain comparable or superior performance in selected metrics-such as F1-score and Matthews correlation coefficient-relative to ensemble baselines, at substantially lower computational cost and with improved interpretability. Distributional and robustness analyses further show that consolidation mechanisms act primarily by stabilizing the upper tail of the performance distribution rather than by improving isolated best-case outcomes.</div><div>Beyond this application, the proposed framework provides a reproducible benchmarking protocol and a unifying operational lens for understanding when additional methodological sophistication yields robust, stable, and practically meaningful performance gains under extreme class imbalance.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"312 ","pages":"Article 131366"},"PeriodicalIF":7.5,"publicationDate":"2026-05-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146192782","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multi-perspective domain-invariant network with energy density-based data augmentation for domain generalization fault diagnosis 基于能量密度数据增强的多视角域不变网络用于域泛化故障诊断
IF 7.5 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2026-05-25 Epub Date: 2026-02-08 DOI: 10.1016/j.eswa.2026.131583
Sukeun Hong, Jaewook Lee, Jongsoo Lee
Existing domain generalization fault diagnosis methods achieve satisfactory interpolation performance but struggle with extrapolation owing to two fundamental limitations: insufficient source domain coverage and the inability to verify whether learned features represent causal fault characteristics or spurious correlations. To address these challenges, this study proposes a multi-perspective domain-invariant network (MPDIN) with energy–density-based data augmentation. MPDIN employs bootstrap aggregation to train multiple feature extractors on strategically defined domain subsets, establishing hierarchical domain invariance by enforcing subset-level invariance through triplet loss and inter-subset consistency via correlation alignment. This multi-perspective framework effectively suppresses subset-specific spurious correlations while preserving genuine fault characteristics. The energy–density-based augmentation leverages the ω2-proportional relationship between rotational speed and vibration energy to generate realistic extrapolation data beyond source domain boundaries, utilizing raw short-time Fourier transform power spectrograms to preserve absolute energy information essential for physics-based scaling. Experimental validation across four diverse datasets demonstrated substantial improvements in challenging extrapolation scenarios, achieving gains of 19–47%, whereas conventional methods showed significant performance degradation. Manifold analysis confirmed continuity and complete target–source integration, validating the attainment of true domain-invariant learning. Although limitations exist in time-varying scenarios, the proposed methodology provides a principled framework for industrial deployment where targets frequently exceed training envelopes.
现有的领域泛化故障诊断方法可以获得令人满意的内插性能,但由于源域覆盖范围不足以及无法验证学习到的特征是代表因果故障特征还是虚假相关,因此在外推方面存在困难。为了解决这些挑战,本研究提出了一种基于能量密度的数据增强的多视角域不变网络(MPDIN)。MPDIN采用自举聚合在策略定义的领域子集上训练多个特征提取器,通过三元组损失强制子集级不变性建立层次域不变性,通过相关对齐强制子集间一致性建立层次域不变性。这种多视角框架有效地抑制了子集特定的伪相关,同时保留了真实的故障特征。基于能量密度的增强利用转速和振动能量之间的ω2比例关系,在源域边界之外生成真实的外推数据,利用原始的短时傅立叶变换功率谱来保留基于物理的缩放所必需的绝对能量信息。在四个不同数据集上的实验验证表明,在具有挑战性的外推场景中有了实质性的改进,实现了19-47%的增益,而传统方法表现出显著的性能下降。流形分析证实了连续性和完整的目标-源集成,验证了真正的领域不变学习的实现。虽然在时变的情况下存在局限性,但拟议的方法为工业部署提供了一个原则性框架,其中目标经常超过培训范围。
{"title":"Multi-perspective domain-invariant network with energy density-based data augmentation for domain generalization fault diagnosis","authors":"Sukeun Hong,&nbsp;Jaewook Lee,&nbsp;Jongsoo Lee","doi":"10.1016/j.eswa.2026.131583","DOIUrl":"10.1016/j.eswa.2026.131583","url":null,"abstract":"<div><div>Existing domain generalization fault diagnosis methods achieve satisfactory interpolation performance but struggle with extrapolation owing to two fundamental limitations: insufficient source domain coverage and the inability to verify whether learned features represent causal fault characteristics or spurious correlations. To address these challenges, this study proposes a multi-perspective domain-invariant network (MPDIN) with energy–density-based data augmentation. MPDIN employs bootstrap aggregation to train multiple feature extractors on strategically defined domain subsets, establishing hierarchical domain invariance by enforcing subset-level invariance through triplet loss and inter-subset consistency via correlation alignment. This multi-perspective framework effectively suppresses subset-specific spurious correlations while preserving genuine fault characteristics. The energy–density-based augmentation leverages the <span><math><mrow><msup><mrow><mi>ω</mi></mrow><mn>2</mn></msup></mrow></math></span>-proportional relationship between rotational speed and vibration energy to generate realistic extrapolation data beyond source domain boundaries, utilizing raw short-time Fourier transform power spectrograms to preserve absolute energy information essential for physics-based scaling. Experimental validation across four diverse datasets demonstrated substantial improvements in challenging extrapolation scenarios, achieving gains of 19–47%, whereas conventional methods showed significant performance degradation. Manifold analysis confirmed continuity and complete target–source integration, validating the attainment of true domain-invariant learning. Although limitations exist in time-varying scenarios, the proposed methodology provides a principled framework for industrial deployment where targets frequently exceed training envelopes.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"312 ","pages":"Article 131583"},"PeriodicalIF":7.5,"publicationDate":"2026-05-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146192784","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Expert Systems with Applications
全部 Adv. Atmos. Sci. ECOLOGY Carbon Balance Manage. ARCHAEOMETRY Clean-Soil Air Water Int. J. Earth Sci. ENVIRONMENT Appl. Geochem. IZV-PHYS SOLID EART+ Études Caribéennes Resour. Geol. Geobiology Communications Earth & Environment J. Atmos. Sol. Terr. Phys. Global Biogeochem. Cycles ENG SANIT AMBIENT EVOL MED PUBLIC HLTH Eurasian Journal of Emergency Medicine Geosci. J. 国际生物医学工程杂志 J. Afr. Earth. Sci. Environmental Control in Biology ERN: Other Macroeconomics: Aggregative Models (Topic) TERR ATMOS OCEAN SCI Essentials of Polymer Flooding Technique OCEAN SCI J Expert Opin. Pharmacother. 化工催化剂及甲醇技术 Geol. J. Am. J. Phys. Anthropol. Ocean and Coastal Research Am. Mineral. EXPERT REV ANTICANC Solid Earth Environmental Progress ERN: Other IO: Empirical Studies of Firms & Markets (Topic) Ecol. Monogr. Expert Rev. Clin. Immunol. PERIOD MINERAL B SOC GEOL MEX Annu. Rev. Earth Planet. Sci. Weather Forecasting ECOL RESTOR Surv. Geophys. GEOHERITAGE 胜利油田党校学报 Expert Rev. Mol. Diagn. Int. J. Geog. Inf. Sci. J. South Am. Earth Sci. GEOGR FIS DIN QUAT Acta Geophys. Archaeol. Anthropol. Sci. Atmos. Meas. Tech. Exp. Cell. Res. Meteorol. Z. Acta Palaeontol. Polonica Equine veterinary journal. Supplement Public Management Review ACTA GEOL POL ENVIRON HEALTH-GLOB Conserv. Biol. 遥感信息 Atmos. Sci. Lett. J. Atmos. Oceanic Technol. QUATERNAIRE ACTA GEOL SIN-ENGL Environ. Res. Lett. Eur. J. Control Seismol. Res. Lett. Big Earth Data CRIT REV ENV SCI TEC Miner. Deposita Org. Geochem. J GEOPHYS RES-OCEANS Ife Journal of Science Geochim. Cosmochim. Acta Int. J. Biometeorol. TECTONOPHYSICS IDOJARAS GROUNDWATER Exp. Biol. Med. Ann. Glaciol. Precambrian Res. Acta Oceanolog. Sin. Austrian J. Earth Sci. ATMOSPHERE-BASEL GEOCHEM GEOPHY GEOSY Atmos. Chem. Phys. Mar. Geod. GEOL BELG Appl. Clay Sci. J. Atmos. Chem. Ann. Geophys. Environ. Prot. Eng. Engineering, Technology & Applied Science Research BIOGEOSCIENCES Environ. Eng. Sci. Am. J. Sci. COMP BIOCHEM PHYS C AAPG Bull.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1