Pub Date : 2025-12-15DOI: 10.1016/j.isatra.2025.12.020
Yu Hu, Aibing Qiu, Yintao Wang, Shengfeng Wang
Multirate sampled data (MRSD) dynamic systems are abundant in modern engineering systems. The inconsistent sampling rates cause data asynchrony and alter system properties, negatively impacting fault diagnosis with delayed and missed detections. In this paper, a fast rate fault detection scheme for dynamic systems is proposed, which is directly driven by MRSD. Firstly, the lifting technique is employed to transform asynchronous MRSD into single but slow rate sampled data. An auxiliary lifted output is constructed to compute a parity vector via subspace identification, facilitating a multi-dimensional diagnostic observer satisfying the Luenberger conditions. Then a post filter addresses causality constraint, allowing fast rate residual generation. Further, a fast rate residual evaluation scheme is developed. The effectiveness and superiority of the proposed scheme are demonstrated by a heating, ventilation and air conditioning (HVAC) example.
{"title":"Multirate sampled data driven fast rate fault detection of dynamic systems.","authors":"Yu Hu, Aibing Qiu, Yintao Wang, Shengfeng Wang","doi":"10.1016/j.isatra.2025.12.020","DOIUrl":"https://doi.org/10.1016/j.isatra.2025.12.020","url":null,"abstract":"<p><p>Multirate sampled data (MRSD) dynamic systems are abundant in modern engineering systems. The inconsistent sampling rates cause data asynchrony and alter system properties, negatively impacting fault diagnosis with delayed and missed detections. In this paper, a fast rate fault detection scheme for dynamic systems is proposed, which is directly driven by MRSD. Firstly, the lifting technique is employed to transform asynchronous MRSD into single but slow rate sampled data. An auxiliary lifted output is constructed to compute a parity vector via subspace identification, facilitating a multi-dimensional diagnostic observer satisfying the Luenberger conditions. Then a post filter addresses causality constraint, allowing fast rate residual generation. Further, a fast rate residual evaluation scheme is developed. The effectiveness and superiority of the proposed scheme are demonstrated by a heating, ventilation and air conditioning (HVAC) example.</p>","PeriodicalId":94059,"journal":{"name":"ISA transactions","volume":" ","pages":""},"PeriodicalIF":6.5,"publicationDate":"2025-12-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145800841","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-12-11DOI: 10.1016/j.isatra.2025.12.015
Shuhan Zhang, Xiuxia Yin, Zhiwei Gao
This paper examines the prescribed-time consensus of multi-agent systems under time-varying communication delays and Denial-of-Service (DoS) attacks. Considering a general class of DoS attacks with limited duration, a novel control protocol accompanied by time-varying node delays communication delays and integral action is proposed to guarantee the secure prescribed-time consensus. Moreover, we propose a controller to achieve prescribed-time consensus by utilizing the Artstein's reducing transformation, effectively addressing the challenges posed by time-varying delays. By using the comparison principle and Lyapunov stability theory, consensus convergence properties are analyzed, and sufficient criteria are obtained. Furthermore, a distributed prescribed-time observer is introduced to guarantee that all follower agents obtain the leader's state information within the prescribed time, even if only a subset initially has access. To conclude, a numerical simulation is offered to substantiate the robustness and implementation of our theoretical insights.
{"title":"Prescribed-time consensus control of nonlinear time-delayed multi-agent systems under DoS attacks.","authors":"Shuhan Zhang, Xiuxia Yin, Zhiwei Gao","doi":"10.1016/j.isatra.2025.12.015","DOIUrl":"https://doi.org/10.1016/j.isatra.2025.12.015","url":null,"abstract":"<p><p>This paper examines the prescribed-time consensus of multi-agent systems under time-varying communication delays and Denial-of-Service (DoS) attacks. Considering a general class of DoS attacks with limited duration, a novel control protocol accompanied by time-varying node delays communication delays and integral action is proposed to guarantee the secure prescribed-time consensus. Moreover, we propose a controller to achieve prescribed-time consensus by utilizing the Artstein's reducing transformation, effectively addressing the challenges posed by time-varying delays. By using the comparison principle and Lyapunov stability theory, consensus convergence properties are analyzed, and sufficient criteria are obtained. Furthermore, a distributed prescribed-time observer is introduced to guarantee that all follower agents obtain the leader's state information within the prescribed time, even if only a subset initially has access. To conclude, a numerical simulation is offered to substantiate the robustness and implementation of our theoretical insights.</p>","PeriodicalId":94059,"journal":{"name":"ISA transactions","volume":" ","pages":""},"PeriodicalIF":6.5,"publicationDate":"2025-12-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145784131","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-12-05DOI: 10.1016/j.isatra.2025.12.006
Jian Huang, Hang Ruan, Jianbo Yu, Qingchao Jiang, Xiaofeng Yang
Recognizing nonstationarity is pivotal for trustworthy industrial process monitoring. Existing methods address this issue from a unimodal perspective, which struggles to capture intrinsic heterogeneity. To resolve this, we introduce a novel unsupervised multimodal nonstationary monitoring framework (UMNMF), integrating a bimodal paradigm with contrastive and adversarial schemes. Initially, the knowledge labeling unit (KLU) is established to generate pseudo-labels augmented with prior knowledge for semantic guidance. Subsequently, the dynamic alignment and encoding unit (DAEU) exploits contrastive language-image pre-training (CLIP) and the Vision Transformer (ViT) for modality-aware alignment through a pseudo-supervised contrastive mechanism. Furthermore, the association alignment and distillation unit (AADU) is devised to achieve decoupling through self-adversarial distribution regularization within a variational graph autoencoder (VGAE). The superior performance is substantiated by extensive experiments on three industrial processes, where the UMNMF attains an average fault detection rate exceeding 94 % and maintains a false alarm rate below 2.5 %. Additional ablation studies further confirm the contribution of each module to overall performance improvement.
{"title":"A bimodal framework for nonstationary process monitoring via collaborative contrastive and adversarial unsupervised learning.","authors":"Jian Huang, Hang Ruan, Jianbo Yu, Qingchao Jiang, Xiaofeng Yang","doi":"10.1016/j.isatra.2025.12.006","DOIUrl":"https://doi.org/10.1016/j.isatra.2025.12.006","url":null,"abstract":"<p><p>Recognizing nonstationarity is pivotal for trustworthy industrial process monitoring. Existing methods address this issue from a unimodal perspective, which struggles to capture intrinsic heterogeneity. To resolve this, we introduce a novel unsupervised multimodal nonstationary monitoring framework (UMNMF), integrating a bimodal paradigm with contrastive and adversarial schemes. Initially, the knowledge labeling unit (KLU) is established to generate pseudo-labels augmented with prior knowledge for semantic guidance. Subsequently, the dynamic alignment and encoding unit (DAEU) exploits contrastive language-image pre-training (CLIP) and the Vision Transformer (ViT) for modality-aware alignment through a pseudo-supervised contrastive mechanism. Furthermore, the association alignment and distillation unit (AADU) is devised to achieve decoupling through self-adversarial distribution regularization within a variational graph autoencoder (VGAE). The superior performance is substantiated by extensive experiments on three industrial processes, where the UMNMF attains an average fault detection rate exceeding 94 % and maintains a false alarm rate below 2.5 %. Additional ablation studies further confirm the contribution of each module to overall performance improvement.</p>","PeriodicalId":94059,"journal":{"name":"ISA transactions","volume":" ","pages":""},"PeriodicalIF":6.5,"publicationDate":"2025-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145727941","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-12-04DOI: 10.1016/j.isatra.2025.12.001
Lei Chen, Haoyan Dong, Shuaijie Chen, Kuangrong Hao
Real-world industrial fault diagnosis faces challenges from unknown fault types and limited labeled data, where existing methods often suffer from prototype collapse and unreliable clustering. This paper proposes an uncertainty-aware prototype learning framework with structural constraints for open-world semi-supervised fault diagnosis (OpenUPS). It introduces prototypes based on simplex equiangular tight frame to enforce uniformly distributed and maximally separated class centers, effectively preventing collapse under limited supervision. To address the varying reliability of unlabeled data, an uncertainty-aware contrastive strategy adaptively selects informative pairs, enabling robust alignment of seen classes and progressive clustering of novel faults. Experiments on the Tennessee Eastman process and a real-world polyester esterification process demonstrate that OpenUPS outperforms existing methods, achieving strong generalization and adaptability for open-world industrial fault diagnosis.
{"title":"An uncertainty-aware prototype learning framework with structural constraints for open-world semi-supervised fault diagnosis.","authors":"Lei Chen, Haoyan Dong, Shuaijie Chen, Kuangrong Hao","doi":"10.1016/j.isatra.2025.12.001","DOIUrl":"https://doi.org/10.1016/j.isatra.2025.12.001","url":null,"abstract":"<p><p>Real-world industrial fault diagnosis faces challenges from unknown fault types and limited labeled data, where existing methods often suffer from prototype collapse and unreliable clustering. This paper proposes an uncertainty-aware prototype learning framework with structural constraints for open-world semi-supervised fault diagnosis (OpenUPS). It introduces prototypes based on simplex equiangular tight frame to enforce uniformly distributed and maximally separated class centers, effectively preventing collapse under limited supervision. To address the varying reliability of unlabeled data, an uncertainty-aware contrastive strategy adaptively selects informative pairs, enabling robust alignment of seen classes and progressive clustering of novel faults. Experiments on the Tennessee Eastman process and a real-world polyester esterification process demonstrate that OpenUPS outperforms existing methods, achieving strong generalization and adaptability for open-world industrial fault diagnosis.</p>","PeriodicalId":94059,"journal":{"name":"ISA transactions","volume":" ","pages":""},"PeriodicalIF":6.5,"publicationDate":"2025-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145770549","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-11-27DOI: 10.1016/j.isatra.2025.11.036
Qifu Chen, Zhuang Li, Weijun Li, Yunpeng Guo, Jianqi An, Jinhua She
The permeability index (PI) of a blast furnace (BF) is a key indicator of furnace performance, as it reflects the extent of indirect reduction, energy consumption, molten iron quality, and overall production efficiency. Accurate prediction of the PI is essential for ensuring stable and efficient BF performance. Due to the complex multi-time-scale characteristics of different operational parameters, this paper presents a multi-step prediction model based on multi-time-scale analysis to capture their long-term evolution trends. First, the multi-time-scale characteristics of BF operation are analyzed from both the smelting mechanism and data-driven perspectives. According to the characteristics, this paper constructs a single-step prediction model of PI on the long-time-scale, medium-time-scale, and short-time-scale, respectively, and introduces an iterative compensation strategy to extend each single-step model into a multi-step prediction framework, and then fuses the prediction results under the multi-time-scale to obtain the results of the future PI. Finally, the performance evaluation is shown based on actual industrial data, which verifies the significant advantages of the proposed multi-step prediction method based on iterative compensation in terms of accuracy and stability.
{"title":"Multi-step prediction of blast furnace permeability index based on multi-time-scale analysis.","authors":"Qifu Chen, Zhuang Li, Weijun Li, Yunpeng Guo, Jianqi An, Jinhua She","doi":"10.1016/j.isatra.2025.11.036","DOIUrl":"https://doi.org/10.1016/j.isatra.2025.11.036","url":null,"abstract":"<p><p>The permeability index (PI) of a blast furnace (BF) is a key indicator of furnace performance, as it reflects the extent of indirect reduction, energy consumption, molten iron quality, and overall production efficiency. Accurate prediction of the PI is essential for ensuring stable and efficient BF performance. Due to the complex multi-time-scale characteristics of different operational parameters, this paper presents a multi-step prediction model based on multi-time-scale analysis to capture their long-term evolution trends. First, the multi-time-scale characteristics of BF operation are analyzed from both the smelting mechanism and data-driven perspectives. According to the characteristics, this paper constructs a single-step prediction model of PI on the long-time-scale, medium-time-scale, and short-time-scale, respectively, and introduces an iterative compensation strategy to extend each single-step model into a multi-step prediction framework, and then fuses the prediction results under the multi-time-scale to obtain the results of the future PI. Finally, the performance evaluation is shown based on actual industrial data, which verifies the significant advantages of the proposed multi-step prediction method based on iterative compensation in terms of accuracy and stability.</p>","PeriodicalId":94059,"journal":{"name":"ISA transactions","volume":" ","pages":""},"PeriodicalIF":6.5,"publicationDate":"2025-11-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145673194","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
With the widespread deployment of intelligent sensors and advances in data storage, large volumes of process data are continuously collected, providing a foundation for developing soft sensors for multi-scale monitoring in complex industrial processes. This paper proposes an enhanced autoencoder-based temporal convolutional soft sensor model for industrial process monitoring, aiming to effectively capture multi-scale features and the dynamic evolution of process data. The proposed filter temporal convolutional network incorporates adaptive filter-response normalization, thereby enhancing multi-scale feature extraction and improving model generalization. Then, a multi-layer filter temporal convolutional autoencoder is developed to enable efficient multi-scale feature extraction and accurate process data reconstruction. Moreover, a multi-scale feature fusion module with a channel attention mechanism is designed to adaptively integrate temporal features and significantly enhance model robustness. Finally, a statistical metric based on reconstruction errors is established, and the Kullback-Leibler divergence is employed to determine control limits for fault detection. The superiority and effectiveness of the proposed method are validated through applications to the wastewater treatment process and the multiphase flow process.
{"title":"Soft sensors for industrial fault detection using multi-scale fusion temporal convolutional autoencoders.","authors":"Huanqi Sun, Weili Xiong, Zhongmei Li, Wenxin Sun, Yiyang Chen, Hongtian Chen","doi":"10.1016/j.isatra.2025.11.020","DOIUrl":"https://doi.org/10.1016/j.isatra.2025.11.020","url":null,"abstract":"<p><p>With the widespread deployment of intelligent sensors and advances in data storage, large volumes of process data are continuously collected, providing a foundation for developing soft sensors for multi-scale monitoring in complex industrial processes. This paper proposes an enhanced autoencoder-based temporal convolutional soft sensor model for industrial process monitoring, aiming to effectively capture multi-scale features and the dynamic evolution of process data. The proposed filter temporal convolutional network incorporates adaptive filter-response normalization, thereby enhancing multi-scale feature extraction and improving model generalization. Then, a multi-layer filter temporal convolutional autoencoder is developed to enable efficient multi-scale feature extraction and accurate process data reconstruction. Moreover, a multi-scale feature fusion module with a channel attention mechanism is designed to adaptively integrate temporal features and significantly enhance model robustness. Finally, a statistical metric based on reconstruction errors is established, and the Kullback-Leibler divergence is employed to determine control limits for fault detection. The superiority and effectiveness of the proposed method are validated through applications to the wastewater treatment process and the multiphase flow process.</p>","PeriodicalId":94059,"journal":{"name":"ISA transactions","volume":" ","pages":""},"PeriodicalIF":6.5,"publicationDate":"2025-11-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145835753","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-11-08DOI: 10.1016/j.isatra.2025.11.001
Zhengxuan Zhang, Xu Yang, Yuri A W Shardt, Jingjing Gao, Jiarui Cui
Dynamic latent variable (DLV) models have been widely applied in industrial soft sensing due to their ability to extract features and capture dynamic behavior. However, conventional DLV models are limited to linear feature extraction and perform poorly with nonuniformly sampled data. Thus, this paper proposes a soft sensor for a nonuniform sampling nonlinear dynamic process using irregular-time-interval latent probabilistic predictability embedding supervised deep network (ILPPSDN). First, a prediction regularization term is added to the decoding loss of the target-related autoencoder to model latent temporal dependencies and enhance feature predictability. Furthermore, the internal state derivative in the proposed irregular-time-interval variational recurrent neural network is parameterized by an ordinary differential equation network, integrating hidden-state evolution with state updates. In addition, all network components are jointly optimized through unified training. Then, an ILPPSDN-based soft sensor is developed for nonuniformly sampled nonlinear dynamic processes via pre-training and supervised fine-tuning. Finally, the results indicate that the proposed ILPPSDN can reduce the root mean square error by at least 26.1 %, 21.1 %, and 26.1 % at the uneven sampling ratios of 1/2, 2/3, and 3/4 in the debutanizer column. Correspondingly, in the sulfur recovery unit, these values are 21.1 %, 26.1 %, and 26.1 %. Additionally, in the ablation studies, the proposed method reduced the root mean square error by at least 5 % and 6 % in the two industrial cases, respectively.
{"title":"Soft sensor for nonuniform sampling nonlinear dynamic process using irregular-time-interval latent probabilistic predictability embedding supervised deep network.","authors":"Zhengxuan Zhang, Xu Yang, Yuri A W Shardt, Jingjing Gao, Jiarui Cui","doi":"10.1016/j.isatra.2025.11.001","DOIUrl":"https://doi.org/10.1016/j.isatra.2025.11.001","url":null,"abstract":"<p><p>Dynamic latent variable (DLV) models have been widely applied in industrial soft sensing due to their ability to extract features and capture dynamic behavior. However, conventional DLV models are limited to linear feature extraction and perform poorly with nonuniformly sampled data. Thus, this paper proposes a soft sensor for a nonuniform sampling nonlinear dynamic process using irregular-time-interval latent probabilistic predictability embedding supervised deep network (ILPPSDN). First, a prediction regularization term is added to the decoding loss of the target-related autoencoder to model latent temporal dependencies and enhance feature predictability. Furthermore, the internal state derivative in the proposed irregular-time-interval variational recurrent neural network is parameterized by an ordinary differential equation network, integrating hidden-state evolution with state updates. In addition, all network components are jointly optimized through unified training. Then, an ILPPSDN-based soft sensor is developed for nonuniformly sampled nonlinear dynamic processes via pre-training and supervised fine-tuning. Finally, the results indicate that the proposed ILPPSDN can reduce the root mean square error by at least 26.1 %, 21.1 %, and 26.1 % at the uneven sampling ratios of 1/2, 2/3, and 3/4 in the debutanizer column. Correspondingly, in the sulfur recovery unit, these values are 21.1 %, 26.1 %, and 26.1 %. Additionally, in the ablation studies, the proposed method reduced the root mean square error by at least 5 % and 6 % in the two industrial cases, respectively.</p>","PeriodicalId":94059,"journal":{"name":"ISA transactions","volume":" ","pages":""},"PeriodicalIF":6.5,"publicationDate":"2025-11-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145524927","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-11-04DOI: 10.1016/j.isatra.2025.10.051
Yunkai Wu, Yu Tian, Yang Zhou, Xiangqian Liu
The high-speed train power traction system operates for extended periods of time in complex environments with high temperatures and vibrations, making the system susceptible to various types of incipient faults. In addition, coupled with missing sensor data under complex operating conditions, the diagnosis of incipient faults under practical conditions becomes even more challenging. To address the aforementioned issues, this paper proposes an innovative data imputation method that combines kernel functions with the modified Akima (Makima) interpolation algorithm. This method can effectively address the scenario of a large number of sensor signals continuously missing in high-speed train traction systems. Expanding on this, a real-time incipient fault diagnosis framework is proposed, which combines Deep-Principal Component Analysis (Deep-PCA) with Multi-scale Principal Component Analysis (MSPCA). This framework enhances the capability to extract fault features from both horizontal and vertical perspectives, thereby improving the accuracy of fault detection. Furthermore, the continuous wavelet transform (CWT) is employed to amplify fault-related information within the dataset. This enhanced dataset, combined with a fault isolation criterion based on the reconstructed cumulative contribution rate, enables the achievement of precise fault isolation.
{"title":"Deep-PCA and MSPCA based fault diagnosis of high-speed train traction systems under missing data conditions.","authors":"Yunkai Wu, Yu Tian, Yang Zhou, Xiangqian Liu","doi":"10.1016/j.isatra.2025.10.051","DOIUrl":"https://doi.org/10.1016/j.isatra.2025.10.051","url":null,"abstract":"<p><p>The high-speed train power traction system operates for extended periods of time in complex environments with high temperatures and vibrations, making the system susceptible to various types of incipient faults. In addition, coupled with missing sensor data under complex operating conditions, the diagnosis of incipient faults under practical conditions becomes even more challenging. To address the aforementioned issues, this paper proposes an innovative data imputation method that combines kernel functions with the modified Akima (Makima) interpolation algorithm. This method can effectively address the scenario of a large number of sensor signals continuously missing in high-speed train traction systems. Expanding on this, a real-time incipient fault diagnosis framework is proposed, which combines Deep-Principal Component Analysis (Deep-PCA) with Multi-scale Principal Component Analysis (MSPCA). This framework enhances the capability to extract fault features from both horizontal and vertical perspectives, thereby improving the accuracy of fault detection. Furthermore, the continuous wavelet transform (CWT) is employed to amplify fault-related information within the dataset. This enhanced dataset, combined with a fault isolation criterion based on the reconstructed cumulative contribution rate, enables the achievement of precise fault isolation.</p>","PeriodicalId":94059,"journal":{"name":"ISA transactions","volume":" ","pages":""},"PeriodicalIF":6.5,"publicationDate":"2025-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145515063","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Anomaly detection of converter valves via acoustic analysis has been a hot topic in the high-voltage direct current research field. However, several factors have significantly hindered the practical deployment of such acoustic anomaly detection methods, including the infrequency of valve anomalies leading to imbalanced distributions of acoustic samples, limitations in computational resources, and inherent class imbalance in acoustic signals. To address these challenges, this paper proposes a novel unsupervised anomaly detection framework, named lightweight transformer-based generative adversarial networks (LT-GAN). It introduces two lightweight modules, MobileNet V2 and D-MobileNet V2, to perform downsampling and upsampling of Mel-spectrograms derived from acoustic signals. Furthermore, it incorporates a K-ViT block to enhance global representation learning of spectral images and reduce network parameters. Experiments on real-world acoustic datasets show the superiority of the proposed LT-GAN, which achieves an AUC of 0.9806 on the ROC curve, significantly outperforming baseline methods. In the low false-positive regions [0, 0.1], [0, 0.2], and [0, 0.3], the p-AUCs reach 0.9295, 0.9122, and 0.9559, respectively. Moreover, LT-GAN exhibits exceptional lightweight characteristics, with model complexity metrics of 9.626 million parameters, 0.506 GFLOPs, and a model size of 37.48 MB. These results validate the effectiveness of the proposed approach in terms of anomaly detection performance and resource efficiency.
{"title":"Lightweight transformer-based generative adversarial network for acoustic anomaly detection in converter valves.","authors":"Mingzhu Tang, Chen Yin, Haijun Hu, Zhihong Wang, Fuqiang Xiong, Ying Wei, Zhiwen Chen","doi":"10.1016/j.isatra.2025.10.049","DOIUrl":"https://doi.org/10.1016/j.isatra.2025.10.049","url":null,"abstract":"<p><p>Anomaly detection of converter valves via acoustic analysis has been a hot topic in the high-voltage direct current research field. However, several factors have significantly hindered the practical deployment of such acoustic anomaly detection methods, including the infrequency of valve anomalies leading to imbalanced distributions of acoustic samples, limitations in computational resources, and inherent class imbalance in acoustic signals. To address these challenges, this paper proposes a novel unsupervised anomaly detection framework, named lightweight transformer-based generative adversarial networks (LT-GAN). It introduces two lightweight modules, MobileNet V2 and D-MobileNet V2, to perform downsampling and upsampling of Mel-spectrograms derived from acoustic signals. Furthermore, it incorporates a K-ViT block to enhance global representation learning of spectral images and reduce network parameters. Experiments on real-world acoustic datasets show the superiority of the proposed LT-GAN, which achieves an AUC of 0.9806 on the ROC curve, significantly outperforming baseline methods. In the low false-positive regions [0, 0.1], [0, 0.2], and [0, 0.3], the p-AUCs reach 0.9295, 0.9122, and 0.9559, respectively. Moreover, LT-GAN exhibits exceptional lightweight characteristics, with model complexity metrics of 9.626 million parameters, 0.506 GFLOPs, and a model size of 37.48 MB. These results validate the effectiveness of the proposed approach in terms of anomaly detection performance and resource efficiency.</p>","PeriodicalId":94059,"journal":{"name":"ISA transactions","volume":" ","pages":""},"PeriodicalIF":6.5,"publicationDate":"2025-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145515142","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-10-31DOI: 10.1016/j.isatra.2025.10.048
Qiluo Xiong, Yanhui Ren, Fan Yang, Andrei Torgashov
In recent years, deep learning techniques have been increasingly adopted in soft sensor modeling, with the transformer architecture demonstrating notable advantages not only in natural language processing and image analysis but also in time-series modeling. Autoencoders, known for their ability to learn compact representations of process data, have also been widely applied for feature extraction in soft sensors. However, when dealing with multivariate process data, conventional autoencoder-based models often suffer from underfitting due to persistent reconstruction errors or overfitting when the reconstruction loss converges prematurely. These issues hinder effective feature learning and limit the model's generalization capability in real-world applications. To address these challenges, this paper proposes Resformer, a novel transformer-based architecture that incorporates residual feature compensation. Resformer employs a two-stage autoencoding structure to extract both primary and secondary features and fuses them via a cross-attention mechanism to enhance representation completeness. Time tokens are used as the basic modeling units to capture spatiotemporal dependencies among process variables, which are then mapped to the target quality variable through a dedicated decoding structure. Experimental results on the Tennessee Eastman (TE) process and an industrial alkylation process dataset demonstrate that Resformer, with residual compensation and spatiotemporal feature learning, significantly outperforms recent transformer-based variants while maintaining comparable architectural complexity suitable for practical deployment.
{"title":"Resformer: Time-token transformer with residual compensation for quality prediction in industrial processes.","authors":"Qiluo Xiong, Yanhui Ren, Fan Yang, Andrei Torgashov","doi":"10.1016/j.isatra.2025.10.048","DOIUrl":"https://doi.org/10.1016/j.isatra.2025.10.048","url":null,"abstract":"<p><p>In recent years, deep learning techniques have been increasingly adopted in soft sensor modeling, with the transformer architecture demonstrating notable advantages not only in natural language processing and image analysis but also in time-series modeling. Autoencoders, known for their ability to learn compact representations of process data, have also been widely applied for feature extraction in soft sensors. However, when dealing with multivariate process data, conventional autoencoder-based models often suffer from underfitting due to persistent reconstruction errors or overfitting when the reconstruction loss converges prematurely. These issues hinder effective feature learning and limit the model's generalization capability in real-world applications. To address these challenges, this paper proposes Resformer, a novel transformer-based architecture that incorporates residual feature compensation. Resformer employs a two-stage autoencoding structure to extract both primary and secondary features and fuses them via a cross-attention mechanism to enhance representation completeness. Time tokens are used as the basic modeling units to capture spatiotemporal dependencies among process variables, which are then mapped to the target quality variable through a dedicated decoding structure. Experimental results on the Tennessee Eastman (TE) process and an industrial alkylation process dataset demonstrate that Resformer, with residual compensation and spatiotemporal feature learning, significantly outperforms recent transformer-based variants while maintaining comparable architectural complexity suitable for practical deployment.</p>","PeriodicalId":94059,"journal":{"name":"ISA transactions","volume":" ","pages":""},"PeriodicalIF":6.5,"publicationDate":"2025-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145477339","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}