Pub Date : 2025-12-05DOI: 10.1016/j.isatra.2025.12.006
Jian Huang, Hang Ruan, Jianbo Yu, Qingchao Jiang, Xiaofeng Yang
Recognizing nonstationarity is pivotal for trustworthy industrial process monitoring. Existing methods address this issue from a unimodal perspective, which struggles to capture intrinsic heterogeneity. To resolve this, we introduce a novel unsupervised multimodal nonstationary monitoring framework (UMNMF), integrating a bimodal paradigm with contrastive and adversarial schemes. Initially, the knowledge labeling unit (KLU) is established to generate pseudo-labels augmented with prior knowledge for semantic guidance. Subsequently, the dynamic alignment and encoding unit (DAEU) exploits contrastive language-image pre-training (CLIP) and the Vision Transformer (ViT) for modality-aware alignment through a pseudo-supervised contrastive mechanism. Furthermore, the association alignment and distillation unit (AADU) is devised to achieve decoupling through self-adversarial distribution regularization within a variational graph autoencoder (VGAE). The superior performance is substantiated by extensive experiments on three industrial processes, where the UMNMF attains an average fault detection rate exceeding 94 % and maintains a false alarm rate below 2.5 %. Additional ablation studies further confirm the contribution of each module to overall performance improvement.
{"title":"A bimodal framework for nonstationary process monitoring via collaborative contrastive and adversarial unsupervised learning.","authors":"Jian Huang, Hang Ruan, Jianbo Yu, Qingchao Jiang, Xiaofeng Yang","doi":"10.1016/j.isatra.2025.12.006","DOIUrl":"https://doi.org/10.1016/j.isatra.2025.12.006","url":null,"abstract":"<p><p>Recognizing nonstationarity is pivotal for trustworthy industrial process monitoring. Existing methods address this issue from a unimodal perspective, which struggles to capture intrinsic heterogeneity. To resolve this, we introduce a novel unsupervised multimodal nonstationary monitoring framework (UMNMF), integrating a bimodal paradigm with contrastive and adversarial schemes. Initially, the knowledge labeling unit (KLU) is established to generate pseudo-labels augmented with prior knowledge for semantic guidance. Subsequently, the dynamic alignment and encoding unit (DAEU) exploits contrastive language-image pre-training (CLIP) and the Vision Transformer (ViT) for modality-aware alignment through a pseudo-supervised contrastive mechanism. Furthermore, the association alignment and distillation unit (AADU) is devised to achieve decoupling through self-adversarial distribution regularization within a variational graph autoencoder (VGAE). The superior performance is substantiated by extensive experiments on three industrial processes, where the UMNMF attains an average fault detection rate exceeding 94 % and maintains a false alarm rate below 2.5 %. Additional ablation studies further confirm the contribution of each module to overall performance improvement.</p>","PeriodicalId":94059,"journal":{"name":"ISA transactions","volume":" ","pages":""},"PeriodicalIF":6.5,"publicationDate":"2025-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145727941","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-12-04DOI: 10.1016/j.isatra.2025.12.001
Lei Chen, Haoyan Dong, Shuaijie Chen, Kuangrong Hao
Real-world industrial fault diagnosis faces challenges from unknown fault types and limited labeled data, where existing methods often suffer from prototype collapse and unreliable clustering. This paper proposes an uncertainty-aware prototype learning framework with structural constraints for open-world semi-supervised fault diagnosis (OpenUPS). It introduces prototypes based on simplex equiangular tight frame to enforce uniformly distributed and maximally separated class centers, effectively preventing collapse under limited supervision. To address the varying reliability of unlabeled data, an uncertainty-aware contrastive strategy adaptively selects informative pairs, enabling robust alignment of seen classes and progressive clustering of novel faults. Experiments on the Tennessee Eastman process and a real-world polyester esterification process demonstrate that OpenUPS outperforms existing methods, achieving strong generalization and adaptability for open-world industrial fault diagnosis.
{"title":"An uncertainty-aware prototype learning framework with structural constraints for open-world semi-supervised fault diagnosis.","authors":"Lei Chen, Haoyan Dong, Shuaijie Chen, Kuangrong Hao","doi":"10.1016/j.isatra.2025.12.001","DOIUrl":"https://doi.org/10.1016/j.isatra.2025.12.001","url":null,"abstract":"<p><p>Real-world industrial fault diagnosis faces challenges from unknown fault types and limited labeled data, where existing methods often suffer from prototype collapse and unreliable clustering. This paper proposes an uncertainty-aware prototype learning framework with structural constraints for open-world semi-supervised fault diagnosis (OpenUPS). It introduces prototypes based on simplex equiangular tight frame to enforce uniformly distributed and maximally separated class centers, effectively preventing collapse under limited supervision. To address the varying reliability of unlabeled data, an uncertainty-aware contrastive strategy adaptively selects informative pairs, enabling robust alignment of seen classes and progressive clustering of novel faults. Experiments on the Tennessee Eastman process and a real-world polyester esterification process demonstrate that OpenUPS outperforms existing methods, achieving strong generalization and adaptability for open-world industrial fault diagnosis.</p>","PeriodicalId":94059,"journal":{"name":"ISA transactions","volume":" ","pages":""},"PeriodicalIF":6.5,"publicationDate":"2025-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145770549","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-11-27DOI: 10.1016/j.isatra.2025.11.036
Qifu Chen, Zhuang Li, Weijun Li, Yunpeng Guo, Jianqi An, Jinhua She
The permeability index (PI) of a blast furnace (BF) is a key indicator of furnace performance, as it reflects the extent of indirect reduction, energy consumption, molten iron quality, and overall production efficiency. Accurate prediction of the PI is essential for ensuring stable and efficient BF performance. Due to the complex multi-time-scale characteristics of different operational parameters, this paper presents a multi-step prediction model based on multi-time-scale analysis to capture their long-term evolution trends. First, the multi-time-scale characteristics of BF operation are analyzed from both the smelting mechanism and data-driven perspectives. According to the characteristics, this paper constructs a single-step prediction model of PI on the long-time-scale, medium-time-scale, and short-time-scale, respectively, and introduces an iterative compensation strategy to extend each single-step model into a multi-step prediction framework, and then fuses the prediction results under the multi-time-scale to obtain the results of the future PI. Finally, the performance evaluation is shown based on actual industrial data, which verifies the significant advantages of the proposed multi-step prediction method based on iterative compensation in terms of accuracy and stability.
{"title":"Multi-step prediction of blast furnace permeability index based on multi-time-scale analysis.","authors":"Qifu Chen, Zhuang Li, Weijun Li, Yunpeng Guo, Jianqi An, Jinhua She","doi":"10.1016/j.isatra.2025.11.036","DOIUrl":"https://doi.org/10.1016/j.isatra.2025.11.036","url":null,"abstract":"<p><p>The permeability index (PI) of a blast furnace (BF) is a key indicator of furnace performance, as it reflects the extent of indirect reduction, energy consumption, molten iron quality, and overall production efficiency. Accurate prediction of the PI is essential for ensuring stable and efficient BF performance. Due to the complex multi-time-scale characteristics of different operational parameters, this paper presents a multi-step prediction model based on multi-time-scale analysis to capture their long-term evolution trends. First, the multi-time-scale characteristics of BF operation are analyzed from both the smelting mechanism and data-driven perspectives. According to the characteristics, this paper constructs a single-step prediction model of PI on the long-time-scale, medium-time-scale, and short-time-scale, respectively, and introduces an iterative compensation strategy to extend each single-step model into a multi-step prediction framework, and then fuses the prediction results under the multi-time-scale to obtain the results of the future PI. Finally, the performance evaluation is shown based on actual industrial data, which verifies the significant advantages of the proposed multi-step prediction method based on iterative compensation in terms of accuracy and stability.</p>","PeriodicalId":94059,"journal":{"name":"ISA transactions","volume":" ","pages":""},"PeriodicalIF":6.5,"publicationDate":"2025-11-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145673194","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
With the widespread deployment of intelligent sensors and advances in data storage, large volumes of process data are continuously collected, providing a foundation for developing soft sensors for multi-scale monitoring in complex industrial processes. This paper proposes an enhanced autoencoder-based temporal convolutional soft sensor model for industrial process monitoring, aiming to effectively capture multi-scale features and the dynamic evolution of process data. The proposed filter temporal convolutional network incorporates adaptive filter-response normalization, thereby enhancing multi-scale feature extraction and improving model generalization. Then, a multi-layer filter temporal convolutional autoencoder is developed to enable efficient multi-scale feature extraction and accurate process data reconstruction. Moreover, a multi-scale feature fusion module with a channel attention mechanism is designed to adaptively integrate temporal features and significantly enhance model robustness. Finally, a statistical metric based on reconstruction errors is established, and the Kullback-Leibler divergence is employed to determine control limits for fault detection. The superiority and effectiveness of the proposed method are validated through applications to the wastewater treatment process and the multiphase flow process.
{"title":"Soft sensors for industrial fault detection using multi-scale fusion temporal convolutional autoencoders.","authors":"Huanqi Sun, Weili Xiong, Zhongmei Li, Wenxin Sun, Yiyang Chen, Hongtian Chen","doi":"10.1016/j.isatra.2025.11.020","DOIUrl":"https://doi.org/10.1016/j.isatra.2025.11.020","url":null,"abstract":"<p><p>With the widespread deployment of intelligent sensors and advances in data storage, large volumes of process data are continuously collected, providing a foundation for developing soft sensors for multi-scale monitoring in complex industrial processes. This paper proposes an enhanced autoencoder-based temporal convolutional soft sensor model for industrial process monitoring, aiming to effectively capture multi-scale features and the dynamic evolution of process data. The proposed filter temporal convolutional network incorporates adaptive filter-response normalization, thereby enhancing multi-scale feature extraction and improving model generalization. Then, a multi-layer filter temporal convolutional autoencoder is developed to enable efficient multi-scale feature extraction and accurate process data reconstruction. Moreover, a multi-scale feature fusion module with a channel attention mechanism is designed to adaptively integrate temporal features and significantly enhance model robustness. Finally, a statistical metric based on reconstruction errors is established, and the Kullback-Leibler divergence is employed to determine control limits for fault detection. The superiority and effectiveness of the proposed method are validated through applications to the wastewater treatment process and the multiphase flow process.</p>","PeriodicalId":94059,"journal":{"name":"ISA transactions","volume":" ","pages":""},"PeriodicalIF":6.5,"publicationDate":"2025-11-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145835753","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-11-08DOI: 10.1016/j.isatra.2025.11.001
Zhengxuan Zhang, Xu Yang, Yuri A W Shardt, Jingjing Gao, Jiarui Cui
Dynamic latent variable (DLV) models have been widely applied in industrial soft sensing due to their ability to extract features and capture dynamic behavior. However, conventional DLV models are limited to linear feature extraction and perform poorly with nonuniformly sampled data. Thus, this paper proposes a soft sensor for a nonuniform sampling nonlinear dynamic process using irregular-time-interval latent probabilistic predictability embedding supervised deep network (ILPPSDN). First, a prediction regularization term is added to the decoding loss of the target-related autoencoder to model latent temporal dependencies and enhance feature predictability. Furthermore, the internal state derivative in the proposed irregular-time-interval variational recurrent neural network is parameterized by an ordinary differential equation network, integrating hidden-state evolution with state updates. In addition, all network components are jointly optimized through unified training. Then, an ILPPSDN-based soft sensor is developed for nonuniformly sampled nonlinear dynamic processes via pre-training and supervised fine-tuning. Finally, the results indicate that the proposed ILPPSDN can reduce the root mean square error by at least 26.1 %, 21.1 %, and 26.1 % at the uneven sampling ratios of 1/2, 2/3, and 3/4 in the debutanizer column. Correspondingly, in the sulfur recovery unit, these values are 21.1 %, 26.1 %, and 26.1 %. Additionally, in the ablation studies, the proposed method reduced the root mean square error by at least 5 % and 6 % in the two industrial cases, respectively.
{"title":"Soft sensor for nonuniform sampling nonlinear dynamic process using irregular-time-interval latent probabilistic predictability embedding supervised deep network.","authors":"Zhengxuan Zhang, Xu Yang, Yuri A W Shardt, Jingjing Gao, Jiarui Cui","doi":"10.1016/j.isatra.2025.11.001","DOIUrl":"https://doi.org/10.1016/j.isatra.2025.11.001","url":null,"abstract":"<p><p>Dynamic latent variable (DLV) models have been widely applied in industrial soft sensing due to their ability to extract features and capture dynamic behavior. However, conventional DLV models are limited to linear feature extraction and perform poorly with nonuniformly sampled data. Thus, this paper proposes a soft sensor for a nonuniform sampling nonlinear dynamic process using irregular-time-interval latent probabilistic predictability embedding supervised deep network (ILPPSDN). First, a prediction regularization term is added to the decoding loss of the target-related autoencoder to model latent temporal dependencies and enhance feature predictability. Furthermore, the internal state derivative in the proposed irregular-time-interval variational recurrent neural network is parameterized by an ordinary differential equation network, integrating hidden-state evolution with state updates. In addition, all network components are jointly optimized through unified training. Then, an ILPPSDN-based soft sensor is developed for nonuniformly sampled nonlinear dynamic processes via pre-training and supervised fine-tuning. Finally, the results indicate that the proposed ILPPSDN can reduce the root mean square error by at least 26.1 %, 21.1 %, and 26.1 % at the uneven sampling ratios of 1/2, 2/3, and 3/4 in the debutanizer column. Correspondingly, in the sulfur recovery unit, these values are 21.1 %, 26.1 %, and 26.1 %. Additionally, in the ablation studies, the proposed method reduced the root mean square error by at least 5 % and 6 % in the two industrial cases, respectively.</p>","PeriodicalId":94059,"journal":{"name":"ISA transactions","volume":" ","pages":""},"PeriodicalIF":6.5,"publicationDate":"2025-11-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145524927","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-11-04DOI: 10.1016/j.isatra.2025.10.051
Yunkai Wu, Yu Tian, Yang Zhou, Xiangqian Liu
The high-speed train power traction system operates for extended periods of time in complex environments with high temperatures and vibrations, making the system susceptible to various types of incipient faults. In addition, coupled with missing sensor data under complex operating conditions, the diagnosis of incipient faults under practical conditions becomes even more challenging. To address the aforementioned issues, this paper proposes an innovative data imputation method that combines kernel functions with the modified Akima (Makima) interpolation algorithm. This method can effectively address the scenario of a large number of sensor signals continuously missing in high-speed train traction systems. Expanding on this, a real-time incipient fault diagnosis framework is proposed, which combines Deep-Principal Component Analysis (Deep-PCA) with Multi-scale Principal Component Analysis (MSPCA). This framework enhances the capability to extract fault features from both horizontal and vertical perspectives, thereby improving the accuracy of fault detection. Furthermore, the continuous wavelet transform (CWT) is employed to amplify fault-related information within the dataset. This enhanced dataset, combined with a fault isolation criterion based on the reconstructed cumulative contribution rate, enables the achievement of precise fault isolation.
{"title":"Deep-PCA and MSPCA based fault diagnosis of high-speed train traction systems under missing data conditions.","authors":"Yunkai Wu, Yu Tian, Yang Zhou, Xiangqian Liu","doi":"10.1016/j.isatra.2025.10.051","DOIUrl":"https://doi.org/10.1016/j.isatra.2025.10.051","url":null,"abstract":"<p><p>The high-speed train power traction system operates for extended periods of time in complex environments with high temperatures and vibrations, making the system susceptible to various types of incipient faults. In addition, coupled with missing sensor data under complex operating conditions, the diagnosis of incipient faults under practical conditions becomes even more challenging. To address the aforementioned issues, this paper proposes an innovative data imputation method that combines kernel functions with the modified Akima (Makima) interpolation algorithm. This method can effectively address the scenario of a large number of sensor signals continuously missing in high-speed train traction systems. Expanding on this, a real-time incipient fault diagnosis framework is proposed, which combines Deep-Principal Component Analysis (Deep-PCA) with Multi-scale Principal Component Analysis (MSPCA). This framework enhances the capability to extract fault features from both horizontal and vertical perspectives, thereby improving the accuracy of fault detection. Furthermore, the continuous wavelet transform (CWT) is employed to amplify fault-related information within the dataset. This enhanced dataset, combined with a fault isolation criterion based on the reconstructed cumulative contribution rate, enables the achievement of precise fault isolation.</p>","PeriodicalId":94059,"journal":{"name":"ISA transactions","volume":" ","pages":""},"PeriodicalIF":6.5,"publicationDate":"2025-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145515063","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Anomaly detection of converter valves via acoustic analysis has been a hot topic in the high-voltage direct current research field. However, several factors have significantly hindered the practical deployment of such acoustic anomaly detection methods, including the infrequency of valve anomalies leading to imbalanced distributions of acoustic samples, limitations in computational resources, and inherent class imbalance in acoustic signals. To address these challenges, this paper proposes a novel unsupervised anomaly detection framework, named lightweight transformer-based generative adversarial networks (LT-GAN). It introduces two lightweight modules, MobileNet V2 and D-MobileNet V2, to perform downsampling and upsampling of Mel-spectrograms derived from acoustic signals. Furthermore, it incorporates a K-ViT block to enhance global representation learning of spectral images and reduce network parameters. Experiments on real-world acoustic datasets show the superiority of the proposed LT-GAN, which achieves an AUC of 0.9806 on the ROC curve, significantly outperforming baseline methods. In the low false-positive regions [0, 0.1], [0, 0.2], and [0, 0.3], the p-AUCs reach 0.9295, 0.9122, and 0.9559, respectively. Moreover, LT-GAN exhibits exceptional lightweight characteristics, with model complexity metrics of 9.626 million parameters, 0.506 GFLOPs, and a model size of 37.48 MB. These results validate the effectiveness of the proposed approach in terms of anomaly detection performance and resource efficiency.
{"title":"Lightweight transformer-based generative adversarial network for acoustic anomaly detection in converter valves.","authors":"Mingzhu Tang, Chen Yin, Haijun Hu, Zhihong Wang, Fuqiang Xiong, Ying Wei, Zhiwen Chen","doi":"10.1016/j.isatra.2025.10.049","DOIUrl":"https://doi.org/10.1016/j.isatra.2025.10.049","url":null,"abstract":"<p><p>Anomaly detection of converter valves via acoustic analysis has been a hot topic in the high-voltage direct current research field. However, several factors have significantly hindered the practical deployment of such acoustic anomaly detection methods, including the infrequency of valve anomalies leading to imbalanced distributions of acoustic samples, limitations in computational resources, and inherent class imbalance in acoustic signals. To address these challenges, this paper proposes a novel unsupervised anomaly detection framework, named lightweight transformer-based generative adversarial networks (LT-GAN). It introduces two lightweight modules, MobileNet V2 and D-MobileNet V2, to perform downsampling and upsampling of Mel-spectrograms derived from acoustic signals. Furthermore, it incorporates a K-ViT block to enhance global representation learning of spectral images and reduce network parameters. Experiments on real-world acoustic datasets show the superiority of the proposed LT-GAN, which achieves an AUC of 0.9806 on the ROC curve, significantly outperforming baseline methods. In the low false-positive regions [0, 0.1], [0, 0.2], and [0, 0.3], the p-AUCs reach 0.9295, 0.9122, and 0.9559, respectively. Moreover, LT-GAN exhibits exceptional lightweight characteristics, with model complexity metrics of 9.626 million parameters, 0.506 GFLOPs, and a model size of 37.48 MB. These results validate the effectiveness of the proposed approach in terms of anomaly detection performance and resource efficiency.</p>","PeriodicalId":94059,"journal":{"name":"ISA transactions","volume":" ","pages":""},"PeriodicalIF":6.5,"publicationDate":"2025-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145515142","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-10-31DOI: 10.1016/j.isatra.2025.10.048
Qiluo Xiong, Yanhui Ren, Fan Yang, Andrei Torgashov
In recent years, deep learning techniques have been increasingly adopted in soft sensor modeling, with the transformer architecture demonstrating notable advantages not only in natural language processing and image analysis but also in time-series modeling. Autoencoders, known for their ability to learn compact representations of process data, have also been widely applied for feature extraction in soft sensors. However, when dealing with multivariate process data, conventional autoencoder-based models often suffer from underfitting due to persistent reconstruction errors or overfitting when the reconstruction loss converges prematurely. These issues hinder effective feature learning and limit the model's generalization capability in real-world applications. To address these challenges, this paper proposes Resformer, a novel transformer-based architecture that incorporates residual feature compensation. Resformer employs a two-stage autoencoding structure to extract both primary and secondary features and fuses them via a cross-attention mechanism to enhance representation completeness. Time tokens are used as the basic modeling units to capture spatiotemporal dependencies among process variables, which are then mapped to the target quality variable through a dedicated decoding structure. Experimental results on the Tennessee Eastman (TE) process and an industrial alkylation process dataset demonstrate that Resformer, with residual compensation and spatiotemporal feature learning, significantly outperforms recent transformer-based variants while maintaining comparable architectural complexity suitable for practical deployment.
{"title":"Resformer: Time-token transformer with residual compensation for quality prediction in industrial processes.","authors":"Qiluo Xiong, Yanhui Ren, Fan Yang, Andrei Torgashov","doi":"10.1016/j.isatra.2025.10.048","DOIUrl":"https://doi.org/10.1016/j.isatra.2025.10.048","url":null,"abstract":"<p><p>In recent years, deep learning techniques have been increasingly adopted in soft sensor modeling, with the transformer architecture demonstrating notable advantages not only in natural language processing and image analysis but also in time-series modeling. Autoencoders, known for their ability to learn compact representations of process data, have also been widely applied for feature extraction in soft sensors. However, when dealing with multivariate process data, conventional autoencoder-based models often suffer from underfitting due to persistent reconstruction errors or overfitting when the reconstruction loss converges prematurely. These issues hinder effective feature learning and limit the model's generalization capability in real-world applications. To address these challenges, this paper proposes Resformer, a novel transformer-based architecture that incorporates residual feature compensation. Resformer employs a two-stage autoencoding structure to extract both primary and secondary features and fuses them via a cross-attention mechanism to enhance representation completeness. Time tokens are used as the basic modeling units to capture spatiotemporal dependencies among process variables, which are then mapped to the target quality variable through a dedicated decoding structure. Experimental results on the Tennessee Eastman (TE) process and an industrial alkylation process dataset demonstrate that Resformer, with residual compensation and spatiotemporal feature learning, significantly outperforms recent transformer-based variants while maintaining comparable architectural complexity suitable for practical deployment.</p>","PeriodicalId":94059,"journal":{"name":"ISA transactions","volume":" ","pages":""},"PeriodicalIF":6.5,"publicationDate":"2025-10-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145477339","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-10-30DOI: 10.1016/j.isatra.2025.10.050
Yixin Nie, Wanxu Cai, Anqi Wang, Fan Yang, Tao Zhang
Satellites are vital for communication and navigation, yet delayed or incomplete telemetry data hinder the evaluation of degradation processes and system reliability. As the core of satellite power systems, lithium batteries ensure operational stability. This study focuses on lithium batteries in low Earth orbit satellites and addresses telemetry delay-induced data loss. A data recovery method, termed the FAST Model, based on feature acquisition and short-sequence prediction, is proposed. The method effectively captures degradation trends and capacity fluctuations, including regeneration effects, with minimal data. In the satellite battery dataset with approximately 70 % missing data, the proposed model reduces recovery errors by more than 50 % compared with existing methods. Furthermore, the data recovered by the FAST Model enables more accurate estimation of the remaining useful life of batteries. Under the same prediction framework, the use of FAST-recovered data leads to a reduction of prediction errors by about 5 %-30 %, as evaluated by metrics such as MAE and RMSE, compared with other recovery approaches.
{"title":"FAST: A battery data recovery method for missing information due to delayed telemetry.","authors":"Yixin Nie, Wanxu Cai, Anqi Wang, Fan Yang, Tao Zhang","doi":"10.1016/j.isatra.2025.10.050","DOIUrl":"https://doi.org/10.1016/j.isatra.2025.10.050","url":null,"abstract":"<p><p>Satellites are vital for communication and navigation, yet delayed or incomplete telemetry data hinder the evaluation of degradation processes and system reliability. As the core of satellite power systems, lithium batteries ensure operational stability. This study focuses on lithium batteries in low Earth orbit satellites and addresses telemetry delay-induced data loss. A data recovery method, termed the FAST Model, based on feature acquisition and short-sequence prediction, is proposed. The method effectively captures degradation trends and capacity fluctuations, including regeneration effects, with minimal data. In the satellite battery dataset with approximately 70 % missing data, the proposed model reduces recovery errors by more than 50 % compared with existing methods. Furthermore, the data recovered by the FAST Model enables more accurate estimation of the remaining useful life of batteries. Under the same prediction framework, the use of FAST-recovered data leads to a reduction of prediction errors by about 5 %-30 %, as evaluated by metrics such as MAE and RMSE, compared with other recovery approaches.</p>","PeriodicalId":94059,"journal":{"name":"ISA transactions","volume":" ","pages":""},"PeriodicalIF":6.5,"publicationDate":"2025-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145446968","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-10-30DOI: 10.1016/j.isatra.2025.10.035
Liang Cao, Yang Liu, Jing Liu, Jianping Su
Climate change mitigation necessitates significant reductions in carbon emissions from traditional refining industries. However, accurately attributing renewable bio-feedstock contributions to CO2 emissions in refinery co-processing remains challenging due to dynamic feedstock variability and complex nonlinear process interactions. Existing static and linear methodologies inadequately address these real-time adaptive demands and intricate nonlinearities. To overcome these shortcomings, we propose a hybrid adaptive modeling framework integrating constrained Recursive Least Squares (RLS) with sparse Generalized Additive Models (GAM), complemented by conformal prediction for uncertainty quantification. Our method adaptively tracks feedstock contributions, ensures physical interpretability through non-negativity constraints. Validated on an industrial dataset of over 86,000 samples, the proposed framework achieves superior predictive accuracy (RMSE: 345.4, R2: 0.950), outperforming 15 baseline methods. More importantly, it delivers interpretable and uncertainty-aware estimates of renewable CO2 emissions in real time, establishing a practical pathway for transparent and data-driven refinery decarbonization.
{"title":"Physics-informed dynamic hybrid modeling for real-time renewable CO<sub>2</sub> tracking in refinery co-processing.","authors":"Liang Cao, Yang Liu, Jing Liu, Jianping Su","doi":"10.1016/j.isatra.2025.10.035","DOIUrl":"https://doi.org/10.1016/j.isatra.2025.10.035","url":null,"abstract":"<p><p>Climate change mitigation necessitates significant reductions in carbon emissions from traditional refining industries. However, accurately attributing renewable bio-feedstock contributions to CO<sub>2</sub> emissions in refinery co-processing remains challenging due to dynamic feedstock variability and complex nonlinear process interactions. Existing static and linear methodologies inadequately address these real-time adaptive demands and intricate nonlinearities. To overcome these shortcomings, we propose a hybrid adaptive modeling framework integrating constrained Recursive Least Squares (RLS) with sparse Generalized Additive Models (GAM), complemented by conformal prediction for uncertainty quantification. Our method adaptively tracks feedstock contributions, ensures physical interpretability through non-negativity constraints. Validated on an industrial dataset of over 86,000 samples, the proposed framework achieves superior predictive accuracy (RMSE: 345.4, R<sup>2</sup>: 0.950), outperforming 15 baseline methods. More importantly, it delivers interpretable and uncertainty-aware estimates of renewable CO<sub>2</sub> emissions in real time, establishing a practical pathway for transparent and data-driven refinery decarbonization.</p>","PeriodicalId":94059,"journal":{"name":"ISA transactions","volume":" ","pages":""},"PeriodicalIF":6.5,"publicationDate":"2025-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145472552","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}