首页 > 最新文献

Energy and AI最新文献

英文 中文
A charging demand prediction method for individual electric vehicle users based on dual-layer multisource data clustering and a LightGBM 基于双层多源数据聚类和LightGBM的个人电动汽车用户充电需求预测方法
IF 9.6 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-10-29 DOI: 10.1016/j.egyai.2025.100644
Yunfei Mu , Ruichao Zhou , Kangning Zhao , Hongjie Jia , Guoqiang Zu , Ye Yang
The charging behaviors of electric vehicle (EV) users exhibit high randomness and individual heterogeneity, with the key parameters such as the charging duration and charged energy levels displaying significant fluctuations. Compared with EV cluster-layer prediction, predicting the charging demands of individual users requires not only the analysis of more complex charging behaviors but also the establishment of a coupling model between exogenous variables (e.g., weather and holidays) and prediction accuracy, thereby imposing higher robustness requirements on prediction algorithms. An individual-user EV charging demand prediction method that integrates multisource data with a dual-layer clustering approach and a light gradient boosting machine (LightGBM) is proposed in this study to address these technical challenges. First, a multisource dataset that incorporates user charging behavior data and exogenous variables (meteorological factors and date types) is constructed. A dual-layer feature extraction mechanism consisting of data-layer clustering for charging type identification and user-layer clustering for user group classification is employed, thereby establishing a classification feature space that characterizes different charging types and user groups. A predictive model is subsequently developed using the LightGBM algorithm, which directly incorporates classification features as its inputs, effectively mitigating the information loss associated with the traditional categorical variable encoding process. Finally, employing EV users from a typical residential community in northern China as an empirical case, comparative experiments are performed to validate the proposed method, demonstrating its effectiveness at improving prediction accuracy.
电动汽车用户充电行为具有高度随机性和个体异质性,充电时间、充电能级等关键参数波动较大。与电动汽车簇层预测相比,个体用户充电需求预测不仅需要分析更复杂的充电行为,还需要建立外生变量(如天气、节假日)与预测精度之间的耦合模型,从而对预测算法的鲁棒性提出了更高的要求。为了解决这些技术难题,本文提出了一种将多源数据与双层聚类方法和光梯度增强机(LightGBM)相结合的个人用户电动汽车充电需求预测方法。首先,构建了包含用户收费行为数据和外生变量(气象因素和日期类型)的多源数据集。采用数据层聚类识别收费类型和用户层聚类分类用户组的双层特征提取机制,建立具有不同收费类型和用户组特征的分类特征空间。随后,使用LightGBM算法开发了预测模型,该算法直接将分类特征作为输入,有效地减轻了传统分类变量编码过程中相关的信息损失。最后,以中国北方典型住宅社区的电动汽车用户为实证案例,进行了对比实验,验证了该方法在提高预测精度方面的有效性。
{"title":"A charging demand prediction method for individual electric vehicle users based on dual-layer multisource data clustering and a LightGBM","authors":"Yunfei Mu ,&nbsp;Ruichao Zhou ,&nbsp;Kangning Zhao ,&nbsp;Hongjie Jia ,&nbsp;Guoqiang Zu ,&nbsp;Ye Yang","doi":"10.1016/j.egyai.2025.100644","DOIUrl":"10.1016/j.egyai.2025.100644","url":null,"abstract":"<div><div>The charging behaviors of electric vehicle (EV) users exhibit high randomness and individual heterogeneity, with the key parameters such as the charging duration and charged energy levels displaying significant fluctuations. Compared with EV cluster-layer prediction, predicting the charging demands of individual users requires not only the analysis of more complex charging behaviors but also the establishment of a coupling model between exogenous variables (e.g., weather and holidays) and prediction accuracy, thereby imposing higher robustness requirements on prediction algorithms. An individual-user EV charging demand prediction method that integrates multisource data with a dual-layer clustering approach and a light gradient boosting machine (LightGBM) is proposed in this study to address these technical challenges. First, a multisource dataset that incorporates user charging behavior data and exogenous variables (meteorological factors and date types) is constructed. A dual-layer feature extraction mechanism consisting of data-layer clustering for charging type identification and user-layer clustering for user group classification is employed, thereby establishing a classification feature space that characterizes different charging types and user groups. A predictive model is subsequently developed using the LightGBM algorithm, which directly incorporates classification features as its inputs, effectively mitigating the information loss associated with the traditional categorical variable encoding process. Finally, employing EV users from a typical residential community in northern China as an empirical case, comparative experiments are performed to validate the proposed method, demonstrating its effectiveness at improving prediction accuracy.</div></div>","PeriodicalId":34138,"journal":{"name":"Energy and AI","volume":"22 ","pages":"Article 100644"},"PeriodicalIF":9.6,"publicationDate":"2025-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145465937","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A comparative analysis of regression algorithms and a real world application of multivariable energy signatures 回归算法的比较分析和多变量能量特征的实际应用
IF 9.6 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-10-29 DOI: 10.1016/j.egyai.2025.100641
Simone Eiraudo , Daniele Salvatore Schiera , Luca Barbierato , Alena Trifirò , Lorenzo Bottaccioli , Andrea Lanzini
An ecosystem of energy models of buildings is needed to boost the retrofitting process to improve energy efficiency and meet sustainability goals. Such models should enhance the understanding of the energy behavior of a building, the impact of the external variables, and the causes of inefficiencies. Energy Signatures can fill this role, with particular regard to the consumption due to air conditioning. Univariate models, neglecting the impact of solar radiation, have been widely adopted for Energy Signature analysis. This paper presents Multivariable Energy Signatures considering outdoor temperature and solar radiation. The application on a real-world dataset of multivariable non-parametric approaches stands out from previous works in the ES sector. This led to a mean improvement of 0.768 to 0.804 of the coefficients of determination calculated over 103 real-world case studies. Moreover, Neural Networks outperformed several literature algorithms regarding accuracy, robustness, and scalability. The paper also discusses issues regarding the time resolution of input data and introduces appropriate visualization tools to employ Multivariable Energy Signatures as diagnostic tools.
需要一个建筑能源模型生态系统来促进改造过程,以提高能源效率和实现可持续发展目标。这样的模型应该加强对建筑能源行为的理解,外部变量的影响,以及效率低下的原因。能源签名可以填补这一角色,特别是考虑到由于空调的消耗。忽略太阳辐射影响的单变量模型已被广泛用于能量特征分析。本文提出了考虑室外温度和太阳辐射的多变量能量特征。在多变量非参数方法的真实数据集上的应用从ES部门的先前工作中脱颖而出。这导致在103个实际案例研究中计算的决定系数的平均改进为0.768至0.804。此外,神经网络在准确性、鲁棒性和可扩展性方面优于几种文献算法。本文还讨论了输入数据的时间分辨率问题,并介绍了适当的可视化工具,以使用多变量能量特征作为诊断工具。
{"title":"A comparative analysis of regression algorithms and a real world application of multivariable energy signatures","authors":"Simone Eiraudo ,&nbsp;Daniele Salvatore Schiera ,&nbsp;Luca Barbierato ,&nbsp;Alena Trifirò ,&nbsp;Lorenzo Bottaccioli ,&nbsp;Andrea Lanzini","doi":"10.1016/j.egyai.2025.100641","DOIUrl":"10.1016/j.egyai.2025.100641","url":null,"abstract":"<div><div>An ecosystem of energy models of buildings is needed to boost the retrofitting process to improve energy efficiency and meet sustainability goals. Such models should enhance the understanding of the energy behavior of a building, the impact of the external variables, and the causes of inefficiencies. Energy Signatures can fill this role, with particular regard to the consumption due to air conditioning. Univariate models, neglecting the impact of solar radiation, have been widely adopted for Energy Signature analysis. This paper presents Multivariable Energy Signatures considering outdoor temperature and solar radiation. The application on a real-world dataset of multivariable non-parametric approaches stands out from previous works in the ES sector. This led to a mean improvement of 0.768 to 0.804 of the coefficients of determination calculated over 103 real-world case studies. Moreover, Neural Networks outperformed several literature algorithms regarding accuracy, robustness, and scalability. The paper also discusses issues regarding the time resolution of input data and introduces appropriate visualization tools to employ Multivariable Energy Signatures as diagnostic tools.</div></div>","PeriodicalId":34138,"journal":{"name":"Energy and AI","volume":"22 ","pages":"Article 100641"},"PeriodicalIF":9.6,"publicationDate":"2025-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145465936","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Taming deep reinforcement learning agents with pricing mechanism: Validation in power distribution systems 用定价机制驯服深度强化学习代理:在配电系统中的验证
IF 9.6 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-10-28 DOI: 10.1016/j.egyai.2025.100635
Haoyang Zhang , Georgios Tsaousoglou , Sen Zhan , Koen Kok , Nikolaos G. Paterakis
Distributed energy resources, connected to power distribution systems, are increasingly operated by intelligent/learning agents. Such agents, looking to optimize their own payoff, can discover harmful ways to exploit the system. Hence, shielding critical systems of harmful agent behavior is of crucial importance. In this paper, the problem of designing an efficient operating mechanism for a power distribution system is taken on, considering the realistic case where the system’s resources do not possess this information and instead learn to improve their policies through experience. To that end, a multi-agent reinforcement learning algorithm is developed to model the participants’ learning-to-act process and consider the agents’ learning under different pricing schemes that shape the agents’ reward functions. Two popular pricing schemes (pay-as-bid and distribution locational marginal pricing) are presented, exposing that learning agents can discover ways to exploit them, resulting in severe dispatch inefficiency. A game-theoretic pricing scheme is presented that theoretically incentivizes truthful agent behavior, and empirically demonstrate that this property improves the efficiency of the resulting dispatch also in the presence of learning agents. In particular, the proposed scheme is able to outperform the popular distribution locational marginal pricing scheme, in terms of efficiency, by a factor of 15–17%.
连接到配电系统的分布式能源越来越多地由智能/学习代理操作。这些代理人希望优化自己的收益,可能会发现利用系统的有害方法。因此,屏蔽有害物质行为的关键系统至关重要。本文考虑了配电系统资源不具备这些信息,而是通过经验学习改进其政策的现实情况,研究了配电系统有效运行机制的设计问题。为此,开发了一种多智能体强化学习算法来模拟参与者的学习行为过程,并考虑智能体在不同定价方案下的学习情况,这些定价方案塑造了智能体的奖励函数。提出了两种流行的定价方案(按出价付费和配送地点边际定价),表明学习代理可以发现利用它们的方法,从而导致严重的调度效率低下。提出了一种博弈论定价方案,从理论上激励诚实的智能体行为,并通过经验证明,在有学习智能体存在的情况下,这一特性也提高了最终调度的效率。特别是,就效率而言,所提出的方案能够比流行的分配位置边际定价方案高出15-17%。
{"title":"Taming deep reinforcement learning agents with pricing mechanism: Validation in power distribution systems","authors":"Haoyang Zhang ,&nbsp;Georgios Tsaousoglou ,&nbsp;Sen Zhan ,&nbsp;Koen Kok ,&nbsp;Nikolaos G. Paterakis","doi":"10.1016/j.egyai.2025.100635","DOIUrl":"10.1016/j.egyai.2025.100635","url":null,"abstract":"<div><div>Distributed energy resources, connected to power distribution systems, are increasingly operated by intelligent/learning agents. Such agents, looking to optimize their own payoff, can discover harmful ways to exploit the system. Hence, shielding critical systems of harmful agent behavior is of crucial importance. In this paper, the problem of designing an efficient operating mechanism for a power distribution system is taken on, considering the realistic case where the system’s resources do not possess this information and instead learn to improve their policies through experience. To that end, a multi-agent reinforcement learning algorithm is developed to model the participants’ learning-to-act process and consider the agents’ learning under different pricing schemes that shape the agents’ reward functions. Two popular pricing schemes (pay-as-bid and distribution locational marginal pricing) are presented, exposing that learning agents can discover ways to exploit them, resulting in severe dispatch inefficiency. A game-theoretic pricing scheme is presented that theoretically incentivizes truthful agent behavior, and empirically demonstrate that this property improves the efficiency of the resulting dispatch also in the presence of learning agents. In particular, the proposed scheme is able to outperform the popular distribution locational marginal pricing scheme, in terms of efficiency, by a factor of 15–17%.</div></div>","PeriodicalId":34138,"journal":{"name":"Energy and AI","volume":"22 ","pages":"Article 100635"},"PeriodicalIF":9.6,"publicationDate":"2025-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145416161","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Smart energy strategies: Leveraging LSTM and LLMs for advanced energy management 智能能源战略:利用LSTM和llm进行先进的能源管理
IF 9.6 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-10-28 DOI: 10.1016/j.egyai.2025.100642
Fernando Almeida , Mauro Castelli , Nadine Côrte-Real , Camilla Fallarino , Luca Manzoni
Accurately predicting and optimizing heating and cooling demands in building energy management is crucial for enhancing efficiency and reducing energy consumption. Traditional methods often struggle with building energy usage patterns' nonlinear and variable nature. With the advent of advanced data collection through smart sensors, there is a growing need for intelligent systems to leverage this data to provide actionable insights. This study addresses the gap by developing a recommendation system using three machine and deep learning models, Long Short-Term Memory (LSTM), Gated Recurrent Unit (GRU), and XGBoost to predict and optimize the efficiency levels of space heating, ceiling cooling, and free cooling systems. Our proposed solution harnesses the power of these models, with LSTM performing best overall, to forecast energy consumption across hourly and daily timescales, enabling precise adjustments and efficient energy management. The methodology involves extensive data preprocessing, including hierarchical imputation of missing values and label encoding of categorical variables, followed by the transformation of raw data into efficiency levels. The deep learning model architecture, consisting of sequential layers, captures long-term dependencies in the data, while grid search-based hyperparameter tuning optimizes model performance. Results indicate high predictive accuracy, with R-squared values demonstrating the model's ability to explain up to 97.2 % of the variance in hourly space heating, 95.2 % in daily ceiling cooling, and 93 % in daily free cooling energy consumption. Additionally, we interpret graphs using OpenAI's GPT-4 model to enhance understanding and facilitate actionable insights. This interpretation enhances the clarity of the predictive results, supporting more informed decision-making in energy management. The significance of this work lies in its potential to transform energy management practices in building environments, providing a robust tool for optimizing heating and cooling operations and contributing to overall energy efficiency.
在建筑能源管理中,准确预测和优化供热和制冷需求对提高能效和降低能耗至关重要。传统的方法往往与建筑能源使用模式的非线性和可变性作斗争。随着通过智能传感器进行先进数据收集的出现,越来越需要智能系统利用这些数据来提供可操作的见解。本研究通过使用长短期记忆(LSTM)、门控循环单元(GRU)和XGBoost三种机器和深度学习模型开发推荐系统来预测和优化空间供暖、天花板冷却和自然冷却系统的效率水平,从而解决了这一差距。我们提出的解决方案利用这些模型的力量,LSTM整体表现最佳,预测每小时和每天的能源消耗,实现精确的调整和有效的能源管理。该方法涉及广泛的数据预处理,包括缺失值的分层插入和分类变量的标签编码,然后将原始数据转换为效率水平。由顺序层组成的深度学习模型架构捕获了数据中的长期依赖关系,而基于网格搜索的超参数调优优化了模型性能。结果表明预测精度很高,r平方值表明该模型能够解释高达97.2%的每小时空间供暖差异,95.2%的每日天花板冷却差异和93%的每日自由冷却能耗差异。此外,我们使用OpenAI的GPT-4模型来解释图表,以增强理解并促进可操作的见解。这种解释提高了预测结果的清晰度,支持能源管理中更明智的决策。这项工作的意义在于它有可能改变建筑环境中的能源管理实践,为优化供暖和制冷操作提供一个强大的工具,并有助于提高整体能源效率。
{"title":"Smart energy strategies: Leveraging LSTM and LLMs for advanced energy management","authors":"Fernando Almeida ,&nbsp;Mauro Castelli ,&nbsp;Nadine Côrte-Real ,&nbsp;Camilla Fallarino ,&nbsp;Luca Manzoni","doi":"10.1016/j.egyai.2025.100642","DOIUrl":"10.1016/j.egyai.2025.100642","url":null,"abstract":"<div><div>Accurately predicting and optimizing heating and cooling demands in building energy management is crucial for enhancing efficiency and reducing energy consumption. Traditional methods often struggle with building energy usage patterns' nonlinear and variable nature. With the advent of advanced data collection through smart sensors, there is a growing need for intelligent systems to leverage this data to provide actionable insights. This study addresses the gap by developing a recommendation system using three machine and deep learning models, Long Short-Term Memory (LSTM), Gated Recurrent Unit (GRU), and XGBoost to predict and optimize the efficiency levels of space heating, ceiling cooling, and free cooling systems. Our proposed solution harnesses the power of these models, with LSTM performing best overall, to forecast energy consumption across hourly and daily timescales, enabling precise adjustments and efficient energy management. The methodology involves extensive data preprocessing, including hierarchical imputation of missing values and label encoding of categorical variables, followed by the transformation of raw data into efficiency levels. The deep learning model architecture, consisting of sequential layers, captures long-term dependencies in the data, while grid search-based hyperparameter tuning optimizes model performance. Results indicate high predictive accuracy, with R-squared values demonstrating the model's ability to explain up to 97.2 % of the variance in hourly space heating, 95.2 % in daily ceiling cooling, and 93 % in daily free cooling energy consumption. Additionally, we interpret graphs using OpenAI's GPT-4 model to enhance understanding and facilitate actionable insights. This interpretation enhances the clarity of the predictive results, supporting more informed decision-making in energy management. The significance of this work lies in its potential to transform energy management practices in building environments, providing a robust tool for optimizing heating and cooling operations and contributing to overall energy efficiency.</div></div>","PeriodicalId":34138,"journal":{"name":"Energy and AI","volume":"22 ","pages":"Article 100642"},"PeriodicalIF":9.6,"publicationDate":"2025-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145416080","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
ScaleONet: Scalable and control-oriented modeling of building cluster thermal dynamics using deep operator networks — A practical case study for a Belgian district ScaleONet:使用深度运营商网络的可扩展和面向控制的建筑集群热动力学建模-一个比利时地区的实际案例研究
IF 9.6 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-10-28 DOI: 10.1016/j.egyai.2025.100634
Muhammad Hafeez Saeed , Maomao Hu , Hussain Kazmi , Geert Deconinck
Delivering energy flexibility at the district scale entails coordinating control actions across many buildings to shape aggregate demand; this coordination depends on training and deploying control policies and optimization routines, which in turn require predictive models that can be queried efficiently over large building clusters. However, conventional physics-based simulators are computationally prohibitive for large-scale control training, and simple data-driven surrogates often lack the generalization needed for heterogeneous clusters. This paper introduces ScaleONet, a deep operator network framework designed for scalable, control-oriented modeling of building-cluster thermal dynamics. ScaleONet leverages the DeepONet paradigm to decouple and share learning across buildings: an LSTM-based branch network encodes outdoor climate and individual HVAC control signals, while a multilayer perceptron (MLP)-based trunk network embeds prediction timestamps, enabling fast predictions for growing clusters with negligible extra cost for each additional building or timestep. To the authors’ knowledge, this is the first operator-learning method tailored to indoor air temperature forecasting in heterogeneous building clusters. Validation on thirty Belgian buildings (GenkNet) simulated in Dymola shows that, although a non-operator-learning LSTM baseline slightly outperforms ScaleONet for single-building cases, its error grows monotonically with cluster size. In contrast, ScaleONet’s median per-building-per-day RMSE decreases from 0.59 °C at three buildings to 0.53 °C at ten and 0.47 °C at thirty, compared to 0.95 °C for the LSTM at thirty buildings — a 51% reduction in prediction error. Error analysis across envelope heat-loss coefficients (UAbuilding) further reveals that while the LSTM’s RMSE increases for high-UA structures, ScaleONet maintains uniformly low error. With millisecond-scale inference (approximately 4ms per sample for thirty buildings), ScaleONet is well suited for large-scale reinforcement learning, receding-horizon optimization, and real-time model predictive control.
在地区范围内实现能源灵活性需要协调许多建筑物的控制行动,以形成总需求;这种协调依赖于训练和部署控制策略和优化例程,这反过来又需要能够在大型建筑集群上有效查询的预测模型。然而,传统的基于物理的模拟器在计算上不适合大规模的控制训练,并且简单的数据驱动的代理通常缺乏异构集群所需的泛化。本文介绍了ScaleONet,这是一个深度算子网络框架,专为可扩展的、面向控制的建筑集群热动力学建模而设计。ScaleONet利用DeepONet范例在建筑物之间解耦和共享学习:基于lstm的分支网络编码室外气候和单独的HVAC控制信号,而基于多层感知器(MLP)的主干网络嵌入预测时间戳,能够快速预测不断增长的集群,而每个额外的建筑物或时间步长的额外成本可以忽略不计。据作者所知,这是第一个针对异质建筑群室内空气温度预测的操作员学习方法。在Dymola中模拟的30个比利时建筑(GenkNet)上的验证表明,尽管非操作员学习LSTM基线在单个建筑情况下略优于ScaleONet,但其误差随着聚类大小单调增长。相比之下,ScaleONet的每栋建筑每天RMSE的中位数从3栋建筑的0.59°C下降到10栋建筑的0.53°C和30栋建筑的0.47°C,相比之下,LSTM在30栋建筑的0.95°C -预测误差减少了51%。跨包络热损失系数(ubuilding)的误差分析进一步表明,虽然LSTM的RMSE在高ua结构中增加,但ScaleONet保持一致的低误差。ScaleONet具有毫秒级推理(30个建筑物的每个样本大约4毫秒),非常适合大规模强化学习,后退水平优化和实时模型预测控制。
{"title":"ScaleONet: Scalable and control-oriented modeling of building cluster thermal dynamics using deep operator networks — A practical case study for a Belgian district","authors":"Muhammad Hafeez Saeed ,&nbsp;Maomao Hu ,&nbsp;Hussain Kazmi ,&nbsp;Geert Deconinck","doi":"10.1016/j.egyai.2025.100634","DOIUrl":"10.1016/j.egyai.2025.100634","url":null,"abstract":"<div><div>Delivering energy flexibility at the district scale entails coordinating control actions across many buildings to shape aggregate demand; this coordination depends on training and deploying control policies and optimization routines, which in turn require predictive models that can be queried efficiently over large building clusters. However, conventional physics-based simulators are computationally prohibitive for large-scale control training, and simple data-driven surrogates often lack the generalization needed for heterogeneous clusters. This paper introduces <em>ScaleONet</em>, a deep operator network framework designed for scalable, control-oriented modeling of building-cluster thermal dynamics. ScaleONet leverages the DeepONet paradigm to decouple and share learning across buildings: an LSTM-based branch network encodes outdoor climate and individual HVAC control signals, while a multilayer perceptron (MLP)-based trunk network embeds prediction timestamps, enabling fast predictions for growing clusters with negligible extra cost for each additional building or timestep. To the authors’ knowledge, this is the first operator-learning method tailored to indoor air temperature forecasting in heterogeneous building clusters. Validation on thirty Belgian buildings (GenkNet) simulated in Dymola shows that, although a non-operator-learning LSTM baseline slightly outperforms ScaleONet for single-building cases, its error grows monotonically with cluster size. In contrast, ScaleONet’s median per-building-per-day RMSE decreases from 0.59 °C at three buildings to 0.53 °C at ten and 0.47 °C at thirty, compared to 0.95 °C for the LSTM at thirty buildings — a 51% reduction in prediction error. Error analysis across envelope heat-loss coefficients (<span><math><msub><mrow><mi>UA</mi></mrow><mrow><mi>building</mi></mrow></msub></math></span>) further reveals that while the LSTM’s RMSE increases for high-<span><math><mrow><mi>U</mi><mi>A</mi></mrow></math></span> structures, ScaleONet maintains uniformly low error. With millisecond-scale inference (approximately <span><math><mrow><mn>4</mn><mspace></mspace><mi>ms</mi></mrow></math></span> per sample for thirty buildings), ScaleONet is well suited for large-scale reinforcement learning, receding-horizon optimization, and real-time model predictive control.</div></div>","PeriodicalId":34138,"journal":{"name":"Energy and AI","volume":"22 ","pages":"Article 100634"},"PeriodicalIF":9.6,"publicationDate":"2025-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145416162","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Imputing the long-term missing heating load data using a generative network 利用生成网络对长期缺失热负荷数据进行推算
IF 9.6 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-10-28 DOI: 10.1016/j.egyai.2025.100637
Mengbo Yu, Alexander Neubauer, Pedram Babakhani, Stefan Brandt, Martin Kriegel
Accurately filling in missing heating data is essential for ensuring data quality in applications such as energy management optimization and building efficiency analysis. Traditional machine learning methods use historical heating data as an input feature to predict the following missing data. However, when the duration of missing data is long, previous estimated values are inevitably used for further imputation, leading to error accumulation and a growing deviation from true values. To overcome this problem, this paper proposes a generative network that can fill missing data solely based on weather and temporal data, without using previous imputed values for further imputation. Our method outperformed the state of the art such as Seq2seq and Transformer, achieving relative normalized root mean square error (NRMSE) reductions of 1.65 % to 41.38 %, 0.30 % to 66.43 %, and 14.84 % to 50.22 % across three different data sources. In addition, with our proposed method, the effect of selecting different weather variables on model performance, and the benefits of transfer learning under limited data were also demonstrated. The relative NRMSE reduction is between 3.88 % to 15.85 % in cold months and from 7.49 % to 12.29 % in warm months when applying transfer learning.
准确填写缺失的供暖数据对于确保能源管理优化和建筑效率分析等应用中的数据质量至关重要。传统的机器学习方法使用历史加热数据作为输入特征来预测以下缺失的数据。然而,当缺失数据的持续时间较长时,不可避免地会使用先前的估计值进行进一步的重置,导致误差累积,与真实值的偏差越来越大。为了克服这一问题,本文提出了一种生成网络,该网络可以仅根据天气和时间数据填充缺失数据,而无需使用先前的输入值进行进一步的输入。我们的方法优于Seq2seq和Transformer等最先进的方法,在三个不同的数据源中实现了相对归一化均方根误差(NRMSE)降低1.65%至41.38%,0.30%至66.43%,14.84%至50.22%。此外,利用我们提出的方法,还证明了选择不同天气变量对模型性能的影响,以及有限数据下迁移学习的好处。应用迁移学习时,冷月份的相对NRMSE降低幅度在3.88% ~ 15.85%之间,热月份的相对NRMSE降低幅度在7.49% ~ 12.29%之间。
{"title":"Imputing the long-term missing heating load data using a generative network","authors":"Mengbo Yu,&nbsp;Alexander Neubauer,&nbsp;Pedram Babakhani,&nbsp;Stefan Brandt,&nbsp;Martin Kriegel","doi":"10.1016/j.egyai.2025.100637","DOIUrl":"10.1016/j.egyai.2025.100637","url":null,"abstract":"<div><div>Accurately filling in missing heating data is essential for ensuring data quality in applications such as energy management optimization and building efficiency analysis. Traditional machine learning methods use historical heating data as an input feature to predict the following missing data. However, when the duration of missing data is long, previous estimated values are inevitably used for further imputation, leading to error accumulation and a growing deviation from true values. To overcome this problem, this paper proposes a generative network that can fill missing data solely based on weather and temporal data, without using previous imputed values for further imputation. Our method outperformed the state of the art such as Seq2seq and Transformer, achieving relative normalized root mean square error (NRMSE) reductions of 1.65<!--> <!-->% to 41.38<!--> <!-->%, 0.30<!--> <!-->% to 66.43<!--> <!-->%, and 14.84<!--> <!-->% to 50.22<!--> <!-->% across three different data sources. In addition, with our proposed method, the effect of selecting different weather variables on model performance, and the benefits of transfer learning under limited data were also demonstrated. The relative NRMSE reduction is between 3.88<!--> <!-->% to 15.85<!--> <!-->% in cold months and from 7.49<!--> <!-->% to 12.29<!--> <!-->% in warm months when applying transfer learning.</div></div>","PeriodicalId":34138,"journal":{"name":"Energy and AI","volume":"22 ","pages":"Article 100637"},"PeriodicalIF":9.6,"publicationDate":"2025-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145416081","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Adaptive smoothing drift normalization for day-ahead net load forecasting in renewable power system 可再生能源系统日前净负荷预测的自适应平滑漂移归一化
IF 9.6 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-10-27 DOI: 10.1016/j.egyai.2025.100633
Bizhi Wu , Shanlin Wang
The increasing integration of renewable energy sources introduces significant variability and non-stationarity into power system, challenging accurate net load forecasting. Although net load forecasting research has devoted considerable efforts to handle non-stationarity – via normalization, incremental learning, or drift detection – existing solutions often suffer from hyperparameter tuning, threshold-based triggers, or reliance on specialized architectures. To overcome these limitations, we propose Adaptive Smoothing Drift Normalization (ASDN), a lightweight normalization layer that continuously adapts to distribution shifts without threshold tuning. ASDN effectively adapts to new data via a mechanism that combines entropy-based adjustments with a dynamic filtering approach. At the same time, it maintains stability with respect to historical patterns, allowing the method to capture both gradual and abrupt shifts in the data distribution. We provide a theoretical guarantee that the estimation error of ASDN remains bounded under piecewise-stationary drift; as incremental drift and noise decrease, this bound tightens and converges to zero. Experiments on nine forecasting models across five public datasets and four prediction horizons show that ASDN consistently outperforms traditional normalization techniques, reducing mean squared error and enhancing robustness. These results confirm ASDN’s effectiveness in handling complex temporal dynamics, making it valuable for improving forecast accuracy in dynamic renewable power systems.
可再生能源的日益整合给电力系统带来了显著的可变性和非平稳性,对准确的净负荷预测提出了挑战。尽管净负荷预测研究已经投入了相当大的努力来处理非平稳性——通过归一化、增量学习或漂移检测——现有的解决方案往往受到超参数调优、基于阈值的触发或依赖于专门架构的影响。为了克服这些限制,我们提出了自适应平滑漂移归一化(ASDN),这是一种轻量级的归一化层,可以在没有阈值调优的情况下不断适应分布变化。ASDN通过将基于熵的调整与动态过滤方法相结合的机制有效地适应新数据。同时,它保持相对于历史模式的稳定性,允许该方法捕捉数据分布中的渐变和突变。给出了分段平稳漂移下ASDN估计误差保持有界的理论保证;随着增量漂移和噪声的减小,该界收紧并收敛于零。在5个公共数据集和4个预测范围的9个预测模型上进行的实验表明,ASDN始终优于传统的归一化技术,降低了均方误差并增强了鲁棒性。这些结果证实了ASDN在处理复杂时间动态方面的有效性,使其对提高动态可再生能源系统的预测精度有价值。
{"title":"Adaptive smoothing drift normalization for day-ahead net load forecasting in renewable power system","authors":"Bizhi Wu ,&nbsp;Shanlin Wang","doi":"10.1016/j.egyai.2025.100633","DOIUrl":"10.1016/j.egyai.2025.100633","url":null,"abstract":"<div><div>The increasing integration of renewable energy sources introduces significant variability and non-stationarity into power system, challenging accurate net load forecasting. Although net load forecasting research has devoted considerable efforts to handle non-stationarity – via normalization, incremental learning, or drift detection – existing solutions often suffer from hyperparameter tuning, threshold-based triggers, or reliance on specialized architectures. To overcome these limitations, we propose Adaptive Smoothing Drift Normalization (ASDN), a lightweight normalization layer that continuously adapts to distribution shifts without threshold tuning. ASDN effectively adapts to new data via a mechanism that combines entropy-based adjustments with a dynamic filtering approach. At the same time, it maintains stability with respect to historical patterns, allowing the method to capture both gradual and abrupt shifts in the data distribution. We provide a theoretical guarantee that the estimation error of ASDN remains bounded under piecewise-stationary drift; as incremental drift and noise decrease, this bound tightens and converges to zero. Experiments on nine forecasting models across five public datasets and four prediction horizons show that ASDN consistently outperforms traditional normalization techniques, reducing mean squared error and enhancing robustness. These results confirm ASDN’s effectiveness in handling complex temporal dynamics, making it valuable for improving forecast accuracy in dynamic renewable power systems.</div></div>","PeriodicalId":34138,"journal":{"name":"Energy and AI","volume":"22 ","pages":"Article 100633"},"PeriodicalIF":9.6,"publicationDate":"2025-10-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145465938","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Data-driven machine learning model estimates efficiency gains from passive filters under variable loads 数据驱动的机器学习模型估计了可变负载下无源滤波器的效率增益
IF 9.6 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-10-17 DOI: 10.1016/j.egyai.2025.100631
Uchenna Johnpaul Aniekwensi , Dipyaman Basu , Jörg Bausch
Accurately estimating power loss reduction from passive filters before installation is challenging due to variable loads and power quality conditions across grid points. Existing studies rely on simulation or analytical models. These approaches often fail to capture real-world variability through data-driven methods. This gap limits effective, site-specific filter deployment decisions. We present a two-step machine learning approach to estimate energy efficiency gains from passive filters under variable conditions using high-resolution power analyzer data. Ridge Regression identifies key predictive variables, achieving baseline R² = 0.591. XGBoost then captures nonlinear interactions between load variability, power quality disturbances, and filter performance, improving accuracy to R² = 0.755. The methodology was validated through deployment at three industrial facilities in collaboration with Livarsa GmbH. Results demonstrate 9.9% average relative error across measured efficiency gains, confirming reliability under real-world conditions. Comprehensive validation through k-fold cross-validation, ensemble methods, and external testing quantified prediction uncertainty inherent in small industrial datasets (25 training samples). The approach offers a scalable, data-driven decision-support tool overcoming simulation-based limitations. Computational efficiency enables real-time assessment during client consultations without specialized software. Economic value derives from reduced performance guarantee margins, accelerated assessment timelines, and minimized warranty exposure. Limitations include statistical constraints from limited training data, reflected in cross-validation overfitting and wide confidence intervals. External validity requires site-specific validation for facilities with substantially different electrical characteristics. Despite these constraints, the findings provide practical value for energy professionals seeking efficient power quality solutions, enabling confident passive filter deployment decisions based on quantified performance predictions.
由于各电网点的负载和电能质量条件变化,在安装无源滤波器之前准确估计其功率损耗降低是具有挑战性的。现有的研究依赖于模拟或分析模型。这些方法通常无法通过数据驱动的方法捕获现实世界的可变性。这种差距限制了有效的、特定于站点的过滤器部署决策。我们提出了一种两步机器学习方法,使用高分辨率功率分析仪数据估计可变条件下无源滤波器的能效增益。岭回归识别关键预测变量,实现基线R²= 0.591。然后,XGBoost捕获负载可变性,电能质量干扰和滤波器性能之间的非线性相互作用,将精度提高到R²= 0.755。该方法通过与Livarsa GmbH合作在三个工业设施的部署进行了验证。结果表明,在测量的效率增益中,平均相对误差为9.9%,证实了在实际条件下的可靠性。通过k-fold交叉验证、集成方法和外部测试的综合验证量化了小型工业数据集(25个训练样本)固有的预测不确定性。该方法提供了一种可扩展的、数据驱动的决策支持工具,克服了基于仿真的限制。计算效率可以在客户咨询期间进行实时评估,而无需专门的软件。经济价值来源于减少的性能保证利润,加速的评估时间,以及最小化的保证风险。局限性包括来自有限训练数据的统计约束,反映在交叉验证过拟合和宽置信区间上。外部有效性要求对电气特性有很大不同的设施进行特定场所的验证。尽管存在这些限制,但研究结果为寻求高效电能质量解决方案的能源专业人士提供了实用价值,使他们能够根据量化的性能预测做出自信的无源滤波器部署决策。
{"title":"Data-driven machine learning model estimates efficiency gains from passive filters under variable loads","authors":"Uchenna Johnpaul Aniekwensi ,&nbsp;Dipyaman Basu ,&nbsp;Jörg Bausch","doi":"10.1016/j.egyai.2025.100631","DOIUrl":"10.1016/j.egyai.2025.100631","url":null,"abstract":"<div><div>Accurately estimating power loss reduction from passive filters before installation is challenging due to variable loads and power quality conditions across grid points. Existing studies rely on simulation or analytical models. These approaches often fail to capture real-world variability through data-driven methods. This gap limits effective, site-specific filter deployment decisions. We present a two-step machine learning approach to estimate energy efficiency gains from passive filters under variable conditions using high-resolution power analyzer data. Ridge Regression identifies key predictive variables, achieving baseline R² = 0.591. XGBoost then captures nonlinear interactions between load variability, power quality disturbances, and filter performance, improving accuracy to R² = 0.755. The methodology was validated through deployment at three industrial facilities in collaboration with Livarsa GmbH. Results demonstrate 9.9% average relative error across measured efficiency gains, confirming reliability under real-world conditions. Comprehensive validation through k-fold cross-validation, ensemble methods, and external testing quantified prediction uncertainty inherent in small industrial datasets (25 training samples). The approach offers a scalable, data-driven decision-support tool overcoming simulation-based limitations. Computational efficiency enables real-time assessment during client consultations without specialized software. Economic value derives from reduced performance guarantee margins, accelerated assessment timelines, and minimized warranty exposure. Limitations include statistical constraints from limited training data, reflected in cross-validation overfitting and wide confidence intervals. External validity requires site-specific validation for facilities with substantially different electrical characteristics. Despite these constraints, the findings provide practical value for energy professionals seeking efficient power quality solutions, enabling confident passive filter deployment decisions based on quantified performance predictions.</div></div>","PeriodicalId":34138,"journal":{"name":"Energy and AI","volume":"22 ","pages":"Article 100631"},"PeriodicalIF":9.6,"publicationDate":"2025-10-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145362268","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A method for lost circulation risk identification: Labeling, LSTM transfer learning recognition, and delayed matching verification 一种漏失风险识别方法:标记、LSTM迁移学习识别和延迟匹配验证
IF 9.6 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-10-17 DOI: 10.1016/j.egyai.2025.100632
Yang Bai , Chen Yang , Mao Li , Pingya Luo , Haochen Han
Accurate identification and prediction of lost circulation (LC) are critical for ensuring drilling safety and reducing production costs. Traditional LC detection methods rely heavily on expert knowledge, which inherently suffers from limitations such as subjective bias, low operational efficiency, and insufficient precision. In this context, data-driven prediction approaches have shown promising potential. This study develops a novel LC risk prediction framework that integrates transfer learning with deep learning. By leveraging the mud pit volume from acquired drilling parameters and defining three distinct step intervals, LC risk categories were efficiently labeled. An optimized LSTM-based predictive architecture was constructed using three distribution-alignment transfer learning techniques. Comparative experiments under varying numbers of input features confirmed the effectiveness of transfer learning in improving LC prediction performance. To address the class imbalance issue commonly observed in LC risk prediction, a delayed matching verification (DMV) strategy—customized for drilling operations—was introduced. This method mitigates the impact of class imbalance and enhances the evaluation of LC risk recognition capabilities. Experimental results from five test wells demonstrate that the proposed method can effectively label LC risk categories and promptly identify risk types, thereby offering valuable insights to support safe and efficient drilling operations.
准确识别和预测漏失(LC)对于确保钻井安全和降低生产成本至关重要。传统的LC检测方法严重依赖专家知识,存在主观偏差、操作效率低、精度不足等局限性。在这种情况下,数据驱动的预测方法显示出了很大的潜力。本研究开发了一种新的LC风险预测框架,将迁移学习与深度学习相结合。通过利用从获得的钻井参数中获得的泥浆坑体积,并定义三个不同的步骤间隔,可以有效地标记LC风险类别。利用三种分布对齐迁移学习技术,构建了一种优化的基于lstm的预测体系结构。不同输入特征数下的对比实验证实了迁移学习在提高LC预测性能方面的有效性。为了解决LC风险预测中常见的类不平衡问题,引入了针对钻井作业定制的延迟匹配验证(DMV)策略。该方法减轻了类不平衡的影响,增强了对LC风险识别能力的评价。五口测试井的实验结果表明,该方法可以有效地标记LC风险类别并及时识别风险类型,从而为安全高效的钻井作业提供有价值的见解。
{"title":"A method for lost circulation risk identification: Labeling, LSTM transfer learning recognition, and delayed matching verification","authors":"Yang Bai ,&nbsp;Chen Yang ,&nbsp;Mao Li ,&nbsp;Pingya Luo ,&nbsp;Haochen Han","doi":"10.1016/j.egyai.2025.100632","DOIUrl":"10.1016/j.egyai.2025.100632","url":null,"abstract":"<div><div>Accurate identification and prediction of lost circulation (LC) are critical for ensuring drilling safety and reducing production costs. Traditional LC detection methods rely heavily on expert knowledge, which inherently suffers from limitations such as subjective bias, low operational efficiency, and insufficient precision. In this context, data-driven prediction approaches have shown promising potential. This study develops a novel LC risk prediction framework that integrates transfer learning with deep learning. By leveraging the mud pit volume from acquired drilling parameters and defining three distinct step intervals, LC risk categories were efficiently labeled. An optimized LSTM-based predictive architecture was constructed using three distribution-alignment transfer learning techniques. Comparative experiments under varying numbers of input features confirmed the effectiveness of transfer learning in improving LC prediction performance. To address the class imbalance issue commonly observed in LC risk prediction, a delayed matching verification (DMV) strategy—customized for drilling operations—was introduced. This method mitigates the impact of class imbalance and enhances the evaluation of LC risk recognition capabilities. Experimental results from five test wells demonstrate that the proposed method can effectively label LC risk categories and promptly identify risk types, thereby offering valuable insights to support safe and efficient drilling operations.</div></div>","PeriodicalId":34138,"journal":{"name":"Energy and AI","volume":"22 ","pages":"Article 100632"},"PeriodicalIF":9.6,"publicationDate":"2025-10-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145362269","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Dual attention-enhanced data augmentation for diagnosing water management faults in proton exchange membrane fuel cells using imbalanced multi-sine AC data 双注意力增强数据增强在质子交换膜燃料电池水管理故障诊断中的应用
IF 9.6 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2025-10-16 DOI: 10.1016/j.egyai.2025.100630
Abdullah Shoukat , Zhongyong Liu , Yousif Yahia Ahmed Abuker , Jingguo Li , Lei Mao
Proton exchange membrane fuel cell (PEMFC) faults, especially dehydration and flooding, cause distinct changes in electrochemical behavior. Consequently, real-time monitoring is essential for early and accurate diagnosis. However, acquiring real-world fault data is challenging, and the rarity of such faults results in severe class imbalance. This imbalance limits the performance, reliability, and practical applicability of conventional diagnostic methods. To address these limitations, this study proposes a unified diagnostic framework that integrates multi-sine AC voltage response, boundary-aware resampling, and attention-guided generative modeling. The key innovations of the proposed approach include: (1) Enhanced fault separability through the first application of multi-sine AC voltage response under data imbalance, enabling real-time extraction of critical electrochemical spectral features for early-stage diagnosis; (2) Improved data balance and clearer class boundaries using synthetic minority oversampling with Tomek links, which oversamples minority classes and removes borderline samples; (3) Realistic minority class synthesis using a dual attention Wasserstein generative adversarial networks, where channel attention focuses on diagnostically relevant spectral features and temporal attention models the dynamic evolution of PEMFC electrochemical behavior, ensuring high-quality, diagnostically informative synthetic fault data. The integrated framework achieves 99.67 % overall diagnostic accuracy and, under an extreme 1:200 class imbalance, outperforms state-of-the-art methods by 14 %. This approach enables rapid, data-efficient PEMFC fault diagnosis, strengthening fault management and advancing the performance of energy systems.
质子交换膜燃料电池(PEMFC)的故障,特别是脱水和淹水,会引起电化学行为的明显变化。因此,实时监测对于早期和准确诊断至关重要。然而,获取真实世界的故障数据是具有挑战性的,并且此类故障的稀缺性导致了严重的类不平衡。这种不平衡限制了常规诊断方法的性能、可靠性和实用性。为了解决这些限制,本研究提出了一个统一的诊断框架,该框架集成了多正弦交流电压响应、边界感知重采样和注意引导生成建模。该方法的主要创新点包括:(1)通过首次应用数据不平衡下的多正弦交流电压响应,增强了故障可分性,实现了关键电化学光谱特征的实时提取,用于早期诊断;(2)利用Tomek链接的合成少数派过采样改善了数据的平衡性和更清晰的类边界,该方法对少数派类进行过采样并去除边缘样本;(3)采用双关注Wasserstein生成对抗网络的现实少数派类合成,其中通道关注关注诊断相关的光谱特征,时间关注建模PEMFC电化学行为的动态演变,确保高质量、诊断信息丰富的合成故障数据。集成框架的总体诊断准确率达到99.67%,并且在1:200的极端分类不平衡下,比最先进的方法高出14%。这种方法可以实现快速、数据高效的PEMFC故障诊断,加强故障管理,提高能源系统的性能。
{"title":"Dual attention-enhanced data augmentation for diagnosing water management faults in proton exchange membrane fuel cells using imbalanced multi-sine AC data","authors":"Abdullah Shoukat ,&nbsp;Zhongyong Liu ,&nbsp;Yousif Yahia Ahmed Abuker ,&nbsp;Jingguo Li ,&nbsp;Lei Mao","doi":"10.1016/j.egyai.2025.100630","DOIUrl":"10.1016/j.egyai.2025.100630","url":null,"abstract":"<div><div>Proton exchange membrane fuel cell (PEMFC) faults, especially dehydration and flooding, cause distinct changes in electrochemical behavior. Consequently, real-time monitoring is essential for early and accurate diagnosis. However, acquiring real-world fault data is challenging, and the rarity of such faults results in severe class imbalance. This imbalance limits the performance, reliability, and practical applicability of conventional diagnostic methods. To address these limitations, this study proposes a unified diagnostic framework that integrates multi-sine AC voltage response, boundary-aware resampling, and attention-guided generative modeling. The key innovations of the proposed approach include: (1) Enhanced fault separability through the first application of multi-sine AC voltage response under data imbalance, enabling real-time extraction of critical electrochemical spectral features for early-stage diagnosis; (2) Improved data balance and clearer class boundaries using synthetic minority oversampling with Tomek links, which oversamples minority classes and removes borderline samples; (3) Realistic minority class synthesis using a dual attention Wasserstein generative adversarial networks, where channel attention focuses on diagnostically relevant spectral features and temporal attention models the dynamic evolution of PEMFC electrochemical behavior, ensuring high-quality, diagnostically informative synthetic fault data. The integrated framework achieves 99.67 % overall diagnostic accuracy and, under an extreme 1:200 class imbalance, outperforms state-of-the-art methods by 14 %. This approach enables rapid, data-efficient PEMFC fault diagnosis, strengthening fault management and advancing the performance of energy systems.</div></div>","PeriodicalId":34138,"journal":{"name":"Energy and AI","volume":"22 ","pages":"Article 100630"},"PeriodicalIF":9.6,"publicationDate":"2025-10-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145362267","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Energy and AI
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1