基于余热的建筑运行优化耦合时间尺度强化学习方法

IF 11 1区 工程技术 Q1 ENERGY & FUELS Applied Energy Pub Date : 2025-08-01 Epub Date: 2025-04-15 DOI:10.1016/j.apenergy.2025.125851
Zhe Chen , Tian Xing , Yu Wang , Yunlin Zhuang , Meng Zheng , Qianchuan Zhao , Qing-Shan Jia
{"title":"基于余热的建筑运行优化耦合时间尺度强化学习方法","authors":"Zhe Chen ,&nbsp;Tian Xing ,&nbsp;Yu Wang ,&nbsp;Yunlin Zhuang ,&nbsp;Meng Zheng ,&nbsp;Qianchuan Zhao ,&nbsp;Qing-Shan Jia","doi":"10.1016/j.apenergy.2025.125851","DOIUrl":null,"url":null,"abstract":"<div><div>This paper focuses on the joint optimization of fan coil units (FCUs) and heat pumps in HVAC systems for multi-zone building environments. The core problem involves balancing fast local control of FCUs with slower global control of heat pumps to ensure energy efficiency and indoor comfort. To tackle this, we propose a coupling time-scale reinforcement learning (RL) algorithm, specifically a Deep Q-Network (DQN)-based approach that employs a multi-task learning network to manage both FCU and heat pump control efficiently through the utilization of shared state information. Moreover, the agents are trained and make decisions collaboratively across different timescales. In addition, we develop a high-fidelity building simulation that incorporates detailed thermal models, dynamic loading, and waste heat modules to evaluate the performance of the system in real-world conditions. The experimental results show that the proposed coupling time-scale DQN algorithm improves the accuracy of temperature control by 35.4 % and 26.21 % compared to traditional DQN and the occupant-centric control (OCC) algorithms. Additionally, it reduces regional power fluctuations by 25.18 % and 56.74 % relative to these traditional algorithms. Simultaneously, the proposed algorithm achieves the lowest heat pump energy consumption (2964 W), outperforming traditional DQN (2977 W) and OCC (3051 W) respectively, while maintaining superior temperature control accuracy. These quantitative improvements collectively demonstrate the proposed algorithm’s capability to synergistically balance thermal comfort, power fluctuation, and energy consumption.</div></div>","PeriodicalId":246,"journal":{"name":"Applied Energy","volume":"391 ","pages":"Article 125851"},"PeriodicalIF":11.0000,"publicationDate":"2025-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Coupling time-scale reinforcement learning methods for building operational optimization with waste heat\",\"authors\":\"Zhe Chen ,&nbsp;Tian Xing ,&nbsp;Yu Wang ,&nbsp;Yunlin Zhuang ,&nbsp;Meng Zheng ,&nbsp;Qianchuan Zhao ,&nbsp;Qing-Shan Jia\",\"doi\":\"10.1016/j.apenergy.2025.125851\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>This paper focuses on the joint optimization of fan coil units (FCUs) and heat pumps in HVAC systems for multi-zone building environments. The core problem involves balancing fast local control of FCUs with slower global control of heat pumps to ensure energy efficiency and indoor comfort. To tackle this, we propose a coupling time-scale reinforcement learning (RL) algorithm, specifically a Deep Q-Network (DQN)-based approach that employs a multi-task learning network to manage both FCU and heat pump control efficiently through the utilization of shared state information. Moreover, the agents are trained and make decisions collaboratively across different timescales. In addition, we develop a high-fidelity building simulation that incorporates detailed thermal models, dynamic loading, and waste heat modules to evaluate the performance of the system in real-world conditions. The experimental results show that the proposed coupling time-scale DQN algorithm improves the accuracy of temperature control by 35.4 % and 26.21 % compared to traditional DQN and the occupant-centric control (OCC) algorithms. Additionally, it reduces regional power fluctuations by 25.18 % and 56.74 % relative to these traditional algorithms. Simultaneously, the proposed algorithm achieves the lowest heat pump energy consumption (2964 W), outperforming traditional DQN (2977 W) and OCC (3051 W) respectively, while maintaining superior temperature control accuracy. These quantitative improvements collectively demonstrate the proposed algorithm’s capability to synergistically balance thermal comfort, power fluctuation, and energy consumption.</div></div>\",\"PeriodicalId\":246,\"journal\":{\"name\":\"Applied Energy\",\"volume\":\"391 \",\"pages\":\"Article 125851\"},\"PeriodicalIF\":11.0000,\"publicationDate\":\"2025-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Applied Energy\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0306261925005811\",\"RegionNum\":1,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/4/15 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q1\",\"JCRName\":\"ENERGY & FUELS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Energy","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0306261925005811","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/4/15 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"ENERGY & FUELS","Score":null,"Total":0}
引用次数: 0

摘要

本文重点研究多区建筑环境下暖通空调系统中风机盘管机组(FCU)和热泵的联合优化问题。核心问题包括平衡 FCU 的快速局部控制和热泵的慢速全局控制,以确保能源效率和室内舒适度。为解决这一问题,我们提出了一种耦合时间尺度强化学习(RL)算法,特别是基于深度 Q 网络(DQN)的方法,该方法采用多任务学习网络,通过利用共享状态信息来有效管理 FCU 和热泵控制。此外,我们还对代理进行了培训,并在不同时间尺度上协同做出决策。此外,我们还开发了一种高保真建筑仿真,其中包含详细的热模型、动态负载和废热模块,以评估系统在实际条件下的性能。实验结果表明,与传统的 DQN 算法和以住户为中心的控制 (OCC) 算法相比,所提出的耦合时间尺度 DQN 算法分别提高了 35.4% 和 26.21% 的温度控制精确度。此外,与这些传统算法相比,它还将区域功率波动降低了 25.18% 和 56.74%。同时,所提出的算法实现了最低的热泵能耗(2964 瓦),分别优于传统的 DQN(2977 瓦)和 OCC(3051 瓦),同时保持了卓越的温度控制精确度。这些量化改进共同证明了所提出的算法能够协同平衡热舒适度、功率波动和能耗。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Coupling time-scale reinforcement learning methods for building operational optimization with waste heat
This paper focuses on the joint optimization of fan coil units (FCUs) and heat pumps in HVAC systems for multi-zone building environments. The core problem involves balancing fast local control of FCUs with slower global control of heat pumps to ensure energy efficiency and indoor comfort. To tackle this, we propose a coupling time-scale reinforcement learning (RL) algorithm, specifically a Deep Q-Network (DQN)-based approach that employs a multi-task learning network to manage both FCU and heat pump control efficiently through the utilization of shared state information. Moreover, the agents are trained and make decisions collaboratively across different timescales. In addition, we develop a high-fidelity building simulation that incorporates detailed thermal models, dynamic loading, and waste heat modules to evaluate the performance of the system in real-world conditions. The experimental results show that the proposed coupling time-scale DQN algorithm improves the accuracy of temperature control by 35.4 % and 26.21 % compared to traditional DQN and the occupant-centric control (OCC) algorithms. Additionally, it reduces regional power fluctuations by 25.18 % and 56.74 % relative to these traditional algorithms. Simultaneously, the proposed algorithm achieves the lowest heat pump energy consumption (2964 W), outperforming traditional DQN (2977 W) and OCC (3051 W) respectively, while maintaining superior temperature control accuracy. These quantitative improvements collectively demonstrate the proposed algorithm’s capability to synergistically balance thermal comfort, power fluctuation, and energy consumption.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Applied Energy
Applied Energy 工程技术-工程:化工
CiteScore
21.20
自引率
10.70%
发文量
1830
审稿时长
41 days
期刊介绍: Applied Energy serves as a platform for sharing innovations, research, development, and demonstrations in energy conversion, conservation, and sustainable energy systems. The journal covers topics such as optimal energy resource use, environmental pollutant mitigation, and energy process analysis. It welcomes original papers, review articles, technical notes, and letters to the editor. Authors are encouraged to submit manuscripts that bridge the gap between research, development, and implementation. The journal addresses a wide spectrum of topics, including fossil and renewable energy technologies, energy economics, and environmental impacts. Applied Energy also explores modeling and forecasting, conservation strategies, and the social and economic implications of energy policies, including climate change mitigation. It is complemented by the open-access journal Advances in Applied Energy.
期刊最新文献
A review of electrochemical process-driven recycling and regeneration technologies for lithium-ion batteries A curriculum-guided deep reinforcement learning framework for health-aware energy management of hybrid electric aircraft Physics-informed multi-gated convolutional recurrent network for extreme wind speed prediction A universal two-stage framework for gas diffusion layer reconstruction in proton exchange membrane fuel cells: Toward high-fidelity through-plane porosity distribution Renewable-colocated green hydrogen production: Optimal scheduling and profitability
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1