基于余热的建筑运行优化耦合时间尺度强化学习方法

IF 11 1区工程技术 Q1 ENERGY & FUELS Applied Energy Pub Date : 2025-08-01 Epub Date: 2025-04-15 DOI:10.1016/j.apenergy.2025.125851

Zhe Chen , Tian Xing , Yu Wang , Yunlin Zhuang , Meng Zheng , Qianchuan Zhao , Qing-Shan Jia

{"title":"基于余热的建筑运行优化耦合时间尺度强化学习方法","authors":"Zhe Chen , Tian Xing , Yu Wang , Yunlin Zhuang , Meng Zheng , Qianchuan Zhao , Qing-Shan Jia","doi":"10.1016/j.apenergy.2025.125851","DOIUrl":null,"url":null,"abstract":"<div><div>This paper focuses on the joint optimization of fan coil units (FCUs) and heat pumps in HVAC systems for multi-zone building environments. The core problem involves balancing fast local control of FCUs with slower global control of heat pumps to ensure energy efficiency and indoor comfort. To tackle this, we propose a coupling time-scale reinforcement learning (RL) algorithm, specifically a Deep Q-Network (DQN)-based approach that employs a multi-task learning network to manage both FCU and heat pump control efficiently through the utilization of shared state information. Moreover, the agents are trained and make decisions collaboratively across different timescales. In addition, we develop a high-fidelity building simulation that incorporates detailed thermal models, dynamic loading, and waste heat modules to evaluate the performance of the system in real-world conditions. The experimental results show that the proposed coupling time-scale DQN algorithm improves the accuracy of temperature control by 35.4 % and 26.21 % compared to traditional DQN and the occupant-centric control (OCC) algorithms. Additionally, it reduces regional power fluctuations by 25.18 % and 56.74 % relative to these traditional algorithms. Simultaneously, the proposed algorithm achieves the lowest heat pump energy consumption (2964 W), outperforming traditional DQN (2977 W) and OCC (3051 W) respectively, while maintaining superior temperature control accuracy. These quantitative improvements collectively demonstrate the proposed algorithm’s capability to synergistically balance thermal comfort, power fluctuation, and energy consumption.</div></div>","PeriodicalId":246,"journal":{"name":"Applied Energy","volume":"391 ","pages":"Article 125851"},"PeriodicalIF":11.0000,"publicationDate":"2025-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Coupling time-scale reinforcement learning methods for building operational optimization with waste heat\",\"authors\":\"Zhe Chen , Tian Xing , Yu Wang , Yunlin Zhuang , Meng Zheng , Qianchuan Zhao , Qing-Shan Jia\",\"doi\":\"10.1016/j.apenergy.2025.125851\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>This paper focuses on the joint optimization of fan coil units (FCUs) and heat pumps in HVAC systems for multi-zone building environments. The core problem involves balancing fast local control of FCUs with slower global control of heat pumps to ensure energy efficiency and indoor comfort. To tackle this, we propose a coupling time-scale reinforcement learning (RL) algorithm, specifically a Deep Q-Network (DQN)-based approach that employs a multi-task learning network to manage both FCU and heat pump control efficiently through the utilization of shared state information. Moreover, the agents are trained and make decisions collaboratively across different timescales. In addition, we develop a high-fidelity building simulation that incorporates detailed thermal models, dynamic loading, and waste heat modules to evaluate the performance of the system in real-world conditions. The experimental results show that the proposed coupling time-scale DQN algorithm improves the accuracy of temperature control by 35.4 % and 26.21 % compared to traditional DQN and the occupant-centric control (OCC) algorithms. Additionally, it reduces regional power fluctuations by 25.18 % and 56.74 % relative to these traditional algorithms. Simultaneously, the proposed algorithm achieves the lowest heat pump energy consumption (2964 W), outperforming traditional DQN (2977 W) and OCC (3051 W) respectively, while maintaining superior temperature control accuracy. These quantitative improvements collectively demonstrate the proposed algorithm’s capability to synergistically balance thermal comfort, power fluctuation, and energy consumption.</div></div>\",\"PeriodicalId\":246,\"journal\":{\"name\":\"Applied Energy\",\"volume\":\"391 \",\"pages\":\"Article 125851\"},\"PeriodicalIF\":11.0000,\"publicationDate\":\"2025-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Applied Energy\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0306261925005811\",\"RegionNum\":1,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/4/15 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q1\",\"JCRName\":\"ENERGY & FUELS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Energy","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0306261925005811","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/4/15 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"ENERGY & FUELS","Score":null,"Total":0}

引用次数: 0

摘要

本文重点研究多区建筑环境下暖通空调系统中风机盘管机组（FCU）和热泵的联合优化问题。核心问题包括平衡 FCU 的快速局部控制和热泵的慢速全局控制，以确保能源效率和室内舒适度。为解决这一问题，我们提出了一种耦合时间尺度强化学习（RL）算法，特别是基于深度 Q 网络（DQN）的方法，该方法采用多任务学习网络，通过利用共享状态信息来有效管理 FCU 和热泵控制。此外，我们还对代理进行了培训，并在不同时间尺度上协同做出决策。此外，我们还开发了一种高保真建筑仿真，其中包含详细的热模型、动态负载和废热模块，以评估系统在实际条件下的性能。实验结果表明，与传统的 DQN 算法和以住户为中心的控制 (OCC) 算法相比，所提出的耦合时间尺度 DQN 算法分别提高了 35.4% 和 26.21% 的温度控制精确度。此外，与这些传统算法相比，它还将区域功率波动降低了 25.18% 和 56.74%。同时，所提出的算法实现了最低的热泵能耗（2964 瓦），分别优于传统的 DQN（2977 瓦）和 OCC（3051 瓦），同时保持了卓越的温度控制精确度。这些量化改进共同证明了所提出的算法能够协同平衡热舒适度、功率波动和能耗。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Coupling time-scale reinforcement learning methods for building operational optimization with waste heat

This paper focuses on the joint optimization of fan coil units (FCUs) and heat pumps in HVAC systems for multi-zone building environments. The core problem involves balancing fast local control of FCUs with slower global control of heat pumps to ensure energy efficiency and indoor comfort. To tackle this, we propose a coupling time-scale reinforcement learning (RL) algorithm, specifically a Deep Q-Network (DQN)-based approach that employs a multi-task learning network to manage both FCU and heat pump control efficiently through the utilization of shared state information. Moreover, the agents are trained and make decisions collaboratively across different timescales. In addition, we develop a high-fidelity building simulation that incorporates detailed thermal models, dynamic loading, and waste heat modules to evaluate the performance of the system in real-world conditions. The experimental results show that the proposed coupling time-scale DQN algorithm improves the accuracy of temperature control by 35.4 % and 26.21 % compared to traditional DQN and the occupant-centric control (OCC) algorithms. Additionally, it reduces regional power fluctuations by 25.18 % and 56.74 % relative to these traditional algorithms. Simultaneously, the proposed algorithm achieves the lowest heat pump energy consumption (2964 W), outperforming traditional DQN (2977 W) and OCC (3051 W) respectively, while maintaining superior temperature control accuracy. These quantitative improvements collectively demonstrate the proposed algorithm’s capability to synergistically balance thermal comfort, power fluctuation, and energy consumption.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Applied Energy 工程技术-工程：化工

CiteScore

21.20

自引率

10.70%

发文量

1830

审稿时长

41 days

期刊介绍： Applied Energy serves as a platform for sharing innovations, research, development, and demonstrations in energy conversion, conservation, and sustainable energy systems. The journal covers topics such as optimal energy resource use, environmental pollutant mitigation, and energy process analysis. It welcomes original papers, review articles, technical notes, and letters to the editor. Authors are encouraged to submit manuscripts that bridge the gap between research, development, and implementation. The journal addresses a wide spectrum of topics, including fossil and renewable energy technologies, energy economics, and environmental impacts. Applied Energy also explores modeling and forecasting, conservation strategies, and the social and economic implications of energy policies, including climate change mitigation. It is complemented by the open-access journal Advances in Applied Energy.