Towards various occupants with different thermal comfort requirements: A deep reinforcement learning approach combined with a dynamic PMV model for HVAC control in buildings
{"title":"Towards various occupants with different thermal comfort requirements: A deep reinforcement learning approach combined with a dynamic PMV model for HVAC control in buildings","authors":"","doi":"10.1016/j.enconman.2024.118995","DOIUrl":null,"url":null,"abstract":"<div><p>Reinforcement learning (RL) has great potential in achieving energy-efficient, comfortable and intelligent control of heating, ventilation and air conditioning (HVAC) systems. Although research on RL-based HVAC control has attracted increasing interest, current studies generally use simple building simulation as the environment to train agents, and the definition of thermal comfort is limited to a wide temperature range, which cannot meet the different thermal comfort requirements of various occupants. This study proposes a deep reinforcement learning (DRL) control framework based on the Dueling Deep Q-network (DQN) algorithm, combined with a self-designed environmental model and reward function, for HVAC control meeting different thermal comfort requirements. Specifically, based on the theory of building thermal dynamics, a nonlinear equation modified by experimental data is used for the environmental model that reflects the actual thermal change of building. Different thermal comfort requirements are considered and analysed through a dynamic predicted mean vote (PMV) model that focuses on the metabolic rate and clothing level of occupants. By systematically exploring different heating modes for occupants and control time intervals, the proposed framework demonstrates that heating energy consumption can be reduced by 4.8%-39.58% under various conditions compared to rule-based control. In addition, the study found that the HVAC control based on DRL has greater potential in saving energy when the heating demand of building is higher. Our study is helpful for researchers to make HVAC control more energy-efficient and user-friendly with the help of artificial intelligence.</p></div>","PeriodicalId":11664,"journal":{"name":"Energy Conversion and Management","volume":null,"pages":null},"PeriodicalIF":9.9000,"publicationDate":"2024-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Energy Conversion and Management","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0196890424009361","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENERGY & FUELS","Score":null,"Total":0}
引用次数: 0
Abstract
Reinforcement learning (RL) has great potential in achieving energy-efficient, comfortable and intelligent control of heating, ventilation and air conditioning (HVAC) systems. Although research on RL-based HVAC control has attracted increasing interest, current studies generally use simple building simulation as the environment to train agents, and the definition of thermal comfort is limited to a wide temperature range, which cannot meet the different thermal comfort requirements of various occupants. This study proposes a deep reinforcement learning (DRL) control framework based on the Dueling Deep Q-network (DQN) algorithm, combined with a self-designed environmental model and reward function, for HVAC control meeting different thermal comfort requirements. Specifically, based on the theory of building thermal dynamics, a nonlinear equation modified by experimental data is used for the environmental model that reflects the actual thermal change of building. Different thermal comfort requirements are considered and analysed through a dynamic predicted mean vote (PMV) model that focuses on the metabolic rate and clothing level of occupants. By systematically exploring different heating modes for occupants and control time intervals, the proposed framework demonstrates that heating energy consumption can be reduced by 4.8%-39.58% under various conditions compared to rule-based control. In addition, the study found that the HVAC control based on DRL has greater potential in saving energy when the heating demand of building is higher. Our study is helpful for researchers to make HVAC control more energy-efficient and user-friendly with the help of artificial intelligence.
期刊介绍:
The journal Energy Conversion and Management provides a forum for publishing original contributions and comprehensive technical review articles of interdisciplinary and original research on all important energy topics.
The topics considered include energy generation, utilization, conversion, storage, transmission, conservation, management and sustainability. These topics typically involve various types of energy such as mechanical, thermal, nuclear, chemical, electromagnetic, magnetic and electric. These energy types cover all known energy resources, including renewable resources (e.g., solar, bio, hydro, wind, geothermal and ocean energy), fossil fuels and nuclear resources.