{"title":"End-to-end data-driven modeling framework for automated and trustworthy short-term building energy load forecasting","authors":"Chaobo Zhang, Jie Lu, Jiahua Huang, Yang Zhao","doi":"10.1007/s12273-024-1149-y","DOIUrl":null,"url":null,"abstract":"<p>Conventional automated machine learning (AutoML) technologies fall short in preprocessing low-quality raw data and adapting to varying indoor and outdoor environments, leading to accuracy reduction in forecasting short-term building energy loads. Moreover, their predictions are not transparent because of their black box nature. Hence, the building field currently lacks an AutoML framework capable of data quality enhancement, environment self-adaptation, and model interpretation. To address this research gap, an improved AutoML-based end-to-end data-driven modeling framework is proposed. Bayesian optimization is applied by this framework to find an optimal data preprocessing process for quality improvement of raw data. It bridges the gap where conventional AutoML technologies cannot automatically handle missing data and outliers. A sliding window-based model retraining strategy is utilized to achieve environment self-adaptation, contributing to the accuracy enhancement of AutoML technologies. Moreover, a local interpretable model-agnostic explanations-based approach is developed to interpret predictions made by the improved framework. It overcomes the poor interpretability of conventional AutoML technologies. The performance of the improved framework in forecasting one-hour ahead cooling loads is evaluated using two-year operational data from a real building. It is discovered that the accuracy of the improved framework increases by 4.24%–8.79% compared with four conventional frameworks for buildings with not only high-quality but also low-quality operational data. Furthermore, it is demonstrated that the developed model interpretation approach can effectively explain the predictions of the improved framework. The improved framework offers a novel perspective on creating accurate and reliable AutoML frameworks tailored to building energy load prediction tasks and other similar tasks.</p>","PeriodicalId":49226,"journal":{"name":"Building Simulation","volume":"46 1","pages":""},"PeriodicalIF":6.1000,"publicationDate":"2024-06-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Building Simulation","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1007/s12273-024-1149-y","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CONSTRUCTION & BUILDING TECHNOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Conventional automated machine learning (AutoML) technologies fall short in preprocessing low-quality raw data and adapting to varying indoor and outdoor environments, leading to accuracy reduction in forecasting short-term building energy loads. Moreover, their predictions are not transparent because of their black box nature. Hence, the building field currently lacks an AutoML framework capable of data quality enhancement, environment self-adaptation, and model interpretation. To address this research gap, an improved AutoML-based end-to-end data-driven modeling framework is proposed. Bayesian optimization is applied by this framework to find an optimal data preprocessing process for quality improvement of raw data. It bridges the gap where conventional AutoML technologies cannot automatically handle missing data and outliers. A sliding window-based model retraining strategy is utilized to achieve environment self-adaptation, contributing to the accuracy enhancement of AutoML technologies. Moreover, a local interpretable model-agnostic explanations-based approach is developed to interpret predictions made by the improved framework. It overcomes the poor interpretability of conventional AutoML technologies. The performance of the improved framework in forecasting one-hour ahead cooling loads is evaluated using two-year operational data from a real building. It is discovered that the accuracy of the improved framework increases by 4.24%–8.79% compared with four conventional frameworks for buildings with not only high-quality but also low-quality operational data. Furthermore, it is demonstrated that the developed model interpretation approach can effectively explain the predictions of the improved framework. The improved framework offers a novel perspective on creating accurate and reliable AutoML frameworks tailored to building energy load prediction tasks and other similar tasks.
期刊介绍:
Building Simulation: An International Journal publishes original, high quality, peer-reviewed research papers and review articles dealing with modeling and simulation of buildings including their systems. The goal is to promote the field of building science and technology to such a level that modeling will eventually be used in every aspect of building construction as a routine instead of an exception. Of particular interest are papers that reflect recent developments and applications of modeling tools and their impact on advances of building science and technology.