{"title":"Enhancing source domain availability through data and feature transfer learning for building power load forecasting","authors":"Fanyue Qian, Yingjun Ruan, Huiming Lu, Hua Meng, Tingting Xu","doi":"10.1007/s12273-023-1087-0","DOIUrl":null,"url":null,"abstract":"<p>During the initial phases of operation following the construction or renovation of existing buildings, the availability of historical power usage data is limited, which leads to lower accuracy in load forecasting and hinders normal usage. Fortunately, by transferring load data from similar buildings, it is possible to enhance forecasting accuracy. However, indiscriminately expanding all source domain data to the target domain is highly likely to result in negative transfer learning. This study explores the feasibility of utilizing similar buildings (source domains) in transfer learning by implementing and comparing two distinct forms of multi-source transfer learning. Firstly, this study focuses on the Higashita area in Kitakyushu City, Japan, as the research object. Four buildings that exhibit the highest similarity to the target buildings within this area were selected for analysis. Next, the two-stage TrAdaBoost.R2 algorithm is used for multi-source transfer learning, and its transfer effect is analyzed. Finally, the application effects of instance-based (IBMTL) and feature-based (FBMTL) multi-source transfer learning are compared, which explained the effect of the source domain data on the forecasting accuracy in different transfer modes. The results show that combining the two-stage TrAdaBoost.R2 algorithm with multi-source data can reduce the CV-RMSE by 7.23% compared to a single-source domain, and the accuracy improvement is significant. At the same time, multi-source transfer learning, which is based on instance, can better supplement the integrity of the target domain data and has a higher forecasting accuracy. Overall, IBMTL tends to retain effective data associations and FBMTL shows higher forecasting stability. The findings of this study, which include the verification of real-life algorithm application and source domain availability, can serve as a theoretical reference for implementing transfer learning in load forecasting.</p>","PeriodicalId":49226,"journal":{"name":"Building Simulation","volume":"54 1","pages":""},"PeriodicalIF":6.1000,"publicationDate":"2024-01-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Building Simulation","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1007/s12273-023-1087-0","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CONSTRUCTION & BUILDING TECHNOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
During the initial phases of operation following the construction or renovation of existing buildings, the availability of historical power usage data is limited, which leads to lower accuracy in load forecasting and hinders normal usage. Fortunately, by transferring load data from similar buildings, it is possible to enhance forecasting accuracy. However, indiscriminately expanding all source domain data to the target domain is highly likely to result in negative transfer learning. This study explores the feasibility of utilizing similar buildings (source domains) in transfer learning by implementing and comparing two distinct forms of multi-source transfer learning. Firstly, this study focuses on the Higashita area in Kitakyushu City, Japan, as the research object. Four buildings that exhibit the highest similarity to the target buildings within this area were selected for analysis. Next, the two-stage TrAdaBoost.R2 algorithm is used for multi-source transfer learning, and its transfer effect is analyzed. Finally, the application effects of instance-based (IBMTL) and feature-based (FBMTL) multi-source transfer learning are compared, which explained the effect of the source domain data on the forecasting accuracy in different transfer modes. The results show that combining the two-stage TrAdaBoost.R2 algorithm with multi-source data can reduce the CV-RMSE by 7.23% compared to a single-source domain, and the accuracy improvement is significant. At the same time, multi-source transfer learning, which is based on instance, can better supplement the integrity of the target domain data and has a higher forecasting accuracy. Overall, IBMTL tends to retain effective data associations and FBMTL shows higher forecasting stability. The findings of this study, which include the verification of real-life algorithm application and source domain availability, can serve as a theoretical reference for implementing transfer learning in load forecasting.
期刊介绍:
Building Simulation: An International Journal publishes original, high quality, peer-reviewed research papers and review articles dealing with modeling and simulation of buildings including their systems. The goal is to promote the field of building science and technology to such a level that modeling will eventually be used in every aspect of building construction as a routine instead of an exception. Of particular interest are papers that reflect recent developments and applications of modeling tools and their impact on advances of building science and technology.