Training data selection using information entropy: Application to heating load modeling of rural residence in northern China

IF 0.7 4区 地球科学 Q4 GEOCHEMISTRY & GEOPHYSICS Applied Geophysics Pub Date : 2024-07-10 DOI:10.1007/s11770-024-1120-9
Li-gai Kang, Hao Li, Zhi-chao Wang, Dong-xiang Sun, Jin-zhu Wang, Yang Yang, Xu Zhang
{"title":"Training data selection using information entropy: Application to heating load modeling of rural residence in northern China","authors":"Li-gai Kang, Hao Li, Zhi-chao Wang, Dong-xiang Sun, Jin-zhu Wang, Yang Yang, Xu Zhang","doi":"10.1007/s11770-024-1120-9","DOIUrl":null,"url":null,"abstract":"<p>The selection of input variables and their amount has been an important issue in big data load forecasting. Taking heating load forecasting as an example, this paper proposed a method for data filtering based on information entropy. First, the heating data from an air source heat pump system adopted by a rural residence in northern China were employed. Moreover, the training data were classified based on linear or nonlinear variations of outdoor temperature and its changing ranges, while the validation data included three different types of weather conditions, namely, cold, cool, and mild. Then, the information entropy under 2-h, 4-h, 6-h and 8-h training window was quantified to be 1.811, 1.839, 1.877 and 1.856, respectively. For the employed rural residence, an equivalent three-resistance and two-capacity model was established to validate the effectiveness of the training window. Using the derived optimal thermal resistance and capacity, the various selection of outdoor temperature variation trend and range were compared and optimized. Results showed that 6 h of training data had the maximum information entropy and the most abundant information, the minimum errors between actual and forecasting data were observed under 6 h of training data, linear change, and lower outdoor temperature. The mean absolute percentage errors for the load forecasting of three typical days were 5.63%, 8.46%, and 12.10%, respectively.</p>","PeriodicalId":55500,"journal":{"name":"Applied Geophysics","volume":"18 1","pages":""},"PeriodicalIF":0.7000,"publicationDate":"2024-07-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Geophysics","FirstCategoryId":"89","ListUrlMain":"https://doi.org/10.1007/s11770-024-1120-9","RegionNum":4,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"GEOCHEMISTRY & GEOPHYSICS","Score":null,"Total":0}
引用次数: 0

Abstract

The selection of input variables and their amount has been an important issue in big data load forecasting. Taking heating load forecasting as an example, this paper proposed a method for data filtering based on information entropy. First, the heating data from an air source heat pump system adopted by a rural residence in northern China were employed. Moreover, the training data were classified based on linear or nonlinear variations of outdoor temperature and its changing ranges, while the validation data included three different types of weather conditions, namely, cold, cool, and mild. Then, the information entropy under 2-h, 4-h, 6-h and 8-h training window was quantified to be 1.811, 1.839, 1.877 and 1.856, respectively. For the employed rural residence, an equivalent three-resistance and two-capacity model was established to validate the effectiveness of the training window. Using the derived optimal thermal resistance and capacity, the various selection of outdoor temperature variation trend and range were compared and optimized. Results showed that 6 h of training data had the maximum information entropy and the most abundant information, the minimum errors between actual and forecasting data were observed under 6 h of training data, linear change, and lower outdoor temperature. The mean absolute percentage errors for the load forecasting of three typical days were 5.63%, 8.46%, and 12.10%, respectively.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
利用信息熵选择训练数据:应用于中国北方农村居民采暖负荷建模
输入变量及其数量的选择一直是大数据负荷预测中的一个重要问题。本文以采暖负荷预测为例,提出了一种基于信息熵的数据过滤方法。首先,采用了中国北方农村居民采用的空气源热泵系统的采暖数据。此外,训练数据根据室外温度的线性或非线性变化及其变化范围进行分类,而验证数据则包括三种不同的天气条件,即寒冷、凉爽和温和。然后,将 2 小时、4 小时、6 小时和 8 小时训练窗口下的信息熵分别量化为 1.811、1.839、1.877 和 1.856。针对所采用的农村住宅,建立了一个等效的三阻力和两容量模型,以验证训练窗口的有效性。利用得出的最佳热阻和热容量,对室外温度变化趋势和范围的各种选择进行了比较和优化。结果表明,6 小时的训练数据具有最大的信息熵和最丰富的信息,在 6 小时训练数据、线性变化和室外温度较低的情况下,实际数据与预测数据之间的误差最小。三个典型日负荷预测的平均绝对百分比误差分别为 5.63%、8.46% 和 12.10%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Applied Geophysics
Applied Geophysics 地学-地球化学与地球物理
CiteScore
1.50
自引率
14.30%
发文量
912
审稿时长
2 months
期刊介绍: The journal is designed to provide an academic realm for a broad blend of academic and industry papers to promote rapid communication and exchange of ideas between Chinese and world-wide geophysicists. The publication covers the applications of geoscience, geophysics, and related disciplines in the fields of energy, resources, environment, disaster, engineering, information, military, and surveying.
期刊最新文献
Earthquake detection probabilities and completeness magnitude in the northern margin of the Ordos Block Multi-well wavelet-synchronized inversion based on particle swarm optimization Low-Frequency Sweep Design—A Case Study in Middle East Desert Environments Research on Paleoearthquake and Recurrence Characteristics of Strong Earthquakes in Active Faults of Mainland China Capacity matching and optimization of solar-ground source heat pump coupling systems
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1