Improving Model Generalization for Short-Term Customer Load Forecasting With Causal Inference

IF 9.8 1区 工程技术 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC IEEE Transactions on Smart Grid Pub Date : 2024-08-30 DOI:10.1109/TSG.2024.3452490
Zhenyi Wang;Hongcai Zhang;Ruixiong Yang;Yong Chen
{"title":"Improving Model Generalization for Short-Term Customer Load Forecasting With Causal Inference","authors":"Zhenyi Wang;Hongcai Zhang;Ruixiong Yang;Yong Chen","doi":"10.1109/TSG.2024.3452490","DOIUrl":null,"url":null,"abstract":"Short-term customer load forecasting is vital for the normal operation of power systems. Unfortunately, conventional machine learning-based forecasting methods are susceptible to generalization issues (e.g., the customer heterogeneity and distribution drift of load data), manifested in model performance degradation. In recent years, some studies have employed the advanced deep learning technology, such as online learning, to overcome the aforesaid problems. However, these methods can only alleviate the adverse impacts of generalization problems on model performance, because they are inherently built on unstable relationships (i.e., correlations). In this paper, we propose a novel causal inference-based method to improve the generalization for short-term customer load forecasting models. Specifically, we first investigate the causal relations between input features and the output in existing methods, and introduce the load characteristics as an extra model input to enhance the causality. Then, we closely inspect the causality in models by using the causal graph to distinguish the confounder, followed by employing the causal intervention with do-calculus to eliminate the spurious correlations caused by the confounder. Moreover, we propose a novel load forecasting framework with the load characteristic extraction, characteristic pool approximation and characteristic-injected model to realize the causal intervention in an efficient and fidelity way. Finally, the effectiveness and superiority of our proposed method are validated on a public dataset.","PeriodicalId":13331,"journal":{"name":"IEEE Transactions on Smart Grid","volume":"16 1","pages":"424-436"},"PeriodicalIF":9.8000,"publicationDate":"2024-08-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Smart Grid","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10660529/","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0

Abstract

Short-term customer load forecasting is vital for the normal operation of power systems. Unfortunately, conventional machine learning-based forecasting methods are susceptible to generalization issues (e.g., the customer heterogeneity and distribution drift of load data), manifested in model performance degradation. In recent years, some studies have employed the advanced deep learning technology, such as online learning, to overcome the aforesaid problems. However, these methods can only alleviate the adverse impacts of generalization problems on model performance, because they are inherently built on unstable relationships (i.e., correlations). In this paper, we propose a novel causal inference-based method to improve the generalization for short-term customer load forecasting models. Specifically, we first investigate the causal relations between input features and the output in existing methods, and introduce the load characteristics as an extra model input to enhance the causality. Then, we closely inspect the causality in models by using the causal graph to distinguish the confounder, followed by employing the causal intervention with do-calculus to eliminate the spurious correlations caused by the confounder. Moreover, we propose a novel load forecasting framework with the load characteristic extraction, characteristic pool approximation and characteristic-injected model to realize the causal intervention in an efficient and fidelity way. Finally, the effectiveness and superiority of our proposed method are validated on a public dataset.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
利用因果推理改进短期客户负荷预测的模型泛化
短期负荷预测对电力系统的正常运行至关重要。不幸的是,传统的基于机器学习的预测方法容易受到泛化问题的影响(例如,客户异质性和负载数据的分布漂移),表现在模型性能下降。近年来,一些研究采用了先进的深度学习技术,如在线学习来克服上述问题。然而,这些方法只能减轻泛化问题对模型性能的不利影响,因为它们本质上是建立在不稳定的关系(即相关性)上的。本文提出了一种新的基于因果推理的方法来提高短期客户负荷预测模型的泛化能力。具体而言,我们首先研究了现有方法中输入特征与输出之间的因果关系,并引入负载特征作为额外的模型输入来增强因果关系。然后,我们通过使用因果图来区分混杂因素来仔细检查模型中的因果关系,然后使用带有do-calculus的因果干预来消除由混杂因素引起的虚假相关。在此基础上,提出了一种基于负荷特征提取、特征池逼近和特征注入模型的负荷预测框架,实现了高效、逼真的因果干预。最后,在一个公共数据集上验证了该方法的有效性和优越性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
IEEE Transactions on Smart Grid
IEEE Transactions on Smart Grid ENGINEERING, ELECTRICAL & ELECTRONIC-
CiteScore
22.10
自引率
9.40%
发文量
526
审稿时长
6 months
期刊介绍: The IEEE Transactions on Smart Grid is a multidisciplinary journal that focuses on research and development in the field of smart grid technology. It covers various aspects of the smart grid, including energy networks, prosumers (consumers who also produce energy), electric transportation, distributed energy resources, and communications. The journal also addresses the integration of microgrids and active distribution networks with transmission systems. It publishes original research on smart grid theories and principles, including technologies and systems for demand response, Advance Metering Infrastructure, cyber-physical systems, multi-energy systems, transactive energy, data analytics, and electric vehicle integration. Additionally, the journal considers surveys of existing work on the smart grid that propose new perspectives on the history and future of intelligent and active grids.
期刊最新文献
Fixed-Time Privacy-Preserving Distributed Coordinated Control for Grid-Forming and Grid-Following Inverters in Microgrids Real-Time Operational Optimization of Microgrid via Piecewise Linear Basis Function Based Galerkin Approximation Unified Model Aggregation of Active Distribution Network With Heterogeneous Uncertain Distributed Energy Resources Data-Driven Privacy-Preserving Modeling and Frequency Regulation with Aggregated Electric Vehicles via Bilinear Hidden Markov Model Distributed Location Method for SPG Faults in Distribution Networks Combining Zero-Sequence and Phase-to-Phase Current Fault Components
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1