Investigating the effect of training time for machine learning based photovoltaic power forecasting

Dávid Markovics, M. J. Mayer
{"title":"Investigating the effect of training time for machine learning based photovoltaic power forecasting","authors":"Dávid Markovics, M. J. Mayer","doi":"10.1109/IYCE54153.2022.9857544","DOIUrl":null,"url":null,"abstract":"The built-in capacity and therefore the importance of photovoltaic (PV) power generation is inevitably increasing in the past years resulting that it is less and less possible to turn a blind eye to the challenging problems that come with it. This article focuses on a practical issue regarding numerical weather prediction (NWP) based PV forecasting with machine learning, which is the effect of the training time. Most of the studies propose new high-level sophisticated algorithms which can reduce the errors with a given percentage, however few of them deals with practical questions regarding the training time therefore the operating time needed to reach the accuracy of physical models, and the optimal length of the training dataset is still generally unclear. This study is supposed to provide some suggestions introducing the test method and the results of the simulations that were done on 2-years data including spatial weather forecast and the power production of a Hungarian PV plant. Three fast but powerful ensemble methods, XGBoost, LightGBM and ExtraTrees regression were applied using different train length cases in the range of 2,4,6… 102 weeks, for all the 52 two-week intervals. This simulation allows to investigate not just the previously mentioned aspects, but also the seasonal difference and sensitivity for the forecasting accuracy. The results are shown in both day-ahead and intraday time horizons, where in case of latter, also some measured power values of the close-past has been added to the input variables.","PeriodicalId":248738,"journal":{"name":"2022 8th International Youth Conference on Energy (IYCE)","volume":"31 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 8th International Youth Conference on Energy (IYCE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IYCE54153.2022.9857544","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

The built-in capacity and therefore the importance of photovoltaic (PV) power generation is inevitably increasing in the past years resulting that it is less and less possible to turn a blind eye to the challenging problems that come with it. This article focuses on a practical issue regarding numerical weather prediction (NWP) based PV forecasting with machine learning, which is the effect of the training time. Most of the studies propose new high-level sophisticated algorithms which can reduce the errors with a given percentage, however few of them deals with practical questions regarding the training time therefore the operating time needed to reach the accuracy of physical models, and the optimal length of the training dataset is still generally unclear. This study is supposed to provide some suggestions introducing the test method and the results of the simulations that were done on 2-years data including spatial weather forecast and the power production of a Hungarian PV plant. Three fast but powerful ensemble methods, XGBoost, LightGBM and ExtraTrees regression were applied using different train length cases in the range of 2,4,6… 102 weeks, for all the 52 two-week intervals. This simulation allows to investigate not just the previously mentioned aspects, but also the seasonal difference and sensitivity for the forecasting accuracy. The results are shown in both day-ahead and intraday time horizons, where in case of latter, also some measured power values of the close-past has been added to the input variables.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
研究训练时间对基于机器学习的光伏发电预测的影响
在过去的几年里,内置容量和光伏发电的重要性不可避免地增加,导致越来越不可能对随之而来的具有挑战性的问题视而不见。本文主要研究基于数值天气预报(NWP)的PV预测与机器学习的一个实际问题,即训练时间的影响。大多数研究都提出了新的高级复杂算法,可以将误差降低到给定的百分比,但是很少有研究涉及到训练时间的实际问题,因此达到物理模型精度所需的操作时间,以及训练数据集的最佳长度仍然普遍不清楚。本研究通过对匈牙利某光伏电站2年的空间天气预报和发电数据进行模拟,并介绍了测试方法和模拟结果,提出了一些建议。在所有52周的时间间隔中,采用了XGBoost、LightGBM和ExtraTrees三种快速但功能强大的集成方法,在2、4、6…102周的不同训练长度的情况下应用。该模拟不仅可以研究前面提到的方面,还可以研究季节差异和预测精度的敏感性。结果显示在前一天和当天的时间范围,在后者的情况下,也有一些测量功率值的近过去已被添加到输入变量。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Numerical study of a duct-type ESP with W-type collecting electrodes and different circular corona electrodes radius Examination of electric field distribution in a laboratory-built electrospinning device Overview of the EROI, a tool to measure energy availability through the energy transition Developing a residential consumption forecast method Utilization of Additive Manufacturing in Nuclear Power Industry
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1