{"title":"Averaged Errors as a Risk Factor for Intelligent Forecasting Systems Operation in the Power Industry","authors":"A. Khalyasmaa, P. Matrenin, S. Eroshenko","doi":"10.1109/USSEC53120.2021.9655742","DOIUrl":null,"url":null,"abstract":"The paper discusses the operational risk in intelligent systems for forecasting time series. Typically, when developing and testing regression models based on machine learning, their accuracy is calculated over a long time interval, from several months to several years, and then is averaged. However, in the real-life operation of such systems, the customer is likely to draw a conclusion about the system efficiency based on the results of the first 2–4 weeks of operation. If one or several large errors appear on this short interval, they will not be averaged as it happens over a long one. As a result, there is a risk of failure in the intelligent forecasting system implementation due to the discrepancy between the calculated mean error and that obtained over a short time period at the start of operation. This study considers the problem of solar power plant generation short-term forecasting, analyzes the distribution of errors over short time periods, and substantiates the need to use more detailed accuracy metrics of machine learning models than the error values averaged over a long interval.","PeriodicalId":260032,"journal":{"name":"2021 Ural-Siberian Smart Energy Conference (USSEC)","volume":"458 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 Ural-Siberian Smart Energy Conference (USSEC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/USSEC53120.2021.9655742","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
The paper discusses the operational risk in intelligent systems for forecasting time series. Typically, when developing and testing regression models based on machine learning, their accuracy is calculated over a long time interval, from several months to several years, and then is averaged. However, in the real-life operation of such systems, the customer is likely to draw a conclusion about the system efficiency based on the results of the first 2–4 weeks of operation. If one or several large errors appear on this short interval, they will not be averaged as it happens over a long one. As a result, there is a risk of failure in the intelligent forecasting system implementation due to the discrepancy between the calculated mean error and that obtained over a short time period at the start of operation. This study considers the problem of solar power plant generation short-term forecasting, analyzes the distribution of errors over short time periods, and substantiates the need to use more detailed accuracy metrics of machine learning models than the error values averaged over a long interval.