A novel hybrid solar radiation forecasting algorithm based on discrete wavelet transform and multivariate machine learning models integrated with clearness index clusters
{"title":"A novel hybrid solar radiation forecasting algorithm based on discrete wavelet transform and multivariate machine learning models integrated with clearness index clusters","authors":"Burak Arseven, Said Mahmut Çınar","doi":"10.1016/j.jastp.2025.106417","DOIUrl":null,"url":null,"abstract":"<div><div>This study presents an innovative forecasting algorithm that combines multivariate regression (MR) and discrete wavelet transform (DWT) techniques with clearness index (CI)-based clustering methods to enhance short-term (1 h ahead) solar radiation forecasting. The proposed algorithm consists of two main steps: the first involves forecasting processes using DWT and MR methods, while the second includes clustering processes determined based on CI values. In the forecasting process, the data has been decomposed into sub-signals at different levels using DWT first. Multivariate ridge regression (MRR) and lasso regression (MLR) models for the sub-signals have been determined based on input training data sets created from three different combinations of these sub-signals. Sub-forecast signals have been obtained using models that were determined in different formats. The sub-forecast signals obtained have been recombined using the DWT reconstruction to produce the final forecasts. In the clustering process, clusters have been formed based on CI values using the Kernel k-means algorithm, which has been identified as the most effective among three different algorithms. The effectiveness of forecasts generated using DWT-MRR and DWT-MLR models for all input data set versions has been evaluated within the CI-based clusters.</div><div>The study's key findings have revealed that decomposition at the first level of DWT is sufficient to achieve optimal forecasting performance. Furthermore, the input variables yielding the best results have differed across clusters: radiation and relative humidity for the mostly cloudy cluster, radiation, temperature, and relative humidity for the cloudy cluster, and radiation and temperature for the slightly cloudy cluster. The results have demonstrated that the proposed algorithm achieves a 17% improvement in root mean square error (RMSE) compared to the best-performing model developed without CI clustering. The proposed approach significantly contributes to the literature by optimizing DWT decomposition levels, adapting data modeling to cloudiness conditions, and integrating multiple forecasting techniques to improve performance.</div></div>","PeriodicalId":15096,"journal":{"name":"Journal of Atmospheric and Solar-Terrestrial Physics","volume":"267 ","pages":"Article 106417"},"PeriodicalIF":1.8000,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Atmospheric and Solar-Terrestrial Physics","FirstCategoryId":"89","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S136468262500001X","RegionNum":4,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"GEOCHEMISTRY & GEOPHYSICS","Score":null,"Total":0}
引用次数: 0
Abstract
This study presents an innovative forecasting algorithm that combines multivariate regression (MR) and discrete wavelet transform (DWT) techniques with clearness index (CI)-based clustering methods to enhance short-term (1 h ahead) solar radiation forecasting. The proposed algorithm consists of two main steps: the first involves forecasting processes using DWT and MR methods, while the second includes clustering processes determined based on CI values. In the forecasting process, the data has been decomposed into sub-signals at different levels using DWT first. Multivariate ridge regression (MRR) and lasso regression (MLR) models for the sub-signals have been determined based on input training data sets created from three different combinations of these sub-signals. Sub-forecast signals have been obtained using models that were determined in different formats. The sub-forecast signals obtained have been recombined using the DWT reconstruction to produce the final forecasts. In the clustering process, clusters have been formed based on CI values using the Kernel k-means algorithm, which has been identified as the most effective among three different algorithms. The effectiveness of forecasts generated using DWT-MRR and DWT-MLR models for all input data set versions has been evaluated within the CI-based clusters.
The study's key findings have revealed that decomposition at the first level of DWT is sufficient to achieve optimal forecasting performance. Furthermore, the input variables yielding the best results have differed across clusters: radiation and relative humidity for the mostly cloudy cluster, radiation, temperature, and relative humidity for the cloudy cluster, and radiation and temperature for the slightly cloudy cluster. The results have demonstrated that the proposed algorithm achieves a 17% improvement in root mean square error (RMSE) compared to the best-performing model developed without CI clustering. The proposed approach significantly contributes to the literature by optimizing DWT decomposition levels, adapting data modeling to cloudiness conditions, and integrating multiple forecasting techniques to improve performance.
期刊介绍:
The Journal of Atmospheric and Solar-Terrestrial Physics (JASTP) is an international journal concerned with the inter-disciplinary science of the Earth''s atmospheric and space environment, especially the highly varied and highly variable physical phenomena that occur in this natural laboratory and the processes that couple them.
The journal covers the physical processes operating in the troposphere, stratosphere, mesosphere, thermosphere, ionosphere, magnetosphere, the Sun, interplanetary medium, and heliosphere. Phenomena occurring in other "spheres", solar influences on climate, and supporting laboratory measurements are also considered. The journal deals especially with the coupling between the different regions.
Solar flares, coronal mass ejections, and other energetic events on the Sun create interesting and important perturbations in the near-Earth space environment. The physics of such "space weather" is central to the Journal of Atmospheric and Solar-Terrestrial Physics and the journal welcomes papers that lead in the direction of a predictive understanding of the coupled system. Regarding the upper atmosphere, the subjects of aeronomy, geomagnetism and geoelectricity, auroral phenomena, radio wave propagation, and plasma instabilities, are examples within the broad field of solar-terrestrial physics which emphasise the energy exchange between the solar wind, the magnetospheric and ionospheric plasmas, and the neutral gas. In the lower atmosphere, topics covered range from mesoscale to global scale dynamics, to atmospheric electricity, lightning and its effects, and to anthropogenic changes.