A robust error correction method for numerical weather prediction wind speed based on Bayesian optimization, variational mode decomposition, principal component analysis, and random forest: VMD-PCA-RF (version 1.0.0)
{"title":"A robust error correction method for numerical weather prediction wind speed based on Bayesian optimization, variational mode decomposition, principal component analysis, and random forest: VMD-PCA-RF (version 1.0.0)","authors":"Shaohui Zhou, Chloe Yuchao Gao, Zexia Duan, Xingya Xi, Yubin Li","doi":"10.5194/gmd-16-6247-2023","DOIUrl":null,"url":null,"abstract":"Abstract. Accurate wind speed prediction is crucial for the safe and efficient utilization of wind resources. However, current single-value deterministic numerical weather prediction methods employed by wind farms do not adequately meet the actual needs of power grid dispatching. In this study, we propose a new hybrid forecasting method for correcting 10 m wind speed predictions made by the Weather Research and Forecasting (WRF) model. Our approach incorporates variational mode decomposition (VMD), principal component analysis (PCA), and five artificial intelligence algorithms: deep belief network (DBN), multilayer perceptron (MLP), random forest (RF), eXtreme gradient boosting (XGBoost), light gradient boosting machine (lightGBM), and the Bayesian optimization algorithm (BOA). We first predict wind speeds using the WRF model, with initial and lateral boundary conditions from the Global Forecast System (GFS). We then perform two sets of experiments with different input factors and apply BOA optimization to tune the four artificial intelligence models, ultimately building the final models. Furthermore, we compare the aforementioned five optimal artificial intelligence models suitable for five provinces in southern China in the wintertime: VMD-PCA-RF in December 2021 and VMD-PCA-lightGBM in January 2022. We find that the VMD-PCA-RF evaluation indices exhibit relative stability over nearly a year: the correlation coefficient (R) is above 0.6, forecasting accuracy (FA) is above 85 %, mean absolute error (MAE) is below 0.6 m s−1, root mean square error (RMSE) is below 0.8 m s−1, relative mean absolute error (rMAE) is below 60 %, and relative root mean square error (rRMSE) is below 75 %. Thus, for its promising performance and excellent year-round robustness, we recommend adopting the proposed VMD-PCA-RF method for improved wind speed prediction in models.","PeriodicalId":12799,"journal":{"name":"Geoscientific Model Development","volume":"16 3","pages":"0"},"PeriodicalIF":4.0000,"publicationDate":"2023-11-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Geoscientific Model Development","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5194/gmd-16-6247-2023","RegionNum":3,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GEOSCIENCES, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
Abstract
Abstract. Accurate wind speed prediction is crucial for the safe and efficient utilization of wind resources. However, current single-value deterministic numerical weather prediction methods employed by wind farms do not adequately meet the actual needs of power grid dispatching. In this study, we propose a new hybrid forecasting method for correcting 10 m wind speed predictions made by the Weather Research and Forecasting (WRF) model. Our approach incorporates variational mode decomposition (VMD), principal component analysis (PCA), and five artificial intelligence algorithms: deep belief network (DBN), multilayer perceptron (MLP), random forest (RF), eXtreme gradient boosting (XGBoost), light gradient boosting machine (lightGBM), and the Bayesian optimization algorithm (BOA). We first predict wind speeds using the WRF model, with initial and lateral boundary conditions from the Global Forecast System (GFS). We then perform two sets of experiments with different input factors and apply BOA optimization to tune the four artificial intelligence models, ultimately building the final models. Furthermore, we compare the aforementioned five optimal artificial intelligence models suitable for five provinces in southern China in the wintertime: VMD-PCA-RF in December 2021 and VMD-PCA-lightGBM in January 2022. We find that the VMD-PCA-RF evaluation indices exhibit relative stability over nearly a year: the correlation coefficient (R) is above 0.6, forecasting accuracy (FA) is above 85 %, mean absolute error (MAE) is below 0.6 m s−1, root mean square error (RMSE) is below 0.8 m s−1, relative mean absolute error (rMAE) is below 60 %, and relative root mean square error (rRMSE) is below 75 %. Thus, for its promising performance and excellent year-round robustness, we recommend adopting the proposed VMD-PCA-RF method for improved wind speed prediction in models.
摘要准确的风速预测对风资源的安全高效利用至关重要。然而,目前风电场采用的单值确定性数值天气预报方法不能很好地满足电网调度的实际需要。在这项研究中,我们提出了一种新的混合预报方法,用于校正由天气研究与预报(WRF)模式预测的10 m风速。我们的方法结合了变分模态分解(VMD)、主成分分析(PCA)和五种人工智能算法:深度信念网络(DBN)、多层感知器(MLP)、随机森林(RF)、极限梯度增强(XGBoost)、轻梯度增强机(lightGBM)和贝叶斯优化算法(BOA)。我们首先利用全球预报系统(GFS)的初始和横向边界条件,利用WRF模式预测风速。然后,我们使用不同的输入因素进行两组实验,并应用BOA优化来调整四个人工智能模型,最终构建最终模型。此外,我们还比较了上述适用于中国南方五省冬季的五种最优人工智能模型:2021年12月的VMD-PCA-RF和2022年1月的VMD-PCA-lightGBM。研究发现,近一年来,VMD-PCA-RF评价指标表现出相对稳定性:相关系数(R)在0.6以上,预测精度(FA)在85%以上,平均绝对误差(MAE)在0.6 m s−1以下,均方根误差(RMSE)在0.8 m s−1以下,相对平均绝对误差(rMAE)在60%以下,相对均方根误差(rRMSE)在75%以下。因此,由于其良好的性能和出色的全年鲁棒性,我们建议采用所提出的VMD-PCA-RF方法来改进模型中的风速预测。
期刊介绍:
Geoscientific Model Development (GMD) is an international scientific journal dedicated to the publication and public discussion of the description, development, and evaluation of numerical models of the Earth system and its components. The following manuscript types can be considered for peer-reviewed publication:
* geoscientific model descriptions, from statistical models to box models to GCMs;
* development and technical papers, describing developments such as new parameterizations or technical aspects of running models such as the reproducibility of results;
* new methods for assessment of models, including work on developing new metrics for assessing model performance and novel ways of comparing model results with observational data;
* papers describing new standard experiments for assessing model performance or novel ways of comparing model results with observational data;
* model experiment descriptions, including experimental details and project protocols;
* full evaluations of previously published models.