{"title":"Impact of NWP Model Configuration and Training Sample Characteristics on Random Forest-Based Day-1 Convective Outlook Guidance","authors":"Aaron Johnson, Xuguang Wang","doi":"10.1029/2024EA003822","DOIUrl":null,"url":null,"abstract":"<p>This study aims to quantify and better understand the impact of an upgrade to the configuration of an FV3 (Finite Volume cubed-sphere) LAM (Limited Area Model) convection-allowing ensemble on the skill of the RF models trained on cases before the upgrade and forecast on cases after the upgrade. Specifically, Random Forest (RF) models were used to produce probabilistic forecasts of severe weather, significant severe weather, and individual hazards of wind, hail, and tornado for the purpose of day-1 convective outlook guidance. The RF models are trained and forecast on different subsets of the available data set of forecast cases from the spring seasons of 2019 and 2021 (before the FV3 LAM upgrade) and 2022 (after the upgrade) and evaluated both quantitatively and qualitatively. It is found for most predictands that the RF models forecasting 2022 (2019/2021) cases are statistically significantly more skillful when trained on other cases from the 2022 (2019/2021) data set using a leave-one-out approach. However, within the 2019/2021 data set, training on cases from a different year than the year being forecast also leads to statistically significant degradations of skill, apparently at least in part due to the different sample climate between 2019 and 2021. For this particular NWP (Numerical Weather Prediction) model configuration change, the consistency in sample climate between training and forecast cases is at least as important as consistency in model configuration. Finally, increases in skill resulting from increasing the number of forecast cases used to train the RF levels off around 30 forecast cases.</p>","PeriodicalId":54286,"journal":{"name":"Earth and Space Science","volume":"12 2","pages":""},"PeriodicalIF":2.9000,"publicationDate":"2025-02-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1029/2024EA003822","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Earth and Space Science","FirstCategoryId":"89","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1029/2024EA003822","RegionNum":3,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ASTRONOMY & ASTROPHYSICS","Score":null,"Total":0}
引用次数: 0
Abstract
This study aims to quantify and better understand the impact of an upgrade to the configuration of an FV3 (Finite Volume cubed-sphere) LAM (Limited Area Model) convection-allowing ensemble on the skill of the RF models trained on cases before the upgrade and forecast on cases after the upgrade. Specifically, Random Forest (RF) models were used to produce probabilistic forecasts of severe weather, significant severe weather, and individual hazards of wind, hail, and tornado for the purpose of day-1 convective outlook guidance. The RF models are trained and forecast on different subsets of the available data set of forecast cases from the spring seasons of 2019 and 2021 (before the FV3 LAM upgrade) and 2022 (after the upgrade) and evaluated both quantitatively and qualitatively. It is found for most predictands that the RF models forecasting 2022 (2019/2021) cases are statistically significantly more skillful when trained on other cases from the 2022 (2019/2021) data set using a leave-one-out approach. However, within the 2019/2021 data set, training on cases from a different year than the year being forecast also leads to statistically significant degradations of skill, apparently at least in part due to the different sample climate between 2019 and 2021. For this particular NWP (Numerical Weather Prediction) model configuration change, the consistency in sample climate between training and forecast cases is at least as important as consistency in model configuration. Finally, increases in skill resulting from increasing the number of forecast cases used to train the RF levels off around 30 forecast cases.
期刊介绍:
Marking AGU’s second new open access journal in the last 12 months, Earth and Space Science is the only journal that reflects the expansive range of science represented by AGU’s 62,000 members, including all of the Earth, planetary, and space sciences, and related fields in environmental science, geoengineering, space engineering, and biogeochemistry.