Ankur Mahesh, William Collins, Boris Bonev, Noah Brenowitz, Yair Cohen, Joshua Elms, Peter Harrington, Karthik Kashinath, Thorsten Kurth, Joshua North, Travis OBrien, Michael Pritchard, David Pruitt, Mark Risser, Shashank Subramanian, Jared Willard
{"title":"Huge Ensembles Part I: Design of Ensemble Weather Forecasts using Spherical Fourier Neural Operators","authors":"Ankur Mahesh, William Collins, Boris Bonev, Noah Brenowitz, Yair Cohen, Joshua Elms, Peter Harrington, Karthik Kashinath, Thorsten Kurth, Joshua North, Travis OBrien, Michael Pritchard, David Pruitt, Mark Risser, Shashank Subramanian, Jared Willard","doi":"arxiv-2408.03100","DOIUrl":null,"url":null,"abstract":"Studying low-likelihood high-impact extreme weather events in a warming world\nis a significant and challenging task for current ensemble forecasting systems.\nWhile these systems presently use up to 100 members, larger ensembles could\nenrich the sampling of internal variability. They may capture the long tails\nassociated with climate hazards better than traditional ensemble sizes. Due to\ncomputational constraints, it is infeasible to generate huge ensembles\n(comprised of 1,000-10,000 members) with traditional, physics-based numerical\nmodels. In this two-part paper, we replace traditional numerical simulations\nwith machine learning (ML) to generate hindcasts of huge ensembles. In Part I,\nwe construct an ensemble weather forecasting system based on Spherical Fourier\nNeural Operators (SFNO), and we discuss important design decisions for\nconstructing such an ensemble. The ensemble represents model uncertainty\nthrough perturbed-parameter techniques, and it represents initial condition\nuncertainty through bred vectors, which sample the fastest growing modes of the\nforecast. Using the European Centre for Medium-Range Weather Forecasts\nIntegrated Forecasting System (IFS) as a baseline, we develop an evaluation\npipeline composed of mean, spectral, and extreme diagnostics. Using\nlarge-scale, distributed SFNOs with 1.1 billion learned parameters, we achieve\ncalibrated probabilistic forecasts. As the trajectories of the individual\nmembers diverge, the ML ensemble mean spectra degrade with lead time,\nconsistent with physical expectations. However, the individual ensemble\nmembers' spectra stay constant with lead time. Therefore, these members\nsimulate realistic weather states, and the ML ensemble thus passes a crucial\nspectral test in the literature. The IFS and ML ensembles have similar Extreme\nForecast Indices, and we show that the ML extreme weather forecasts are\nreliable and discriminating.","PeriodicalId":501166,"journal":{"name":"arXiv - PHYS - Atmospheric and Oceanic Physics","volume":"22 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - PHYS - Atmospheric and Oceanic Physics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2408.03100","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Studying low-likelihood high-impact extreme weather events in a warming world
is a significant and challenging task for current ensemble forecasting systems.
While these systems presently use up to 100 members, larger ensembles could
enrich the sampling of internal variability. They may capture the long tails
associated with climate hazards better than traditional ensemble sizes. Due to
computational constraints, it is infeasible to generate huge ensembles
(comprised of 1,000-10,000 members) with traditional, physics-based numerical
models. In this two-part paper, we replace traditional numerical simulations
with machine learning (ML) to generate hindcasts of huge ensembles. In Part I,
we construct an ensemble weather forecasting system based on Spherical Fourier
Neural Operators (SFNO), and we discuss important design decisions for
constructing such an ensemble. The ensemble represents model uncertainty
through perturbed-parameter techniques, and it represents initial condition
uncertainty through bred vectors, which sample the fastest growing modes of the
forecast. Using the European Centre for Medium-Range Weather Forecasts
Integrated Forecasting System (IFS) as a baseline, we develop an evaluation
pipeline composed of mean, spectral, and extreme diagnostics. Using
large-scale, distributed SFNOs with 1.1 billion learned parameters, we achieve
calibrated probabilistic forecasts. As the trajectories of the individual
members diverge, the ML ensemble mean spectra degrade with lead time,
consistent with physical expectations. However, the individual ensemble
members' spectra stay constant with lead time. Therefore, these members
simulate realistic weather states, and the ML ensemble thus passes a crucial
spectral test in the literature. The IFS and ML ensembles have similar Extreme
Forecast Indices, and we show that the ML extreme weather forecasts are
reliable and discriminating.