Haoyu Qin, Yungang Chen, Qianchuan Jiang, Pengchao Sun, Xiancai Ye, Chao Lin
{"title":"MetMamba: Regional Weather Forecasting with Spatial-Temporal Mamba Model","authors":"Haoyu Qin, Yungang Chen, Qianchuan Jiang, Pengchao Sun, Xiancai Ye, Chao Lin","doi":"arxiv-2408.06400","DOIUrl":null,"url":null,"abstract":"Deep Learning based Weather Prediction (DLWP) models have been improving\nrapidly over the last few years, surpassing state of the art numerical weather\nforecasts by significant margins. While much of the optimization effort is\nfocused on training curriculum to extend forecast range in the global context,\ntwo aspects remains less explored: limited area modeling and better backbones\nfor weather forecasting. We show in this paper that MetMamba, a DLWP model\nbuilt on a state-of-the-art state-space model, Mamba, offers notable\nperformance gains and unique advantages over other popular backbones using\ntraditional attention mechanisms and neural operators. We also demonstrate the\nfeasibility of deep learning based limited area modeling via coupled training\nwith a global host model.","PeriodicalId":501166,"journal":{"name":"arXiv - PHYS - Atmospheric and Oceanic Physics","volume":"21 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-08-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - PHYS - Atmospheric and Oceanic Physics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2408.06400","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Deep Learning based Weather Prediction (DLWP) models have been improving
rapidly over the last few years, surpassing state of the art numerical weather
forecasts by significant margins. While much of the optimization effort is
focused on training curriculum to extend forecast range in the global context,
two aspects remains less explored: limited area modeling and better backbones
for weather forecasting. We show in this paper that MetMamba, a DLWP model
built on a state-of-the-art state-space model, Mamba, offers notable
performance gains and unique advantages over other popular backbones using
traditional attention mechanisms and neural operators. We also demonstrate the
feasibility of deep learning based limited area modeling via coupled training
with a global host model.