Shengchen Zhu, Yiming Chen, Peiying Yu, Xiang Qu, Yuxiao Zhou, Yiming Ma, Zhizhan Zhao, Yukai Liu, Hao Mi, Bin Wang
{"title":"普云利用大核注意力卷积网络进行中程全球天气预报","authors":"Shengchen Zhu, Yiming Chen, Peiying Yu, Xiang Qu, Yuxiao Zhou, Yiming Ma, Zhizhan Zhao, Yukai Liu, Hao Mi, Bin Wang","doi":"arxiv-2409.02123","DOIUrl":null,"url":null,"abstract":"Accurate weather forecasting is essential for understanding and mitigating\nweather-related impacts. In this paper, we present PuYun, an autoregressive\ncascade model that leverages large kernel attention convolutional networks. The\nmodel's design inherently supports extended weather prediction horizons while\nbroadening the effective receptive field. The integration of large kernel\nattention mechanisms within the convolutional layers enhances the model's\ncapacity to capture fine-grained spatial details, thereby improving its\npredictive accuracy for meteorological phenomena. We introduce PuYun, comprising PuYun-Short for 0-5 day forecasts and\nPuYun-Medium for 5-10 day predictions. This approach enhances the accuracy of\n10-day weather forecasting. Through evaluation, we demonstrate that PuYun-Short\nalone surpasses the performance of both GraphCast and FuXi-Short in generating\naccurate 10-day forecasts. Specifically, on the 10th day, PuYun-Short reduces\nthe RMSE for Z500 to 720 $m^2/s^2$, compared to 732 $m^2/s^2$ for GraphCast and\n740 $m^2/s^2$ for FuXi-Short. Additionally, the RMSE for T2M is reduced to 2.60\nK, compared to 2.63 K for GraphCast and 2.65 K for FuXi-Short. Furthermore,\nwhen employing a cascaded approach by integrating PuYun-Short and PuYun-Medium,\nour method achieves superior results compared to the combined performance of\nFuXi-Short and FuXi-Medium. On the 10th day, the RMSE for Z500 is further\nreduced to 638 $m^2/s^2$, compared to 641 $m^2/s^2$ for FuXi. These findings\nunderscore the effectiveness of our model ensemble in advancing medium-range\nweather prediction. Our training code and model will be open-sourced.","PeriodicalId":501166,"journal":{"name":"arXiv - PHYS - Atmospheric and Oceanic Physics","volume":"18 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"PuYun: Medium-Range Global Weather Forecasting Using Large Kernel Attention Convolutional Networks\",\"authors\":\"Shengchen Zhu, Yiming Chen, Peiying Yu, Xiang Qu, Yuxiao Zhou, Yiming Ma, Zhizhan Zhao, Yukai Liu, Hao Mi, Bin Wang\",\"doi\":\"arxiv-2409.02123\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Accurate weather forecasting is essential for understanding and mitigating\\nweather-related impacts. In this paper, we present PuYun, an autoregressive\\ncascade model that leverages large kernel attention convolutional networks. The\\nmodel's design inherently supports extended weather prediction horizons while\\nbroadening the effective receptive field. The integration of large kernel\\nattention mechanisms within the convolutional layers enhances the model's\\ncapacity to capture fine-grained spatial details, thereby improving its\\npredictive accuracy for meteorological phenomena. We introduce PuYun, comprising PuYun-Short for 0-5 day forecasts and\\nPuYun-Medium for 5-10 day predictions. This approach enhances the accuracy of\\n10-day weather forecasting. Through evaluation, we demonstrate that PuYun-Short\\nalone surpasses the performance of both GraphCast and FuXi-Short in generating\\naccurate 10-day forecasts. Specifically, on the 10th day, PuYun-Short reduces\\nthe RMSE for Z500 to 720 $m^2/s^2$, compared to 732 $m^2/s^2$ for GraphCast and\\n740 $m^2/s^2$ for FuXi-Short. Additionally, the RMSE for T2M is reduced to 2.60\\nK, compared to 2.63 K for GraphCast and 2.65 K for FuXi-Short. Furthermore,\\nwhen employing a cascaded approach by integrating PuYun-Short and PuYun-Medium,\\nour method achieves superior results compared to the combined performance of\\nFuXi-Short and FuXi-Medium. On the 10th day, the RMSE for Z500 is further\\nreduced to 638 $m^2/s^2$, compared to 641 $m^2/s^2$ for FuXi. These findings\\nunderscore the effectiveness of our model ensemble in advancing medium-range\\nweather prediction. Our training code and model will be open-sourced.\",\"PeriodicalId\":501166,\"journal\":{\"name\":\"arXiv - PHYS - Atmospheric and Oceanic Physics\",\"volume\":\"18 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - PHYS - Atmospheric and Oceanic Physics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2409.02123\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - PHYS - Atmospheric and Oceanic Physics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.02123","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
PuYun: Medium-Range Global Weather Forecasting Using Large Kernel Attention Convolutional Networks
Accurate weather forecasting is essential for understanding and mitigating
weather-related impacts. In this paper, we present PuYun, an autoregressive
cascade model that leverages large kernel attention convolutional networks. The
model's design inherently supports extended weather prediction horizons while
broadening the effective receptive field. The integration of large kernel
attention mechanisms within the convolutional layers enhances the model's
capacity to capture fine-grained spatial details, thereby improving its
predictive accuracy for meteorological phenomena. We introduce PuYun, comprising PuYun-Short for 0-5 day forecasts and
PuYun-Medium for 5-10 day predictions. This approach enhances the accuracy of
10-day weather forecasting. Through evaluation, we demonstrate that PuYun-Short
alone surpasses the performance of both GraphCast and FuXi-Short in generating
accurate 10-day forecasts. Specifically, on the 10th day, PuYun-Short reduces
the RMSE for Z500 to 720 $m^2/s^2$, compared to 732 $m^2/s^2$ for GraphCast and
740 $m^2/s^2$ for FuXi-Short. Additionally, the RMSE for T2M is reduced to 2.60
K, compared to 2.63 K for GraphCast and 2.65 K for FuXi-Short. Furthermore,
when employing a cascaded approach by integrating PuYun-Short and PuYun-Medium,
our method achieves superior results compared to the combined performance of
FuXi-Short and FuXi-Medium. On the 10th day, the RMSE for Z500 is further
reduced to 638 $m^2/s^2$, compared to 641 $m^2/s^2$ for FuXi. These findings
underscore the effectiveness of our model ensemble in advancing medium-range
weather prediction. Our training code and model will be open-sourced.