P. Vourlioti, S. Kotsopoulos, Theano Mamouka, Apostolos Agrafiotis, Francisco Javier Nieto, Carlos Fernández Sánchez, Cecilia Grela Llerena, Sergio García González
{"title":"最大化数值天气预报模式的潜力:结合高性能计算和云计算的经验教训","authors":"P. Vourlioti, S. Kotsopoulos, Theano Mamouka, Apostolos Agrafiotis, Francisco Javier Nieto, Carlos Fernández Sánchez, Cecilia Grela Llerena, Sergio García González","doi":"10.5194/asr-20-1-2023","DOIUrl":null,"url":null,"abstract":"Abstract. To promote cloud and HPC computing, GRAPEVINE* project objectives include using these tools along with open data sources to provide a reusable IT service. In this service a predictive model based on Machine learning (ML) techniques is created with the aim of preventing and\ncontrolling grape vine diseases in the wine cultivation sector. Aside from\nthe predictive ML, meteorological forecasts are crucial input to train the\nML models and on a second step to be used as input for the operational\nprediction of grapevine diseases. To this end, the Weather and Research\nForecasting model (WRF) has been deployed in CESGA's HPC infrastructure to\nproduce medium-range and sub-seasonal forecasts for the targeted pilot areas (Greece and Spain). The data assimilation component of WRF – WRFDA – has been also introduced for improving the initial conditions of the WRF model by assimilating observations from weather stations and satellite\nprecipitation products (Integrated Multi-satellitE Retrieval for GPM – IMERG). This methodology for assimilation was developed during STARGATE* project, allowing the testing of the methodology in the operational service of GRAPEVINE. The operational production of the forecasts is achieved by the cloudify orchestrator on a Kubernetes cluster. The connections between the Kubernetes cluster and the HPC infrastructure, where the model resides, is achieved with the croupier plugin of cloudify. Blueprints that encapsule the workflows of the meteorological model and its dependencies were created. The instances of the blueprints (deployments) were created automatically to produce operationally weather forecasts and they were made available to the ML models via a THREDDS server. Valuable lessons were learned with regards the automation of the process and the coupling with the HPC in terms of reservations and operational production.\n","PeriodicalId":30081,"journal":{"name":"Advances in Science and Research","volume":"28 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-03-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Maximizing the potential of numerical weather prediction models: lessons learned from combining high-performance computing and cloud computing\",\"authors\":\"P. Vourlioti, S. Kotsopoulos, Theano Mamouka, Apostolos Agrafiotis, Francisco Javier Nieto, Carlos Fernández Sánchez, Cecilia Grela Llerena, Sergio García González\",\"doi\":\"10.5194/asr-20-1-2023\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Abstract. To promote cloud and HPC computing, GRAPEVINE* project objectives include using these tools along with open data sources to provide a reusable IT service. In this service a predictive model based on Machine learning (ML) techniques is created with the aim of preventing and\\ncontrolling grape vine diseases in the wine cultivation sector. Aside from\\nthe predictive ML, meteorological forecasts are crucial input to train the\\nML models and on a second step to be used as input for the operational\\nprediction of grapevine diseases. To this end, the Weather and Research\\nForecasting model (WRF) has been deployed in CESGA's HPC infrastructure to\\nproduce medium-range and sub-seasonal forecasts for the targeted pilot areas (Greece and Spain). The data assimilation component of WRF – WRFDA – has been also introduced for improving the initial conditions of the WRF model by assimilating observations from weather stations and satellite\\nprecipitation products (Integrated Multi-satellitE Retrieval for GPM – IMERG). This methodology for assimilation was developed during STARGATE* project, allowing the testing of the methodology in the operational service of GRAPEVINE. The operational production of the forecasts is achieved by the cloudify orchestrator on a Kubernetes cluster. The connections between the Kubernetes cluster and the HPC infrastructure, where the model resides, is achieved with the croupier plugin of cloudify. Blueprints that encapsule the workflows of the meteorological model and its dependencies were created. The instances of the blueprints (deployments) were created automatically to produce operationally weather forecasts and they were made available to the ML models via a THREDDS server. Valuable lessons were learned with regards the automation of the process and the coupling with the HPC in terms of reservations and operational production.\\n\",\"PeriodicalId\":30081,\"journal\":{\"name\":\"Advances in Science and Research\",\"volume\":\"28 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-03-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Advances in Science and Research\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.5194/asr-20-1-2023\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"Earth and Planetary Sciences\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Advances in Science and Research","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5194/asr-20-1-2023","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"Earth and Planetary Sciences","Score":null,"Total":0}
引用次数: 1
摘要
摘要为了促进云计算和高性能计算,GRAPEVINE*项目的目标包括使用这些工具和开放数据源来提供可重用的IT服务。在这项服务中,基于机器学习(ML)技术的预测模型被创建,目的是预防和控制葡萄酒种植部门的葡萄藤疾病。除了预测性机器学习,气象预报是训练机器学习模型的关键输入,第二步是用作葡萄藤病害操作预测的输入。为此,天气和研究预报模型(WRF)已部署在CESGA的高性能计算基础设施中,为目标试点地区(希腊和西班牙)提供中期和分季节预报。为了通过同化来自气象站和卫星降水产品的观测数据来改善WRF模式的初始条件,还引入了WRF的数据同化成分WRFDA (Integrated Multi-satellitE Retrieval for GPM - IMERG)。这种同化方法是在STARGATE*项目期间开发的,允许在GRAPEVINE的操作服务中测试该方法。预测的操作生产是由Kubernetes集群上的cloudify编排器实现的。Kubernetes集群和模型所在的HPC基础设施之间的连接是通过cloudify的croupier插件实现的。创建了封装气象模型及其依赖关系的工作流的蓝图。蓝图(部署)的实例被自动创建,以生成可操作的天气预报,并通过THREDDS服务器提供给ML模型。在过程自动化方面,以及在保留和运营生产方面与HPC的耦合方面,吸取了宝贵的经验教训。
Maximizing the potential of numerical weather prediction models: lessons learned from combining high-performance computing and cloud computing
Abstract. To promote cloud and HPC computing, GRAPEVINE* project objectives include using these tools along with open data sources to provide a reusable IT service. In this service a predictive model based on Machine learning (ML) techniques is created with the aim of preventing and
controlling grape vine diseases in the wine cultivation sector. Aside from
the predictive ML, meteorological forecasts are crucial input to train the
ML models and on a second step to be used as input for the operational
prediction of grapevine diseases. To this end, the Weather and Research
Forecasting model (WRF) has been deployed in CESGA's HPC infrastructure to
produce medium-range and sub-seasonal forecasts for the targeted pilot areas (Greece and Spain). The data assimilation component of WRF – WRFDA – has been also introduced for improving the initial conditions of the WRF model by assimilating observations from weather stations and satellite
precipitation products (Integrated Multi-satellitE Retrieval for GPM – IMERG). This methodology for assimilation was developed during STARGATE* project, allowing the testing of the methodology in the operational service of GRAPEVINE. The operational production of the forecasts is achieved by the cloudify orchestrator on a Kubernetes cluster. The connections between the Kubernetes cluster and the HPC infrastructure, where the model resides, is achieved with the croupier plugin of cloudify. Blueprints that encapsule the workflows of the meteorological model and its dependencies were created. The instances of the blueprints (deployments) were created automatically to produce operationally weather forecasts and they were made available to the ML models via a THREDDS server. Valuable lessons were learned with regards the automation of the process and the coupling with the HPC in terms of reservations and operational production.