Artificial intelligence for the earth systems最新文献_第9页

Strictly Enforcing Invertibility and Conservation in CNN-Based Super Resolution for Scientific Datasets 基于cnn的科学数据集超分辨率严格执行可逆性和守恒

Artificial intelligence for the earth systems

Pub Date : 2023-01-01 DOI: 10.1175/aies-d-21-0012.1

A. Geiss, Joseph C. Hardin

Recently, deep convolutional neural networks (CNNs) have revolutionized image “super resolution” (SR), dramatically outperforming past methods for enhancing image resolution. They could be a boon for the many scientific fields that involve imaging or any regularly gridded datasets: satellite remote sensing, radar meteorology, medical imaging, numerical modeling, and so on. Unfortunately, while SR-CNNs produce visually compelling results, they do not necessarily conserve physical quantities between their low-resolution inputs and high-resolution outputs when applied to scientific datasets. Here, a method for “downsampling enforcement” in SR-CNNs is proposed. A differentiable operator is derived that, when applied as the final transfer function of a CNN, ensures the high-resolution outputs exactly reproduce the low-resolution inputs under 2D-average downsampling while improving performance of the SR schemes. The method is demonstrated across seven modern CNN-based SR schemes on several benchmark image datasets, and applications to weather radar, satellite imager, and climate model data are shown. The approach improves training time and performance while ensuring physical consistency between the super-resolved and low-resolution data.Recent advancements in using deep learning to increase the resolution of images have substantial potential across the many scientific fields that use images and image-like data. Most image super-resolution research has focused on the visual quality of outputs, however, and is not necessarily well suited for use with scientific data where known physics constraints may need to be enforced. Here, we introduce a method to modify existing deep neural network architectures so that they strictly conserve physical quantities in the input field when “super resolving” scientific data and find that the method can improve performance across a wide range of datasets and neural networks. Integration of known physics and adherence to established physical constraints into deep neural networks will be a critical step before their potential can be fully realized in the physical sciences.

最近，深度卷积神经网络(cnn)彻底改变了图像“超分辨率”(SR)，显著优于过去增强图像分辨率的方法。对于许多涉及成像或任何常规网格数据集的科学领域来说，它们可能是一个福音:卫星遥感、雷达气象学、医学成像、数值模拟等等。不幸的是，虽然sr - cnn产生了视觉上引人注目的结果，但当应用于科学数据集时，它们不一定能在低分辨率输入和高分辨率输出之间保留物理量。本文提出了一种sr - cnn的“降采样强制”方法。推导出一个可微算子，当作为CNN的最终传递函数时，确保高分辨率输出准确地再现2d平均下采样下的低分辨率输入，同时提高SR方案的性能。该方法在几个基准图像数据集上演示了7种基于cnn的现代SR方案，并展示了在天气雷达、卫星成像仪和气候模式数据上的应用。该方法提高了训练时间和性能，同时保证了超分辨率和低分辨率数据之间的物理一致性。最近在使用深度学习来提高图像分辨率方面的进展在许多使用图像和类图像数据的科学领域具有巨大的潜力。然而，大多数图像超分辨率研究都集中在输出的视觉质量上，并且不一定很适合用于可能需要强制执行已知物理约束的科学数据。在这里，我们引入了一种方法来修改现有的深度神经网络架构，使它们在“超解析”科学数据时严格保留输入字段中的物理量，并发现该方法可以提高各种数据集和神经网络的性能。在深度神经网络的潜力在物理科学中得到充分实现之前，将已知的物理学和对既定物理约束的遵守整合到深度神经网络中将是关键的一步。

{"title":"Strictly Enforcing Invertibility and Conservation in CNN-Based Super Resolution for Scientific Datasets","authors":"A. Geiss, Joseph C. Hardin","doi":"10.1175/aies-d-21-0012.1","DOIUrl":"https://doi.org/10.1175/aies-d-21-0012.1","url":null,"abstract":"\u0000Recently, deep convolutional neural networks (CNNs) have revolutionized image “super resolution” (SR), dramatically outperforming past methods for enhancing image resolution. They could be a boon for the many scientific fields that involve imaging or any regularly gridded datasets: satellite remote sensing, radar meteorology, medical imaging, numerical modeling, and so on. Unfortunately, while SR-CNNs produce visually compelling results, they do not necessarily conserve physical quantities between their low-resolution inputs and high-resolution outputs when applied to scientific datasets. Here, a method for “downsampling enforcement” in SR-CNNs is proposed. A differentiable operator is derived that, when applied as the final transfer function of a CNN, ensures the high-resolution outputs exactly reproduce the low-resolution inputs under 2D-average downsampling while improving performance of the SR schemes. The method is demonstrated across seven modern CNN-based SR schemes on several benchmark image datasets, and applications to weather radar, satellite imager, and climate model data are shown. The approach improves training time and performance while ensuring physical consistency between the super-resolved and low-resolution data.\u0000\u0000\u0000Recent advancements in using deep learning to increase the resolution of images have substantial potential across the many scientific fields that use images and image-like data. Most image super-resolution research has focused on the visual quality of outputs, however, and is not necessarily well suited for use with scientific data where known physics constraints may need to be enforced. Here, we introduce a method to modify existing deep neural network architectures so that they strictly conserve physical quantities in the input field when “super resolving” scientific data and find that the method can improve performance across a wide range of datasets and neural networks. Integration of known physics and adherence to established physical constraints into deep neural networks will be a critical step before their potential can be fully realized in the physical sciences.\u0000","PeriodicalId":94369,"journal":{"name":"Artificial intelligence for the earth systems","volume":"140 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"73273330","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Emulating the adaptation of wind fields to complex terrain with deep-learning 用深度学习模拟风场对复杂地形的适应

Artificial intelligence for the earth systems

Pub Date : 2022-12-27 DOI: 10.1175/aies-d-22-0034.1

L. Le Toumelin, I. Gouttevin, N. Helbig, C. Galiez, Mathis Roux, F. Karbou

Estimating the impact of wind-driven snow transport requires modeling wind fields with a lower grid spacing than the spacing on the order of one or a few kilometers used in the current numerical weather prediction (NWP) systems. In this context, we introduce a new strategy to downscale wind fields from NWP systems to decametric scales, using high resolution (30m) topographic information. Our method (named DEVINE) leverage on a convolutional neural network (CNN), trained to replicate the behaviour of the complex atmospheric model ARPS, previously run on a large number (7279) of synthetic Gaussian topographies under controlled weather conditions. A 10-fold cross validation reveals that our CNN is able to accurately emulate the behavior of ARPS (mean absolute error for wind speed = 0.16 m/s). We then apply DEVINE to real cases in the Alps, i.e. downscaling wind fields forecasted by AROME NWP system using information from real alpine topographies. DEVINE proved able to reproduce main features of wind fields in complex terrain (acceleration on ridges, leeward deceleration, deviations around obstacles). Furthermore, an evaluation on quality checked observations acquired at 61 sites in the French Alps reveals an improved behaviour of the downscaled winds (AROME wind speed mean bias is reduced by 27% with DEVINE), especially at the most elevated and exposed stations. Wind direction is however only slightly modified. Hence, despite some current limitations inherited from the ARPS simulations setup, DEVINE appears as an efficient downscaling tool whose minimalist architecture, low input data requirements (NWP wind fields and high-resolution topography) and competitive computing times may be attractive for operational applications.

估计风驱动雪运输的影响需要用比当前数值天气预报(NWP)系统中使用的一公里或几公里的间距更小的网格间距来模拟风场。在这种情况下，我们引入了一种新的策略，利用高分辨率(30米)的地形信息，将NWP系统的风场缩小到10米尺度。我们的方法(名为DEVINE)利用卷积神经网络(CNN)，训练来复制复杂大气模型ARPS的行为，之前在控制天气条件下在大量(7279)合成高斯地形上运行。10倍交叉验证表明，我们的CNN能够准确地模拟ARPS的行为(风速的平均绝对误差= 0.16 m/s)。然后，我们将DEVINE应用于阿尔卑斯山脉的实际情况，即AROME NWP系统利用真实阿尔卑斯地形信息预测的风场降尺度。事实证明，DEVINE能够重现复杂地形中风场的主要特征(山脊上的加速、背风减速、障碍物周围的偏差)。此外，对法国阿尔卑斯山61个站点获得的质量检查观测结果的评估显示，低尺度风的行为有所改善(AROME风速平均偏差与DEVINE一起减少了27%)，特别是在最高和暴露的站点。然而，风向只有轻微的改变。因此，尽管目前从ARPS模拟设置中继承了一些限制，DEVINE作为一种高效的缩小工具，其极简的架构，低输入数据要求(NWP风场和高分辨率地形)和具有竞争力的计算时间可能对运营应用具有吸引力。

{"title":"Emulating the adaptation of wind fields to complex terrain with deep-learning","authors":"L. Le Toumelin, I. Gouttevin, N. Helbig, C. Galiez, Mathis Roux, F. Karbou","doi":"10.1175/aies-d-22-0034.1","DOIUrl":"https://doi.org/10.1175/aies-d-22-0034.1","url":null,"abstract":"\u0000Estimating the impact of wind-driven snow transport requires modeling wind fields with a lower grid spacing than the spacing on the order of one or a few kilometers used in the current numerical weather prediction (NWP) systems. In this context, we introduce a new strategy to downscale wind fields from NWP systems to decametric scales, using high resolution (30m) topographic information. Our method (named DEVINE) leverage on a convolutional neural network (CNN), trained to replicate the behaviour of the complex atmospheric model ARPS, previously run on a large number (7279) of synthetic Gaussian topographies under controlled weather conditions. A 10-fold cross validation reveals that our CNN is able to accurately emulate the behavior of ARPS (mean absolute error for wind speed = 0.16 m/s). We then apply DEVINE to real cases in the Alps, i.e. downscaling wind fields forecasted by AROME NWP system using information from real alpine topographies. DEVINE proved able to reproduce main features of wind fields in complex terrain (acceleration on ridges, leeward deceleration, deviations around obstacles). Furthermore, an evaluation on quality checked observations acquired at 61 sites in the French Alps reveals an improved behaviour of the downscaled winds (AROME wind speed mean bias is reduced by 27% with DEVINE), especially at the most elevated and exposed stations. Wind direction is however only slightly modified. Hence, despite some current limitations inherited from the ARPS simulations setup, DEVINE appears as an efficient downscaling tool whose minimalist architecture, low input data requirements (NWP wind fields and high-resolution topography) and competitive computing times may be attractive for operational applications.","PeriodicalId":94369,"journal":{"name":"Artificial intelligence for the earth systems","volume":"2 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-12-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"74154222","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 5

A Real-Time Spatio-Temporal Machine Learning Framework for the Prediction of Nearshore Wave Conditions 近岸波浪条件预测的实时时空机器学习框架

Artificial intelligence for the earth systems

Pub Date : 2022-12-13 DOI: 10.1175/aies-d-22-0033.1

Jiaxin Chen, I. Ashton, E. Steele, A. Pillai

The safe and successful operation of offshore infrastructure relies on a detailed awareness of ocean wave conditions. Ongoing growth in offshore wind energy is focused on very large-scale projects, deployed in ever-more challenging environments. This inherently increases both cost and complexity, and therefore the requirement for efficient operational planning. To support this, we propose a new machine learning framework for the short-term forecasting of ocean wave conditions, to support critical decision-making associated with marine operations. Here, an attention-based Long Short-Term Memory (LSTM) neural network approach is used to learn the short-term temporal patterns from in-situ observations. This is then integrated with an existing, low-computational cost spatial nowcasting model to develop a complete framework for spatio-temporal forecasting. The framework addresses the challenge of filling gaps in the in-situ observations, and undertakes feature selection, with seasonal training datasets embedded. The full spatio-temporal forecasting system is demonstrated using a case study based on independent observation locations near the southwest coast of the United Kingdom. Results are validated against in-situ data from two wave buoy locations within the domain and compared to operational physics-based wave forecasts from the Met Office (the UK’s national weather service). For these two example locations, the spatio-temporal forecast is found to have the accuracy of R2 0.9083 and 0.7409 in forecasting 1 hour ahead significant wave height, and R2 0.8581 and 0.6978 in 12 hour ahead forecasts, respectively. Importantly, this represents respectable levels of accuracy, comparable to traditional physics-based forecast products, but requires only a fraction of the computational resources.

海上基础设施的安全和成功运行依赖于对海浪状况的详细了解。海上风能的持续增长主要集中在非常大规模的项目上，这些项目部署在越来越具有挑战性的环境中。这本质上增加了成本和复杂性，因此需要有效的操作计划。为了支持这一点，我们提出了一个新的机器学习框架，用于海浪条件的短期预测，以支持与海洋作业相关的关键决策。本文采用基于注意的长短期记忆(LSTM)神经网络方法从现场观测数据中学习短期时间模式。然后将其与现有的低计算成本空间临近预报模型集成，以开发一个完整的时空预测框架。该框架解决了在现场观测中填补空白的挑战，并通过嵌入季节性训练数据集进行特征选择。利用基于英国西南海岸附近独立观测点的案例研究，演示了完整的时空预报系统。结果与区域内两个波浪浮标位置的现场数据进行了验证，并与英国气象局(英国国家气象局)基于业务物理的波浪预报进行了比较。对于这两个样点，预测1 h前有效波高的时空预报精度R2分别为0.9083和0.7409，预测12 h前有效波高的时空预报精度R2分别为0.8581和0.6978。重要的是，这代表了相当的精度水平，与传统的基于物理的预测产品相当，但只需要一小部分计算资源。

{"title":"A Real-Time Spatio-Temporal Machine Learning Framework for the Prediction of Nearshore Wave Conditions","authors":"Jiaxin Chen, I. Ashton, E. Steele, A. Pillai","doi":"10.1175/aies-d-22-0033.1","DOIUrl":"https://doi.org/10.1175/aies-d-22-0033.1","url":null,"abstract":"\u0000The safe and successful operation of offshore infrastructure relies on a detailed awareness of ocean wave conditions. Ongoing growth in offshore wind energy is focused on very large-scale projects, deployed in ever-more challenging environments. This inherently increases both cost and complexity, and therefore the requirement for efficient operational planning. To support this, we propose a new machine learning framework for the short-term forecasting of ocean wave conditions, to support critical decision-making associated with marine operations. Here, an attention-based Long Short-Term Memory (LSTM) neural network approach is used to learn the short-term temporal patterns from in-situ observations. This is then integrated with an existing, low-computational cost spatial nowcasting model to develop a complete framework for spatio-temporal forecasting. The framework addresses the challenge of filling gaps in the in-situ observations, and undertakes feature selection, with seasonal training datasets embedded. The full spatio-temporal forecasting system is demonstrated using a case study based on independent observation locations near the southwest coast of the United Kingdom. Results are validated against in-situ data from two wave buoy locations within the domain and compared to operational physics-based wave forecasts from the Met Office (the UK’s national weather service). For these two example locations, the spatio-temporal forecast is found to have the accuracy of R2 0.9083 and 0.7409 in forecasting 1 hour ahead significant wave height, and R2 0.8581 and 0.6978 in 12 hour ahead forecasts, respectively. Importantly, this represents respectable levels of accuracy, comparable to traditional physics-based forecast products, but requires only a fraction of the computational resources.","PeriodicalId":94369,"journal":{"name":"Artificial intelligence for the earth systems","volume":"35 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-12-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"86555421","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Can a Machine-Learning-Enabled Numerical Model Help Extend Effective Forecast Range through Consistently Trained Subgrid-Scale Models? 支持机器学习的数值模型能否通过持续训练的亚网格尺度模型帮助扩展有效的预测范围?

Artificial intelligence for the earth systems

Pub Date : 2022-11-30 DOI: 10.1175/aies-d-22-0050.1

Yongquan Qu, X. Shi

The development of machine learning (ML) techniques enables data-driven parameterizations, which have been investigated in many recent studies. Some investigations suggest that a priori trained ML models exhibit satisfying accuracy during training but poor performance when coupled to dynamical cores and tested. Here we use the evolution of the barotropic vorticity equation (BVE) with periodically reinforced shear instability as a prototype problem to develop and evaluate a model-consistent training strategy, which employs a numerical solver supporting automatic differentiation and includes the solver in the loss function for training ML-based subgrid-scale (SGS) turbulence models. This approach enables the interaction between the dynamical core and the ML-based parameterization during the model training phase. The BVE model was run at low, high, and ultra-high (truth) resolutions. Our training dataset contains only a short period of coarsened high-resolution simulations. However, given initial conditions long after the training dataset time, the trained SGS model can still significantly increase the effective lead time of the BVE model running at the low resolution by up to 50% compared to the BVE simulation without an SGS model. We also tested using a covariance matrix to normalize the loss function and found it can notably boost the performance of the ML parameterization. The SGS model’s performance is further improved by conducting transfer learning using a limited number of discontinuous observations, increasing the forecast lead time improvement to 73%. This study demonstrates a potential pathway to using machine learning to enhance the prediction skills of our climate and weather models.

机器学习(ML)技术的发展使数据驱动的参数化成为可能，这在最近的许多研究中得到了研究。一些研究表明，先验训练的机器学习模型在训练过程中表现出令人满意的准确性，但在与动态核心耦合和测试时表现不佳。本文以具有周期性增强剪切不稳定性的正压涡度方程(BVE)的演化为原型问题，开发并评估了一种模型一致性训练策略，该策略采用支持自动微分的数值求解器，并将求解器包含在损失函数中，用于训练基于ml的亚网格尺度(SGS)湍流模型。该方法在模型训练阶段实现了动态核心和基于ml的参数化之间的交互。BVE模型在低、高和超高(真值)分辨率下运行。我们的训练数据集只包含短时间的粗化高分辨率模拟。然而，在训练数据集时间很长之后的初始条件下，与不使用SGS模型的BVE模拟相比，训练后的SGS模型仍然可以显着提高BVE模型在低分辨率下运行的有效提前期，最高可提高50%。我们还测试了使用协方差矩阵来规范化损失函数，发现它可以显著提高机器学习参数化的性能。通过使用有限数量的不连续观测进行迁移学习，SGS模型的性能得到进一步改善，将预测提前期提高到73%。这项研究展示了使用机器学习来提高气候和天气模型预测技能的潜在途径。

{"title":"Can a Machine-Learning-Enabled Numerical Model Help Extend Effective Forecast Range through Consistently Trained Subgrid-Scale Models?","authors":"Yongquan Qu, X. Shi","doi":"10.1175/aies-d-22-0050.1","DOIUrl":"https://doi.org/10.1175/aies-d-22-0050.1","url":null,"abstract":"\u0000The development of machine learning (ML) techniques enables data-driven parameterizations, which have been investigated in many recent studies. Some investigations suggest that a priori trained ML models exhibit satisfying accuracy during training but poor performance when coupled to dynamical cores and tested. Here we use the evolution of the barotropic vorticity equation (BVE) with periodically reinforced shear instability as a prototype problem to develop and evaluate a model-consistent training strategy, which employs a numerical solver supporting automatic differentiation and includes the solver in the loss function for training ML-based subgrid-scale (SGS) turbulence models. This approach enables the interaction between the dynamical core and the ML-based parameterization during the model training phase. The BVE model was run at low, high, and ultra-high (truth) resolutions. Our training dataset contains only a short period of coarsened high-resolution simulations. However, given initial conditions long after the training dataset time, the trained SGS model can still significantly increase the effective lead time of the BVE model running at the low resolution by up to 50% compared to the BVE simulation without an SGS model. We also tested using a covariance matrix to normalize the loss function and found it can notably boost the performance of the ML parameterization. The SGS model’s performance is further improved by conducting transfer learning using a limited number of discontinuous observations, increasing the forecast lead time improvement to 73%. This study demonstrates a potential pathway to using machine learning to enhance the prediction skills of our climate and weather models.","PeriodicalId":94369,"journal":{"name":"Artificial intelligence for the earth systems","volume":"39 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-11-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"79719859","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A Deep Learning-based Velocity Dealiasing Algorithm Derived from the WSR-88D Open Radar Product Generator 基于WSR-88D开放式雷达产品发生器的深度学习速度处理算法

Artificial intelligence for the earth systems

Pub Date : 2022-11-23 DOI: 10.1175/aies-d-22-0084.1

M. Veillette, J. Kurdzo, P. Stepanian, Joseph McDonald, S. Samsi, John Y. N. Cho

Radial velocity estimates provided by Doppler weather radar are critical measurements used by operational forecasters for the detection and monitoring of life-impacting storms. The sampling methods used to produce these measurements are inherently susceptible to aliasing, which produces ambiguous velocity values in regions with high winds, and needs to be corrected using a velocity dealiasing algorithm (VDA). In the US, the Weather Surveillance Radar – 1988 Doppler (WSR-88D) Open Radar Product Generator (ORPG) is a processing environment that provides a world-class VDA; however, this algorithm is complex and can be difficult to port to other radar systems outside of the WSR-88D network. In this work, a Deep Neural Network (DNN) is used to emulate the 2-dimensionalWSR-88D ORPG dealiasing algorithm. It is shown that a DNN, specifically a customized U-Net, is highly effective for building VDAs that are accurate, fast, and portable to multiple radar types. To train the DNN model, a large dataset is generated containing aligned samples of folded and dealiased velocity pairs. This dataset contains samples collected from WSR-88D Level-II and Level-III archives, and uses the ORPG dealiasing algorithm output as a source of truth. Using this dataset, a U-Net is trained to produce the number of folds at each point of a velocity image. Several performance metrics are presented using WSR-88D data. The algorithm is also applied to other non-WSR-88D radar systems to demonstrate portability to other hardware/software interfaces. A discussion of the broad applicability of this method is presented, including how other Level-III algorithms may benefit from this approach.

多普勒天气雷达提供的径向速度估计是业务预报员用于探测和监测影响生命的风暴的关键测量。用于产生这些测量的采样方法本质上容易受到混叠的影响，在大风地区会产生模糊的速度值，需要使用速度去混叠算法(VDA)进行校正。在美国，天气监视雷达- 1988多普勒(WSR-88D)开放式雷达产品发生器(ORPG)是一个提供世界级VDA的处理环境;然而，这种算法很复杂，很难移植到WSR-88D网络之外的其他雷达系统上。在这项工作中，使用深度神经网络(DNN)来模拟二维wsr - 88d ORPG去噪算法。研究表明，DNN，特别是定制的U-Net，对于构建精确、快速和可移植到多种雷达类型的vda非常有效。为了训练深度神经网络模型，生成了一个大型数据集，其中包含折叠和去锯齿速度对的对齐样本。该数据集包含从WSR-88D Level-II和Level-III档案中收集的样本，并使用ORPG去噪算法输出作为事实来源。使用这个数据集，U-Net被训练在速度图像的每个点上产生折叠的数量。使用WSR-88D数据给出了几个性能指标。该算法也应用于其他非wsr - 88d雷达系统，以演示可移植性到其他硬件/软件接口。讨论了该方法的广泛适用性，包括其他iii级算法如何从该方法中受益。

{"title":"A Deep Learning-based Velocity Dealiasing Algorithm Derived from the WSR-88D Open Radar Product Generator","authors":"M. Veillette, J. Kurdzo, P. Stepanian, Joseph McDonald, S. Samsi, John Y. N. Cho","doi":"10.1175/aies-d-22-0084.1","DOIUrl":"https://doi.org/10.1175/aies-d-22-0084.1","url":null,"abstract":"\u0000Radial velocity estimates provided by Doppler weather radar are critical measurements used by operational forecasters for the detection and monitoring of life-impacting storms. The sampling methods used to produce these measurements are inherently susceptible to aliasing, which produces ambiguous velocity values in regions with high winds, and needs to be corrected using a velocity dealiasing algorithm (VDA). In the US, the Weather Surveillance Radar – 1988 Doppler (WSR-88D) Open Radar Product Generator (ORPG) is a processing environment that provides a world-class VDA; however, this algorithm is complex and can be difficult to port to other radar systems outside of the WSR-88D network. In this work, a Deep Neural Network (DNN) is used to emulate the 2-dimensionalWSR-88D ORPG dealiasing algorithm. It is shown that a DNN, specifically a customized U-Net, is highly effective for building VDAs that are accurate, fast, and portable to multiple radar types. To train the DNN model, a large dataset is generated containing aligned samples of folded and dealiased velocity pairs. This dataset contains samples collected from WSR-88D Level-II and Level-III archives, and uses the ORPG dealiasing algorithm output as a source of truth. Using this dataset, a U-Net is trained to produce the number of folds at each point of a velocity image. Several performance metrics are presented using WSR-88D data. The algorithm is also applied to other non-WSR-88D radar systems to demonstrate portability to other hardware/software interfaces. A discussion of the broad applicability of this method is presented, including how other Level-III algorithms may benefit from this approach.","PeriodicalId":94369,"journal":{"name":"Artificial intelligence for the earth systems","volume":"249 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-11-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"80018186","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Skillful US Soy-yield Forecasts at Pre-sowing Lead-times 熟练的美国大豆播种前产量预测

Artificial intelligence for the earth systems

Pub Date : 2022-11-23 DOI: 10.1175/aies-d-21-0009.1

S. Vijverberg, Raed Hamed, D. Coumou

Soy harvest failure events can severely impact farmers, insurance companies and raise global prices. Reliable seasonal forecasts of mis-harvests would allow stakeholders to prepare and take appropriate early action. However, especially for farmers, the reliability and lead-time of current prediction systems provide insufficient information to justify within-season adaptation measures. Recent innovations increased our ability to generate reliable statistical seasonal forecasts. Here, we combine these innovations to predict the 1-3 poor soy harvest years in eastern US. We first use a clustering algorithm to spatially aggregate crop producing regions within the eastern US that are particularly sensitive to hot-dry weather conditions. Next, we use observational climate variables (sea surface temperature (SST) and soil moisture) to extract precursor timeseries at multiple lags. This allows the machine learning model to learn the low-frequency evolution, which carries important information for predictability. A selection based on causal inference allows for physically interpretable precursors. We show that the robust selected predictors are associated with the evolution of the horseshoe Pacific SST pattern, in line with previous research. We use the state of the horseshoe Pacific to identify years with enhanced predictability. We achieve very high forecast skill of poor harvests events, even 3 months prior to sowing, using a strict one-step-ahead train-test splitting. Over the last 25 years, 82% of the in February predicted poor harvests were correct. When operational, this forecast would enable farmers (and insurance/trading companies) to make informed decisions on adaption measures, e.g., selecting more drought-resistant cultivars, invest in insurance, change planting management.

大豆歉收事件会严重影响农民、保险公司，并提高全球价格。对收成不佳的可靠季节性预测将使利益相关者能够做好准备并采取适当的早期行动。然而，特别是对农民来说，当前预测系统的可靠性和提前期提供的信息不足以证明采取季内适应措施的合理性。最近的创新提高了我们产生可靠的季节性统计预报的能力。在这里，我们结合这些创新来预测美国东部1-3年的大豆歉收。我们首先使用聚类算法对美国东部对干热天气条件特别敏感的作物产区进行空间聚集。其次，我们利用观测气候变量(海表温度和土壤湿度)提取多滞后的前兆时间序列。这使得机器学习模型能够学习低频进化，这为可预测性提供了重要的信息。基于因果推理的选择允许物理上可解释的前体。我们发现，稳健的预测因子与马蹄形太平洋海温模式的演变有关，与先前的研究一致。我们使用马蹄形太平洋的状态来确定具有增强可预测性的年份。我们对收成不好的事件的预测能力非常高，甚至在播种前3个月，使用严格的一步前训练测试分割。在过去的25年里，二月份预测歉收的人有82%是正确的。一旦投入使用，这一预测将使农民(以及保险/贸易公司)能够就适应措施做出明智的决定，例如选择更抗旱的品种、投资保险、改变种植管理。

{"title":"Skillful US Soy-yield Forecasts at Pre-sowing Lead-times","authors":"S. Vijverberg, Raed Hamed, D. Coumou","doi":"10.1175/aies-d-21-0009.1","DOIUrl":"https://doi.org/10.1175/aies-d-21-0009.1","url":null,"abstract":"\u0000Soy harvest failure events can severely impact farmers, insurance companies and raise global prices. Reliable seasonal forecasts of mis-harvests would allow stakeholders to prepare and take appropriate early action. However, especially for farmers, the reliability and lead-time of current prediction systems provide insufficient information to justify within-season adaptation measures. Recent innovations increased our ability to generate reliable statistical seasonal forecasts. Here, we combine these innovations to predict the 1-3 poor soy harvest years in eastern US. We first use a clustering algorithm to spatially aggregate crop producing regions within the eastern US that are particularly sensitive to hot-dry weather conditions. Next, we use observational climate variables (sea surface temperature (SST) and soil moisture) to extract precursor timeseries at multiple lags. This allows the machine learning model to learn the low-frequency evolution, which carries important information for predictability. A selection based on causal inference allows for physically interpretable precursors. We show that the robust selected predictors are associated with the evolution of the horseshoe Pacific SST pattern, in line with previous research. We use the state of the horseshoe Pacific to identify years with enhanced predictability. We achieve very high forecast skill of poor harvests events, even 3 months prior to sowing, using a strict one-step-ahead train-test splitting. Over the last 25 years, 82% of the in February predicted poor harvests were correct. When operational, this forecast would enable farmers (and insurance/trading companies) to make informed decisions on adaption measures, e.g., selecting more drought-resistant cultivars, invest in insurance, change planting management.","PeriodicalId":94369,"journal":{"name":"Artificial intelligence for the earth systems","volume":"1 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-11-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87408958","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A Hybrid Physics-AI Model to Improve Hydrological Forecasts 改进水文预报的混合物理-人工智能模型

Artificial intelligence for the earth systems

Pub Date : 2022-11-01 DOI: 10.1175/aies-d-22-0023.1

Yanan Duan, S. Akula, Sanjiv Kumar, Wonjun Lee, Sepideh Khajehei

The National Oceanic and Atmospheric Administration have developed a very high-resolution streamflow forecast using National Water Model (NWM) for 2.7 million stream locations in the United States. However, considerable challenges exist for quantifying uncertainty at ungauged locations and forecast reliability. A data science approach is presented to address the challenge. The long-range daily streamflow forecasts are analyzed from Dec. 2018 to Aug. 2021 for Alabama and Georgia. The forecast is evaluated at 389 observed USGS stream gauging locations using standard deterministic metrics. Next, the forecast errors are grouped using watersheds’ biophysical characteristics, including drainage area, land use, soil type, and topographic index. The NWM forecasts are more skillful for larger and forested watersheds than smaller and urban watersheds. The NWM forecast considerably overestimates the streamflow in the urban watersheds. The classification and regression tree analysis confirm the dependency of the forecast errors on the biophysical characteristics.A densely connected neural network model consisting of 6 layers (Deep Learning, DL) is developed using biophysical characteristics, NWM forecast as inputs, and the forecast errors as outputs. The DL model successfully learns location invariant transferrable knowledge from the domain trained in the gauged locations and applies the learned model to estimate forecast errors at the ungauged locations. A temporal and spatial split of the gauged data shows that the probability of capturing the observations in the forecast range improved significantly in the hybrid NWM-DL model (82±3 %) than in the NWM-only forecast (21±1 %). A tradeoff between overly constrained NWM forecast and increased forecast uncertainty range in the DL model is noted.

美国国家海洋和大气管理局利用国家水模型(NWM)开发了一种高分辨率的流量预测系统，预测了美国270万条河流的位置。然而，在未测量位置量化不确定性和预测可靠性方面存在相当大的挑战。提出了一种数据科学方法来应对这一挑战。对阿拉巴马州和佐治亚州2018年12月至2021年8月的长期每日流量预测进行了分析。预测是在389个观测到的USGS流量测量地点使用标准确定性度量进行评估的。其次，利用流域的生物物理特征(包括流域面积、土地利用、土壤类型和地形指数)对预测误差进行分组。NWM对大流域和森林流域的预报比小流域和城市流域的预报更准确。NWM的预报大大高估了城市流域的流量。分类和回归树分析证实了预测误差与生物物理特性的相关性。利用生物物理特征、NWM预测作为输入，预测误差作为输出，建立了一个由6层组成的密集连接神经网络模型(Deep Learning, DL)。深度学习模型成功地从测量位置的域训练中学习到位置不变的可转移知识，并将学习到的模型应用于估计未测量位置的预测误差。测量数据的时空分裂表明，NWM-DL混合模式捕获预测范围内观测值的概率(82±3%)比NWM-DL混合模式(21±1%)显著提高。注意到DL模型中过度约束的NWM预测和增加的预测不确定性范围之间的权衡。

{"title":"A Hybrid Physics-AI Model to Improve Hydrological Forecasts","authors":"Yanan Duan, S. Akula, Sanjiv Kumar, Wonjun Lee, Sepideh Khajehei","doi":"10.1175/aies-d-22-0023.1","DOIUrl":"https://doi.org/10.1175/aies-d-22-0023.1","url":null,"abstract":"\u0000The National Oceanic and Atmospheric Administration have developed a very high-resolution streamflow forecast using National Water Model (NWM) for 2.7 million stream locations in the United States. However, considerable challenges exist for quantifying uncertainty at ungauged locations and forecast reliability. A data science approach is presented to address the challenge. The long-range daily streamflow forecasts are analyzed from Dec. 2018 to Aug. 2021 for Alabama and Georgia. The forecast is evaluated at 389 observed USGS stream gauging locations using standard deterministic metrics. Next, the forecast errors are grouped using watersheds’ biophysical characteristics, including drainage area, land use, soil type, and topographic index. The NWM forecasts are more skillful for larger and forested watersheds than smaller and urban watersheds. The NWM forecast considerably overestimates the streamflow in the urban watersheds. The classification and regression tree analysis confirm the dependency of the forecast errors on the biophysical characteristics.\u0000A densely connected neural network model consisting of 6 layers (Deep Learning, DL) is developed using biophysical characteristics, NWM forecast as inputs, and the forecast errors as outputs. The DL model successfully learns location invariant transferrable knowledge from the domain trained in the gauged locations and applies the learned model to estimate forecast errors at the ungauged locations. A temporal and spatial split of the gauged data shows that the probability of capturing the observations in the forecast range improved significantly in the hybrid NWM-DL model (82±3 %) than in the NWM-only forecast (21±1 %). A tradeoff between overly constrained NWM forecast and increased forecast uncertainty range in the DL model is noted.","PeriodicalId":94369,"journal":{"name":"Artificial intelligence for the earth systems","volume":"10 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"88778582","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

On Variability due to Local Minima and K-fold Cross-validation 局部最小值与K-fold交叉验证的可变性

Artificial intelligence for the earth systems

Pub Date : 2022-10-06 DOI: 10.1175/aies-d-21-0004.1

C. Marzban, Jueyi Liu, P. Tissot

Resampling methods such as cross-validation or bootstrap are often employed to estimate the uncertainty in a loss function due to sampling variability, usually for the purpose of model selection. But in models that require nonlinear optimization, the existence of local minima in the loss function landscape introduces an additional source of variability which is confounded with sampling variability. In other words, some portion of the variability in the loss function across different resamples is due to local minima. Given that statistically-sound model selection is based on an examination of variance, it is important to disentangle these two sources of variability. To that end, a methodology is developed for estimating each, specifically in the context of K-fold cross-validation, and Neural Networks (NN) whose training leads to different local minima. Random effects models are used to estimate the two variance components - due to sampling and due to local minima. The results are examined as a function of the number of hidden nodes, and the variance of the initial weights, with the latter controlling the “depth” of local minima. The main goal of the methodology is to increase statistical power in model selection and/or model comparison. Using both simulated and realistic data it is shown that the two sources of variability can be comparable, casting doubt on model selection methods that ignore the variability due to local minima. Furthermore, the methodology is sufficiently flexible so as to allow assessment of the effect of other/any NN parameters on variability.

交叉验证或自举等重采样方法通常用于估计由于采样可变性而导致的损失函数中的不确定性，通常用于模型选择。但是在需要非线性优化的模型中，损失函数中局部极小值的存在引入了一个额外的可变性源，它与采样可变性相混淆。换句话说，损失函数中跨不同样本的可变性的一部分是由于局部最小值。考虑到统计上合理的模型选择是基于对方差的检查，区分这两个变异性来源是很重要的。为此，开发了一种方法来估计每个，特别是在K-fold交叉验证和神经网络(NN)的背景下，其训练导致不同的局部最小值。随机效应模型用于估计两个方差成分-由于抽样和由于局部最小值。将结果作为隐藏节点数和初始权重方差的函数进行检验，后者控制局部最小值的“深度”。该方法的主要目标是提高模型选择和/或模型比较的统计能力。模拟数据和实际数据表明，两种变异性来源可以比较，这对忽略局部极小值引起的变异性的模型选择方法提出了质疑。此外，该方法具有足够的灵活性，可以评估其他/任何神经网络参数对变异性的影响。

{"title":"On Variability due to Local Minima and K-fold Cross-validation","authors":"C. Marzban, Jueyi Liu, P. Tissot","doi":"10.1175/aies-d-21-0004.1","DOIUrl":"https://doi.org/10.1175/aies-d-21-0004.1","url":null,"abstract":"\u0000Resampling methods such as cross-validation or bootstrap are often employed to estimate the uncertainty in a loss function due to sampling variability, usually for the purpose of model selection. But in models that require nonlinear optimization, the existence of local minima in the loss function landscape introduces an additional source of variability which is confounded with sampling variability. In other words, some portion of the variability in the loss function across different resamples is due to local minima. Given that statistically-sound model selection is based on an examination of variance, it is important to disentangle these two sources of variability. To that end, a methodology is developed for estimating each, specifically in the context of K-fold cross-validation, and Neural Networks (NN) whose training leads to different local minima. Random effects models are used to estimate the two variance components - due to sampling and due to local minima. The results are examined as a function of the number of hidden nodes, and the variance of the initial weights, with the latter controlling the “depth” of local minima. The main goal of the methodology is to increase statistical power in model selection and/or model comparison. Using both simulated and realistic data it is shown that the two sources of variability can be comparable, casting doubt on model selection methods that ignore the variability due to local minima. Furthermore, the methodology is sufficiently flexible so as to allow assessment of the effect of other/any NN parameters on variability.","PeriodicalId":94369,"journal":{"name":"Artificial intelligence for the earth systems","volume":"83 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-10-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"79008232","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Downscaling of Historical Wind Fields over Switzerland using Generative Adversarial Networks 使用生成对抗网络缩小瑞士历史风场的规模

Artificial intelligence for the earth systems

Pub Date : 2022-10-06 DOI: 10.1175/aies-d-22-0018.1

Ophélia Miralles, Daniel Steinfield, O. Martius, A. Davison

Near-surface wind is difficult to estimate using global numerical weather and climate models, as airflow is strongly modified by underlying topography, especially that of a country such as Switzerland. In this article, we use a statistical approach based on deep learning and a high-resolution Digital Elevation Model to spatially downscale hourly near-surface wind fields at coarse resolution from ERA5 reanalysis from their original 25 km to a 1.1 km grid. A 1.1 km resolution wind dataset for 2016–2020 from the operational numerical weather prediction model COSMO-1 of the national weather service, MeteoSwiss, is used to train and validate our model, a generative adversarial network (GAN) with gradient penalized Wasserstein loss aided by transfer learning. The results are realistic-looking high-resolution historical maps of gridded hourly wind fields over Switzerland and very good and robust predictions of the aggregated wind speed distribution. Regionally averaged image-specific metrics show a clear improvement in prediction compared to ERA5, with skill measures generally better for locations over the flatter Swiss Plateau than for Alpine regions. The downscaled wind fields demonstrate higher-resolution, physically plausible orographic effects, such as ridge acceleration and sheltering, which are not resolved in the original ERA5 fields.

近地面风很难用全球数值天气和气候模式来估计，因为气流受到下层地形的强烈影响，尤其是像瑞士这样的国家。在本文中，我们使用基于深度学习和高分辨率数字高程模型的统计方法，将ERA5再分析的粗分辨率每小时近地面风场的空间尺度从原始的25公里降至1.1公里网格。使用来自国家气象局MeteoSwiss的运行数值天气预报模型cosmos -1的2016-2020年1.1公里分辨率的风数据集来训练和验证我们的模型，这是一个在迁移学习辅助下具有梯度惩罚Wasserstein损失的生成对抗网络(GAN)。结果是真实的高分辨率历史地图，网格每小时风场在瑞士和非常好的和可靠的预测汇总风速分布。与ERA5相比，区域平均图像特定指标在预测方面有明显改善，瑞士高原较平坦地区的技能测量通常优于阿尔卑斯地区。缩小尺度的风场表现出更高分辨率的、物理上合理的地形效应，如脊加速和遮蔽，这些在原始的ERA5场中没有解决。

{"title":"Downscaling of Historical Wind Fields over Switzerland using Generative Adversarial Networks","authors":"Ophélia Miralles, Daniel Steinfield, O. Martius, A. Davison","doi":"10.1175/aies-d-22-0018.1","DOIUrl":"https://doi.org/10.1175/aies-d-22-0018.1","url":null,"abstract":"\u0000Near-surface wind is difficult to estimate using global numerical weather and climate models, as airflow is strongly modified by underlying topography, especially that of a country such as Switzerland. In this article, we use a statistical approach based on deep learning and a high-resolution Digital Elevation Model to spatially downscale hourly near-surface wind fields at coarse resolution from ERA5 reanalysis from their original 25 km to a 1.1 km grid. A 1.1 km resolution wind dataset for 2016–2020 from the operational numerical weather prediction model COSMO-1 of the national weather service, MeteoSwiss, is used to train and validate our model, a generative adversarial network (GAN) with gradient penalized Wasserstein loss aided by transfer learning. The results are realistic-looking high-resolution historical maps of gridded hourly wind fields over Switzerland and very good and robust predictions of the aggregated wind speed distribution. Regionally averaged image-specific metrics show a clear improvement in prediction compared to ERA5, with skill measures generally better for locations over the flatter Swiss Plateau than for Alpine regions. The downscaled wind fields demonstrate higher-resolution, physically plausible orographic effects, such as ridge acceleration and sheltering, which are not resolved in the original ERA5 fields.","PeriodicalId":94369,"journal":{"name":"Artificial intelligence for the earth systems","volume":"1 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-10-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82954801","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 4

Understanding Predictability of Daily Southeast US Precipitation using Explainable Machine Learning 利用可解释的机器学习了解美国东南部每日降水的可预测性

Artificial intelligence for the earth systems

Pub Date : 2022-10-06 DOI: 10.1175/aies-d-22-0011.1

K. Pegion, E. Becker, B. Kirtman

We investigate the predictability of the sign of daily South-East US (SEUS) precipitation anomalies associated with simultaneous predictors of large-scale climate variability using machine learning models. Models using index-based climate predictors and gridded fields of large-scale circulation as predictors are utilized. Logistic regression (LR) and fully connected neural networks using indices of climate phenomena as predictors produce neither accurate nor reliable predictions, indicating that the indices themselves are not good predictors. Using gridded fields as predictors, a LR and convolutional neural network (CNN) are more accurate than the index based models. However, only the CNN can produce reliable predictions which can be used to identify forecasts of opportunity. Using explainable machine learning we identify which variables and gridpoints of the input fields are most relevant for confident and correct predictions in the CNN. Our results show that the local circulation is most important as represented by maximum relevance of 850 hPa geopotential heights and zonal winds to making skillful, high probability predictions. Corresponding composite anomalies identify connections with the El-Niño Southern Oscillation during winter and the Atlantic Multidecadal Oscillation and North Atlantic Subtropical High during summer.

我们利用机器学习模型研究了与大尺度气候变率同时预测因子相关的美国东南部(SEUS)每日降水异常符号的可预测性。利用基于指数的气候预测因子和大尺度环流网格场作为预测因子的模式。使用气候现象指数作为预测因子的逻辑回归(LR)和全连接神经网络的预测结果既不准确也不可靠，这表明这些指数本身并不是很好的预测因子。使用网格域作为预测因子，LR和卷积神经网络(CNN)比基于指数的模型更准确。然而，只有CNN可以产生可靠的预测，可以用来识别机会的预测。使用可解释的机器学习，我们确定哪些变量和输入字段的网格点与CNN中自信和正确的预测最相关。结果表明，850 hPa位势高度和纬向风的最大相关度代表的局地环流对于做出熟练的高概率预报最为重要。冬季与El-Niño南方涛动有关，夏季与大西洋多年代际涛动和北大西洋副热带高压有关。

{"title":"Understanding Predictability of Daily Southeast US Precipitation using Explainable Machine Learning","authors":"K. Pegion, E. Becker, B. Kirtman","doi":"10.1175/aies-d-22-0011.1","DOIUrl":"https://doi.org/10.1175/aies-d-22-0011.1","url":null,"abstract":"\u0000We investigate the predictability of the sign of daily South-East US (SEUS) precipitation anomalies associated with simultaneous predictors of large-scale climate variability using machine learning models. Models using index-based climate predictors and gridded fields of large-scale circulation as predictors are utilized. Logistic regression (LR) and fully connected neural networks using indices of climate phenomena as predictors produce neither accurate nor reliable predictions, indicating that the indices themselves are not good predictors. Using gridded fields as predictors, a LR and convolutional neural network (CNN) are more accurate than the index based models. However, only the CNN can produce reliable predictions which can be used to identify forecasts of opportunity. Using explainable machine learning we identify which variables and gridpoints of the input fields are most relevant for confident and correct predictions in the CNN. Our results show that the local circulation is most important as represented by maximum relevance of 850 hPa geopotential heights and zonal winds to making skillful, high probability predictions. Corresponding composite anomalies identify connections with the El-Niño Southern Oscillation during winter and the Atlantic Multidecadal Oscillation and North Atlantic Subtropical High during summer.","PeriodicalId":94369,"journal":{"name":"Artificial intelligence for the earth systems","volume":"5 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-10-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"90994690","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 4