Artificial intelligence for the earth systems最新文献_第10页

Improvements to the Land Surface Air Temperature Reconstruction in NOAAGlobalTemp: An Artificial Neural Network Approach 基于NOAAGlobalTemp的地表气温重建改进:人工神经网络方法

Artificial intelligence for the earth systems

Pub Date : 2022-09-23 DOI: 10.1175/aies-d-22-0032.1

Boyin Huang, Xungang Yin, M. Menne, R. Vose, Huai-min Zhang

NOAAGlobalTemp is NOAA’s operational global surface temperature product, which has been widely used in the Earth’s climate assessment and monitoring. To improve the spatial interpolation of monthly land surface air temperatures (LSATs) in NOAAGlobalTemp from 1850 to 2020, a three-layer artificial neural network (ANN) system was designed. The ANN system was trained by repeatedly randomly selecting 90% of the LSATs from ERA5 (1950–2019) and validating with the remaining 10%. Validations show clear improvements of ANN over the original empirical orthogonal teleconnection (EOT) method: The global spatial correlation coefficient (SCC) increases from 65% to 80%, and the global root-mean-square-difference (RMSD) decreases from 0.99°C to 0.57°C during 1850–2020. The improvements of SCCs and RMSDs are larger in the Southern Hemisphere (SH) than in the Northern Hemisphere (NH), and are larger before the 1950s and where observations are sparse.The ANN system was finally fed in observed LSATs, and its output over the global land surface was compared with those from the EOT method. Comparisons demonstrate similar improvements by ANN over the EOT method: The global SCC increased from 78% to 89%, the global RMSD decreased from 0.93°C to 0.68°C, and the LSAT variability quantified by the monthly standard deviation (STD) increases from 1.16°C to 1.41°C during 1850–2020. While the SCC, RMSD, and STD at the monthly timescale have been improved, long-term trends remain largely unchanged because the low-frequency component of LSAT in ANN is identical to that in the EOT approach.

NOAAGlobalTemp是美国国家海洋和大气管理局的全球表面温度产品，已广泛应用于地球气候评估和监测。为了改进1850 - 2020年NOAAGlobalTemp月月地表气温的空间插值，设计了一个三层人工神经网络(ANN)系统。人工神经网络系统通过反复随机选择ERA5(1950-2019)中90%的lsat，并对剩余的10%进行验证来训练。验证结果表明，与原始的经验正交远距连接(EOT)方法相比，人工神经网络有了明显的改进:1850-2020年间，全球空间相关系数(SCC)从65%增加到80%，全球均方根差(RMSD)从0.99°C降低到0.57°C。SCCs和rmsd的改善在南半球(SH)比在北半球(NH)更大，在1950年代以前和观测稀疏的地方更大。最后将人工神经网络系统输入到观测到的lsat中，并将其在全球陆地表面的输出与EOT方法的输出进行了比较。与EOT方法相比，ANN方法也有类似的改进:1850-2020年间，全球SCC从78%增加到89%，全球RMSD从0.93°C下降到0.68°C，月标准差(STD)量化的LSAT变率从1.16°C增加到1.41°C。虽然SCC、RMSD和STD在月时间尺度上有所改善，但长期趋势基本保持不变，因为人工神经网络中LSAT的低频成分与EOT方法中的相同。

{"title":"Improvements to the Land Surface Air Temperature Reconstruction in NOAAGlobalTemp: An Artificial Neural Network Approach","authors":"Boyin Huang, Xungang Yin, M. Menne, R. Vose, Huai-min Zhang","doi":"10.1175/aies-d-22-0032.1","DOIUrl":"https://doi.org/10.1175/aies-d-22-0032.1","url":null,"abstract":"\u0000NOAAGlobalTemp is NOAA’s operational global surface temperature product, which has been widely used in the Earth’s climate assessment and monitoring. To improve the spatial interpolation of monthly land surface air temperatures (LSATs) in NOAAGlobalTemp from 1850 to 2020, a three-layer artificial neural network (ANN) system was designed. The ANN system was trained by repeatedly randomly selecting 90% of the LSATs from ERA5 (1950–2019) and validating with the remaining 10%. Validations show clear improvements of ANN over the original empirical orthogonal teleconnection (EOT) method: The global spatial correlation coefficient (SCC) increases from 65% to 80%, and the global root-mean-square-difference (RMSD) decreases from 0.99°C to 0.57°C during 1850–2020. The improvements of SCCs and RMSDs are larger in the Southern Hemisphere (SH) than in the Northern Hemisphere (NH), and are larger before the 1950s and where observations are sparse.\u0000The ANN system was finally fed in observed LSATs, and its output over the global land surface was compared with those from the EOT method. Comparisons demonstrate similar improvements by ANN over the EOT method: The global SCC increased from 78% to 89%, the global RMSD decreased from 0.93°C to 0.68°C, and the LSAT variability quantified by the monthly standard deviation (STD) increases from 1.16°C to 1.41°C during 1850–2020. While the SCC, RMSD, and STD at the monthly timescale have been improved, long-term trends remain largely unchanged because the low-frequency component of LSAT in ANN is identical to that in the EOT approach.","PeriodicalId":94369,"journal":{"name":"Artificial intelligence for the earth systems","volume":"17 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89107682","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

Modeling Spatial Distribution of Snow Water Equivalent by Combining Meteorological and Satellite Data with Lidar Maps 结合气象和卫星数据与激光雷达地图模拟雪水当量的空间分布

Artificial intelligence for the earth systems

Pub Date : 2022-09-06 DOI: 10.1175/aies-d-22-0010.1

U. Mital, D. Dwivedi, Ilhan Özgen-Xian, J. B. Brown, C. Steefel

An accurate characterization of the water content of snowpack, or snow water equivalent (SWE), is necessary to quantify water availability and constrain hydrologic and land-surface models. Recently, airborne observations (e.g., lidar) have emerged as a promising method to accurately quantify SWE at high resolutions (scales of ∼100m and finer). However, the frequency of these observations is very low, typically once or twice per season in Rocky Mountains, Colorado. Here, we present a machine learning framework based on Random Forests to model temporally sparse lidar-derived SWE, enabling estimation of SWE at unmapped time points. We approximated the physical processes governing snow accumulation and melt as well as snow characteristics by obtaining fifteen different variables from gridded estimates of precipitation, temperature, surface reflectance, elevation, and canopy. Results showed that in the Rocky Mountains of Colorado, our framework is capable of modeling SWE with a higher accuracy when compared with estimates generated by the Snow Data Assimilation System (SNODAS). The mean value of the coefficient of determination (R2) using our approach was 0.57 and the root mean squared error (RMSE) was 13 cm, which was a significant improvement over SNODAS (mean R2 = 0.13, RMSE = 20 cm). We explored the relative importance of the input variables, and observed that at the spatial resolution of 800 m, meteorological variables are more important drivers of predictive accuracy than surface variables which characterize the properties of snow on the ground. This research provides a framework to expand the applicability of lidar-derived SWE to unmapped time points.

对积雪含水量或雪水当量(SWE)的准确描述对于量化水分有效性和约束水文和陆地表面模型是必要的。最近，航空观测(例如激光雷达)已经成为一种有前途的方法，可以在高分辨率(~ 100米及更小的尺度)下准确量化SWE。然而，这些观测的频率非常低，通常在科罗拉多州的落基山脉每个季节一次或两次。在这里，我们提出了一个基于随机森林的机器学习框架来建模时间稀疏激光雷达衍生的SWE，从而能够在未映射的时间点估计SWE。我们通过从降水、温度、地表反射率、海拔和冠层的网格估计中获得15个不同的变量，近似地模拟了控制积雪和融化的物理过程以及雪的特征。结果表明，与SNODAS (Snow Data Assimilation System)产生的估算值相比，我们的框架能够以更高的精度模拟科罗拉多州落基山脉的SWE。该方法的决定系数(R2)均值为0.57，均方根误差(RMSE)为13 cm，较SNODAS(均方根误差R2 = 0.13, RMSE = 20 cm)有显著改善。我们探讨了输入变量的相对重要性，发现在800 m的空间分辨率下，气象变量比表征地面雪特性的地表变量对预测精度的影响更重要。本研究提供了一个框架，将激光雷达衍生的SWE扩展到未映射的时间点。

{"title":"Modeling Spatial Distribution of Snow Water Equivalent by Combining Meteorological and Satellite Data with Lidar Maps","authors":"U. Mital, D. Dwivedi, Ilhan Özgen-Xian, J. B. Brown, C. Steefel","doi":"10.1175/aies-d-22-0010.1","DOIUrl":"https://doi.org/10.1175/aies-d-22-0010.1","url":null,"abstract":"\u0000An accurate characterization of the water content of snowpack, or snow water equivalent (SWE), is necessary to quantify water availability and constrain hydrologic and land-surface models. Recently, airborne observations (e.g., lidar) have emerged as a promising method to accurately quantify SWE at high resolutions (scales of ∼100m and finer). However, the frequency of these observations is very low, typically once or twice per season in Rocky Mountains, Colorado. Here, we present a machine learning framework based on Random Forests to model temporally sparse lidar-derived SWE, enabling estimation of SWE at unmapped time points. We approximated the physical processes governing snow accumulation and melt as well as snow characteristics by obtaining fifteen different variables from gridded estimates of precipitation, temperature, surface reflectance, elevation, and canopy. Results showed that in the Rocky Mountains of Colorado, our framework is capable of modeling SWE with a higher accuracy when compared with estimates generated by the Snow Data Assimilation System (SNODAS). The mean value of the coefficient of determination (R2) using our approach was 0.57 and the root mean squared error (RMSE) was 13 cm, which was a significant improvement over SNODAS (mean R2 = 0.13, RMSE = 20 cm). We explored the relative importance of the input variables, and observed that at the spatial resolution of 800 m, meteorological variables are more important drivers of predictive accuracy than surface variables which characterize the properties of snow on the ground. This research provides a framework to expand the applicability of lidar-derived SWE to unmapped time points.","PeriodicalId":94369,"journal":{"name":"Artificial intelligence for the earth systems","volume":"43 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-09-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77165135","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Hybrid neural network models for postprocessing medium-range forecasts of tropical cyclone tracks over the western North Pacific 北太平洋西部热带气旋路径中期后期预报的混合神经网络模型

Artificial intelligence for the earth systems

Pub Date : 2022-08-31 DOI: 10.1175/aies-d-21-0003.1

H. Cheung, Chang‐Hoi Ho, Minhee Chang

Tropical cyclone (TC) track forecasts derived from dynamical models inherit their errors. In this study, a neural network (NN) algorithm was proposed for postprocessing TC tracks predicted by the Global Ensemble Forecast System (GEFS) for lead times of 2, 4, 5, and 6 days over the western North Pacific. The hybrid NN is a combination of three NN classes: (1) convolutional NN that extracts spatial features from GEFS fields; (2) multilayer perceptron which processes TC positions predicted by GEFS; and (3) recurrent NN that handles information from previous time steps. A dataset of 204 TCs (6744 samples), which were formed from 1985 to 2019 (June through October) and survived for at least six days, was separated into various track patterns. TCs in each track pattern were distributed uniformly to validation and test dataset, in which each contained 10% TCs of the entire dataset, and the remaining 80% were allocated to the training dataset. Two NN architectures were developed, with and without a shortcut connection. Feature selection and hyperparameter tuning were performed to improve model performance. The results present that mean track error and dispersion could be reduced, particularly with the shortcut connection, which also corrected the systematic speed and direction bias of GEFS. Although a reduction in mean track error was not achieved by the NNs for every forecast lead time, improvement can be foreseen upon calibration for reducing overfitting, and the performance encourages further development in the present application.

基于动力模式的热带气旋路径预报继承了它们的误差。本文提出了一种神经网络(NN)算法，用于对全球综合预报系统(GEFS)预测的北太平洋西部地区提前期为2、4、5和6天的TC轨迹进行后处理。混合神经网络是三种神经网络的组合:(1)从GEFS字段中提取空间特征的卷积神经网络;(2)多层感知器，对GEFS预测的TC位置进行处理;(3)处理来自前一个时间步长的信息的递归神经网络。204个tc(6744个样本)的数据集形成于1985年至2019年(6月至10月)，并存活了至少6天，被分成不同的轨迹模式。每个轨迹模式的tc均匀分布到验证和测试数据集，每个数据集占整个数据集的10%，其余80%分配给训练数据集。开发了两种具有和不具有快捷连接的神经网络架构。通过特征选择和超参数调优来提高模型性能。结果表明，采用快捷连接可以减小平均航迹误差和频散，同时还可以纠正GEFS的系统速度和方向偏差。虽然神经网络并没有在每个预测提前期都能减少平均跟踪误差，但在校正后可以预见到改善，以减少过拟合，并且性能鼓励在当前应用中进一步发展。

{"title":"Hybrid neural network models for postprocessing medium-range forecasts of tropical cyclone tracks over the western North Pacific","authors":"H. Cheung, Chang‐Hoi Ho, Minhee Chang","doi":"10.1175/aies-d-21-0003.1","DOIUrl":"https://doi.org/10.1175/aies-d-21-0003.1","url":null,"abstract":"\u0000Tropical cyclone (TC) track forecasts derived from dynamical models inherit their errors. In this study, a neural network (NN) algorithm was proposed for postprocessing TC tracks predicted by the Global Ensemble Forecast System (GEFS) for lead times of 2, 4, 5, and 6 days over the western North Pacific. The hybrid NN is a combination of three NN classes: (1) convolutional NN that extracts spatial features from GEFS fields; (2) multilayer perceptron which processes TC positions predicted by GEFS; and (3) recurrent NN that handles information from previous time steps. A dataset of 204 TCs (6744 samples), which were formed from 1985 to 2019 (June through October) and survived for at least six days, was separated into various track patterns. TCs in each track pattern were distributed uniformly to validation and test dataset, in which each contained 10% TCs of the entire dataset, and the remaining 80% were allocated to the training dataset. Two NN architectures were developed, with and without a shortcut connection. Feature selection and hyperparameter tuning were performed to improve model performance. The results present that mean track error and dispersion could be reduced, particularly with the shortcut connection, which also corrected the systematic speed and direction bias of GEFS. Although a reduction in mean track error was not achieved by the NNs for every forecast lead time, improvement can be foreseen upon calibration for reducing overfitting, and the performance encourages further development in the present application.","PeriodicalId":94369,"journal":{"name":"Artificial intelligence for the earth systems","volume":"44 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87320352","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Automated Identification of ‘Dunkelflaute’ Events: A Convolutional Neural Network-Based Autoencoder Approach “Dunkelflaute”事件的自动识别:基于卷积神经网络的自动编码器方法

Artificial intelligence for the earth systems

Pub Date : 2022-08-29 DOI: 10.1175/aies-d-22-0015.1

Bowen Li, S. Basu, S. Watson

As wind and solar power play increasingly important roles in the European energy system, unfavourable weather conditions, such as ‘Dunkelflaute’ (extended calm and cloudy periods), will pose ever greater challenges to transmission system operators. Thus, accurate identification and characterization of such events from open data streams (e.g., reanalysis, numerical weather prediction, and climate projection) are going to be crucial. In this study, we propose a two-step, unsupervised deep learning framework (named WISRnet) to automatically encode spatial patterns of wind speed and insolation, and subsequently, identify Dunkelflaute periods from the encoded patterns. Specifically, a deep convolutional neural network (CNN)-based autoencoder (AE) is first employed for feature extraction from the spatial patterns. These two-dimensional CNN-AE patterns encapsulate both amplitude and spatial information in a parsimonious way. In the second step of the WISRnet framework, a variant of the well-known k-means algorithm is used to divide the CNN-AE patterns in region-dependent meteorological clusters. For the validation of the WISRnet framework, aggregated wind and solar power production data from Belgium are used. Using a simple criterion from published literature, all the Dunkelflaute periods are directly identified from this six year-long dataset. Next, each of these periods is associated with a WISRnet-derived cluster. Interestingly, we find that the majority of these Dunkelflaute periods are part of only five clusters (out of twenty five). We show that in lieu of proprietary power production data, the WISRnet framework can identify Dunkelflaute periods from public-domain meteorological data. To further demonstrate the prowess of this framework, it is deployed to identify and characterize Dunkelflaute events in Denmark, Sweden, and the UK.

随着风能和太阳能在欧洲能源系统中发挥越来越重要的作用，不利的天气条件，如“Dunkelflaute”(长时间无风多云)，将给输电系统运营商带来更大的挑战。因此，从开放数据流(例如，再分析、数值天气预报和气候预测)中准确识别和描述此类事件将是至关重要的。在这项研究中，我们提出了一个两步无监督深度学习框架(名为WISRnet)来自动编码风速和日照的空间模式，并随后从编码模式中识别邓克尔弗劳特周期。具体而言，首先采用基于深度卷积神经网络(CNN)的自编码器(AE)从空间模式中提取特征。这些二维CNN-AE模式以一种简洁的方式封装了幅度和空间信息。在WISRnet框架的第二步中，使用了一种众所周知的k-means算法的变体来划分区域相关气象聚类中的CNN-AE模式。为了验证WISRnet框架，使用了来自比利时的汇总风能和太阳能生产数据。使用一个来自已发表文献的简单标准，所有邓克尔弗劳特时期都直接从这个六年的数据集中确定出来。接下来，每个周期都与wisrnet派生的集群相关联。有趣的是，我们发现大多数Dunkelflaute时期只属于5个集群(25个集群中)。我们表明，WISRnet框架可以从公共领域的气象数据中识别邓克尔弗劳特时期，而不是专有的电力生产数据。为了进一步证明这一框架的威力，我们将其用于识别和描述丹麦、瑞典和英国的邓克尔弗劳特事件。

{"title":"Automated Identification of ‘Dunkelflaute’ Events: A Convolutional Neural Network-Based Autoencoder Approach","authors":"Bowen Li, S. Basu, S. Watson","doi":"10.1175/aies-d-22-0015.1","DOIUrl":"https://doi.org/10.1175/aies-d-22-0015.1","url":null,"abstract":"\u0000As wind and solar power play increasingly important roles in the European energy system, unfavourable weather conditions, such as ‘Dunkelflaute’ (extended calm and cloudy periods), will pose ever greater challenges to transmission system operators. Thus, accurate identification and characterization of such events from open data streams (e.g., reanalysis, numerical weather prediction, and climate projection) are going to be crucial. In this study, we propose a two-step, unsupervised deep learning framework (named WISRnet) to automatically encode spatial patterns of wind speed and insolation, and subsequently, identify Dunkelflaute periods from the encoded patterns. Specifically, a deep convolutional neural network (CNN)-based autoencoder (AE) is first employed for feature extraction from the spatial patterns. These two-dimensional CNN-AE patterns encapsulate both amplitude and spatial information in a parsimonious way. In the second step of the WISRnet framework, a variant of the well-known k-means algorithm is used to divide the CNN-AE patterns in region-dependent meteorological clusters. For the validation of the WISRnet framework, aggregated wind and solar power production data from Belgium are used. Using a simple criterion from published literature, all the Dunkelflaute periods are directly identified from this six year-long dataset. Next, each of these periods is associated with a WISRnet-derived cluster. Interestingly, we find that the majority of these Dunkelflaute periods are part of only five clusters (out of twenty five). We show that in lieu of proprietary power production data, the WISRnet framework can identify Dunkelflaute periods from public-domain meteorological data. To further demonstrate the prowess of this framework, it is deployed to identify and characterize Dunkelflaute events in Denmark, Sweden, and the UK.","PeriodicalId":94369,"journal":{"name":"Artificial intelligence for the earth systems","volume":"30 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77990196","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Machine Learning Crop Yield Models Based on Meteorological Features and Comparison with a Process-Based Model 基于气象特征的机器学习作物产量模型及其与基于过程模型的比较

Artificial intelligence for the earth systems

Pub Date : 2022-08-26 DOI: 10.1175/aies-d-22-0002.1

Qinqing Liu, Meijian Yang, Koushan Mohammadi, Dongjin Song, J. Bi, Guiling Wang

A major challenge for food security worldwide is the large inter-annual variability of crop yield, and climate change is expected to further exacerbate this volatility. Accurate prediction of the crop response to climate variability and change is critical for short-term management and long-term planning in multiple sectors. In this study, using maize in the U.S. Corn Belt as an example, we train and validate multiple machine learning (ML) models predicting crop yield based on meteorological variables and soil properties using the leaving-one-year-out approach, and compare their performance with that of a widely used process-based crop model (PBM). Our proposed Long Short-Term Memory model with attention (LSTMatt) outperforms other ML models (including other variations of LSTM developed in this study), and explains 73% of the spatiotemporal variance of the observed maize yield, in contrast to 16% explained by the regionally calibrated PBM; the magnitude of yield prediction errors in LSTMatt is about one-third of that in the PBM. When applied to the extreme drought year 2012 that has no counterpart in the training data, the LSTMatt performance drops but still shows advantage over the PBM. Findings from this study suggest a great potential for out-of-sample application of the 𝐿𝑆𝑇𝑀𝑎𝑡𝑡 model to predict crop yieldunder a changing climate.

全球粮食安全面临的一项重大挑战是作物产量的巨大年际变化，而气候变化预计将进一步加剧这种波动。准确预测作物对气候变率和变化的响应对于多个部门的短期管理和长期规划至关重要。在本研究中，以美国玉米带的玉米为例，我们训练并验证了多个机器学习(ML)模型，该模型基于气象变量和土壤特性，使用留年方法预测作物产量，并将其与广泛使用的基于过程的作物模型(PBM)的性能进行了比较。我们提出的具有注意的长短期记忆模型(LSTMatt)优于其他ML模型(包括本研究开发的LSTM的其他变体)，并解释了观测玉米产量的73%的时空方差，而区域校准的PBM解释了16%;LSTMatt的产量预测误差约为PBM的三分之一。当应用于训练数据中没有对应的2012年极端干旱时，LSTMatt的性能下降，但仍优于PBM。这项研究的结果表明，𝐿𝑆𝑇𝑀𝑎𝑡𝑡模型在预测气候变化下的作物产量方面具有很大的样本外应用潜力。

{"title":"Machine Learning Crop Yield Models Based on Meteorological Features and Comparison with a Process-Based Model","authors":"Qinqing Liu, Meijian Yang, Koushan Mohammadi, Dongjin Song, J. Bi, Guiling Wang","doi":"10.1175/aies-d-22-0002.1","DOIUrl":"https://doi.org/10.1175/aies-d-22-0002.1","url":null,"abstract":"\u0000A major challenge for food security worldwide is the large inter-annual variability of crop yield, and climate change is expected to further exacerbate this volatility. Accurate prediction of the crop response to climate variability and change is critical for short-term management and long-term planning in multiple sectors. In this study, using maize in the U.S. Corn Belt as an example, we train and validate multiple machine learning (ML) models predicting crop yield based on meteorological variables and soil properties using the leaving-one-year-out approach, and compare their performance with that of a widely used process-based crop model (PBM). Our proposed Long Short-Term Memory model with attention (LSTMatt) outperforms other ML models (including other variations of LSTM developed in this study), and explains 73% of the spatiotemporal variance of the observed maize yield, in contrast to 16% explained by the regionally calibrated PBM; the magnitude of yield prediction errors in LSTMatt is about one-third of that in the PBM. When applied to the extreme drought year 2012 that has no counterpart in the training data, the LSTMatt performance drops but still shows advantage over the PBM. Findings from this study suggest a great potential for out-of-sample application of the 𝐿𝑆𝑇𝑀𝑎𝑡𝑡 model to predict crop yieldunder a changing climate.","PeriodicalId":94369,"journal":{"name":"Artificial intelligence for the earth systems","volume":"451 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"79706635","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 4

Probing the Explainability of Neural Network Cloud-Top Pressure Models for LEO and GEO Imagers 神经网络云顶压力模型对LEO和GEO成像仪的可解释性探讨

Artificial intelligence for the earth systems

Pub Date : 2022-08-26 DOI: 10.1175/aies-d-21-0001.1

C. White, A. Heidinger, S. Ackerman

Satellite imager estimates of cloud-top pressure (CTP) have many applications in both operations and in studying long-term variations in cloud properties. Recently, machine learning (ML) approaches have shown improvement upon physically-based algorithms. However, ML approaches, and especially neural networks, can suffer from a lack of interpretability making it difficult to understand what information is most useful for accurate predictions of cloud properties. We trained several neural networks to estimate CTP from the infrared channels of the Visible Infrared Imaging Radiometer Suite (VIIRS) and the Advanced Baseline Imager (ABI). The main focus of this work is assessing the relative importance of each instrument’s infrared channels in neural networks trained to estimate CTP. We use several ML explainability methods to offer different perspectives on feature importance. These methods show many differences in the relative feature importance depending on the exact method used, but most agree on a few points. Overall, the 8.4- and 8.6-μm channels appear to be the most useful for CTP estimation on ABI and VIIRS, respectively, with other native infrared window channels and the 13.3-μm channel playing a moderate role. Furthermore, we find that the neural networks learn relationships that may account for properties of clouds such as opacity and cloud-top phase that otherwise complicate the estimation of CTP.

卫星成像仪对云顶压力(CTP)的估计在作业和研究云性质的长期变化方面有许多应用。最近，机器学习(ML)方法已经显示出对基于物理的算法的改进。然而，机器学习方法，尤其是神经网络，可能缺乏可解释性，这使得很难理解哪些信息对准确预测云属性最有用。我们训练了几个神经网络，从可见光红外成像辐射计套件(VIIRS)和高级基线成像仪(ABI)的红外通道估计CTP。这项工作的主要焦点是评估每个仪器的红外通道在神经网络中用于估计CTP的相对重要性。我们使用几种机器学习可解释性方法来提供不同的特征重要性视角。根据所使用的确切方法，这些方法在相对特征重要性方面表现出许多差异，但大多数方法在几个点上是一致的。总体而言，8.4 μm通道和8.6 μm通道分别对ABI和VIIRS的CTP估计最有用，其他原生红外窗口通道和13.3 μm通道的作用中等。此外，我们发现神经网络学习的关系可以解释云的性质，如不透明度和云顶相位，否则会使CTP的估计复杂化。

{"title":"Probing the Explainability of Neural Network Cloud-Top Pressure Models for LEO and GEO Imagers","authors":"C. White, A. Heidinger, S. Ackerman","doi":"10.1175/aies-d-21-0001.1","DOIUrl":"https://doi.org/10.1175/aies-d-21-0001.1","url":null,"abstract":"\u0000Satellite imager estimates of cloud-top pressure (CTP) have many applications in both operations and in studying long-term variations in cloud properties. Recently, machine learning (ML) approaches have shown improvement upon physically-based algorithms. However, ML approaches, and especially neural networks, can suffer from a lack of interpretability making it difficult to understand what information is most useful for accurate predictions of cloud properties. We trained several neural networks to estimate CTP from the infrared channels of the Visible Infrared Imaging Radiometer Suite (VIIRS) and the Advanced Baseline Imager (ABI). The main focus of this work is assessing the relative importance of each instrument’s infrared channels in neural networks trained to estimate CTP. We use several ML explainability methods to offer different perspectives on feature importance. These methods show many differences in the relative feature importance depending on the exact method used, but most agree on a few points. Overall, the 8.4- and 8.6-μm channels appear to be the most useful for CTP estimation on ABI and VIIRS, respectively, with other native infrared window channels and the 13.3-μm channel playing a moderate role. Furthermore, we find that the neural networks learn relationships that may account for properties of clouds such as opacity and cloud-top phase that otherwise complicate the estimation of CTP.","PeriodicalId":94369,"journal":{"name":"Artificial intelligence for the earth systems","volume":"7 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"73796852","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Artificial intelligence for the earth systems

Pub Date : 2022-08-26 DOI: 10.1175/aies-d-22-0005.1

G. Petty

A simple yet flexible and robust algorithm is described for fully partitioning an arbitrary dataset into compact, non-overlapping groups or classes, sorted by size, based entirely on a pairwise similarity matrix and a user-specified similarity threshold. Unlike many clustering algorithms, there is no assumption that natural clusters exist in the dataset, though clusters, when present, may be preferentially assigned to one or more classes. The method also does not require data objects to be compared within any coordinate system but rather permits the user to define pairwise similarity using almost any conceivable criterion. The method therefore lends itself to certain geoscientific applications for which conventional clustering methods are unsuited, including two non-trivial and distinctly different datasets presented as examples. In addition to identifying large classes containing numerous similar dataset members, it is also well-suited for isolating rare or anomalous members of a dataset. The method is inductive, in that prototypes identified in representative subset of a larger dataset can be used to classify the remainder.

本文描述了一种简单而灵活且鲁棒的算法，用于将任意数据集完全划分为紧凑的、不重叠的组或类，并根据大小进行排序，完全基于成对相似矩阵和用户指定的相似阈值。与许多聚类算法不同，它不假设数据集中存在自然聚类，尽管当存在聚类时，可能优先分配给一个或多个类。该方法也不需要在任何坐标系内比较数据对象，而是允许用户使用几乎任何可以想到的标准定义成对相似性。因此，该方法适合于传统聚类方法不适合的某些地球科学应用，包括作为示例的两个不同的数据集。除了识别包含许多相似数据集成员的大型类之外，它还非常适合于隔离数据集的罕见或异常成员。该方法是归纳的，因为在较大数据集的代表性子集中识别的原型可用于对其余部分进行分类。

{"title":"The Pairwise Similarity Partitioning algorithm: a method for unsupervised partitioning of geoscientific and other datasets using arbitrary similarity metrics","authors":"G. Petty","doi":"10.1175/aies-d-22-0005.1","DOIUrl":"https://doi.org/10.1175/aies-d-22-0005.1","url":null,"abstract":"\u0000A simple yet flexible and robust algorithm is described for fully partitioning an arbitrary dataset into compact, non-overlapping groups or classes, sorted by size, based entirely on a pairwise similarity matrix and a user-specified similarity threshold. Unlike many clustering algorithms, there is no assumption that natural clusters exist in the dataset, though clusters, when present, may be preferentially assigned to one or more classes. The method also does not require data objects to be compared within any coordinate system but rather permits the user to define pairwise similarity using almost any conceivable criterion. The method therefore lends itself to certain geoscientific applications for which conventional clustering methods are unsuited, including two non-trivial and distinctly different datasets presented as examples. In addition to identifying large classes containing numerous similar dataset members, it is also well-suited for isolating rare or anomalous members of a dataset. The method is inductive, in that prototypes identified in representative subset of a larger dataset can be used to classify the remainder.","PeriodicalId":94369,"journal":{"name":"Artificial intelligence for the earth systems","volume":"25 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"85911109","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Adaptive Blending of Probabilistic Precipitation Forecasts with Emphasis on Calibration and Temporal Forecast Consistency 基于校准和时间一致性的概率降水预报自适应混合

Artificial intelligence for the earth systems

Pub Date : 2022-08-03 DOI: 10.1175/aies-d-22-0020.1

M. Rempel, P. Schaumann, R. Hess, V. Schmidt, U. Blahak

A wealth of forecasting models is available for operational weather forecasting. Their strengths often depend on the lead time considered, which generates the need for a seamless combination of different forecast methods. The combined and continuous products are made in order to retain or even enhance the forecast quality of the individual forecasts and to extend the lead time to potentially hazardous weather events. In this study, we further improve an artificial neural network based combination model that was recently proposed in a previous paper. This model combines two initial precipitation ensemble forecasts and produces exceedance probabilities for a set of thresholds for hourly precipitation amounts. Both initial forecasts perform differently well for different lead times, whereas the combined forecast is calibrated and outperforms both initial forecasts with respect to various validation scores and for all considered lead times (+1h to +6h). Moreover, the robustness of the combination model is tested by applying it to a new dataset and by evaluating the spatial and temporal consistency of its forecasts. The changes proposed further improve the forecast quality and make it more useful for practical applications. Temporal consistency of the combined product is evaluated using a flip-flop index. It is shown that the combination provides a higher persistence with decreasing lead times compared to both input systems.

有大量的预报模型可用于业务天气预报。它们的优势往往取决于所考虑的提前期，这就需要不同预测方法的无缝结合。这些组合和连续的预报产品是为了保持甚至提高个别预报的预报质量，并延长对潜在危险天气事件的预警时间。在本研究中，我们进一步改进了先前论文中提出的基于人工神经网络的组合模型。该模式结合了两个初始降水集合预报，并产生了一组每小时降水量阈值的超出概率。两种初始预测对于不同的提前期表现不同，而组合预测是经过校准的，并且在各种验证分数和所有考虑的提前期(+1小时到+6小时)方面优于两种初始预测。此外，通过将组合模型应用于新数据集，并通过评估其预测的时空一致性来检验组合模型的稳健性。提出的修改进一步提高了预报质量，使其更适合实际应用。使用触发器指数评估组合产品的时间一致性。结果表明，与两种输入系统相比，这种组合提供了更高的持久性，并且交货时间缩短。

{"title":"Adaptive Blending of Probabilistic Precipitation Forecasts with Emphasis on Calibration and Temporal Forecast Consistency","authors":"M. Rempel, P. Schaumann, R. Hess, V. Schmidt, U. Blahak","doi":"10.1175/aies-d-22-0020.1","DOIUrl":"https://doi.org/10.1175/aies-d-22-0020.1","url":null,"abstract":"\u0000A wealth of forecasting models is available for operational weather forecasting. Their strengths often depend on the lead time considered, which generates the need for a seamless combination of different forecast methods. The combined and continuous products are made in order to retain or even enhance the forecast quality of the individual forecasts and to extend the lead time to potentially hazardous weather events. In this study, we further improve an artificial neural network based combination model that was recently proposed in a previous paper. This model combines two initial precipitation ensemble forecasts and produces exceedance probabilities for a set of thresholds for hourly precipitation amounts. Both initial forecasts perform differently well for different lead times, whereas the combined forecast is calibrated and outperforms both initial forecasts with respect to various validation scores and for all considered lead times (+1h to +6h). Moreover, the robustness of the combination model is tested by applying it to a new dataset and by evaluating the spatial and temporal consistency of its forecasts. The changes proposed further improve the forecast quality and make it more useful for practical applications. Temporal consistency of the combined product is evaluated using a flip-flop index. It is shown that the combination provides a higher persistence with decreasing lead times compared to both input systems.","PeriodicalId":94369,"journal":{"name":"Artificial intelligence for the earth systems","volume":"8 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-08-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87629003","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Application of deep learning to understanding ENSO dynamics 应用深度学习来理解ENSO动力学

Artificial intelligence for the earth systems

Pub Date : 2022-08-02 DOI: 10.1175/aies-d-21-0011.1

Na-Yeon Shin, Y. Ham, Jeong-Hwan Kim, M. Cho, J. Kug

Many deep learning technologies have been applied to the Earth sciences. Nonetheless, the difficulty in interpreting deep learning results still prevents their applications to studies on climate dynamics. Here, we applied a convolutional neural network to understand El Niño–Southern Oscillation (ENSO) dynamics from long-term climate model simulations. The deep learning algorithm successfully predicted ENSO events with a high correlation skill (∼0.82) for a 9-month lead. For interpreting deep learning results beyond the prediction, we present a “contribution map” to estimate how much the grid box and variable contribute to the output and “contribution sensitivity” to estimate how much the output variable is changed to the small perturbation of the input variables. The contribution map and sensitivity are calculated by modifying the input variables to the pre-trained deep learning, which is quite similar to the occlusion sensitivity. Based on the two methods, we identified three precursors of ENSO and investigated their physical processes with El Niño and La Niña development. In particular, it is suggested here that the roles of each precursor are asymmetric between El Niño and La Niña. Our results suggest that the contribution map and sensitivity are simple approaches but can be a powerful tool in understanding ENSO dynamics and they might be also applied to other climate phenomena.

许多深度学习技术已经应用于地球科学。尽管如此，解释深度学习结果的困难仍然阻碍了它们在气候动力学研究中的应用。在这里，我们应用卷积神经网络从长期气候模式模拟中理解El Niño-Southern振荡(ENSO)动力学。深度学习算法以高相关技能(~ 0.82)成功预测ENSO事件，领先9个月。为了解释预测之外的深度学习结果，我们提出了一个“贡献图”来估计网格框和变量对输出的贡献程度，以及“贡献灵敏度”来估计输出变量被输入变量的小扰动改变的程度。通过修改预训练深度学习的输入变量来计算贡献图和灵敏度，这与遮挡灵敏度非常相似。基于这两种方法，我们确定了三种ENSO前体，并通过El Niño和La Niña的发育研究了它们的物理过程。特别指出的是，在El Niño和La Niña之间，每个前驱体的作用是不对称的。我们的研究结果表明，贡献图和敏感性是一种简单的方法，但可以成为理解ENSO动力学的有力工具，它们也可以应用于其他气候现象。

{"title":"Application of deep learning to understanding ENSO dynamics","authors":"Na-Yeon Shin, Y. Ham, Jeong-Hwan Kim, M. Cho, J. Kug","doi":"10.1175/aies-d-21-0011.1","DOIUrl":"https://doi.org/10.1175/aies-d-21-0011.1","url":null,"abstract":"\u0000Many deep learning technologies have been applied to the Earth sciences. Nonetheless, the difficulty in interpreting deep learning results still prevents their applications to studies on climate dynamics. Here, we applied a convolutional neural network to understand El Niño–Southern Oscillation (ENSO) dynamics from long-term climate model simulations. The deep learning algorithm successfully predicted ENSO events with a high correlation skill (∼0.82) for a 9-month lead. For interpreting deep learning results beyond the prediction, we present a “contribution map” to estimate how much the grid box and variable contribute to the output and “contribution sensitivity” to estimate how much the output variable is changed to the small perturbation of the input variables. The contribution map and sensitivity are calculated by modifying the input variables to the pre-trained deep learning, which is quite similar to the occlusion sensitivity. Based on the two methods, we identified three precursors of ENSO and investigated their physical processes with El Niño and La Niña development. In particular, it is suggested here that the roles of each precursor are asymmetric between El Niño and La Niña. Our results suggest that the contribution map and sensitivity are simple approaches but can be a powerful tool in understanding ENSO dynamics and they might be also applied to other climate phenomena.","PeriodicalId":94369,"journal":{"name":"Artificial intelligence for the earth systems","volume":"122 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-08-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"91454379","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

Archetypal Analysis of Geophysical Data illustrated by Sea Surface Temperature 以海表温度为例的地球物理资料的原型分析

Artificial intelligence for the earth systems

Pub Date : 2022-07-25 DOI: 10.1175/aies-d-21-0007.1

A. Black, D. Monselesan, J. Risbey, B. Sloyan, C. Chapman, A. Hannachi, D. Richardson, D. Squire, C. Tozer, Nikolay Trendafilov

The ability to find and recognize patterns in high-dimensional geophysical data is fundamental to climate science and critical for meaningful interpretation of weather and climate processes. Archetypal analysis (AA) is one technique that has recently gained traction in the geophysical science community for its ability to find patterns based on extreme conditions. While traditional empirical orthogonal function (EOF) analysis can reveal patterns based on data covariance, AA seeks patterns from the points located at the edges of the data distribution. The utility of any objective pattern method depends on the properties of the data to which it is applied and the choices made in implementing the method. Given the relative novelty of the application of AA in geophysics it is important to develop experience in applying the method. We provide an assessment of the method, implementation, sensitivity, and interpretation of AA with respect to geophysical data. As an example for demonstration, we apply AA to a 39-year sea surface temperature (SST) reanalysis data set. We show that the decisions made to implement AA can significantly affect the interpretation of results, but also, in the case of SST, that the analysis is exceptionally robust under both spatial and temporal coarse-graining.

在高维地球物理数据中发现和识别模式的能力是气候科学的基础，对于有意义地解释天气和气候过程至关重要。原型分析(AA)是最近在地球物理科学界获得关注的一种技术，因为它能够发现基于极端条件的模式。传统的经验正交函数(EOF)分析可以根据数据协方差揭示模式，而AA从位于数据分布边缘的点寻找模式。任何客观模式方法的效用取决于应用该方法的数据的属性以及在实现该方法时所做的选择。鉴于AA在地球物理中的应用相对新颖，因此积累应用该方法的经验是很重要的。我们对地球物理数据方面的AA方法、实施、灵敏度和解释进行了评估。作为示例，我们将AA应用于39年的海温(SST)再分析数据集。我们发现，实施AA的决策可以显著影响结果的解释，而且，在海表温度的情况下，分析在空间和时间粗粒度下都非常稳健。

{"title":"Archetypal Analysis of Geophysical Data illustrated by Sea Surface Temperature","authors":"A. Black, D. Monselesan, J. Risbey, B. Sloyan, C. Chapman, A. Hannachi, D. Richardson, D. Squire, C. Tozer, Nikolay Trendafilov","doi":"10.1175/aies-d-21-0007.1","DOIUrl":"https://doi.org/10.1175/aies-d-21-0007.1","url":null,"abstract":"\u0000The ability to find and recognize patterns in high-dimensional geophysical data is fundamental to climate science and critical for meaningful interpretation of weather and climate processes. Archetypal analysis (AA) is one technique that has recently gained traction in the geophysical science community for its ability to find patterns based on extreme conditions. While traditional empirical orthogonal function (EOF) analysis can reveal patterns based on data covariance, AA seeks patterns from the points located at the edges of the data distribution. The utility of any objective pattern method depends on the properties of the data to which it is applied and the choices made in implementing the method. Given the relative novelty of the application of AA in geophysics it is important to develop experience in applying the method. We provide an assessment of the method, implementation, sensitivity, and interpretation of AA with respect to geophysical data. As an example for demonstration, we apply AA to a 39-year sea surface temperature (SST) reanalysis data set. We show that the decisions made to implement AA can significantly affect the interpretation of results, but also, in the case of SST, that the analysis is exceptionally robust under both spatial and temporal coarse-graining.","PeriodicalId":94369,"journal":{"name":"Artificial intelligence for the earth systems","volume":"30 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2022-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"86750173","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1