首页 > 最新文献

Artificial intelligence for the earth systems最新文献

英文 中文
Can Machine Learning Models be a Suitable Tool for Predicting Central European Cold Winter Weather on Subseasonal to Seasonal Timescales? 机器学习模型能否成为预测中欧亚季节到季节时间尺度上寒冷冬季天气的合适工具?
Pub Date : 2023-07-28 DOI: 10.1175/aies-d-23-0020.1
S. Kiefer, Sebastian Lerch, P. Ludwig, J. Pinto
Skillful weather prediction on subseasonal to seasonal timescales is crucial for many socio-economic ventures. But forecasting, especially extremes, on these timescales is very challenging as the information from initial conditions is gradually lost. Therefore, data-driven methods are discussed as an alternative to numerical weather prediction models. Here, Quantile Regression Forests (QRFs) and Random Forest Classifiers (RFCs) are used for probabilistic forecasting of Central European wintertime mean 2-meter temperatures and cold wave days at lead times of 14, 21 and 28 days. ERA5-reanalysis meteorological predictors are used as input data for the machine learning models. The predictions are compared for the winters 2000/2001 to 2019/2020 to a climatological ensemble obtained from E-OBS observational data. The evaluation is performed as full distribution predictions for continuous values using the Continuous Ranked Probability Skill Score and as binary categorical forecasts using the Brier Skill Score. We find skill at lead times up to 28 days in the 20-winter mean and for individual winters. Case studies show that all used machine learning models are able to learn pattern in the data beyond climatology. A more detailed analysis using Shapley Additive Explanations suggest, that both Random-Forest (RF) based models are able to learn physically known relationships in the data. This underlines that RF-based data-driven models can be a suitable tool for forecasting Central European wintertime 2-meter temperatures and the occurrence of cold wave days.
熟练的亚季节到季节时间尺度的天气预报对许多社会经济企业至关重要。但是,在这些时间尺度上进行预测,尤其是极端情况下的预测,是非常具有挑战性的,因为来自初始条件的信息会逐渐丢失。因此,讨论了数据驱动方法作为数值天气预报模式的替代方法。本文利用分位数回归森林(qrf)和随机森林分类器(rfc)对中欧冬季平均2米温度和提前14、21和28天的寒潮天数进行了概率预测。era5再分析气象预测用作机器学习模型的输入数据。将2000/2001年至2019/2020年冬季的预测与从E-OBS观测数据获得的气候集合进行了比较。评估是作为连续值的完整分布预测来执行的,使用连续排名概率技能分数,作为二元分类预测使用Brier技能分数。我们发现,在平均20个冬季和个别冬季中,交货时间最长可达28天。案例研究表明,所有使用的机器学习模型都能够学习气候学以外的数据模式。使用Shapley加性解释的更详细分析表明,基于随机森林(RF)的模型都能够学习数据中物理已知的关系。这强调了基于射频的数据驱动模型可以成为预测中欧冬季2米温度和寒潮日发生的合适工具。
{"title":"Can Machine Learning Models be a Suitable Tool for Predicting Central European Cold Winter Weather on Subseasonal to Seasonal Timescales?","authors":"S. Kiefer, Sebastian Lerch, P. Ludwig, J. Pinto","doi":"10.1175/aies-d-23-0020.1","DOIUrl":"https://doi.org/10.1175/aies-d-23-0020.1","url":null,"abstract":"\u0000Skillful weather prediction on subseasonal to seasonal timescales is crucial for many socio-economic ventures. But forecasting, especially extremes, on these timescales is very challenging as the information from initial conditions is gradually lost. Therefore, data-driven methods are discussed as an alternative to numerical weather prediction models. Here, Quantile Regression Forests (QRFs) and Random Forest Classifiers (RFCs) are used for probabilistic forecasting of Central European wintertime mean 2-meter temperatures and cold wave days at lead times of 14, 21 and 28 days. ERA5-reanalysis meteorological predictors are used as input data for the machine learning models. The predictions are compared for the winters 2000/2001 to 2019/2020 to a climatological ensemble obtained from E-OBS observational data. The evaluation is performed as full distribution predictions for continuous values using the Continuous Ranked Probability Skill Score and as binary categorical forecasts using the Brier Skill Score. We find skill at lead times up to 28 days in the 20-winter mean and for individual winters. Case studies show that all used machine learning models are able to learn pattern in the data beyond climatology. A more detailed analysis using Shapley Additive Explanations suggest, that both Random-Forest (RF) based models are able to learn physically known relationships in the data. This underlines that RF-based data-driven models can be a suitable tool for forecasting Central European wintertime 2-meter temperatures and the occurrence of cold wave days.","PeriodicalId":94369,"journal":{"name":"Artificial intelligence for the earth systems","volume":"76 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-07-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"79200401","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Uncertainty Calibration of Passive Microwave Brightness Temperatures Predicted by Bayesian Deep Learning Models 贝叶斯深度学习模型预测被动微波亮度温度的不确定度标定
Pub Date : 2023-07-24 DOI: 10.1175/aies-d-22-0056.1
P. Ortiz, Eleanor Casas, M. Orescanin, S. Powell, V. Petković, Micky Hall
Visible and infrared radiance products of geostationary orbiting platforms provide virtually continuous observations of Earth. In contrast, low Earth orbiters observe passive microwave (PMW) radiances at any location much less frequently. Prior literature demonstrates the ability of a Machine Learning (ML) approach to build a link between these two complementary radiance spectra by predicting PMW observations using infrared and visible products collected from geostationary instruments, which could potentially deliver a highly-desirable synthetic PMW product with nearly continuous spatio-temporal coverage. However, current ML models lack the ability to provide a measure of uncertainty of such a product, significantly limiting its applications. In this work, Bayesian Deep Learning is employed to generate synthetic Global Precipitation Measurement (GPM) mission Microwave Imager (GMI) data from Advanced Baseline Imager (ABI) observations with attached uncertainties over the ocean. The study first uses deterministic Residual Networks (ResNets) to generate synthetic GMI brightness temperatures with as little mean absolute error as 1.72 K at the ABI spatio-temporal resolution. Then, for the same task, we use three Bayesian ResNet models to produce a comparable amount of error while providing previously unavailable predictive variance (i.e. uncertainty) for each synthetic data point. We find that the Flipout configuration provides the most robust calibration between uncertainty and error across GMI frequencies, and then demonstrate how this additional information is useful for discarding high-error synthetic data points prior to use by downstream applications.
地球静止轨道平台的可见光和红外辐射产品提供了几乎连续的地球观测。相比之下,近地轨道飞行器在任何位置观测被动微波辐射的频率要低得多。先前的文献表明,机器学习(ML)方法能够通过使用从地球静止仪器收集的红外和可见光产品预测PMW观测结果,从而在这两个互补的辐射光谱之间建立联系,这可能提供具有几乎连续时空覆盖的高度理想的合成PMW产品。然而,目前的机器学习模型缺乏提供此类产品不确定性度量的能力,这极大地限制了其应用。在这项工作中,贝叶斯深度学习被用于生成合成的全球降水测量(GPM)任务微波成像仪(GMI)数据,这些数据来自高级基线成像仪(ABI)对海洋的附加不确定性观测。该研究首先使用确定性残差网络(ResNets)在ABI时空分辨率下生成平均绝对误差为1.72 K的合成GMI亮度温度。然后,对于相同的任务,我们使用三个贝叶斯ResNet模型来产生相当数量的误差,同时为每个合成数据点提供以前不可用的预测方差(即不确定性)。我们发现Flipout配置在GMI频率的不确定性和误差之间提供了最稳健的校准,然后演示了这些附加信息如何在下游应用程序使用之前丢弃高误差合成数据点。
{"title":"Uncertainty Calibration of Passive Microwave Brightness Temperatures Predicted by Bayesian Deep Learning Models","authors":"P. Ortiz, Eleanor Casas, M. Orescanin, S. Powell, V. Petković, Micky Hall","doi":"10.1175/aies-d-22-0056.1","DOIUrl":"https://doi.org/10.1175/aies-d-22-0056.1","url":null,"abstract":"\u0000Visible and infrared radiance products of geostationary orbiting platforms provide virtually continuous observations of Earth. In contrast, low Earth orbiters observe passive microwave (PMW) radiances at any location much less frequently. Prior literature demonstrates the ability of a Machine Learning (ML) approach to build a link between these two complementary radiance spectra by predicting PMW observations using infrared and visible products collected from geostationary instruments, which could potentially deliver a highly-desirable synthetic PMW product with nearly continuous spatio-temporal coverage. However, current ML models lack the ability to provide a measure of uncertainty of such a product, significantly limiting its applications. In this work, Bayesian Deep Learning is employed to generate synthetic Global Precipitation Measurement (GPM) mission Microwave Imager (GMI) data from Advanced Baseline Imager (ABI) observations with attached uncertainties over the ocean. The study first uses deterministic Residual Networks (ResNets) to generate synthetic GMI brightness temperatures with as little mean absolute error as 1.72 K at the ABI spatio-temporal resolution. Then, for the same task, we use three Bayesian ResNet models to produce a comparable amount of error while providing previously unavailable predictive variance (i.e. uncertainty) for each synthetic data point. We find that the Flipout configuration provides the most robust calibration between uncertainty and error across GMI frequencies, and then demonstrate how this additional information is useful for discarding high-error synthetic data points prior to use by downstream applications.","PeriodicalId":94369,"journal":{"name":"Artificial intelligence for the earth systems","volume":"40 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87552945","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Assessing Tropical Pacific-induced Predictability of Southern California Precipitation Using a Novel Multi-input Multi-output Autoencoder 利用新型多输入多输出自编码器评估热带太平洋对南加州降水的可预测性
Pub Date : 2023-07-17 DOI: 10.1175/aies-d-23-0003.1
L. Passarella, S. Mahajan
We construct a novel Multi-Input Multi-Output Autoencoder-decoder (MIMO-AE) to capture the non-linear relationship of Southern California precipitation and tropical Pacific Ocean sea surface temperature. The MIMO-AE is trained on both monthly TP-SST and SC-PRECIP anomalies simultaneously. The co-variability of the two fields in the MIMO-AE shared nonlinear latent space can be condensed into an index, termed the MIMO-AE index. We use a transfer learning approach to train a MIMO-AE on the combined dataset of 100 years of output from a historical simulation with the Energy Exascale Earth Systems Model version 1 and a segment of observational data. We further use Long Short-Term Memory networks to assess sub-seasonal predictability of SC-PRECIP using the MIMO-AE index. We find that the MIMO-AE index provides enhanced predictability of SC-PRECIP for a lead-time of up-to four months as compared to Niño 3.4 index and the El Niño Southern Oscillation Longitudinal Index.
本文构建了一种新型的多输入多输出自编解码器(MIMO-AE),用于捕获南加州降水与热带太平洋海表温度的非线性关系。MIMO-AE同时训练每月TP-SST和sc - precp异常。在MIMO-AE共享的非线性潜在空间中,两个场的共变率可以浓缩成一个指数,称为MIMO-AE指数。我们使用迁移学习方法,在Energy Exascale地球系统模型版本1和一段观测数据的100年历史模拟输出的组合数据集上训练MIMO-AE。我们进一步利用长短期记忆网络利用MIMO-AE指数评估sc - precp的分季节可预测性。我们发现,与Niño 3.4指数和El Niño南方涛动纵向指数相比,MIMO-AE指数提供了sc - precp长达4个月的预估时间。
{"title":"Assessing Tropical Pacific-induced Predictability of Southern California Precipitation Using a Novel Multi-input Multi-output Autoencoder","authors":"L. Passarella, S. Mahajan","doi":"10.1175/aies-d-23-0003.1","DOIUrl":"https://doi.org/10.1175/aies-d-23-0003.1","url":null,"abstract":"\u0000We construct a novel Multi-Input Multi-Output Autoencoder-decoder (MIMO-AE) to capture the non-linear relationship of Southern California precipitation and tropical Pacific Ocean sea surface temperature. The MIMO-AE is trained on both monthly TP-SST and SC-PRECIP anomalies simultaneously. The co-variability of the two fields in the MIMO-AE shared nonlinear latent space can be condensed into an index, termed the MIMO-AE index. We use a transfer learning approach to train a MIMO-AE on the combined dataset of 100 years of output from a historical simulation with the Energy Exascale Earth Systems Model version 1 and a segment of observational data. We further use Long Short-Term Memory networks to assess sub-seasonal predictability of SC-PRECIP using the MIMO-AE index. We find that the MIMO-AE index provides enhanced predictability of SC-PRECIP for a lead-time of up-to four months as compared to Niño 3.4 index and the El Niño Southern Oscillation Longitudinal Index.","PeriodicalId":94369,"journal":{"name":"Artificial intelligence for the earth systems","volume":"11 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-07-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"75259421","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Cross-validation strategy impacts the performance and interpretation of machine learning models 交叉验证策略影响机器学习模型的性能和解释
Pub Date : 2023-07-10 DOI: 10.1175/aies-d-23-0026.1
Lily-belle Sweet, Christoph Müller, Mohit Anand, J. Zscheischler
Machine learning algorithms are able to capture complex, nonlinear interacting relationships and are increasingly used to predict yield variability at regional and national scales. Using explainable artificial intelligence (XAI) methods applied to such algorithms may enable better scientific understanding of drivers of yield variability. However, XAI methods may provide misleading results when applied to spatiotemporal correlated datasets. In this study, machine learning models are trained to predict simulated crop yield from climate indices, and the impact of model evaluation strategy on the interpretation and performance of the resulting models is assessed. Using data from a process-based crop model allows us to then comment on the plausibility of the ‘explanations’ provided by XAI methods. Our results show that the choice of evaluation strategy has an impact on (i) interpretations of the model and (ii) model skill on heldout years and regions, after the evaluation strategy is used for hyperparameter-tuning and feature-selection. We find that use of a cross-validation strategy based on clustering in feature-space achieves the most plausible interpretations as well as the best model performance on heldout years and regions. Our results provide first steps towards identifying domain-specific ‘best practices’ for the use of XAI tools on spatiotemporal agricultural or climatic data.
机器学习算法能够捕捉复杂的非线性相互作用关系,并越来越多地用于预测区域和国家尺度上的产量变化。将可解释的人工智能(XAI)方法应用于此类算法,可以更好地科学理解产量变化的驱动因素。然而,当应用于时空相关数据集时,XAI方法可能会提供误导性的结果。在本研究中,通过训练机器学习模型来根据气候指数预测模拟作物产量,并评估模型评估策略对结果模型的解释和性能的影响。使用来自基于过程的裁剪模型的数据,我们可以对XAI方法提供的“解释”的合理性进行评论。我们的研究结果表明,在使用评估策略进行超参数调整和特征选择后,评估策略的选择会影响(i)模型的解释和(ii)模型技能对保留年份和地区的影响。我们发现,在特征空间中使用基于聚类的交叉验证策略可以获得最合理的解释,以及在滞留年份和区域上的最佳模型性能。我们的结果为在时空农业或气候数据上使用XAI工具确定特定领域的“最佳实践”提供了第一步。
{"title":"Cross-validation strategy impacts the performance and interpretation of machine learning models","authors":"Lily-belle Sweet, Christoph Müller, Mohit Anand, J. Zscheischler","doi":"10.1175/aies-d-23-0026.1","DOIUrl":"https://doi.org/10.1175/aies-d-23-0026.1","url":null,"abstract":"\u0000Machine learning algorithms are able to capture complex, nonlinear interacting relationships and are increasingly used to predict yield variability at regional and national scales. Using explainable artificial intelligence (XAI) methods applied to such algorithms may enable better scientific understanding of drivers of yield variability. However, XAI methods may provide misleading results when applied to spatiotemporal correlated datasets. In this study, machine learning models are trained to predict simulated crop yield from climate indices, and the impact of model evaluation strategy on the interpretation and performance of the resulting models is assessed. Using data from a process-based crop model allows us to then comment on the plausibility of the ‘explanations’ provided by XAI methods. Our results show that the choice of evaluation strategy has an impact on (i) interpretations of the model and (ii) model skill on heldout years and regions, after the evaluation strategy is used for hyperparameter-tuning and feature-selection. We find that use of a cross-validation strategy based on clustering in feature-space achieves the most plausible interpretations as well as the best model performance on heldout years and regions. Our results provide first steps towards identifying domain-specific ‘best practices’ for the use of XAI tools on spatiotemporal agricultural or climatic data.","PeriodicalId":94369,"journal":{"name":"Artificial intelligence for the earth systems","volume":"32 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-07-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87621783","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
A deep learning filter for the intraseasonal variability of the tropics 热带季节内变化的深度学习过滤器
Pub Date : 2023-07-06 DOI: 10.1175/aies-d-22-0079.1
C. Stan, Rama Sesha Sridhar Mantripragada
This paper presents a novel application of convolutional neural network (CNN) models for filtering the intraseasonal variability of the tropical atmosphere. In this deep learning filter, two convolutional layers are applied sequentially in a supervised machine learning framework to extract the intraseasonal signal from the total daily anomalies. The CNN-based filter can be tailored for each field similarly to fast Fourier transform filtering methods. When applied to two different fields (zonal wind stress and outgoing longwave radiation), the index of agreement between the filtered signal obtained using the CNN-based filter and a conventional weight-based filter is between 95 – 99%. The advantage of the CNN-based filter over the conventional filters is its applicability to time series with the length comparable to the period of the signal being extracted.
本文提出了卷积神经网络(CNN)模型在热带大气季节内变率滤波中的新应用。在这个深度学习滤波器中,在监督机器学习框架中依次应用两个卷积层,从总日异常中提取季节内信号。基于cnn的滤波器可以针对每个字段进行定制,类似于快速傅立叶变换滤波方法。当应用于两个不同的场(纬向风应力和向外长波辐射)时,使用基于cnn的滤波器获得的滤波信号与传统的基于权重的滤波器的一致性指数在95 - 99%之间。与传统滤波器相比,基于cnn的滤波器的优点是它适用于长度与被提取信号周期相当的时间序列。
{"title":"A deep learning filter for the intraseasonal variability of the tropics","authors":"C. Stan, Rama Sesha Sridhar Mantripragada","doi":"10.1175/aies-d-22-0079.1","DOIUrl":"https://doi.org/10.1175/aies-d-22-0079.1","url":null,"abstract":"\u0000This paper presents a novel application of convolutional neural network (CNN) models for filtering the intraseasonal variability of the tropical atmosphere. In this deep learning filter, two convolutional layers are applied sequentially in a supervised machine learning framework to extract the intraseasonal signal from the total daily anomalies. The CNN-based filter can be tailored for each field similarly to fast Fourier transform filtering methods. When applied to two different fields (zonal wind stress and outgoing longwave radiation), the index of agreement between the filtered signal obtained using the CNN-based filter and a conventional weight-based filter is between 95 – 99%. The advantage of the CNN-based filter over the conventional filters is its applicability to time series with the length comparable to the period of the signal being extracted.","PeriodicalId":94369,"journal":{"name":"Artificial intelligence for the earth systems","volume":"26 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"85258036","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
TCDetect: A New Method of Detecting the Presence of Tropical Cyclones Using Deep Learning TCDetect:一种利用深度学习检测热带气旋存在的新方法
Pub Date : 2023-07-01 DOI: 10.1175/aies-d-22-0045.1
Daniel Galea, Julian Kunkel, B. Lawrence
Tropical cyclones are high-impact weather events that have large human and economic effects, so it is important to be able to understand how their location, frequency, and structure might change in a future climate. Here, a lightweight deep learning model is presented that is intended for detecting the presence or absence of tropical cyclones during the execution of numerical simulations for use in an online data reduction method. This will help to avoid saving vast amounts of data for analysis after the simulation is complete. With run-time detection, it might be possible to reduce the need for some of the high-frequency high-resolution output that would otherwise be required. The model was trained on ERA-Interim reanalysis data from 1979 to 2017, and the training was concentrated on delivering the highest possible recall rate (successful detection of cyclones) while rejecting enough data to make a difference in outputs. When tested using data from the two subsequent years, the recall or probability of detection rate was 92%. The precision rate or success ratio obtained was that of 36%. For the desired data reduction application, if the desired target included all tropical cyclone events, even those that did not obtain hurricane-strength status, the effective precision was 85%. The recall rate and the area under curve for the precision–recall (AUC-PR) compare favorably with other methods of cyclone identification while using the smallest number of parameters for both training and inference.
热带气旋是影响很大的天气事件,对人类和经济都有很大的影响,因此了解它们的位置、频率和结构在未来气候中如何变化是很重要的。在这里,提出了一个轻量级的深度学习模型,用于在执行数值模拟期间检测热带气旋的存在或不存在,用于在线数据简化方法。这将有助于避免在模拟完成后为分析保存大量数据。有了运行时检测,就有可能减少对高频高分辨率输出的需求。该模型是在1979年至2017年的ERA-Interim再分析数据上进行训练的,训练的重点是提供尽可能高的召回率(成功检测到气旋),同时拒绝足够的数据以产生输出差异。当使用随后两年的数据进行测试时,召回率或检测率的概率为92%。获得的准确率或成功率为36%。对于期望的数据约简应用,如果期望的目标包括所有热带气旋事件,甚至那些没有获得飓风强度状态的热带气旋事件,则有效精度为85%。精密度召回(AUC-PR)的召回率和曲线下面积与其他旋风识别方法相比具有优势,同时使用最小数量的参数进行训练和推理。
{"title":"TCDetect: A New Method of Detecting the Presence of Tropical Cyclones Using Deep Learning","authors":"Daniel Galea, Julian Kunkel, B. Lawrence","doi":"10.1175/aies-d-22-0045.1","DOIUrl":"https://doi.org/10.1175/aies-d-22-0045.1","url":null,"abstract":"\u0000Tropical cyclones are high-impact weather events that have large human and economic effects, so it is important to be able to understand how their location, frequency, and structure might change in a future climate. Here, a lightweight deep learning model is presented that is intended for detecting the presence or absence of tropical cyclones during the execution of numerical simulations for use in an online data reduction method. This will help to avoid saving vast amounts of data for analysis after the simulation is complete. With run-time detection, it might be possible to reduce the need for some of the high-frequency high-resolution output that would otherwise be required. The model was trained on ERA-Interim reanalysis data from 1979 to 2017, and the training was concentrated on delivering the highest possible recall rate (successful detection of cyclones) while rejecting enough data to make a difference in outputs. When tested using data from the two subsequent years, the recall or probability of detection rate was 92%. The precision rate or success ratio obtained was that of 36%. For the desired data reduction application, if the desired target included all tropical cyclone events, even those that did not obtain hurricane-strength status, the effective precision was 85%. The recall rate and the area under curve for the precision–recall (AUC-PR) compare favorably with other methods of cyclone identification while using the smallest number of parameters for both training and inference.","PeriodicalId":94369,"journal":{"name":"Artificial intelligence for the earth systems","volume":"46 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77081320","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A New Approach to Predict Tributary Phosphorus Loads Using Machine Learning- and Physics-Based Modeling Systems. 一种使用基于机器学习和物理的建模系统预测支流磷负荷的新方法。
Pub Date : 2023-07-01 DOI: 10.1175/aies-d-22-0049.1
Christina Feng Chang, Marina Astitha, Yongping Yuan, Chunling Tang, Penny Vlahos, Valerie Garcia, Ummul Khaira

Tributary phosphorus (P) loads are one of the main drivers of eutrophication problems in freshwater lakes. Being able to predict P loads can aid in understanding subsequent load patterns and elucidate potential degraded water quality conditions in downstream surface waters. We demonstrate the development and performance of an integrated multimedia modeling system that uses machine learning (ML) to assess and predict monthly total P (TP) and dissolved reactive P (DRP) loads. Meteorological variables from the Weather Research and Forecasting (WRF) Model, hydrologic variables from the Variable Infiltration Capacity model, and agricultural management practice variables from the Environmental Policy Integrated Climate agroecosystem model are utilized to train the ML models to predict P loads. Our study presents a new modeling methodology using as testbeds the Maumee, Sandusky, Portage, and Raisin watersheds, which discharge into Lake Erie and contribute to significant P loads to the lake. Two models were built, one for TP loads using 10 environmental variables and one for DRP loads using nine environmental variables. Both models ranked streamflow as the most important predictive variable. In comparison with observations, TP and DRP loads were predicted very well temporally and spatially. Modeling results of TP loads are within the ranges of those obtained from other studies and on some occasions more accurate. Modeling results of DRP loads exceed performance measures from other studies. We explore the ability of both ML-based models to further improve as more data become available over time. This integrated multimedia approach is recommended for studying other freshwater systems and water quality variables using available decadal data from physics-based model simulations.

支流磷负荷是淡水湖富营养化问题的主要驱动因素之一。能够预测P负荷有助于理解后续负荷模式,并阐明下游地表水潜在的水质退化条件。我们展示了一个集成多媒体建模系统的开发和性能,该系统使用机器学习(ML)来评估和预测每月总P(TP)和溶解无功P(DRP)负荷。利用来自天气研究和预测(WRF)模型的气象变量、来自可变渗透能力模型的水文变量和来自环境政策综合气候农业生态系统模型的农业管理实践变量来训练ML模型来预测磷负荷。我们的研究提出了一种新的建模方法,使用Maumee、Sandusky、Portage和Raisin流域作为试验台,这些流域排入伊利湖,并对该湖产生显著的磷负荷。建立了两个模型,一个模型用于使用10个环境变量的TP负荷,另一个模型使用9个环境变量用于DRP负荷。两个模型都将流量列为最重要的预测变量。与观测结果相比,TP和DRP负荷在时间和空间上都得到了很好的预测。TP负荷的建模结果在其他研究的范围内,在某些情况下更准确。DRP负载的建模结果超过了其他研究的性能指标。我们探索了随着时间的推移,随着更多数据的可用性,这两个基于ML的模型进一步改进的能力。建议采用这种综合多媒体方法,利用基于物理的模型模拟的可用十年数据来研究其他淡水系统和水质变量。
{"title":"A New Approach to Predict Tributary Phosphorus Loads Using Machine Learning- and Physics-Based Modeling Systems.","authors":"Christina Feng Chang, Marina Astitha, Yongping Yuan, Chunling Tang, Penny Vlahos, Valerie Garcia, Ummul Khaira","doi":"10.1175/aies-d-22-0049.1","DOIUrl":"10.1175/aies-d-22-0049.1","url":null,"abstract":"<p><p>Tributary phosphorus (P) loads are one of the main drivers of eutrophication problems in freshwater lakes. Being able to predict P loads can aid in understanding subsequent load patterns and elucidate potential degraded water quality conditions in downstream surface waters. We demonstrate the development and performance of an integrated multimedia modeling system that uses machine learning (ML) to assess and predict monthly total P (TP) and dissolved reactive P (DRP) loads. Meteorological variables from the Weather Research and Forecasting (WRF) Model, hydrologic variables from the Variable Infiltration Capacity model, and agricultural management practice variables from the Environmental Policy Integrated Climate agroecosystem model are utilized to train the ML models to predict P loads. Our study presents a new modeling methodology using as testbeds the Maumee, Sandusky, Portage, and Raisin watersheds, which discharge into Lake Erie and contribute to significant P loads to the lake. Two models were built, one for TP loads using 10 environmental variables and one for DRP loads using nine environmental variables. Both models ranked streamflow as the most important predictive variable. In comparison with observations, TP and DRP loads were predicted very well temporally and spatially. Modeling results of TP loads are within the ranges of those obtained from other studies and on some occasions more accurate. Modeling results of DRP loads exceed performance measures from other studies. We explore the ability of both ML-based models to further improve as more data become available over time. This integrated multimedia approach is recommended for studying other freshwater systems and water quality variables using available decadal data from physics-based model simulations.</p>","PeriodicalId":94369,"journal":{"name":"Artificial intelligence for the earth systems","volume":"2 3","pages":"1-20"},"PeriodicalIF":0.0,"publicationDate":"2023-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10569129/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41242940","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Automatic detection of rainfall at hourly time scales from mooring near-surface salinity in the eastern tropical Pacific 从东热带太平洋系泊近地表盐度自动探测每小时降雨量
Pub Date : 2023-06-30 DOI: 10.1175/aies-d-22-0009.1
O. Chkrebtii, F. Bingham
We explore the use of ocean near-surface salinity (NSS), i.e. salinity at 1 m depth, as a rainfall occurrence detector for hourly precipitation using data from the SPURS-2 (Salinity Processes in the Upper-ocean Regional Studies - 2) mooring at 10°N,125°W. Our proposed unsupervised learning algorithm consisting of two stages. First, an empirical quantile-based identification of dips in NSS enables us to capture most events with hourly averaged rainfall rate > 5 mm/hr. Over-estimation of precipitation duration is then corrected locally by fitting a parametric model based on the salinity balance equation. We propose a local precipitation model composed of a small number of calibration parameters representing individual rainfall events and their location in time. We show that unsupervised rainfall detection can be formulated as a statistical problem of predicting these variables from NSS data. We present our results and provide a validation technique based on data collected at the SPURS-2 mooring.
我们利用sprs -2(上层海洋区域研究中的盐度过程-2)系泊在10°N,125°W的数据,探索使用海洋近地表盐度(NSS),即1米深度的盐度,作为每小时降水的降雨发生探测器。我们提出的无监督学习算法包括两个阶段。首先,基于经验分位数的NSS下降识别使我们能够捕获每小时平均降雨量> 5毫米/小时的大多数事件。然后,通过拟合基于盐度平衡方程的参数模型,局部校正降水持续时间的高估。我们提出了一个由代表个别降雨事件及其时间位置的少量校准参数组成的局地降水模型。我们表明,无监督降雨检测可以表述为从NSS数据预测这些变量的统计问题。我们展示了我们的结果,并提供了一种基于在sprs -2系泊处收集的数据的验证技术。
{"title":"Automatic detection of rainfall at hourly time scales from mooring near-surface salinity in the eastern tropical Pacific","authors":"O. Chkrebtii, F. Bingham","doi":"10.1175/aies-d-22-0009.1","DOIUrl":"https://doi.org/10.1175/aies-d-22-0009.1","url":null,"abstract":"\u0000We explore the use of ocean near-surface salinity (NSS), i.e. salinity at 1 m depth, as a rainfall occurrence detector for hourly precipitation using data from the SPURS-2 (Salinity Processes in the Upper-ocean Regional Studies - 2) mooring at 10°N,125°W. Our proposed unsupervised learning algorithm consisting of two stages. First, an empirical quantile-based identification of dips in NSS enables us to capture most events with hourly averaged rainfall rate > 5 mm/hr. Over-estimation of precipitation duration is then corrected locally by fitting a parametric model based on the salinity balance equation. We propose a local precipitation model composed of a small number of calibration parameters representing individual rainfall events and their location in time. We show that unsupervised rainfall detection can be formulated as a statistical problem of predicting these variables from NSS data. We present our results and provide a validation technique based on data collected at the SPURS-2 mooring.","PeriodicalId":94369,"journal":{"name":"Artificial intelligence for the earth systems","volume":"27 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-06-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89674925","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Deriving Severe Hail Likelihood from Satellite Observations and Model Reanalysis Parameters using a Deep Neural Network 利用深度神经网络从卫星观测和模式再分析参数推导强冰雹可能性
Pub Date : 2023-06-23 DOI: 10.1175/aies-d-22-0042.1
B. Scarino, K. Itterly, Kristopher Bedka, C. Homeyer, J. Allen, S. Bang, Daniel J. Cecil
Geostationary satellite imagers provide historical and near-real-time observations of cloud top patterns that are commonly associated with severe convection. Environmental conditions favorable for severe weather are thought to be represented well by reanalyses. Predicting exactly where convection and costly storm hazards like hail will occur using models or satellite imagery alone, however, is extremely challenging. The multivariate combination of satellite-observed cloud patterns with reanalysis environmental parameters, linked to Next Generation Weather Radar- (NEXRAD-) estimated Maximum Expected Size of Hail (MESH) using a deep neural network (DNN), enables estimation of potentially severe hail likelihood for any observed storm cell. These estimates are made where satellites observe cold clouds, indicative of convection, located in favorable storm environments. We seek an approach that can be used to estimate climatological hailstorm frequency and risk throughout the historical satellite data record.Statistical distributions of convective parameters from satellite and reanalysis show separation between non-severe/severe hailstorm classes for predictors including overshooting cloud top temperature and area characteristics, vertical wind shear, and convective inhibition. These complex, multivariate predictor relationships are exploited within a DNN to produce a likelihood estimate with a critical success index of 0.511 and Heidke skill score of 0.407, which is exceptional among analogous hail studies. Furthermore, applications of the DNN to case studies demonstrate good qualitative agreement between hail likelihood and MESH. These hail classifications are aggregated across an 11-year GOES-12/13 image database to derive a hail frequency and severity climatology, which denotes the Central Plains, the Midwest, and northwestern Mexico as being the most hail-prone regions within the domain studied.
地球同步卫星成像仪提供了通常与强对流有关的云顶模式的历史和近实时观测。人们认为重新分析很好地反映了有利于恶劣天气的环境条件。然而,仅使用模型或卫星图像准确预测对流和冰雹等代价高昂的风暴灾害将发生的地方是极具挑战性的。卫星观测到的云型与再分析环境参数的多元组合,与下一代天气雷达(NEXRAD)结合,使用深度神经网络(DNN)估计最大预期冰雹大小(MESH),可以估计任何观测到的风暴单体的潜在严重冰雹可能性。这些估计是在卫星观测到位于有利的风暴环境中指示对流的冷云时作出的。我们寻求一种方法,可用于估计整个历史卫星数据记录的气候冰雹频率和风险。来自卫星和再分析的对流参数的统计分布显示非严重/严重冰雹等级之间的分离,包括超调云顶温度和面积特征、垂直风切变和对流抑制。在DNN中利用这些复杂的多变量预测关系来产生具有0.511关键成功指数和0.407 Heidke技能分数的可能性估计,这在类似的冰雹研究中是例外的。此外,DNN在案例研究中的应用表明,冰雹可能性和MESH之间具有良好的定性一致性。这些冰雹分类是在一个11年的GOES-12/13图像数据库中汇总的,以得出冰雹频率和严重程度气候学,其中表明中部平原,中西部和墨西哥西北部是研究领域内最容易发生冰雹的地区。
{"title":"Deriving Severe Hail Likelihood from Satellite Observations and Model Reanalysis Parameters using a Deep Neural Network","authors":"B. Scarino, K. Itterly, Kristopher Bedka, C. Homeyer, J. Allen, S. Bang, Daniel J. Cecil","doi":"10.1175/aies-d-22-0042.1","DOIUrl":"https://doi.org/10.1175/aies-d-22-0042.1","url":null,"abstract":"\u0000Geostationary satellite imagers provide historical and near-real-time observations of cloud top patterns that are commonly associated with severe convection. Environmental conditions favorable for severe weather are thought to be represented well by reanalyses. Predicting exactly where convection and costly storm hazards like hail will occur using models or satellite imagery alone, however, is extremely challenging. The multivariate combination of satellite-observed cloud patterns with reanalysis environmental parameters, linked to Next Generation Weather Radar- (NEXRAD-) estimated Maximum Expected Size of Hail (MESH) using a deep neural network (DNN), enables estimation of potentially severe hail likelihood for any observed storm cell. These estimates are made where satellites observe cold clouds, indicative of convection, located in favorable storm environments. We seek an approach that can be used to estimate climatological hailstorm frequency and risk throughout the historical satellite data record.\u0000Statistical distributions of convective parameters from satellite and reanalysis show separation between non-severe/severe hailstorm classes for predictors including overshooting cloud top temperature and area characteristics, vertical wind shear, and convective inhibition. These complex, multivariate predictor relationships are exploited within a DNN to produce a likelihood estimate with a critical success index of 0.511 and Heidke skill score of 0.407, which is exceptional among analogous hail studies. Furthermore, applications of the DNN to case studies demonstrate good qualitative agreement between hail likelihood and MESH. These hail classifications are aggregated across an 11-year GOES-12/13 image database to derive a hail frequency and severity climatology, which denotes the Central Plains, the Midwest, and northwestern Mexico as being the most hail-prone regions within the domain studied.","PeriodicalId":94369,"journal":{"name":"Artificial intelligence for the earth systems","volume":"6 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84471686","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Probabilistic Forecasting Methods of Winter Mixed Precipitation Events in New York State Utilizing a Random Forest 基于随机森林的纽约州冬季混合降水事件概率预测方法
Pub Date : 2023-06-23 DOI: 10.1175/aies-d-22-0080.1
Brian C. Filipiak, N. Bassill, Kristen Corbosiero, A. Lang, Ross A. Lazear
Winter mixed precipitation events are associated with multiple hazards and create forecast challenges due to the difficulty in determining the timing and amount of each precipitation type. In New York State, complex terrain enhances these forecast challenges. Machine learning is a relatively nascent tool that can help improve forecasting by synthesizing large amounts of data and finding underlying relationships. This study uses a random forest machine learning algorithm that generates probabilistic winter precipitation type forecasts. Random forest configuration, testing, and development methods are presented to show how this tool can be applied to operational forecasting. Dataset generation and variation are also explained due to their essential nature in the random forest. Lastly, the methodology of transitioning a machine learning algorithm from research to operations is discussed.
冬季混合降水事件与多种灾害有关,由于难以确定每种降水类型的时间和数量,因此给预报带来了挑战。在纽约州,复杂的地形增加了这些预测的挑战。机器学习是一种相对新兴的工具,它可以通过综合大量数据和发现潜在关系来帮助改进预测。本研究使用随机森林机器学习算法生成概率冬季降水类型预测。随机森林配置、测试和开发方法展示了如何将此工具应用于操作预测。由于数据集的生成和变化在随机森林中的本质,也解释了它们的生成和变化。最后,讨论了机器学习算法从研究到操作的过渡方法。
{"title":"Probabilistic Forecasting Methods of Winter Mixed Precipitation Events in New York State Utilizing a Random Forest","authors":"Brian C. Filipiak, N. Bassill, Kristen Corbosiero, A. Lang, Ross A. Lazear","doi":"10.1175/aies-d-22-0080.1","DOIUrl":"https://doi.org/10.1175/aies-d-22-0080.1","url":null,"abstract":"\u0000Winter mixed precipitation events are associated with multiple hazards and create forecast challenges due to the difficulty in determining the timing and amount of each precipitation type. In New York State, complex terrain enhances these forecast challenges. Machine learning is a relatively nascent tool that can help improve forecasting by synthesizing large amounts of data and finding underlying relationships. This study uses a random forest machine learning algorithm that generates probabilistic winter precipitation type forecasts. Random forest configuration, testing, and development methods are presented to show how this tool can be applied to operational forecasting. Dataset generation and variation are also explained due to their essential nature in the random forest. Lastly, the methodology of transitioning a machine learning algorithm from research to operations is discussed.","PeriodicalId":94369,"journal":{"name":"Artificial intelligence for the earth systems","volume":"19 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"75719090","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Artificial intelligence for the earth systems
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1