首页 > 最新文献

Artificial intelligence for the earth systems最新文献

英文 中文
Subseasonal Representation and Predictability of North American Weather Regimes using Cluster Analysis 利用聚类分析的北美天气状况的亚季节表征和可预测性
Pub Date : 2023-01-30 DOI: 10.1175/aies-d-22-0051.1
M. Molina, J. Richter, A. Glanville, K. Dagon, J. Berner, A. Hu, G. Meehl
This study focuses on assessing the representation and predictability of North American weather regimes, which are persistent large-scale atmospheric patterns, in a set of initialized subseasonal reforecasts created using the Community Earth System Model version 2 (CESM2). K-means clustering was used to extract four key North American (10-70°N, 150-40°W) weather regimes within ERA5 reanalysis, which were used to interpret CESM2 subseasonal forecast performance. Results show that CESM2 can recreate the climatology of the four main North American weather regimes with skill, but exhibits biases during later lead times with over occurrence of the West Coast High regime and under occurrence of the Greenland High and Alaskan Ridge regimes. Overall, the West Coast High and Pacific Trough regimes exhibited higher predictability within CESM2, partly related to El Niño. Despite biases, several reforecasts were skillful and exhibited high predictability during later lead times, which could be partly attributed to skillful representation of the atmosphere from the tropics to extratropics upstream of North America. The high predictability at the subseasonal time scale of these case study examples was manifested as an “ensemble realignment,” in which most ensemble members agreed on a prediction despite ensemble trajectory dispersion during earlier lead times. Weather regimes were also shown to project distinct temperature and precipitation anomalies across North America that largely agree with observational products. This study further demonstrates that unsupervised learning methods can be used to uncover sources and limits of subseasonal predictability, along with systematic biases present in numerical prediction systems.
本研究的重点是评估北美天气状况的代表性和可预测性,这些天气状况是持续的大尺度大气模式,在使用社区地球系统模式第2版(CESM2)创建的一组初始化的亚季节再预报中。在ERA5再分析中,使用K-means聚类提取了四个关键的北美(10-70°N, 150-40°W)天气状态,用于解释CESM2的亚季节预报性能。结果表明,CESM2可以较好地再现北美4种主要天气状态的气候学,但在较晚的预期表现出偏差,西海岸高压状态出现偏多,格陵兰高压和阿拉斯加脊状态出现偏少。总体而言,西海岸高压和太平洋槽在CESM2内表现出更高的可预测性,部分与El Niño有关。尽管存在偏差,但几次重新预报是熟练的,并且在后期预估时间内表现出很高的可预测性,这可能部分归因于从热带到北美上游温带地区的大气的熟练表现。这些案例研究示例在亚季节时间尺度上的高可预测性表现为“集合调整”,其中大多数集合成员同意预测,尽管集合轨迹在早期提前期分散。天气状况也显示出北美各地不同的温度和降水异常,这在很大程度上与观测产品一致。本研究进一步表明,无监督学习方法可用于揭示亚季节可预测性的来源和限制,以及数值预测系统中存在的系统偏差。
{"title":"Subseasonal Representation and Predictability of North American Weather Regimes using Cluster Analysis","authors":"M. Molina, J. Richter, A. Glanville, K. Dagon, J. Berner, A. Hu, G. Meehl","doi":"10.1175/aies-d-22-0051.1","DOIUrl":"https://doi.org/10.1175/aies-d-22-0051.1","url":null,"abstract":"\u0000This study focuses on assessing the representation and predictability of North American weather regimes, which are persistent large-scale atmospheric patterns, in a set of initialized subseasonal reforecasts created using the Community Earth System Model version 2 (CESM2). K-means clustering was used to extract four key North American (10-70°N, 150-40°W) weather regimes within ERA5 reanalysis, which were used to interpret CESM2 subseasonal forecast performance. Results show that CESM2 can recreate the climatology of the four main North American weather regimes with skill, but exhibits biases during later lead times with over occurrence of the West Coast High regime and under occurrence of the Greenland High and Alaskan Ridge regimes. Overall, the West Coast High and Pacific Trough regimes exhibited higher predictability within CESM2, partly related to El Niño. Despite biases, several reforecasts were skillful and exhibited high predictability during later lead times, which could be partly attributed to skillful representation of the atmosphere from the tropics to extratropics upstream of North America. The high predictability at the subseasonal time scale of these case study examples was manifested as an “ensemble realignment,” in which most ensemble members agreed on a prediction despite ensemble trajectory dispersion during earlier lead times. Weather regimes were also shown to project distinct temperature and precipitation anomalies across North America that largely agree with observational products. This study further demonstrates that unsupervised learning methods can be used to uncover sources and limits of subseasonal predictability, along with systematic biases present in numerical prediction systems.","PeriodicalId":94369,"journal":{"name":"Artificial intelligence for the earth systems","volume":"1 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-01-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"90264225","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Convective-scale Assimilation of Cloud Cover from Photographs using a Machine Learning Forward Operator 基于机器学习正演算子的照片云量对流尺度同化
Pub Date : 2023-01-19 DOI: 10.1175/aies-d-22-0025.1
Maria Reinhardt, S. Schoger, Frederik Kurzrock, R. Potthast
This paper presents an innovational way of assimilating observations of clouds into the ICOsahedral Nonhydrostatic weather forecasting model for regional scale, ICON-D2, which is operated by Deutscher Wetterdienst (DWD). A convolutional neural network (CNN) is trained to detect clouds in camera photographs. The network’s output is a greyscale picture, in which each pixel has a value between 0 and 1, describing the probability of the pixel belonging to a cloud (1) or not (0). By averaging over a certain box of the picture a value for the cloud cover of that region is obtained. A forward operator is built to map an ICON model state into the observation space. A three dimensional grid in the space of the camera’s perspective is constructed and the ICON model variable cloud cover (CLC) is interpolated onto that grid. The maximum CLC along the rays that fabricate the camera grid, is taken as a model equivalent for each pixel. After superobbing, monitoring experiments have been conducted to compare the observations and model equivalents over a longer time period, yielding promising results. Further we show the performance of a single assimilation step as well as a longer assimilation experiment over a time period of six days which also yields good results. These findings are a proof of concept and further research has to be invested before these new innovational observations can be assimilated operationally in any numerical weather prediction (NWP) model.
本文提出了一种将云观测资料同化到区域尺度icosterdral non - hydro静力天气预报模式ICON-D2中的创新方法,该模式由Deutscher weterdienst (DWD)运行。训练卷积神经网络(CNN)来检测相机照片中的云。网络的输出是一幅灰度图像,其中每个像素的值在0到1之间,描述了像素属于云(1)或不属于云(0)的概率。通过对图像的某一框进行平均,得到该区域的云覆盖值。建立了一个前向算子,将ICON模型状态映射到观测空间。在摄像机视角空间中构造一个三维网格,并将ICON模型变量云量(CLC)插值到该网格中。沿构成相机网格的射线的最大CLC被作为每个像素的模型等效。在超级取样之后,进行了监测实验,以比较较长时间内的观测结果和模型当量,得出了有希望的结果。此外,我们还展示了单一同化步骤的性能以及为期六天的较长同化实验,这也产生了良好的结果。这些发现是概念的证明,在这些新的创新观测能够在任何数值天气预报(NWP)模式中进行业务吸收之前,必须进行进一步的研究。
{"title":"Convective-scale Assimilation of Cloud Cover from Photographs using a Machine Learning Forward Operator","authors":"Maria Reinhardt, S. Schoger, Frederik Kurzrock, R. Potthast","doi":"10.1175/aies-d-22-0025.1","DOIUrl":"https://doi.org/10.1175/aies-d-22-0025.1","url":null,"abstract":"\u0000This paper presents an innovational way of assimilating observations of clouds into the ICOsahedral Nonhydrostatic weather forecasting model for regional scale, ICON-D2, which is operated by Deutscher Wetterdienst (DWD). A convolutional neural network (CNN) is trained to detect clouds in camera photographs. The network’s output is a greyscale picture, in which each pixel has a value between 0 and 1, describing the probability of the pixel belonging to a cloud (1) or not (0). By averaging over a certain box of the picture a value for the cloud cover of that region is obtained. A forward operator is built to map an ICON model state into the observation space. A three dimensional grid in the space of the camera’s perspective is constructed and the ICON model variable cloud cover (CLC) is interpolated onto that grid. The maximum CLC along the rays that fabricate the camera grid, is taken as a model equivalent for each pixel. After superobbing, monitoring experiments have been conducted to compare the observations and model equivalents over a longer time period, yielding promising results. Further we show the performance of a single assimilation step as well as a longer assimilation experiment over a time period of six days which also yields good results. These findings are a proof of concept and further research has to be invested before these new innovational observations can be assimilated operationally in any numerical weather prediction (NWP) model.","PeriodicalId":94369,"journal":{"name":"Artificial intelligence for the earth systems","volume":"62 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-01-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"83939235","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Detail Enhancement of AIRS/AMSU Temperature and Moisture Profiles Using a 3D Deep Neural Network 使用3D深度神经网络的AIRS/AMSU温度和湿度剖面的细节增强
Pub Date : 2023-01-10 DOI: 10.1175/aies-d-22-0037.1
A. Milstein, J. Santanello, W. Blackwell
In recent decades, spaceborne microwave and hyperspectral infrared sounding instruments have significantly benefited weather forecasting and climate science. However, existing retrievals of lower troposphere temperature and humidity profiles have limitations in vertical resolution, and often cannot accurately represent key features such as the mixed layer thermodynamic structure and the inversion at the planetary boundary layer (PBL) top. Because of the existing limitations in PBL remote sensing from space, there is a compelling need to improve routine, global observations of the PBL and enable advances in scientific understanding and weather and climate prediction. To address this, we have developed a new 3D deep neural network (DNN) which enhances detail and reduces noise in Level 2 granules of temperature and humidity profiles from the Atmospheric Infrared Sounder (AIRS)/Advanced Microwave Sounding Unit (AMSU) sounder instruments aboard NASA’s Aqua spacecraft. We show that the enhancement improves accuracy and detail including key features such as capping inversions at the top of the PBL over land, resulting in improved accuracy in estimations of PBL height.
近几十年来,星载微波和高光谱红外探测仪器为天气预报和气候科学带来了巨大的好处。然而,现有的对流层低层温度和湿度廓线反演在垂直分辨率上存在局限性,而且往往不能准确反映混合层热力结构和行星边界层(PBL)顶部逆温等关键特征。由于目前空间PBL遥感的局限性,迫切需要改进对PBL的常规全球观测,并使科学认识和天气和气候预测取得进展。为了解决这个问题,我们开发了一种新的3D深度神经网络(DNN),可以增强细节并降低来自美国宇航局Aqua航天器上的大气红外探测仪(AIRS)/高级微波探测仪(AMSU)探测仪仪器的2级温度和湿度剖面的噪音。我们发现,这种增强提高了精度和细节,包括陆地上PBL顶部的封顶反演等关键特征,从而提高了PBL高度估计的精度。
{"title":"Detail Enhancement of AIRS/AMSU Temperature and Moisture Profiles Using a 3D Deep Neural Network","authors":"A. Milstein, J. Santanello, W. Blackwell","doi":"10.1175/aies-d-22-0037.1","DOIUrl":"https://doi.org/10.1175/aies-d-22-0037.1","url":null,"abstract":"\u0000In recent decades, spaceborne microwave and hyperspectral infrared sounding instruments have significantly benefited weather forecasting and climate science. However, existing retrievals of lower troposphere temperature and humidity profiles have limitations in vertical resolution, and often cannot accurately represent key features such as the mixed layer thermodynamic structure and the inversion at the planetary boundary layer (PBL) top. Because of the existing limitations in PBL remote sensing from space, there is a compelling need to improve routine, global observations of the PBL and enable advances in scientific understanding and weather and climate prediction. To address this, we have developed a new 3D deep neural network (DNN) which enhances detail and reduces noise in Level 2 granules of temperature and humidity profiles from the Atmospheric Infrared Sounder (AIRS)/Advanced Microwave Sounding Unit (AMSU) sounder instruments aboard NASA’s Aqua spacecraft. We show that the enhancement improves accuracy and detail including key features such as capping inversions at the top of the PBL over land, resulting in improved accuracy in estimations of PBL height.","PeriodicalId":94369,"journal":{"name":"Artificial intelligence for the earth systems","volume":"28 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-01-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82176067","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Emulating Rainfall-Runoff-Inundation Model using Deep Neural Network with Dimensionality Reduction 基于降维深度神经网络的降雨-径流-淹没模型模拟
Pub Date : 2023-01-10 DOI: 10.1175/aies-d-22-0036.1
M. Momoi, S. Kotsuki, Ryota Kikuchi, Satoshi Watanabe, Masafumi Yamada, Shiori Abe
Predicting the spatial distribution of maximum inundation depth (depth-MAP) is important for the mitigation of hydrological disasters induced by extreme precipitation. However, physics-based rainfall-runoff-inundation (RRI) models, which are used operationally to predict hydrological disasters in Japan, require massive computational resources for numerical simulations. Here, we aimed at developing a computationally inexpensive deep learning model (Rain2Depth) that emulates an RRI model. Our study focused on the Omono River (Akita Prefecture, Japan) and predicted the depth-MAP from spatial and temporal rainfall data for individual events.Rain2Depth was developed based on a convolutional neural network (CNN), and predicts depth-MAP from 7-day successive hourly rainfall at 13 rain gauge stations in the basin. For training the Rain2Depth, we simulated the depth-MAP by the RRI model forced by 50-ensembles of 30-year data from large-ensemble weather/climate predictions. Instead of using the input and output data directly, we extracted important features from input and output data with two dimensionality reduction techniques (principal component analysis (PCA) and the CNN approach) prior to training the network. This dimensionality reduction aimed to avoid overfitting caused by insufficient training data. The nonlinear CNN approach was superior to the linear PCA for extracting features. Finally, Rain2Depth was architected by connecting the extracted features between input and output data through a neural network.Rain2Depth-based predictions were more accurate than predictions from our previous model (K20), which used ensemble learning of multiple regularized regressions for a specific station. Whereas the K20 can predict maximum inundation depth only at stations, our study achieved depth-MAP prediction by training only the single model Rain2Depth.
最大淹没深度(deep - map)的空间分布预测对于缓解极端降水引起的水文灾害具有重要意义。然而,基于物理的降雨-径流-淹没(RRI)模型在日本用于实际预测水文灾害,需要大量的计算资源进行数值模拟。在这里,我们的目标是开发一种模拟RRI模型的计算成本低廉的深度学习模型(Rain2Depth)。本研究以日本秋田县小野河为研究对象,利用单个事件的时空降水数据预测深度- map。Rain2Depth是基于卷积神经网络(CNN)开发的,并根据盆地13个雨量站连续7天的每小时降雨量预测深度- map。为了训练Rain2Depth,我们使用RRI模式模拟深度- map,该模式由50个大集合天气/气候预测的30年数据组成。我们没有直接使用输入和输出数据,而是在训练网络之前使用两维降维技术(主成分分析(PCA)和CNN方法)从输入和输出数据中提取重要特征。这种降维的目的是为了避免训练数据不足导致的过拟合。在特征提取方面,非线性CNN方法优于线性PCA方法。最后,通过神经网络连接提取的输入和输出数据之间的特征来构建Rain2Depth。基于rain2depth的预测比我们之前的模型(K20)的预测更准确,后者使用了特定站点的多个正则化回归的集成学习。K20只能在站点上预测最大淹没深度,而我们的研究仅通过训练单一模式Rain2Depth来实现深度- map预测。
{"title":"Emulating Rainfall-Runoff-Inundation Model using Deep Neural Network with Dimensionality Reduction","authors":"M. Momoi, S. Kotsuki, Ryota Kikuchi, Satoshi Watanabe, Masafumi Yamada, Shiori Abe","doi":"10.1175/aies-d-22-0036.1","DOIUrl":"https://doi.org/10.1175/aies-d-22-0036.1","url":null,"abstract":"\u0000Predicting the spatial distribution of maximum inundation depth (depth-MAP) is important for the mitigation of hydrological disasters induced by extreme precipitation. However, physics-based rainfall-runoff-inundation (RRI) models, which are used operationally to predict hydrological disasters in Japan, require massive computational resources for numerical simulations. Here, we aimed at developing a computationally inexpensive deep learning model (Rain2Depth) that emulates an RRI model. Our study focused on the Omono River (Akita Prefecture, Japan) and predicted the depth-MAP from spatial and temporal rainfall data for individual events.\u0000Rain2Depth was developed based on a convolutional neural network (CNN), and predicts depth-MAP from 7-day successive hourly rainfall at 13 rain gauge stations in the basin. For training the Rain2Depth, we simulated the depth-MAP by the RRI model forced by 50-ensembles of 30-year data from large-ensemble weather/climate predictions. Instead of using the input and output data directly, we extracted important features from input and output data with two dimensionality reduction techniques (principal component analysis (PCA) and the CNN approach) prior to training the network. This dimensionality reduction aimed to avoid overfitting caused by insufficient training data. The nonlinear CNN approach was superior to the linear PCA for extracting features. Finally, Rain2Depth was architected by connecting the extracted features between input and output data through a neural network.\u0000Rain2Depth-based predictions were more accurate than predictions from our previous model (K20), which used ensemble learning of multiple regularized regressions for a specific station. Whereas the K20 can predict maximum inundation depth only at stations, our study achieved depth-MAP prediction by training only the single model Rain2Depth.","PeriodicalId":94369,"journal":{"name":"Artificial intelligence for the earth systems","volume":"1 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-01-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89724844","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Creating and Evaluating Uncertainty Estimates with Neural Networks for Environmental-Science Applications 用神经网络创建和评估环境科学应用中的不确定性估计
Pub Date : 2023-01-09 DOI: 10.1175/aies-d-22-0061.1
Katherine Haynes, Ryan Lagerquist, M. McGraw, K. Musgrave, I. Ebert‐Uphoff
Neural networks (NN) have become an important tool for prediction tasks – both regression and classification – in environmental science. Since many environmental-science problems involve life-or-death decisions and policy-making, it is crucial to provide not only predictions but also an estimate of the uncertainty in the predictions. Until recently, very few tools were available to provide uncertainty quantification (UQ) for NN predictions. However, in recent years the computer-science field has developed numerous UQ approaches, and several research groups are exploring how to apply these approaches in environmental science. We provide an accessible introduction to six of these UQ approaches, then focus on tools for the next step, namely to answer the question: Once we obtain an uncertainty estimate (using any approach), how do we know whether it is good or bad? To answer this question, we highlight four evaluation graphics and eight evaluation scores that are well suited for evaluating and comparing uncertainty estimates (NN-based or otherwise) for environmental-science applications. We demonstrate the UQ approaches and UQ-evaluation methods for two real-world problems: (1) estimating vertical profiles of atmospheric dewpoint (a regression task) and (2) predicting convection over Taiwan based on Himawari-8 satellite imagery (a classification task). We also provide Jupyter notebooks with Python code for implementing the UQ approaches and UQ-evaluation methods discussed herein. This article provides the environmental-science community with the knowledge and tools to start incorporating the large number of emerging UQ methods into their research.
神经网络(NN)已经成为环境科学预测任务(回归和分类)的重要工具。由于许多环境科学问题涉及生死攸关的决策和政策制定,因此不仅要提供预测,还要提供预测中不确定性的估计,这一点至关重要。直到最近,很少有工具可以为神经网络预测提供不确定性量化(UQ)。然而,近年来计算机科学领域已经发展了许多UQ方法,一些研究小组正在探索如何将这些方法应用于环境科学。我们为这些UQ方法中的六种提供了一个可访问的介绍,然后将重点放在下一步的工具上,即回答这个问题:一旦我们获得了不确定性估计(使用任何方法),我们如何知道它是好是坏?为了回答这个问题,我们强调了四个评估图形和八个评估分数,它们非常适合评估和比较环境科学应用的不确定性估计(基于神经网络或其他)。我们展示了UQ方法和UQ评估方法,用于两个现实问题:(1)估计大气露点的垂直剖面(回归任务)和(2)基于Himawari-8卫星图像的台湾对流预测(分类任务)。我们还提供了带有Python代码的Jupyter笔记本,用于实现本文讨论的UQ方法和UQ评估方法。本文为环境科学界提供了知识和工具,以开始将大量新兴的UQ方法纳入他们的研究中。
{"title":"Creating and Evaluating Uncertainty Estimates with Neural Networks for Environmental-Science Applications","authors":"Katherine Haynes, Ryan Lagerquist, M. McGraw, K. Musgrave, I. Ebert‐Uphoff","doi":"10.1175/aies-d-22-0061.1","DOIUrl":"https://doi.org/10.1175/aies-d-22-0061.1","url":null,"abstract":"\u0000Neural networks (NN) have become an important tool for prediction tasks – both regression and classification – in environmental science. Since many environmental-science problems involve life-or-death decisions and policy-making, it is crucial to provide not only predictions but also an estimate of the uncertainty in the predictions. Until recently, very few tools were available to provide uncertainty quantification (UQ) for NN predictions. However, in recent years the computer-science field has developed numerous UQ approaches, and several research groups are exploring how to apply these approaches in environmental science. We provide an accessible introduction to six of these UQ approaches, then focus on tools for the next step, namely to answer the question: Once we obtain an uncertainty estimate (using any approach), how do we know whether it is good or bad? To answer this question, we highlight four evaluation graphics and eight evaluation scores that are well suited for evaluating and comparing uncertainty estimates (NN-based or otherwise) for environmental-science applications. We demonstrate the UQ approaches and UQ-evaluation methods for two real-world problems: (1) estimating vertical profiles of atmospheric dewpoint (a regression task) and (2) predicting convection over Taiwan based on Himawari-8 satellite imagery (a classification task). We also provide Jupyter notebooks with Python code for implementing the UQ approaches and UQ-evaluation methods discussed herein. This article provides the environmental-science community with the knowledge and tools to start incorporating the large number of emerging UQ methods into their research.","PeriodicalId":94369,"journal":{"name":"Artificial intelligence for the earth systems","volume":"10 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87885477","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Sub-seasonal Prediction of Central European Summer Heatwaves with Linear and Random Forest Machine Learning Models 基于线性和随机森林机器学习模型的中欧夏季热浪分季节预测
Pub Date : 2023-01-09 DOI: 10.1175/aies-d-22-0038.1
Elizabeth Weirich Benet, M. Pyrina, B. Jiménez-Esteve, E. Fraenkel, J. Cohen, D. Domeisen
Heatwaves are extreme near-surface temperature events that can have substantial impacts on ecosystems and society. EarlyWarning Systems help to reduce these impacts by helping communities prepare for hazardous climate-related events. However, state-of-the-art prediction systems can often not make accurate forecasts of heatwaves more than two weeks in advance, which are required for advance warnings. We therefore investigate the potential of statistical and machine learning methods to understand and predict central European summer heatwaves on timescales of several weeks. As a first step, we identify the most important regional atmospheric and surface predictors based on previous studies and supported by a correlation analysis: 2-m air temperature, 500-hPa geopotential, precipitation, and soil moisture in central Europe, as well as Mediterranean and North Atlantic sea surface temperatures, and the North Atlantic jet stream. Based on these predictors, we apply machine learning methods to forecast two targets: summer temperature anomalies and the probability of heatwaves for 1–6 weeks lead time at weekly resolution. For each of these two target variables, we use both a linear and a random forest model. The performance of these statistical models decays with lead time, as expected, but outperforms persistence and climatology at all lead times. For lead times longer than two weeks, our machine learning models compete with the ensemble mean of the European Centre for Medium-Range Weather Forecasts’ hindcast system. We thus show that machine learning can help improve sub-seasonal forecasts of summer temperature anomalies and heatwaves.
热浪是极端的近地表温度事件,可对生态系统和社会产生重大影响。预警系统通过帮助社区为与气候有关的危险事件做好准备,有助于减少这些影响。然而,最先进的预报系统往往不能提前两周以上准确预报热浪,而这是预警所必需的。因此,我们研究了统计和机器学习方法在几周时间尺度上理解和预测中欧夏季热浪的潜力。首先,我们在前人研究的基础上,通过相关分析确定了最重要的区域大气和地面预测因子:中欧2米气温、500 hpa位势、降水和土壤湿度,以及地中海和北大西洋海面温度和北大西洋急流。基于这些预测因子,我们应用机器学习方法来预测两个目标:夏季温度异常和以周分辨率提前1-6周的热浪概率。对于这两个目标变量中的每一个,我们都使用线性和随机森林模型。正如预期的那样,这些统计模型的性能随着前置时间的推移而衰减,但在所有前置时间内都优于持久性和气候学。如果提前期超过两周,我们的机器学习模型将与欧洲中期天气预报中心的后置系统的整体平均值竞争。因此,我们表明机器学习可以帮助改进夏季温度异常和热浪的分季节预测。
{"title":"Sub-seasonal Prediction of Central European Summer Heatwaves with Linear and Random Forest Machine Learning Models","authors":"Elizabeth Weirich Benet, M. Pyrina, B. Jiménez-Esteve, E. Fraenkel, J. Cohen, D. Domeisen","doi":"10.1175/aies-d-22-0038.1","DOIUrl":"https://doi.org/10.1175/aies-d-22-0038.1","url":null,"abstract":"\u0000Heatwaves are extreme near-surface temperature events that can have substantial impacts on ecosystems and society. EarlyWarning Systems help to reduce these impacts by helping communities prepare for hazardous climate-related events. However, state-of-the-art prediction systems can often not make accurate forecasts of heatwaves more than two weeks in advance, which are required for advance warnings. We therefore investigate the potential of statistical and machine learning methods to understand and predict central European summer heatwaves on timescales of several weeks. As a first step, we identify the most important regional atmospheric and surface predictors based on previous studies and supported by a correlation analysis: 2-m air temperature, 500-hPa geopotential, precipitation, and soil moisture in central Europe, as well as Mediterranean and North Atlantic sea surface temperatures, and the North Atlantic jet stream. Based on these predictors, we apply machine learning methods to forecast two targets: summer temperature anomalies and the probability of heatwaves for 1–6 weeks lead time at weekly resolution. For each of these two target variables, we use both a linear and a random forest model. The performance of these statistical models decays with lead time, as expected, but outperforms persistence and climatology at all lead times. For lead times longer than two weeks, our machine learning models compete with the ensemble mean of the European Centre for Medium-Range Weather Forecasts’ hindcast system. We thus show that machine learning can help improve sub-seasonal forecasts of summer temperature anomalies and heatwaves.","PeriodicalId":94369,"journal":{"name":"Artificial intelligence for the earth systems","volume":"68 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87197752","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Using Neural Networks to Learn the Jet Stream Forced Response from Natural Variability 利用神经网络从自然变率中学习急流强迫响应
Pub Date : 2023-01-02 DOI: 10.1175/aies-d-22-0094.1
Charlotte Connolly, E. Barnes, P. Hassanzadeh, M. Pritchard
Two distinct features of anthropogenic climate change, warming in the tropical upper troposphere and warming at the Arctic surface, have competing effects on the mid-latitude jet stream’s latitudinal position, often referred to as a “tug-of-war”. Studies that investigate the jet’s response to these thermal forcings show that it is sensitive to model type, season, initial atmospheric conditions, and the shape and magnitude of the forcing. Much of this past work focuses on studying a simulation’s response to external manipulation. In contrast, we explore the potential to train a convolutional neural network (CNN) on internal variability alone and then use it to examine possible nonlinear responses of the jet to tropospheric thermal forcing that more closely resemble anthropogenic climate change. Our approach leverages the idea behind the fluctuationdissipation theorem, which relates the internal variability of a system to its forced response but so far has been only used to quantify linear responses. We train a CNN on data from a long control run of the CESM dry dynamical core and show that it is able to skillfully predict the nonlinear response of the jet to sustained external forcing. The trained CNN provides a quick method for exploring the jet stream sensitivity to a wide range of tropospheric temperature tendencies and, considering that this method can likely be applied to any model with a long control run, could lend itself useful for early stage experiment design.
人为气候变化的两个明显特征,热带对流层上层的变暖和北极表面的变暖,对中纬度急流的纬度位置产生了相互竞争的影响,通常被称为“拔河”。研究喷气机对这些热强迫的反应表明,它对模式类型、季节、初始大气条件以及强迫的形状和大小都很敏感。过去的大部分工作都集中在研究模拟对外部操纵的反应上。相比之下,我们探索了在内部变率上单独训练卷积神经网络(CNN)的潜力,然后使用它来检查射流对对流层热强迫的可能的非线性响应,这种响应更接近于人为气候变化。我们的方法利用了波动耗散定理背后的思想,该定理将系统的内部可变性与其强迫响应联系起来,但迄今为止仅用于量化线性响应。我们在CESM干动力核心的长期控制运行数据上训练了一个CNN,并表明它能够巧妙地预测射流对持续外力的非线性响应。训练后的CNN提供了一种快速的方法来探索急流对大范围对流层温度趋势的敏感性,并且考虑到该方法可能适用于任何具有长期控制运行的模型,可以为早期实验设计提供有用的方法。
{"title":"Using Neural Networks to Learn the Jet Stream Forced Response from Natural Variability","authors":"Charlotte Connolly, E. Barnes, P. Hassanzadeh, M. Pritchard","doi":"10.1175/aies-d-22-0094.1","DOIUrl":"https://doi.org/10.1175/aies-d-22-0094.1","url":null,"abstract":"\u0000Two distinct features of anthropogenic climate change, warming in the tropical upper troposphere and warming at the Arctic surface, have competing effects on the mid-latitude jet stream’s latitudinal position, often referred to as a “tug-of-war”. Studies that investigate the jet’s response to these thermal forcings show that it is sensitive to model type, season, initial atmospheric conditions, and the shape and magnitude of the forcing. Much of this past work focuses on studying a simulation’s response to external manipulation. In contrast, we explore the potential to train a convolutional neural network (CNN) on internal variability alone and then use it to examine possible nonlinear responses of the jet to tropospheric thermal forcing that more closely resemble anthropogenic climate change. Our approach leverages the idea behind the fluctuationdissipation theorem, which relates the internal variability of a system to its forced response but so far has been only used to quantify linear responses. We train a CNN on data from a long control run of the CESM dry dynamical core and show that it is able to skillfully predict the nonlinear response of the jet to sustained external forcing. The trained CNN provides a quick method for exploring the jet stream sensitivity to a wide range of tropospheric temperature tendencies and, considering that this method can likely be applied to any model with a long control run, could lend itself useful for early stage experiment design.","PeriodicalId":94369,"journal":{"name":"Artificial intelligence for the earth systems","volume":"6 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-01-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"82488604","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Deep Learning–Based Parameter Transfer in Meteorological Data 基于深度学习的气象数据参数传递
Pub Date : 2023-01-01 DOI: 10.1175/aies-d-22-0024.1
Fatemeh Farokhmanesh, Kevin Höhlein, R. Westermann
Numerical simulations in Earth-system sciences consider a multitude of physical parameters in space and time, leading to severe input/output (I/O) bandwidth requirements and challenges in subsequent data analysis tasks. Deep learning–based identification of redundant parameters and prediction of those from other parameters, that is, variable-to-variable (V2V) transfer, has been proposed as an approach to lessening the bandwidth requirements and streamlining subsequent data analysis. In this paper, we examine the applicability of V2V to meteorological reanalysis data. We find that redundancies within pairs of parameter fields are limited, which hinders application of the original V2V algorithm. Therefore, we assess the predictive strength of reanalysis parameters by analyzing the learning behavior of V2V reconstruction networks in an ablation study. We demonstrate that efficient V2V transfer becomes possible when considering groups of parameter fields for transfer and propose an algorithm to implement this. We investigate further whether the neural networks trained in the V2V process can yield insightful representations of recurring patterns in the data. The interpretability of these representations is assessed via layerwise relevance propagation that highlights field areas and parameters of high importance for the reconstruction model. Applied to reanalysis data, this allows for uncovering mutual relationships between landscape orography and different regional weather situations. We see our approach as an effective means to reduce bandwidth requirements in numerical weather simulations, which can be used on top of conventional data compression schemes. The proposed identification of multiparameter features can spawn further research on the importance of regional weather situations for parameter prediction and also in other kinds of simulation data.
地球系统科学中的数值模拟考虑了空间和时间上的大量物理参数,导致了对输入/输出(I/O)带宽的严格要求和后续数据分析任务的挑战。基于深度学习的冗余参数识别和从其他参数中预测冗余参数,即变量到变量(V2V)传递,已被提出作为一种减少带宽需求和简化后续数据分析的方法。本文探讨了V2V在气象再分析数据中的适用性。我们发现参数字段对之间的冗余是有限的,这阻碍了原始V2V算法的应用。因此,我们通过分析消融研究中V2V重建网络的学习行为来评估再分析参数的预测强度。我们证明了当考虑传输的参数字段组时,有效的V2V传输成为可能,并提出了实现这一目标的算法。我们进一步研究了在V2V过程中训练的神经网络是否能够对数据中重复出现的模式产生有洞察力的表示。这些表示的可解释性通过分层相关性传播来评估,该传播突出了对重建模型高度重要的领域和参数。应用于再分析数据,可以揭示景观地形和不同区域天气状况之间的相互关系。我们认为我们的方法是减少数值天气模拟中带宽需求的有效手段,可以在传统的数据压缩方案之上使用。提出的多参数特征识别可以进一步研究区域天气状况对参数预测的重要性,也可以在其他类型的模拟数据中进行研究。
{"title":"Deep Learning–Based Parameter Transfer in Meteorological Data","authors":"Fatemeh Farokhmanesh, Kevin Höhlein, R. Westermann","doi":"10.1175/aies-d-22-0024.1","DOIUrl":"https://doi.org/10.1175/aies-d-22-0024.1","url":null,"abstract":"\u0000Numerical simulations in Earth-system sciences consider a multitude of physical parameters in space and time, leading to severe input/output (I/O) bandwidth requirements and challenges in subsequent data analysis tasks. Deep learning–based identification of redundant parameters and prediction of those from other parameters, that is, variable-to-variable (V2V) transfer, has been proposed as an approach to lessening the bandwidth requirements and streamlining subsequent data analysis. In this paper, we examine the applicability of V2V to meteorological reanalysis data. We find that redundancies within pairs of parameter fields are limited, which hinders application of the original V2V algorithm. Therefore, we assess the predictive strength of reanalysis parameters by analyzing the learning behavior of V2V reconstruction networks in an ablation study. We demonstrate that efficient V2V transfer becomes possible when considering groups of parameter fields for transfer and propose an algorithm to implement this. We investigate further whether the neural networks trained in the V2V process can yield insightful representations of recurring patterns in the data. The interpretability of these representations is assessed via layerwise relevance propagation that highlights field areas and parameters of high importance for the reconstruction model. Applied to reanalysis data, this allows for uncovering mutual relationships between landscape orography and different regional weather situations. We see our approach as an effective means to reduce bandwidth requirements in numerical weather simulations, which can be used on top of conventional data compression schemes. The proposed identification of multiparameter features can spawn further research on the importance of regional weather situations for parameter prediction and also in other kinds of simulation data.","PeriodicalId":94369,"journal":{"name":"Artificial intelligence for the earth systems","volume":"2 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"81152718","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
A Primer on Topological Data Analysis to Support Image Analysis Tasks in Environmental Science 支持环境科学中图像分析任务的拓扑数据分析入门
Pub Date : 2023-01-01 DOI: 10.1175/aies-d-22-0039.1
Lander Ver Hoef, Henry Adams, Emily J. King, Imme Ebert-Uphoff
Abstract Topological data analysis (TDA) is a tool from data science and mathematics that is beginning to make waves in environmental science. In this work, we seek to provide an intuitive and understandable introduction to a tool from TDA that is particularly useful for the analysis of imagery, namely, persistent homology. We briefly discuss the theoretical background but focus primarily on understanding the output of this tool and discussing what information it can glean. To this end, we frame our discussion around a guiding example of classifying satellite images from the sugar, fish, flower, and gravel dataset produced for the study of mesoscale organization of clouds by Rasp et al. We demonstrate how persistent homology and its vectorization, persistence landscapes, can be used in a workflow with a simple machine learning algorithm to obtain good results, and we explore in detail how we can explain this behavior in terms of image-level features. One of the core strengths of persistent homology is how interpretable it can be, so throughout this paper we discuss not just the patterns we find but why those results are to be expected given what we know about the theory of persistent homology. Our goal is that readers of this paper will leave with a better understanding of TDA and persistent homology, will be able to identify problems and datasets of their own for which persistent homology could be helpful, and will gain an understanding of the results they obtain from applying the included GitHub example code. Significance Statement Information such as the geometric structure and texture of image data can greatly support the inference of the physical state of an observed Earth system, for example, in remote sensing to determine whether wildfires are active or to identify local climate zones. Persistent homology is a branch of topological data analysis that allows one to extract such information in an interpretable way—unlike black-box methods like deep neural networks. The purpose of this paper is to explain in an intuitive manner what persistent homology is and how researchers in environmental science can use it to create interpretable models. We demonstrate the approach to identify certain cloud patterns from satellite imagery and find that the resulting model is indeed interpretable.
拓扑数据分析(TDA)是一种来自数据科学和数学的工具,它开始在环境科学中掀起波澜。在这项工作中,我们试图提供一个直观和可理解的介绍,从TDA的工具,是特别有用的分析图像,即持久同源。我们简要地讨论了理论背景,但主要集中在理解这个工具的输出和讨论它可以收集什么信息。为此,我们将围绕Rasp等人为研究云的中尺度组织而制作的糖、鱼、花和砾石数据集的卫星图像分类的指导性示例进行讨论。我们演示了如何在一个简单的机器学习算法的工作流中使用持久同构及其矢量化,持久景观,以获得良好的结果,我们详细探讨了如何在图像级特征方面解释这种行为。持久同调的核心优势之一是它的可解释性,因此在本文中,我们不仅讨论了我们发现的模式,还讨论了为什么我们知道关于持久同调理论的这些结果是可以预期的。我们的目标是,本文的读者将更好地理解TDA和持久同源性,将能够识别持久同源性可能有帮助的问题和数据集,并将了解他们从应用所包含的GitHub示例代码中获得的结果。图像数据的几何结构和纹理等信息可以极大地支持对被观测地球系统物理状态的推断,例如在遥感中确定野火是否活跃或确定当地气候带。持久同调是拓扑数据分析的一个分支,它允许人们以一种可解释的方式提取这些信息——不像深度神经网络这样的黑箱方法。本文的目的是以一种直观的方式解释什么是持久同源性,以及环境科学研究人员如何使用它来创建可解释的模型。我们演示了从卫星图像中识别某些云模式的方法,并发现所得模型确实是可解释的。
{"title":"A Primer on Topological Data Analysis to Support Image Analysis Tasks in Environmental Science","authors":"Lander Ver Hoef, Henry Adams, Emily J. King, Imme Ebert-Uphoff","doi":"10.1175/aies-d-22-0039.1","DOIUrl":"https://doi.org/10.1175/aies-d-22-0039.1","url":null,"abstract":"Abstract Topological data analysis (TDA) is a tool from data science and mathematics that is beginning to make waves in environmental science. In this work, we seek to provide an intuitive and understandable introduction to a tool from TDA that is particularly useful for the analysis of imagery, namely, persistent homology. We briefly discuss the theoretical background but focus primarily on understanding the output of this tool and discussing what information it can glean. To this end, we frame our discussion around a guiding example of classifying satellite images from the sugar, fish, flower, and gravel dataset produced for the study of mesoscale organization of clouds by Rasp et al. We demonstrate how persistent homology and its vectorization, persistence landscapes, can be used in a workflow with a simple machine learning algorithm to obtain good results, and we explore in detail how we can explain this behavior in terms of image-level features. One of the core strengths of persistent homology is how interpretable it can be, so throughout this paper we discuss not just the patterns we find but why those results are to be expected given what we know about the theory of persistent homology. Our goal is that readers of this paper will leave with a better understanding of TDA and persistent homology, will be able to identify problems and datasets of their own for which persistent homology could be helpful, and will gain an understanding of the results they obtain from applying the included GitHub example code. Significance Statement Information such as the geometric structure and texture of image data can greatly support the inference of the physical state of an observed Earth system, for example, in remote sensing to determine whether wildfires are active or to identify local climate zones. Persistent homology is a branch of topological data analysis that allows one to extract such information in an interpretable way—unlike black-box methods like deep neural networks. The purpose of this paper is to explain in an intuitive manner what persistent homology is and how researchers in environmental science can use it to create interpretable models. We demonstrate the approach to identify certain cloud patterns from satellite imagery and find that the resulting model is indeed interpretable.","PeriodicalId":94369,"journal":{"name":"Artificial intelligence for the earth systems","volume":"39 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134956073","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Carefully Choose the Baseline: Lessons Learned from Applying XAI Attribution Methods for Regression Tasks in Geoscience 谨慎选择基线:应用XAI归因方法进行地球科学回归任务的经验教训
Pub Date : 2023-01-01 DOI: 10.1175/aies-d-22-0058.1
Antonios Mamalakis, Elizabeth A. Barnes, Imme Ebert-Uphoff
Abstract Methods of explainable artificial intelligence (XAI) are used in geoscientific applications to gain insights into the decision-making strategy of neural networks (NNs), highlighting which features in the input contribute the most to a NN prediction. Here, we discuss our “lesson learned” that the task of attributing a prediction to the input does not have a single solution. Instead, the attribution results depend greatly on the considered baseline that the XAI method utilizes—a fact that has been overlooked in the geoscientific literature. The baseline is a reference point to which the prediction is compared so that the prediction can be understood. This baseline can be chosen by the user or is set by construction in the method’s algorithm—often without the user being aware of that choice. We highlight that different baselines can lead to different insights for different science questions and, thus, should be chosen accordingly. To illustrate the impact of the baseline, we use a large ensemble of historical and future climate simulations forced with the shared socioeconomic pathway 3-7.0 (SSP3-7.0) scenario and train a fully connected NN to predict the ensemble- and global-mean temperature (i.e., the forced global warming signal) given an annual temperature map from an individual ensemble member. We then use various XAI methods and different baselines to attribute the network predictions to the input. We show that attributions differ substantially when considering different baselines, because they correspond to answering different science questions. We conclude by discussing important implications and considerations about the use of baselines in XAI research. Significance Statement In recent years, methods of explainable artificial intelligence (XAI) have found great application in geoscientific applications, because they can be used to attribute the predictions of neural networks (NNs) to the input and interpret them physically. Here, we highlight that the attributions—and the physical interpretation—depend greatly on the choice of the baseline—a fact that has been overlooked in the geoscientific literature. We illustrate this dependence for a specific climate task, in which a NN is trained to predict the ensemble- and global-mean temperature (i.e., the forced global warming signal) given an annual temperature map from an individual ensemble member. We show that attributions differ substantially when considering different baselines, because they correspond to answering different science questions.
可解释人工智能(XAI)方法用于地球科学应用,以深入了解神经网络(NN)的决策策略,突出输入中的哪些特征对神经网络预测贡献最大。在这里,我们讨论我们的“经验教训”,即将预测归因于输入的任务没有单一的解决方案。相反,归因结果在很大程度上取决于XAI方法所使用的考虑基线——这是地球科学文献中被忽视的一个事实。基线是对预测进行比较的参考点,以便可以理解预测。这个基线可以由用户选择,也可以由方法算法中的构造来设置——通常用户不会意识到这个选择。我们强调,不同的基线可以导致对不同科学问题的不同见解,因此应该相应地选择。为了说明基线的影响,我们使用了共享社会经济路径3-7.0 (SSP3-7.0)情景强制的历史和未来气候模拟的大型集合,并训练了一个完全连接的神经网络来预测集合和全球平均温度(即强制的全球变暖信号),给出了来自单个集合成员的年温度图。然后,我们使用各种XAI方法和不同的基线将网络预测归因于输入。我们表明,当考虑不同的基线时,归因有很大的不同,因为它们对应于回答不同的科学问题。最后,我们讨论了在XAI研究中使用基线的重要含义和注意事项。近年来,可解释人工智能(XAI)的方法在地球科学应用中得到了很大的应用,因为它们可以用来将神经网络(nn)的预测归因于输入并对其进行物理解释。在这里,我们强调归因和物理解释在很大程度上取决于基线的选择,这是一个在地球科学文献中被忽视的事实。我们在一个特定的气候任务中说明了这种依赖性,在这个任务中,一个神经网络被训练来预测总体和全球平均温度(即,给定单个总体成员的年温度图的强迫全球变暖信号)。我们表明,当考虑不同的基线时,归因有很大的不同,因为它们对应于回答不同的科学问题。
{"title":"Carefully Choose the Baseline: Lessons Learned from Applying XAI Attribution Methods for Regression Tasks in Geoscience","authors":"Antonios Mamalakis, Elizabeth A. Barnes, Imme Ebert-Uphoff","doi":"10.1175/aies-d-22-0058.1","DOIUrl":"https://doi.org/10.1175/aies-d-22-0058.1","url":null,"abstract":"Abstract Methods of explainable artificial intelligence (XAI) are used in geoscientific applications to gain insights into the decision-making strategy of neural networks (NNs), highlighting which features in the input contribute the most to a NN prediction. Here, we discuss our “lesson learned” that the task of attributing a prediction to the input does not have a single solution. Instead, the attribution results depend greatly on the considered baseline that the XAI method utilizes—a fact that has been overlooked in the geoscientific literature. The baseline is a reference point to which the prediction is compared so that the prediction can be understood. This baseline can be chosen by the user or is set by construction in the method’s algorithm—often without the user being aware of that choice. We highlight that different baselines can lead to different insights for different science questions and, thus, should be chosen accordingly. To illustrate the impact of the baseline, we use a large ensemble of historical and future climate simulations forced with the shared socioeconomic pathway 3-7.0 (SSP3-7.0) scenario and train a fully connected NN to predict the ensemble- and global-mean temperature (i.e., the forced global warming signal) given an annual temperature map from an individual ensemble member. We then use various XAI methods and different baselines to attribute the network predictions to the input. We show that attributions differ substantially when considering different baselines, because they correspond to answering different science questions. We conclude by discussing important implications and considerations about the use of baselines in XAI research. Significance Statement In recent years, methods of explainable artificial intelligence (XAI) have found great application in geoscientific applications, because they can be used to attribute the predictions of neural networks (NNs) to the input and interpret them physically. Here, we highlight that the attributions—and the physical interpretation—depend greatly on the choice of the baseline—a fact that has been overlooked in the geoscientific literature. We illustrate this dependence for a specific climate task, in which a NN is trained to predict the ensemble- and global-mean temperature (i.e., the forced global warming signal) given an annual temperature map from an individual ensemble member. We show that attributions differ substantially when considering different baselines, because they correspond to answering different science questions.","PeriodicalId":94369,"journal":{"name":"Artificial intelligence for the earth systems","volume":"67 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136052395","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
期刊
Artificial intelligence for the earth systems
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1