A comparison of model validation approaches for echo state networks using climate model replicates

IF 2.1 2区 数学 Q3 GEOSCIENCES, MULTIDISCIPLINARY Spatial Statistics Pub Date : 2024-01-17 DOI:10.1016/j.spasta.2024.100813
Kellie McClernon, Katherine Goode, Daniel Ries
{"title":"A comparison of model validation approaches for echo state networks using climate model replicates","authors":"Kellie McClernon,&nbsp;Katherine Goode,&nbsp;Daniel Ries","doi":"10.1016/j.spasta.2024.100813","DOIUrl":null,"url":null,"abstract":"<div><p>As global temperatures continue to rise, climate mitigation strategies such as stratospheric aerosol injections (SAI) are increasingly discussed, but the downstream effects of these strategies are not well understood. As such, there is interest in developing statistical methods to quantify the evolution of climate variable relationships during the time period surrounding an SAI. Feature importance applied to echo state network (ESN) models has been proposed as a way to understand the effects of SAI using a data-driven model. This approach depends on the ESN fitting the data well. If not, the feature importance may place importance on features that are not representative of the underlying relationships. Typically, time series prediction models such as ESNs are assessed using out-of-sample performance metrics that divide the times series into separate training and testing sets. However, this model assessment approach is geared towards forecasting applications and not scenarios such as the motivating SAI example where the objective is using a data driven model to capture variable relationships. In this paper, we demonstrate a novel use of climate model replicates to investigate the applicability of the commonly used repeated hold-out model assessment approach for the SAI application. Simulations of an SAI are generated using a simplified climate model, and different initialization conditions are used to provide independent training and testing sets containing the same SAI event. The climate model replicates enable out-of-sample measures of model performance, which are compared to the single time series hold-out validation approach. For our case study, it is found that the repeated hold-out sample performance is comparable, but conservative, to the replicate out-of-sample performance when the training set contains enough time after the aerosol injection.</p></div>","PeriodicalId":48771,"journal":{"name":"Spatial Statistics","volume":null,"pages":null},"PeriodicalIF":2.1000,"publicationDate":"2024-01-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2211675324000046/pdfft?md5=a6ba0350eda3c86948baceceabdf144c&pid=1-s2.0-S2211675324000046-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Spatial Statistics","FirstCategoryId":"100","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2211675324000046","RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"GEOSCIENCES, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

Abstract

As global temperatures continue to rise, climate mitigation strategies such as stratospheric aerosol injections (SAI) are increasingly discussed, but the downstream effects of these strategies are not well understood. As such, there is interest in developing statistical methods to quantify the evolution of climate variable relationships during the time period surrounding an SAI. Feature importance applied to echo state network (ESN) models has been proposed as a way to understand the effects of SAI using a data-driven model. This approach depends on the ESN fitting the data well. If not, the feature importance may place importance on features that are not representative of the underlying relationships. Typically, time series prediction models such as ESNs are assessed using out-of-sample performance metrics that divide the times series into separate training and testing sets. However, this model assessment approach is geared towards forecasting applications and not scenarios such as the motivating SAI example where the objective is using a data driven model to capture variable relationships. In this paper, we demonstrate a novel use of climate model replicates to investigate the applicability of the commonly used repeated hold-out model assessment approach for the SAI application. Simulations of an SAI are generated using a simplified climate model, and different initialization conditions are used to provide independent training and testing sets containing the same SAI event. The climate model replicates enable out-of-sample measures of model performance, which are compared to the single time series hold-out validation approach. For our case study, it is found that the repeated hold-out sample performance is comparable, but conservative, to the replicate out-of-sample performance when the training set contains enough time after the aerosol injection.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
利用气候模型副本对回波状态网络的模型验证方法进行比较
随着全球气温的持续上升,平流层气溶胶注入(SAI)等气候减缓战略越来越多地被讨论,但人们对这些战略的下游影响却不甚了解。因此,人们有兴趣开发统计方法来量化 SAI 期间气候变量关系的演变。有人提出将特征重要性应用于回波状态网络(ESN)模型,作为利用数据驱动模型了解 SAI 影响的一种方法。这种方法依赖于 ESN 与数据的良好拟合。否则,特征重要性可能会重视那些不能代表潜在关系的特征。通常情况下,时间序列预测模型(如 ESN)使用样本外性能指标进行评估,该指标将时间序列分为单独的训练集和测试集。然而,这种模型评估方法针对的是预测应用,而不是像激励性 SAI 示例这样的场景,其目标是使用数据驱动模型来捕捉变量关系。在本文中,我们展示了一种利用气候模型副本的新方法,以研究常用的重复保持模型评估方法在 SAI 应用中的适用性。使用简化的气候模式生成 SAI 模拟,并使用不同的初始化条件提供包含相同 SAI 事件的独立训练集和测试集。通过气候模型复制,可以对模型性能进行样本外测量,并与单一时间序列保持验证方法进行比较。对于我们的案例研究,当训练集包含气溶胶注入后的足够时间时,我们发现重复保持样本的性能与样本外复制性能相当,但比较保守。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Spatial Statistics
Spatial Statistics GEOSCIENCES, MULTIDISCIPLINARY-MATHEMATICS, INTERDISCIPLINARY APPLICATIONS
CiteScore
4.00
自引率
21.70%
发文量
89
审稿时长
55 days
期刊介绍: Spatial Statistics publishes articles on the theory and application of spatial and spatio-temporal statistics. It favours manuscripts that present theory generated by new applications, or in which new theory is applied to an important practical case. A purely theoretical study will only rarely be accepted. Pure case studies without methodological development are not acceptable for publication. Spatial statistics concerns the quantitative analysis of spatial and spatio-temporal data, including their statistical dependencies, accuracy and uncertainties. Methodology for spatial statistics is typically found in probability theory, stochastic modelling and mathematical statistics as well as in information science. Spatial statistics is used in mapping, assessing spatial data quality, sampling design optimisation, modelling of dependence structures, and drawing of valid inference from a limited set of spatio-temporal data.
期刊最新文献
Uncovering hidden alignments in two-dimensional point fields Spatio-temporal data fusion for the analysis of in situ and remote sensing data using the INLA-SPDE approach Exploiting nearest-neighbour maps for estimating the variance of sample mean in equal-probability systematic sampling of spatial populations Variable selection of nonparametric spatial autoregressive models via deep learning Estimation and inference of multi-effect generalized geographically and temporally weighted regression models
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1