首页 > 最新文献

Environmetrics最新文献

英文 中文
An illustration of model agnostic explainability methods applied to environmental data 应用于环境数据的模型不可知可解释性方法的实例
IF 1.7 3区 环境科学与生态学 Q4 ENVIRONMENTAL SCIENCES Pub Date : 2022-10-25 DOI: 10.1002/env.2772
Christopher K. Wikle, Abhirup Datta, Bhava Vyasa Hari, Edward L. Boone, Indranil Sahoo, Indulekha Kavila, Stefano Castruccio, Susan J. Simmons, Wesley S. Burr, Won Chang

Historically, two primary criticisms statisticians have of machine learning and deep neural models is their lack of uncertainty quantification and the inability to do inference (i.e., to explain what inputs are important). Explainable AI has developed in the last few years as a sub-discipline of computer science and machine learning to mitigate these concerns (as well as concerns of fairness and transparency in deep modeling). In this article, our focus is on explaining which inputs are important in models for predicting environmental data. In particular, we focus on three general methods for explainability that are model agnostic and thus applicable across a breadth of models without internal explainability: “feature shuffling”, “interpretable local surrogates”, and “occlusion analysis”. We describe particular implementations of each of these and illustrate their use with a variety of models, all applied to the problem of long-lead forecasting monthly soil moisture in the North American corn belt given sea surface temperature anomalies in the Pacific Ocean.

从历史上看,统计学家对机器学习和深度神经模型的两个主要批评是它们缺乏不确定性量化和无法进行推理(即解释哪些输入是重要的)。在过去的几年里,可解释的人工智能作为计算机科学和机器学习的一个分支学科得到了发展,以减轻这些担忧(以及对深度建模公平性和透明度的担忧)。在本文中,我们的重点是解释在预测环境数据的模型中哪些输入是重要的。我们特别关注三种一般的可解释性方法,这些方法与模型无关,因此适用于大量没有内部可解释性的模型:“特征洗牌”、“可解释的局部代理”和“遮挡分析”。我们描述了这些方法的具体实现,并举例说明了它们与各种模型的使用,所有这些模型都适用于在太平洋海面温度异常的情况下长期预测北美玉米带每月土壤湿度的问题。
{"title":"An illustration of model agnostic explainability methods applied to environmental data","authors":"Christopher K. Wikle,&nbsp;Abhirup Datta,&nbsp;Bhava Vyasa Hari,&nbsp;Edward L. Boone,&nbsp;Indranil Sahoo,&nbsp;Indulekha Kavila,&nbsp;Stefano Castruccio,&nbsp;Susan J. Simmons,&nbsp;Wesley S. Burr,&nbsp;Won Chang","doi":"10.1002/env.2772","DOIUrl":"10.1002/env.2772","url":null,"abstract":"<p>Historically, two primary criticisms statisticians have of machine learning and deep neural models is their lack of uncertainty quantification and the inability to do inference (i.e., to explain what inputs are important). Explainable AI has developed in the last few years as a sub-discipline of computer science and machine learning to mitigate these concerns (as well as concerns of fairness and transparency in deep modeling). In this article, our focus is on explaining which inputs are important in models for predicting environmental data. In particular, we focus on three general methods for explainability that are model agnostic and thus applicable across a breadth of models without internal explainability: “feature shuffling”, “interpretable local surrogates”, and “occlusion analysis”. We describe particular implementations of each of these and illustrate their use with a variety of models, all applied to the problem of long-lead forecasting monthly soil moisture in the North American corn belt given sea surface temperature anomalies in the Pacific Ocean.</p>","PeriodicalId":50512,"journal":{"name":"Environmetrics","volume":"34 1","pages":""},"PeriodicalIF":1.7,"publicationDate":"2022-10-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/env.2772","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9495377","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Modeling the spatial evolution wildfires using random spread process 基于随机蔓延过程的野火空间演化模型
IF 1.7 3区 环境科学与生态学 Q4 ENVIRONMENTAL SCIENCES Pub Date : 2022-10-24 DOI: 10.1002/env.2774
Carlos Díaz-Avalos, Pablo Juan

The study of wildfire spread and the growth of the area burned is an important task in ecological studies and in other contexts. In this work we present a model for fire spread and show the results obtained from simulations of burned areas. The model is based on probabilities of fire at different locations. Such probabilities are obtained from the intensity function of a spatial point process model fitted to the observed pattern of fires in the Valencian Community for the years 1993–2015. The models, applied to different wildfires in Spain, including the different temporal states, combines the features of a network model with those of a quasi-physical model of the interaction between burning and nonburning cells, which strongly depends on covariates. The results of the simulated wildfire burned areas resemble the burned areas observed in real cases, suggesting that the model proposed, based on a Markov process called Random Spread Process, works adequately. The model can be extended to simulate other random spread processes such as epidemics.

研究野火的蔓延和燃烧面积的增长是生态学研究和其他领域的一项重要任务。在这项工作中,我们提出了一个火灾蔓延的模型,并展示了从燃烧区域模拟得到的结果。该模型基于不同地点发生火灾的概率。这些概率是由空间点过程模型的强度函数得到的,该模型拟合了1993-2015年瓦伦西亚社区观测到的火灾模式。这些模型应用于西班牙不同的野火,包括不同的时间状态,将网络模型的特征与燃烧和非燃烧细胞之间相互作用的准物理模型的特征结合起来,这些模型强烈依赖于协变量。模拟的野火燃烧区域的结果与在真实情况下观察到的燃烧区域相似,这表明基于称为随机扩散过程的马尔可夫过程的模型是有效的。该模型可以扩展到模拟其他随机传播过程,如流行病。
{"title":"Modeling the spatial evolution wildfires using random spread process","authors":"Carlos Díaz-Avalos,&nbsp;Pablo Juan","doi":"10.1002/env.2774","DOIUrl":"10.1002/env.2774","url":null,"abstract":"<p>The study of wildfire spread and the growth of the area burned is an important task in ecological studies and in other contexts. In this work we present a model for fire spread and show the results obtained from simulations of burned areas. The model is based on probabilities of fire at different locations. Such probabilities are obtained from the intensity function of a spatial point process model fitted to the observed pattern of fires in the Valencian Community for the years 1993–2015. The models, applied to different wildfires in Spain, including the different temporal states, combines the features of a network model with those of a quasi-physical model of the interaction between burning and nonburning cells, which strongly depends on covariates. The results of the simulated wildfire burned areas resemble the burned areas observed in real cases, suggesting that the model proposed, based on a Markov process called Random Spread Process, works adequately. The model can be extended to simulate other random spread processes such as epidemics.</p>","PeriodicalId":50512,"journal":{"name":"Environmetrics","volume":"33 8","pages":""},"PeriodicalIF":1.7,"publicationDate":"2022-10-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/env.2774","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84482208","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Bayesian multiple changepoint detection with missing data and its application to the magnitude-frequency distributions 具有缺失数据的贝叶斯多变点检测及其在幅频分布中的应用
IF 1.7 3区 环境科学与生态学 Q4 ENVIRONMENTAL SCIENCES Pub Date : 2022-10-22 DOI: 10.1002/env.2775
Shaochuan Lu

The detection of abrupt changes in an evolving pattern of time series in the presence of missing data still poses a challenge to real applications. We formulate the multiple changepoint problem into a latent Markov model on a countably infinite state space. For efficiency-enhancing, we propose a partially collapsed Gibbs sampler for the inference of the joint posterior of the number of changepoints and their locations. Variants of Viterbi algorithms are suggested for obtaining the MAP estimates of random changepoints in the presence of missing data, which provides better performances in these varying-dimensional problems. The method is generally applicable for multiple changepoint detection under a variety of missing data mechanism. The method is applied to a case study of the magnitude-frequency distribution of the 2010 Darfield M7.1 earthquake sequence in New Zealand. We find out some unusual features of the seismic b-value in the Darfield earthquake sequence. It is noted that two changepoints are detected and in contrast to the background seismic b-value, relatively low b-values in the early aftershock propagation period are identified. We suggest that this might be a forewarning of potentially devastatingly strong aftershocks. The advance in the method of b-value changepoint detection will enhance our understanding of earthquake occurrence and potentially lead to improved risk forecasting.

在存在缺失数据的情况下,检测时间序列演变模式的突然变化仍然对实际应用构成挑战。我们将多变点问题公式化为可数无限状态空间上的潜在马尔可夫模型。为了提高效率,我们提出了一个部分塌陷的吉布斯采样器,用于推断变化点数量及其位置的联合后验。Viterbi算法的变体被建议用于在存在缺失数据的情况下获得随机变化点的MAP估计,这在这些变维问题中提供了更好的性能。该方法通常适用于各种缺失数据机制下的多个变化点检测。该方法应用于2010年新西兰达菲尔德7.1级地震序列震级频率分布的实例研究。我们发现了达菲尔德地震序列中地震b值的一些异常特征。值得注意的是,检测到了两个变化点,与背景地震b值相比,确定了余震传播早期相对较低的b值。我们认为,这可能是一个潜在的毁灭性强余震的预警。b值变化点检测方法的进步将增强我们对地震发生的理解,并有可能改进风险预测。
{"title":"Bayesian multiple changepoint detection with missing data and its application to the magnitude-frequency distributions","authors":"Shaochuan Lu","doi":"10.1002/env.2775","DOIUrl":"https://doi.org/10.1002/env.2775","url":null,"abstract":"<p>The detection of abrupt changes in an evolving pattern of time series in the presence of missing data still poses a challenge to real applications. We formulate the multiple changepoint problem into a latent Markov model on a countably infinite state space. For efficiency-enhancing, we propose a partially collapsed Gibbs sampler for the inference of the joint posterior of the number of changepoints and their locations. Variants of Viterbi algorithms are suggested for obtaining the MAP estimates of random changepoints in the presence of missing data, which provides better performances in these varying-dimensional problems. The method is generally applicable for multiple changepoint detection under a variety of missing data mechanism. The method is applied to a case study of the magnitude-frequency distribution of the 2010 Darfield M7.1 earthquake sequence in New Zealand. We find out some unusual features of the seismic <i>b</i>-value in the Darfield earthquake sequence. It is noted that two changepoints are detected and in contrast to the background seismic <i>b</i>-value, relatively low <i>b</i>-values in the early aftershock propagation period are identified. We suggest that this might be a forewarning of potentially devastatingly strong aftershocks. The advance in the method of <i>b</i>-value changepoint detection will enhance our understanding of earthquake occurrence and potentially lead to improved risk forecasting.</p>","PeriodicalId":50512,"journal":{"name":"Environmetrics","volume":"34 4","pages":""},"PeriodicalIF":1.7,"publicationDate":"2022-10-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"50141295","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Flood hazard model calibration using multiresolution model output 使用多分辨率模型输出的洪水灾害模型校准
IF 1.7 3区 环境科学与生态学 Q4 ENVIRONMENTAL SCIENCES Pub Date : 2022-10-17 DOI: 10.1002/env.2769
Samantha M. Roth, Ben Seiyon Lee, Sanjib Sharma, Iman Hosseini-Shakib, Klaus Keller, Murali Haran

Riverine floods pose a considerable risk to many communities. Improving flood hazard projections has the potential to inform the design and implementation of flood risk management strategies. Current flood hazard projections are uncertain, especially due to uncertain model parameters. Calibration methods use observations to quantify model parameter uncertainty. With limited computational resources, researchers typically calibrate models using either relatively few expensive model runs at high spatial resolutions or many cheaper runs at lower spatial resolutions. This leads to an open question: is it possible to effectively combine information from the high and low resolution model runs? We propose a Bayesian emulation–calibration approach that assimilates model outputs and observations at multiple resolutions. As a case study for a riverine community in Pennsylvania, we demonstrate our approach using the LISFLOOD-FP flood hazard model. The multiresolution approach results in improved parameter inference over the single resolution approach in multiple scenarios. Results vary based on the parameter values and the number of available models runs. Our method is general and can be used to calibrate other high dimensional computer models to improve projections.

河流洪水对许多社区构成相当大的风险。改进洪水灾害预测有可能为洪水风险管理战略的设计和实施提供信息。目前的洪水灾害预测是不确定的,特别是由于模型参数不确定。校准方法使用观测值来量化模型参数的不确定性。由于计算资源有限,研究人员通常使用相对较少的高空间分辨率的昂贵模型运行或较低空间分辨率的许多较便宜的运行来校准模型。这就引出了一个悬而未决的问题:是否有可能有效地组合来自高分辨率和低分辨率模型运行的信息?我们提出了一种贝叶斯模拟-校准方法,该方法吸收了多分辨率的模型输出和观测结果。作为宾夕法尼亚州河流社区的案例研究,我们使用LISFLOOD-FP洪水灾害模型演示了我们的方法。在多个场景中,多分辨率方法比单分辨率方法改进了参数推断。结果因参数值和可用模型运行的数量而异。我们的方法是通用的,可以用于校准其他高维计算机模型以改进投影。
{"title":"Flood hazard model calibration using multiresolution model output","authors":"Samantha M. Roth,&nbsp;Ben Seiyon Lee,&nbsp;Sanjib Sharma,&nbsp;Iman Hosseini-Shakib,&nbsp;Klaus Keller,&nbsp;Murali Haran","doi":"10.1002/env.2769","DOIUrl":"https://doi.org/10.1002/env.2769","url":null,"abstract":"<p>Riverine floods pose a considerable risk to many communities. Improving flood hazard projections has the potential to inform the design and implementation of flood risk management strategies. Current flood hazard projections are uncertain, especially due to uncertain model parameters. Calibration methods use observations to quantify model parameter uncertainty. With limited computational resources, researchers typically calibrate models using either relatively few expensive model runs at high spatial resolutions or many cheaper runs at lower spatial resolutions. This leads to an open question: is it possible to effectively combine information from the high and low resolution model runs? We propose a Bayesian emulation–calibration approach that assimilates model outputs and observations at multiple resolutions. As a case study for a riverine community in Pennsylvania, we demonstrate our approach using the LISFLOOD-FP flood hazard model. The multiresolution approach results in improved parameter inference over the single resolution approach in multiple scenarios. Results vary based on the parameter values and the number of available models runs. Our method is general and can be used to calibrate other high dimensional computer models to improve projections.</p>","PeriodicalId":50512,"journal":{"name":"Environmetrics","volume":"34 2","pages":""},"PeriodicalIF":1.7,"publicationDate":"2022-10-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"50136284","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
A double fixed rank kriging approach to spatial regression models with covariate measurement error 具有协变量测量误差的空间回归模型的双定秩克里格方法
IF 1.7 3区 环境科学与生态学 Q4 ENVIRONMENTAL SCIENCES Pub Date : 2022-10-17 DOI: 10.1002/env.2771
Xu Ning, Francis K. C. Hui, Alan H. Welsh

In many applications of spatial regression modeling, the spatially indexed covariates are measured with error, and it is known that ignoring this measurement error can lead to attenuation of the estimated regression coefficients. Classical measurement error techniques may not be appropriate in the spatial setting, due to the lack of validation data and the presence of (residual) spatial correlation among the responses. In this article, we propose a double fixed rank kriging (FRK) approach to obtain bias-corrected estimates of and inference on coefficients in spatial regression models, where the covariates are spatially indexed and subject to measurement error. Assuming they vary smoothly in space, the proposed method first fits an FRK model regressing the covariates against spatial basis functions to obtain predictions of the error-free covariates. These are then passed into a second FRK model, where the response is regressed against the predicted covariates plus another set of spatial basis functions to account for spatial correlation. A simulation study and an application to presence–absence records of Carolina wren from the North American Breeding Bird Survey demonstrate that the proposed double FRK approach can be effective in adjusting for measurement error in spatially correlated data.

在空间回归建模的许多应用中,空间索引的协变量是带误差测量的,并且已知忽略该测量误差会导致估计的回归系数的衰减。由于缺乏验证数据以及响应之间存在(残差)空间相关性,经典的测量误差技术在空间环境中可能不合适。在本文中,我们提出了一种双固定秩克里格(FRK)方法,以获得空间回归模型中系数的偏差校正估计和推断,其中协变量是空间索引的,并受到测量误差的影响。假设它们在空间中平滑变化,所提出的方法首先拟合FRK模型,将协变量与空间基函数回归,以获得无误差协变量的预测。然后将其传递到第二个FRK模型中,在该模型中,根据预测的协变量加上另一组空间基函数对响应进行回归,以考虑空间相关性。一项模拟研究和对北美繁殖鸟类调查中卡罗莱纳莺存在-不存在记录的应用表明,所提出的双FRK方法可以有效地调整空间相关数据中的测量误差。
{"title":"A double fixed rank kriging approach to spatial regression models with covariate measurement error","authors":"Xu Ning,&nbsp;Francis K. C. Hui,&nbsp;Alan H. Welsh","doi":"10.1002/env.2771","DOIUrl":"https://doi.org/10.1002/env.2771","url":null,"abstract":"<p>In many applications of spatial regression modeling, the spatially indexed covariates are measured with error, and it is known that ignoring this measurement error can lead to attenuation of the estimated regression coefficients. Classical measurement error techniques may not be appropriate in the spatial setting, due to the lack of validation data and the presence of (residual) spatial correlation among the responses. In this article, we propose a double fixed rank kriging (FRK) approach to obtain bias-corrected estimates of and inference on coefficients in spatial regression models, where the covariates are spatially indexed and subject to measurement error. Assuming they vary smoothly in space, the proposed method first fits an FRK model regressing the covariates against spatial basis functions to obtain predictions of the error-free covariates. These are then passed into a second FRK model, where the response is regressed against the predicted covariates plus another set of spatial basis functions to account for spatial correlation. A simulation study and an application to presence–absence records of Carolina wren from the North American Breeding Bird Survey demonstrate that the proposed double FRK approach can be effective in adjusting for measurement error in spatially correlated data.</p>","PeriodicalId":50512,"journal":{"name":"Environmetrics","volume":"34 1","pages":""},"PeriodicalIF":1.7,"publicationDate":"2022-10-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/env.2771","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"50145274","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Decisions, decisions, decisions in an uncertain environment 不确定环境中的决策
IF 1.7 3区 环境科学与生态学 Q4 ENVIRONMENTAL SCIENCES Pub Date : 2022-10-17 DOI: 10.1002/env.2767
Noel Cressie

Decision-makers abhor uncertainty, and it is certainly true that the less there is of it the better. However, recognizing that uncertainty is part of the equation, particularly for deciding on environmental policy, is a prerequisite for making wise decisions. Even making no decision is a decision that has consequences, and using the presence of uncertainty as the reason for failing to act is a poor excuse. Statistical science is the science of uncertainty, and it should play a critical role in the decision-making process. This opinion piece focuses on the summit of the knowledge pyramid that starts from data and rises in steps from data to information, from information to knowledge, and finally from knowledge to decisions. Enormous advances have been made in the last 100 years ascending the pyramid, with deviations that have followed different routes. There has generally been a healthy supply of uncertainty quantification along the way but, in a rush to the top, where the decisions are made, uncertainty is often left behind. In my opinion, statistical science needs to be much more pro-active in evolving classical decision theory into a relevant and practical area of decision applications. This article follows several threads, building on the decision-theoretic foundations of loss functions and Bayesian uncertainty.

决策者痛恨不确定性,当然不确定性越少越好。然而,认识到不确定性是等式的一部分,特别是在决定环境政策时,这是做出明智决定的先决条件。即使不做决定也是有后果的决定,以不确定性为理由不采取行动是一个糟糕的借口。统计科学是一门不确定性科学,它应该在决策过程中发挥关键作用。这篇观点文章聚焦于知识金字塔的顶峰,知识金字塔从数据开始,从数据到信息,从信息到知识,最后从知识到决策,逐级上升。在过去的100年里,攀登金字塔取得了巨大的进步,但也有不同的偏差。在这一过程中,总体上存在着健康的不确定性量化供应,但在决策的高峰期,不确定性往往会被抛在后面。在我看来,统计科学需要更加积极地将经典决策理论发展成为决策应用的相关和实用领域。本文遵循了几个线索,建立在损失函数和贝叶斯不确定性的决策理论基础上。
{"title":"Decisions, decisions, decisions in an uncertain environment","authors":"Noel Cressie","doi":"10.1002/env.2767","DOIUrl":"https://doi.org/10.1002/env.2767","url":null,"abstract":"<p>Decision-makers abhor uncertainty, and it is certainly true that the less there is of it the better. However, recognizing that uncertainty is part of the equation, particularly for deciding on environmental policy, is a prerequisite for making wise decisions. Even making no decision is a decision that has consequences, and using the presence of uncertainty as the reason for failing to act is a poor excuse. Statistical science is the science of uncertainty, and it should play a critical role in the decision-making process. This opinion piece focuses on the summit of the knowledge pyramid that starts from data and rises in steps from data to information, from information to knowledge, and finally from knowledge to decisions. Enormous advances have been made in the last 100 years ascending the pyramid, with deviations that have followed different routes. There has generally been a healthy supply of uncertainty quantification along the way but, in a rush to the top, where the decisions are made, uncertainty is often left behind. In my opinion, statistical science needs to be much more pro-active in evolving classical decision theory into a relevant and practical area of decision applications. This article follows several threads, building on the decision-theoretic foundations of loss functions and Bayesian uncertainty.</p>","PeriodicalId":50512,"journal":{"name":"Environmetrics","volume":"34 1","pages":""},"PeriodicalIF":1.7,"publicationDate":"2022-10-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/env.2767","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"50145275","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Stochastic tropical cyclone precipitation field generation 随机热带气旋降水场生成
IF 1.7 3区 环境科学与生态学 Q4 ENVIRONMENTAL SCIENCES Pub Date : 2022-10-06 DOI: 10.1002/env.2766
William Kleiber, Stephan Sain, Luke Madaus, Patrick Harr

Tropical cyclones are important drivers of coastal flooding which have severe negative public safety and economic consequences. Due to the rare occurrence of such events, high spatial and temporal resolution historical storm precipitation data are limited in availability. This article introduces a statistical tropical cyclone space-time precipitation generator given limited information from storm track datasets. Given a handful of predictor variables that are common in either historical or simulated storm track ensembles such as pressure deficit at the storm's center, radius of maximal winds, storm center and direction, and distance to coast, the proposed stochastic model generates space-time fields of quantitative precipitation over the study domain. Statistically novel aspects include that the model is developed in Lagrangian coordinates with respect to the dynamic storm center that uses ideas from low-rank representations along with circular process models. The model is trained on a set of tropical cyclone data from an advanced weather forecasting model over the Gulf of Mexico and southern United States, and is validated by cross-validation. Results show the model appropriately captures spatial asymmetry of cyclone precipitation patterns, total precipitation as well as the local distribution of precipitation at a set of case study locations along the coast. We additionally compare our model against a widely-used statistical forecast, and illustrate that our approach better captures uncertainty, as well as storm characteristics such as asymmetry.

热带气旋是沿海洪水的重要驱动因素,对公共安全和经济造成严重的负面影响。由于此类事件的罕见发生,高时空分辨率的历史风暴降水数据的可用性有限。本文介绍了一种统计热带气旋时空降水生成器,该生成器的风暴轨迹数据集信息有限。给定历史或模拟风暴路径集合中常见的少数预测变量,如风暴中心的压力不足、最大风半径、风暴中心和方向以及到海岸的距离,所提出的随机模型生成了研究领域内定量降水的时空场。统计上新颖的方面包括,该模型是在关于动态风暴中心的拉格朗日坐标系中开发的,使用了低阶表示的思想以及圆形过程模型。该模型基于墨西哥湾和美国南部高级天气预报模型的一组热带气旋数据进行训练,并通过交叉验证进行验证。结果表明,该模型适当地捕捉了沿海一组案例研究地点的气旋降水模式、总降水量以及局部降水分布的空间不对称性。此外,我们将我们的模型与广泛使用的统计预测进行了比较,并说明我们的方法更好地捕捉了不确定性以及不对称等风暴特征。
{"title":"Stochastic tropical cyclone precipitation field generation","authors":"William Kleiber,&nbsp;Stephan Sain,&nbsp;Luke Madaus,&nbsp;Patrick Harr","doi":"10.1002/env.2766","DOIUrl":"https://doi.org/10.1002/env.2766","url":null,"abstract":"<p>Tropical cyclones are important drivers of coastal flooding which have severe negative public safety and economic consequences. Due to the rare occurrence of such events, high spatial and temporal resolution historical storm precipitation data are limited in availability. This article introduces a statistical tropical cyclone space-time precipitation generator given limited information from storm track datasets. Given a handful of predictor variables that are common in either historical or simulated storm track ensembles such as pressure deficit at the storm's center, radius of maximal winds, storm center and direction, and distance to coast, the proposed stochastic model generates space-time fields of quantitative precipitation over the study domain. Statistically novel aspects include that the model is developed in Lagrangian coordinates with respect to the dynamic storm center that uses ideas from low-rank representations along with circular process models. The model is trained on a set of tropical cyclone data from an advanced weather forecasting model over the Gulf of Mexico and southern United States, and is validated by cross-validation. Results show the model appropriately captures spatial asymmetry of cyclone precipitation patterns, total precipitation as well as the local distribution of precipitation at a set of case study locations along the coast. We additionally compare our model against a widely-used statistical forecast, and illustrate that our approach better captures uncertainty, as well as storm characteristics such as asymmetry.</p>","PeriodicalId":50512,"journal":{"name":"Environmetrics","volume":"34 1","pages":""},"PeriodicalIF":1.7,"publicationDate":"2022-10-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"50123086","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Two years of COVID-19 pandemic: The Italian experience of Statgroup-19 COVID-19大流行的两年:意大利19国集团的经验
IF 1.7 3区 环境科学与生态学 Q4 ENVIRONMENTAL SCIENCES Pub Date : 2022-10-04 DOI: 10.1002/env.2768
Giovanna Jona Lasinio, Fabio Divino, Gianfranco Lovison, Marco Mingione, Pierfrancesco Alaimo Di Loro, Alessio Farcomeni, Antonello Maruotti

The amount and poor quality of available data and the need of appropriate modeling of the main epidemic indicators require specific skills. In this context, the statistician plays a key role in the process that leads to policy decisions, starting with monitoring changes and evaluating risks. The “what” and the “why” of these changes represent fundamental research questions to provide timely and effective tools to manage the evolution of the epidemic. Answers to such questions need appropriate statistical models and visualization tools. Here, we give an overview of the role played by Statgroup-19, an independent Italian research group born in March 2020. The group includes seven statisticians from different Italian universities, each with different backgrounds but with a shared interest in data analysis, statistical modeling, and biostatistics. Since the beginning of the COVID-19 pandemic the group has interacted with authorities and journalists to support policy decisions and inform the general public about the evolution of the epidemic. This collaboration led to several scientific papers and an accrued visibility across various media, all made possible by the continuous interaction across the group members that shared their unique expertise.

现有数据的数量和质量都很差,需要对主要流行病指标进行适当的建模,这些都需要特殊的技能。在这种情况下,统计学家在导致政策决策的过程中起着关键作用,从监测变化和评估风险开始。这些变化的“是什么”和“为什么”是基本的研究问题,为管理这一流行病的演变提供了及时和有效的工具。这些问题的答案需要适当的统计模型和可视化工具。在这里,我们概述了Statgroup-19所扮演的角色,这是一个成立于2020年3月的独立意大利研究小组。该小组包括来自意大利不同大学的七名统计学家,每个人都有不同的背景,但对数据分析、统计建模和生物统计学有共同的兴趣。自2019冠状病毒病大流行开始以来,该小组一直与当局和记者进行互动,以支持政策决定,并向公众通报疫情的演变。这次合作产生了几篇科学论文,并在各种媒体上积累了知名度,所有这些都是通过小组成员之间的持续互动来实现的,他们分享了自己独特的专业知识。
{"title":"Two years of COVID-19 pandemic: The Italian experience of Statgroup-19","authors":"Giovanna Jona Lasinio,&nbsp;Fabio Divino,&nbsp;Gianfranco Lovison,&nbsp;Marco Mingione,&nbsp;Pierfrancesco Alaimo Di Loro,&nbsp;Alessio Farcomeni,&nbsp;Antonello Maruotti","doi":"10.1002/env.2768","DOIUrl":"10.1002/env.2768","url":null,"abstract":"<p>The amount and poor quality of available data and the need of appropriate modeling of the main epidemic indicators require specific skills. In this context, the statistician plays a key role in the process that leads to policy decisions, starting with monitoring changes and evaluating risks. The “what” and the “why” of these changes represent fundamental research questions to provide timely and effective tools to manage the evolution of the epidemic. Answers to such questions need appropriate statistical models and visualization tools. Here, we give an overview of the role played by Statgroup-19, an independent Italian research group born in March 2020. The group includes seven statisticians from different Italian universities, each with different backgrounds but with a shared interest in data analysis, statistical modeling, and biostatistics. Since the beginning of the COVID-19 pandemic the group has interacted with authorities and journalists to support policy decisions and inform the general public about the evolution of the epidemic. This collaboration led to several scientific papers and an accrued visibility across various media, all made possible by the continuous interaction across the group members that shared their unique expertise.</p>","PeriodicalId":50512,"journal":{"name":"Environmetrics","volume":"33 8","pages":""},"PeriodicalIF":1.7,"publicationDate":"2022-10-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9136278","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Detecting changes in mixed-sampling rate data sequences 检测混合采样率数据序列的变化
IF 1.7 3区 环境科学与生态学 Q4 ENVIRONMENTAL SCIENCES Pub Date : 2022-09-26 DOI: 10.1002/env.2762
Aaron Paul Lowther, Rebecca Killick, Idris Arthur Eckley

Different environmental variables are often monitored using different sampling rates; examples include half-hourly weather station measurements, daily CO2$$ {mathrm{CO}}_2 $$ data, and six-day satellite data. Further when researchers want to combine the data into a single analysis this often requires data aggregation or down-scaling. When one is seeking to identify changes within multivariate data, the aggregation and/or down-scaling processes obscure the changes we seek. In this article, we propose a novel changepoint detection algorithm which can analyze multiple time series for co-occurring changepoints with potentially different sampling rates, without requiring preprocessing to a standard sampling scale. We demonstrate the algorithm on synthetic data before providing an example identifying simultaneous changes in multiple variables at a location on the Greenland ice sheet using synthetic aperture radar and weather station data.

经常使用不同的采样率来监测不同的环境变量;示例包括半小时气象站测量、每日CO2$${mathrm{CO}}_2$$数据,以及六天卫星数据。此外,当研究人员想将数据组合成一个单一的分析时,这通常需要数据聚合或缩小规模。当人们试图识别多元数据中的变化时,聚合和/或缩小过程会掩盖我们所寻求的变化。在本文中,我们提出了一种新的变化点检测算法,该算法可以分析多个时间序列中具有潜在不同采样率的同时发生的变化点,而不需要对标准采样尺度进行预处理。我们在合成数据上演示了算法,然后提供了一个使用合成孔径雷达和气象站数据识别格陵兰冰盖某个位置多个变量同时变化的例子。
{"title":"Detecting changes in mixed-sampling rate data sequences","authors":"Aaron Paul Lowther,&nbsp;Rebecca Killick,&nbsp;Idris Arthur Eckley","doi":"10.1002/env.2762","DOIUrl":"https://doi.org/10.1002/env.2762","url":null,"abstract":"<p>Different environmental variables are often monitored using different sampling rates; examples include half-hourly weather station measurements, daily <math>\u0000 <semantics>\u0000 <mrow>\u0000 <msub>\u0000 <mrow>\u0000 <mtext>CO</mtext>\u0000 </mrow>\u0000 <mrow>\u0000 <mn>2</mn>\u0000 </mrow>\u0000 </msub>\u0000 </mrow>\u0000 <annotation>$$ {mathrm{CO}}_2 $$</annotation>\u0000 </semantics></math> data, and six-day satellite data. Further when researchers want to combine the data into a single analysis this often requires data aggregation or down-scaling. When one is seeking to identify changes within multivariate data, the aggregation and/or down-scaling processes obscure the changes we seek. In this article, we propose a novel changepoint detection algorithm which can analyze multiple time series for co-occurring changepoints with potentially different sampling rates, without requiring preprocessing to a standard sampling scale. We demonstrate the algorithm on synthetic data before providing an example identifying simultaneous changes in multiple variables at a location on the Greenland ice sheet using synthetic aperture radar and weather station data.</p>","PeriodicalId":50512,"journal":{"name":"Environmetrics","volume":"34 1","pages":""},"PeriodicalIF":1.7,"publicationDate":"2022-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/env.2762","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"50154749","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Functional forecasting of dissolved oxygen in high-frequency vertical lake profiles 高频垂直湖泊剖面中溶解氧的函数预测
IF 1.7 3区 环境科学与生态学 Q4 ENVIRONMENTAL SCIENCES Pub Date : 2022-09-23 DOI: 10.1002/env.2765
Luke Durell, J. Thad Scott, Douglas Nychka, Amanda S. Hering

Predicting dissolved oxygen (DO) in lakes is important for assessing environmental conditions as well as reducing water treatment costs. High levels of DO often precede toxic algal blooms, and low DO causes carcinogenic metals to precipitate during water treatment. Typically, DO is predicted from limited data sets using hydrodynamic modeling or data-driven approaches like neural networks. However, functional data analysis (FDA) is also an appropriate modeling paradigm for measurements of DO taken vertically through the water column. In this analysis, we build FDA models for a set of profiles measured every 2 hours and forecast the entire DO percent saturation profile from 2 to 24 hours ahead. Functional smoothing and functional principal component analysis are applied first, followed by a vector autoregressive model to forecast the empirical functional principal component (FPC) scores. Rolling training windows adapt to seasonality, and multiple combinations of window sizes, model variables, and parameter specifications are compared using both functional and direct root mean squared error metrics. The FPC method outperforms a suite of comparison models, and including functional pH, temperature, and conductivity variables improves the longer forecasts. Finally, the FDA approach is useful for identifying unusual observations.

预测湖泊中的溶解氧(DO)对于评估环境条件和降低水处理成本非常重要。高DO水平通常先于有毒藻类水华,低DO会导致致癌金属在水处理过程中沉淀。通常,DO是使用流体动力学建模或神经网络等数据驱动方法从有限的数据集预测的。然而,功能数据分析(FDA)也是通过水柱垂直测量DO的合适建模范例。在这项分析中,我们为每2小时测量一次的一组剖面建立了FDA模型,并预测了未来2至24小时的整个DO百分比饱和度剖面。首先应用函数平滑和函数主成分分析,然后使用向量自回归模型预测经验函数主成分(FPC)得分。滚动训练窗口适应季节性,并且使用函数和直接均方根误差度量来比较窗口大小、模型变量和参数规范的多种组合。FPC方法优于一套比较模型,包括功能pH、温度和电导率变量可以改进更长的预测。最后,美国食品药品监督管理局的方法有助于识别不寻常的观察结果。
{"title":"Functional forecasting of dissolved oxygen in high-frequency vertical lake profiles","authors":"Luke Durell,&nbsp;J. Thad Scott,&nbsp;Douglas Nychka,&nbsp;Amanda S. Hering","doi":"10.1002/env.2765","DOIUrl":"https://doi.org/10.1002/env.2765","url":null,"abstract":"<p>Predicting dissolved oxygen (DO) in lakes is important for assessing environmental conditions as well as reducing water treatment costs. High levels of DO often precede toxic algal blooms, and low DO causes carcinogenic metals to precipitate during water treatment. Typically, DO is predicted from limited data sets using hydrodynamic modeling or data-driven approaches like neural networks. However, functional data analysis (FDA) is also an appropriate modeling paradigm for measurements of DO taken vertically through the water column. In this analysis, we build FDA models for a set of profiles measured every 2 hours and forecast the entire DO percent saturation profile from 2 to 24 hours ahead. Functional smoothing and functional principal component analysis are applied first, followed by a vector autoregressive model to forecast the empirical functional principal component (FPC) scores. Rolling training windows adapt to seasonality, and multiple combinations of window sizes, model variables, and parameter specifications are compared using both functional and direct root mean squared error metrics. The FPC method outperforms a suite of comparison models, and including functional pH, temperature, and conductivity variables improves the longer forecasts. Finally, the FDA approach is useful for identifying unusual observations.</p>","PeriodicalId":50512,"journal":{"name":"Environmetrics","volume":"34 4","pages":""},"PeriodicalIF":1.7,"publicationDate":"2022-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/env.2765","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"50142445","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
期刊
Environmetrics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1