首页 > 最新文献

Environmetrics最新文献

英文 中文
A zero-inflated Poisson spatial model with misreporting for wildfire occurrences in southern Italian municipalities 意大利南部城市野火发生率的零膨胀泊松空间模型与误报问题
IF 1.5 3区 环境科学与生态学 Q4 ENVIRONMENTAL SCIENCES Pub Date : 2024-05-02 DOI: 10.1002/env.2853
Serena Arima, Crescenza Calculli, Alessio Pollice

We propose a Poisson model for zero-inflated spatial counts contaminated by measurement error: we accommodate the excess of zeroes in the counts, consider the possible under/over reporting of the response and account for the neighboring structure of spatial areal units. Bayesian inferences are provided by MCMC implementation through the R package NIMBLE. To evaluate the model performance, a simulation study is carried out under configurations that allow for structured and unstructured spatial random effects. The proposed model is applied to investigate the distribution of the counts of wildfire occurrences in the municipal areas of two neighboring Italian regions for the summer season 2021. Fire counts are obtained by processing MODIS satellite data, while several socio-economic and environmental-driven potential risk factors are also considered in the model formulation. Data from multiple sources with different spatial support are processed in order to comply with the municipal units. Results suggest the appropriateness of the approach and provide some insights on the features of wildfire occurrences.

我们为受测量误差污染的零膨胀空间计数提出了一个泊松模型:我们考虑了计数中过多的零,考虑了可能存在的反应不足/过多的报告,并考虑了空间区域单位的邻近结构。贝叶斯推论是通过 R 软件包 NIMBLE 的 MCMC 实现的。为了评估模型的性能,在允许结构化和非结构化空间随机效应的配置下进行了模拟研究。提出的模型被用于研究 2021 年夏季意大利两个相邻大区市镇地区野火发生次数的分布情况。火灾次数是通过处理 MODIS 卫星数据获得的,同时,在建立模型时还考虑了一些由社会经济和环境驱动的潜在风险因素。对来自不同空间支持的多个来源的数据进行了处理,以符合市政单位的要求。结果表明该方法是适当的,并对野火发生的特点提供了一些启示。
{"title":"A zero-inflated Poisson spatial model with misreporting for wildfire occurrences in southern Italian municipalities","authors":"Serena Arima,&nbsp;Crescenza Calculli,&nbsp;Alessio Pollice","doi":"10.1002/env.2853","DOIUrl":"10.1002/env.2853","url":null,"abstract":"<p>We propose a Poisson model for zero-inflated spatial counts contaminated by measurement error: we accommodate the excess of zeroes in the counts, consider the possible under/over reporting of the response and account for the neighboring structure of spatial areal units. Bayesian inferences are provided by MCMC implementation through the R package NIMBLE. To evaluate the model performance, a simulation study is carried out under configurations that allow for structured and unstructured spatial random effects. The proposed model is applied to investigate the distribution of the counts of wildfire occurrences in the municipal areas of two neighboring Italian regions for the summer season 2021. Fire counts are obtained by processing MODIS satellite data, while several socio-economic and environmental-driven potential risk factors are also considered in the model formulation. Data from multiple sources with different spatial support are processed in order to comply with the municipal units. Results suggest the appropriateness of the approach and provide some insights on the features of wildfire occurrences.</p>","PeriodicalId":50512,"journal":{"name":"Environmetrics","volume":"36 1","pages":""},"PeriodicalIF":1.5,"publicationDate":"2024-05-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140834422","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Pointwise data depth for univariate and multivariate functional outlier detection 用于单变量和多变量异常值功能检测的点式数据深度
IF 1.5 3区 环境科学与生态学 Q4 ENVIRONMENTAL SCIENCES Pub Date : 2024-04-20 DOI: 10.1002/env.2851
Cristian F. Jiménez-Varón, Fouzi Harrou, Ying Sun

Data depth is an efficient tool for robustly summarizing the distribution of functional data and detecting potential magnitude and shape outliers. Commonly used functional data depth notions, such as the modified band depth and extremal depth, are estimated from pointwise depth for each observed functional observation. However, these techniques require calculating one single depth value for each functional observation, which may not be sufficient to characterize the distribution of the functional data and detect potential outliers. This article presents an innovative approach to make the best use of pointwise depth. We propose using the pointwise depth distribution for magnitude outlier visualization and the correlation between pairwise depth for shape outlier detection. Furthermore, a bootstrap-based testing procedure has been introduced for the correlation to test whether there is any shape outlier. The proposed univariate methods are then extended to bivariate functional data. The performance of the proposed methods is examined and compared to conventional outlier detection techniques by intensive simulation studies. In addition, the developed methods are applied to simulated solar energy datasets from a photovoltaic system. Results revealed that the proposed method offers superior detection performance over conventional techniques. These findings will benefit engineers and practitioners in monitoring photovoltaic systems by detecting unnoticed anomalies and outliers.

数据深度是一种有效的工具,可用于稳健地总结功能数据的分布,并检测潜在的幅度和形状异常值。常用的功能数据深度概念,如修正带深度和极值深度,是根据每个功能观测点的点深度估算的。然而,这些技术需要为每个功能观测值计算一个单一的深度值,这可能不足以描述功能数据的分布特征和检测潜在的异常值。本文提出了一种充分利用点深度的创新方法。我们建议将点深度分布用于幅度离群值的可视化,而将成对深度之间的相关性用于形状离群值的检测。此外,我们还为相关性引入了基于引导的测试程序,以测试是否存在任何形状离群点。然后,将提出的单变量方法扩展到双变量函数数据。通过深入的模拟研究,对所提出方法的性能进行了检验,并与传统的离群值检测技术进行了比较。此外,还将所开发的方法应用于光伏系统的模拟太阳能数据集。结果表明,与传统技术相比,所提出的方法具有更优越的检测性能。这些发现将有利于工程师和从业人员通过检测未被发现的异常和离群值来监控光伏系统。
{"title":"Pointwise data depth for univariate and multivariate functional outlier detection","authors":"Cristian F. Jiménez-Varón,&nbsp;Fouzi Harrou,&nbsp;Ying Sun","doi":"10.1002/env.2851","DOIUrl":"10.1002/env.2851","url":null,"abstract":"<p>Data depth is an efficient tool for robustly summarizing the distribution of functional data and detecting potential magnitude and shape outliers. Commonly used functional data depth notions, such as the modified band depth and extremal depth, are estimated from pointwise depth for each observed functional observation. However, these techniques require calculating one single depth value for each functional observation, which may not be sufficient to characterize the distribution of the functional data and detect potential outliers. This article presents an innovative approach to make the best use of pointwise depth. We propose using the pointwise depth distribution for magnitude outlier visualization and the correlation between pairwise depth for shape outlier detection. Furthermore, a bootstrap-based testing procedure has been introduced for the correlation to test whether there is any shape outlier. The proposed univariate methods are then extended to bivariate functional data. The performance of the proposed methods is examined and compared to conventional outlier detection techniques by intensive simulation studies. In addition, the developed methods are applied to simulated solar energy datasets from a photovoltaic system. Results revealed that the proposed method offers superior detection performance over conventional techniques. These findings will benefit engineers and practitioners in monitoring photovoltaic systems by detecting unnoticed anomalies and outliers.</p>","PeriodicalId":50512,"journal":{"name":"Environmetrics","volume":"35 5","pages":""},"PeriodicalIF":1.5,"publicationDate":"2024-04-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140626408","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Scanner : Simultaneously temporal trend and spatial cluster detection for spatial-temporal data 扫描仪同时检测时空数据的时间趋势和空间聚类
IF 1.5 3区 环境科学与生态学 Q4 ENVIRONMENTAL SCIENCES Pub Date : 2024-04-17 DOI: 10.1002/env.2849
Xin Wang, Xin Zhang

Identifying the underlying trajectory pattern in the spatial-temporal data analysis is a fundamental but challenging task. In this paper, we study the problem of simultaneously identifying temporal trends and spatial clusters of spatial-temporal trajectories. To achieve this goal, we propose a novel method named spatial clustered and sparse nonparametric regression (Scanner$$ mathsf{Scanner} $$). Our method leverages the B-spline model to fit the temporal data and penalty terms on spline coefficients to reveal the underlying spatial-temporal patterns. In particular, our method estimates the model by solving a doubly-penalized least square problem, in which we use a group sparse penalty for trend detection and a spanning tree-based fusion penalty for spatial cluster recovery. We also develop an algorithm based on the alternating direction method of multipliers (ADMM) algorithm to efficiently minimize the penalized least square loss. The statistical consistency properties of Scanner$$ mathsf{Scanner} $$ estimator are established in our work. In the end, we conduct thorough numerical experiments to verify our theoretical findings and validate that our method outperforms the existing competitive approaches.

在时空数据分析中识别潜在的轨迹模式是一项基本但具有挑战性的任务。在本文中,我们研究了同时识别时空轨迹的时间趋势和空间聚类的问题。为了实现这一目标,我们提出了一种名为空间聚类和稀疏非参数回归()的新方法。我们的方法利用 B 样条模型来拟合时空数据,并利用样条系数上的惩罚项来揭示潜在的时空模式。特别是,我们的方法通过求解双重惩罚最小平方问题来估计模型,其中,我们使用组稀疏惩罚来检测趋势,使用基于生成树的融合惩罚来恢复空间聚类。我们还开发了一种基于交替方向乘法(ADMM)算法的算法,以有效地最小化惩罚性最小平方损失。我们的工作建立了估计器的统计一致性特性。最后,我们进行了全面的数值实验来验证我们的理论发现,并验证了我们的方法优于现有的竞争方法。
{"title":"Scanner\u0000 : Simultaneously temporal trend and spatial cluster detection for spatial-temporal data","authors":"Xin Wang,&nbsp;Xin Zhang","doi":"10.1002/env.2849","DOIUrl":"10.1002/env.2849","url":null,"abstract":"<p>Identifying the underlying trajectory pattern in the spatial-temporal data analysis is a fundamental but challenging task. In this paper, we study the problem of simultaneously identifying temporal trends and spatial clusters of spatial-temporal trajectories. To achieve this goal, we propose a novel method named spatial clustered and sparse nonparametric regression (<span></span><math>\u0000 <semantics>\u0000 <mrow>\u0000 <mi>Scanner</mi>\u0000 </mrow>\u0000 <annotation>$$ mathsf{Scanner} $$</annotation>\u0000 </semantics></math>). Our method leverages the B-spline model to fit the temporal data and penalty terms on spline coefficients to reveal the underlying spatial-temporal patterns. In particular, our method estimates the model by solving a doubly-penalized least square problem, in which we use a group sparse penalty for trend detection and a spanning tree-based fusion penalty for spatial cluster recovery. We also develop an algorithm based on the alternating direction method of multipliers (ADMM) algorithm to efficiently minimize the penalized least square loss. The statistical consistency properties of <span></span><math>\u0000 <semantics>\u0000 <mrow>\u0000 <mi>Scanner</mi>\u0000 </mrow>\u0000 <annotation>$$ mathsf{Scanner} $$</annotation>\u0000 </semantics></math> estimator are established in our work. In the end, we conduct thorough numerical experiments to verify our theoretical findings and validate that our method outperforms the existing competitive approaches.</p>","PeriodicalId":50512,"journal":{"name":"Environmetrics","volume":"35 5","pages":""},"PeriodicalIF":1.5,"publicationDate":"2024-04-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140615343","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Contamination severity index: An analysis of Bangladesh groundwater arsenic 污染严重程度指数:孟加拉国地下水砷分析
IF 1.5 3区 环境科学与生态学 Q4 ENVIRONMENTAL SCIENCES Pub Date : 2024-04-16 DOI: 10.1002/env.2850
Yogendra P. Chaubey, Qi Zhang

This article deals with the measurement of groundwater arsenic (As$$ As $$) contamination. The focus is on using a proper index for the severity of contamination, rather than just using the proportion of observations above a threshold level. We specifically focus on the contamination severity index (CSI) proposed by Sen (2016. Sankhya B, 78B(2), 341–361.). An alternative estimator in contrast to the one given by Sen (2016. Sankhya B, 78B(2), 341–361.) is used here which is useful for a small number of observations. The data used is that collected by the British Geological Society and the Bangladesh Department of Public Health Engineering during 1997–2001. Their analysis was based on the simple proportion of the observations above a threshold level, whereas the CSI measure adequately takes into account the severity of the observations. It is emphasized in this article that the comparison of areas with average arsenic (As$$ As $$) levels to determine arsenic severity is not appropriate in general due to a large variation in the sample values due to the depth of wells. However, an alternative to the CSI proposed in Sen (2016. Sankhya B, 78B(2), 341–361.) has been given in this article that takes into account the depth of wells corresponding to the As$$ As $$ samples. This article also uses the bootstrap methodology in assessing the bias and standard errors of the estimators, and the corresponding bias-corrected and accelerated confidence intervals.

本文涉及地下水砷()污染的测量。重点在于使用适当的污染严重程度指数,而不仅仅是使用超过阈值水平的观测值比例。我们特别关注 Sen(2016 年)提出的污染严重程度指数(CSI)。Sankhya B,78B(2),341-361)。与 Sen(2016.Sankhya B,78B(2),341-361.)不同的另一种估计方法,该方法适用于少量观测数据。所使用的数据是英国地质学会和孟加拉国公共卫生工程部在 1997-2001 年期间收集的数据。他们的分析是基于超过临界值的观测值的简单比例,而 CSI 测量则充分考虑了观测值的严重程度。本文强调,由于水井深度不同,样本值差异很大,因此一般来说,比较砷()平均水平的地区来确定砷严重程度是不合适的。不过,Sen(2016.Sankhya B, 78B(2), 341-361.)中提出的 CSI 的替代方法,该方法考虑了与样本相对应的水井深度。本文还使用引导法评估了估计值的偏差和标准误差,以及相应的偏差校正置信区间和加速置信区间。
{"title":"Contamination severity index: An analysis of Bangladesh groundwater arsenic","authors":"Yogendra P. Chaubey,&nbsp;Qi Zhang","doi":"10.1002/env.2850","DOIUrl":"10.1002/env.2850","url":null,"abstract":"<p>This article deals with the measurement of groundwater arsenic (<span></span><math>\u0000 <semantics>\u0000 <mrow>\u0000 <mi>A</mi>\u0000 <mi>s</mi>\u0000 </mrow>\u0000 <annotation>$$ As $$</annotation>\u0000 </semantics></math>) contamination. The focus is on using a proper index for the severity of contamination, rather than just using the proportion of observations above a threshold level. We specifically focus on the contamination severity index (CSI) proposed by Sen (2016. <i>Sankhya B</i>, 78B(2), 341–361.). An alternative estimator in contrast to the one given by Sen (2016. <i>Sankhya B</i>, 78B(2), 341–361.) is used here which is useful for a small number of observations. The data used is that collected by the British Geological Society and the Bangladesh Department of Public Health Engineering during 1997–2001. Their analysis was based on the simple proportion of the observations above a threshold level, whereas the CSI measure adequately takes into account the severity of the observations. It is emphasized in this article that the comparison of areas with average arsenic (<span></span><math>\u0000 <semantics>\u0000 <mrow>\u0000 <mi>A</mi>\u0000 <mi>s</mi>\u0000 </mrow>\u0000 <annotation>$$ As $$</annotation>\u0000 </semantics></math>) levels to determine arsenic severity is not appropriate in general due to a large variation in the sample values due to the depth of wells. However, an alternative to the CSI proposed in Sen (2016. <i>Sankhya B</i>, 78B(2), 341–361.) has been given in this article that takes into account the depth of wells corresponding to the <span></span><math>\u0000 <semantics>\u0000 <mrow>\u0000 <mi>A</mi>\u0000 <mi>s</mi>\u0000 </mrow>\u0000 <annotation>$$ As $$</annotation>\u0000 </semantics></math> samples. This article also uses the bootstrap methodology in assessing the bias and standard errors of the estimators, and the corresponding <i>bias-corrected and accelerated</i> confidence intervals.</p>","PeriodicalId":50512,"journal":{"name":"Environmetrics","volume":"35 5","pages":""},"PeriodicalIF":1.5,"publicationDate":"2024-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/env.2850","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140615298","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Automatic deforestation detectors based on frequentist statistics and their extensions for other spatial objects 基于频数统计的自动毁林检测器及其对其他空间物体的扩展
IF 1.5 3区 环境科学与生态学 Q4 ENVIRONMENTAL SCIENCES Pub Date : 2024-04-16 DOI: 10.1002/env.2848
Jesper Muren, Vilhelm Niklasson, Dmitry Otryakhin, Maxim Romashin

This article is devoted to the problem of detection of forest and nonforest areas on Earth images. We propose two statistical methods to tackle this problem: one based on multiple hypothesis testing with parametric distribution families, another one—on nonparametric tests. The parametric approach is novel in the literature and relevant to a larger class of problems—detection of natural objects, as well as anomaly detection. We develop mathematical background for each of the two methods, build self-sufficient detection algorithms using them and discuss practical aspects of their implementation. We also compare our algorithms with each other and with those from standard machine learning using satellite data.

本文主要讨论地球图像上森林和非森林区域的检测问题。我们提出了两种统计方法来解决这个问题:一种是基于参数分布族的多重假设检验,另一种是非参数检验。参数方法在文献中很新颖,与更多问题--自然物体检测和异常检测--相关。我们分别介绍了这两种方法的数学背景,利用它们建立了自给自足的检测算法,并讨论了其实现的实际问题。我们还将我们的算法与其他算法以及使用卫星数据的标准机器学习算法进行了比较。
{"title":"Automatic deforestation detectors based on frequentist statistics and their extensions for other spatial objects","authors":"Jesper Muren,&nbsp;Vilhelm Niklasson,&nbsp;Dmitry Otryakhin,&nbsp;Maxim Romashin","doi":"10.1002/env.2848","DOIUrl":"10.1002/env.2848","url":null,"abstract":"<p>This article is devoted to the problem of detection of forest and nonforest areas on Earth images. We propose two statistical methods to tackle this problem: one based on multiple hypothesis testing with parametric distribution families, another one—on nonparametric tests. The parametric approach is novel in the literature and relevant to a larger class of problems—detection of natural objects, as well as anomaly detection. We develop mathematical background for each of the two methods, build self-sufficient detection algorithms using them and discuss practical aspects of their implementation. We also compare our algorithms with each other and with those from standard machine learning using satellite data.</p>","PeriodicalId":50512,"journal":{"name":"Environmetrics","volume":"35 5","pages":""},"PeriodicalIF":1.5,"publicationDate":"2024-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/env.2848","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140599481","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Estimation and selection for spatial zero-inflated count models 空间零膨胀计数模型的估计和选择
IF 1.7 3区 环境科学与生态学 Q4 ENVIRONMENTAL SCIENCES Pub Date : 2024-04-05 DOI: 10.1002/env.2847
Chung-Wei Shen, Chun-Shu Chen

The count data arise in many scientific areas. Our concerns here focus on spatial count responses with an excessive number of zeros and a set of available covariates. Estimating model parameters and selecting important covariates for spatial zero-inflated count models are both essential. Importantly, to alleviate deviations from model assumptions, we propose a spatial zero-inflated Poisson-like methodology to model this type of data, which relies only on assumptions for the first two moments of spatial count responses. We then design an effective iterative estimation procedure between the generalized estimating equation and the weighted least squares method to respectively estimate the regression coefficients and the variogram of the data model. Moreover, the stabilization of estimators is evaluated via a block jackknife technique. Furthermore, a distribution-free model selection criterion based on an estimate of the mean squared error of the estimated mean structure is proposed to select the best subset of covariates. The effectiveness of the proposed methodology is demonstrated by simulation studies under various scenarios, and a real dataset regarding the number of maternal deaths in Mozambique is analyzed for illustration.

计数数据出现在许多科学领域。我们在此关注的重点是具有过多零点的空间计数响应和一组可用的协变量。为空间零膨胀计数模型估计模型参数和选择重要的协变量都是至关重要的。重要的是,为了减少对模型假设的偏差,我们提出了一种类似于空间零膨胀泊松的方法来为这类数据建模,它只依赖于空间计数响应的前两个矩的假设。然后,我们在广义估计方程和加权最小二乘法之间设计了一个有效的迭代估计程序,以分别估计数据模型的回归系数和变异图。此外,还通过分块千刀技术评估了估计器的稳定性。此外,还提出了一种基于估计均值结构均方误差的无分布模型选择标准,以选择最佳协变量子集。通过在各种情况下进行模拟研究,证明了所提方法的有效性,并分析了莫桑比克孕产妇死亡人数的真实数据集,以资说明。
{"title":"Estimation and selection for spatial zero-inflated count models","authors":"Chung-Wei Shen,&nbsp;Chun-Shu Chen","doi":"10.1002/env.2847","DOIUrl":"10.1002/env.2847","url":null,"abstract":"<p>The count data arise in many scientific areas. Our concerns here focus on spatial count responses with an excessive number of zeros and a set of available covariates. Estimating model parameters and selecting important covariates for spatial zero-inflated count models are both essential. Importantly, to alleviate deviations from model assumptions, we propose a spatial zero-inflated Poisson-like methodology to model this type of data, which relies only on assumptions for the first two moments of spatial count responses. We then design an effective iterative estimation procedure between the generalized estimating equation and the weighted least squares method to respectively estimate the regression coefficients and the variogram of the data model. Moreover, the stabilization of estimators is evaluated via a block jackknife technique. Furthermore, a distribution-free model selection criterion based on an estimate of the mean squared error of the estimated mean structure is proposed to select the best subset of covariates. The effectiveness of the proposed methodology is demonstrated by simulation studies under various scenarios, and a real dataset regarding the number of maternal deaths in Mozambique is analyzed for illustration.</p>","PeriodicalId":50512,"journal":{"name":"Environmetrics","volume":"35 4","pages":""},"PeriodicalIF":1.7,"publicationDate":"2024-04-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140599396","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Testing for galactic cosmic ray warming hypothesis using the notion of block-exogeneity 利用块状异质性概念检验银河宇宙射线变暖假说
IF 1.5 3区 环境科学与生态学 Q4 ENVIRONMENTAL SCIENCES Pub Date : 2024-03-31 DOI: 10.1002/env.2846
Umberto Triacca

In this article, we consider the notion of block-exogeneity and establish a characterization of it. We use this characterization to propose a procedure to test for block-exogeneity in a trivariate system. The proposed procedure has been applied to test the so-called galactic cosmic ray warming hypothesis. The galactic cosmic ray warming hypothesis suggests the existence of an indirect solar influence on Earth's climate. Our results seem to imply that this hypothesis does not hold. In particular, we find that the global temperature is block-exogenous with respect to both sunspot numbers (a measure of the solar activity) and galactic cosmic rays. This implies that the supposed indirect causal link from solar activity to temperature (through cosmic rays), postulated by the galactic cosmic ray warming hypothesis, does not appear to exist.

在本文中,我们考虑了块异质性的概念,并对其进行了表征。我们利用这一表征提出了一种在三变量系统中检验块状异质性的程序。提出的程序已被用于检验所谓的银河宇宙射线变暖假说。银河宇宙射线变暖假说认为太阳对地球气候存在间接影响。我们的结果似乎暗示这一假说并不成立。特别是,我们发现全球温度与太阳黑子数量(太阳活动的测量指标)和银河宇宙射线都是块状外生的。这意味着银河宇宙射线变暖假说所假定的太阳活动与温度(通过宇宙射线)之间的间接因果关系似乎并不存在。
{"title":"Testing for galactic cosmic ray warming hypothesis using the notion of block-exogeneity","authors":"Umberto Triacca","doi":"10.1002/env.2846","DOIUrl":"10.1002/env.2846","url":null,"abstract":"<p>In this article, we consider the notion of block-exogeneity and establish a characterization of it. We use this characterization to propose a procedure to test for block-exogeneity in a trivariate system. The proposed procedure has been applied to test the so-called galactic cosmic ray warming hypothesis. The galactic cosmic ray warming hypothesis suggests the existence of an indirect solar influence on Earth's climate. Our results seem to imply that this hypothesis does not hold. In particular, we find that the global temperature is block-exogenous with respect to both sunspot numbers (a measure of the solar activity) and galactic cosmic rays. This implies that the supposed indirect causal link from solar activity to temperature (through cosmic rays), postulated by the galactic cosmic ray warming hypothesis, does not appear to exist.</p>","PeriodicalId":50512,"journal":{"name":"Environmetrics","volume":"36 1","pages":""},"PeriodicalIF":1.5,"publicationDate":"2024-03-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140361323","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Fast parameter estimation of generalized extreme value distribution using neural networks 利用神经网络快速估计广义极值分布的参数
IF 1.7 3区 环境科学与生态学 Q4 ENVIRONMENTAL SCIENCES Pub Date : 2024-03-12 DOI: 10.1002/env.2845
Sweta Rai, Alexis Hoffman, Soumendra Lahiri, Douglas W. Nychka, Stephan R. Sain, Soutir Bandyopadhyay

The heavy-tailed behavior of the generalized extreme-value distribution makes it a popular choice for modeling extreme events such as floods, droughts, heatwaves, wildfires and so forth. However, estimating the distribution's parameters using conventional maximum likelihood methods can be computationally intensive, even for moderate-sized datasets. To overcome this limitation, we propose a computationally efficient, likelihood-free estimation method utilizing a neural network. Through an extensive simulation study, we demonstrate that the proposed neural network-based method provides generalized extreme value distribution parameter estimates with comparable accuracy to the conventional maximum likelihood method but with a significant computational speedup. To account for estimation uncertainty, we utilize parametric bootstrapping, which is inherent in the trained network. Finally, we apply this method to 1000-year annual maximum temperature data from the Community Climate System Model version 3 across North America for three atmospheric concentrations: 289 ppm CO2$$ {mathrm{CO}}_2 $$ (pre-industrial), 700 ppm CO2$$ {mathrm{CO}}_2 $$ (future conditions), and 1400 ppm CO2$$ {mathrm{CO}}_2 $$, and compare the results with those obtained using the maximum likelihood approach.

广义极值分布的重尾行为使其成为洪水、干旱、热浪、野火等极端事件建模的热门选择。然而,使用传统的最大似然法估计该分布的参数需要大量的计算,即使对于中等规模的数据集也是如此。为了克服这一限制,我们提出了一种利用神经网络的计算高效、无似然法的估计方法。通过广泛的模拟研究,我们证明了所提出的基于神经网络的方法可以提供广义极值分布参数估计,其准确性与传统的最大似然法相当,但计算速度明显加快。为了考虑估计的不确定性,我们利用了参数自举法,这是训练有素的网络所固有的。最后,我们将这一方法应用于共同体气候系统模式第 3 版提供的北美地区三种大气浓度下的 1000 年年度最高气温数据:289 ppm(工业化前)、700 ppm(未来条件)和 1400 ppm,并将结果与使用最大似然法得出的结果进行比较。
{"title":"Fast parameter estimation of generalized extreme value distribution using neural networks","authors":"Sweta Rai,&nbsp;Alexis Hoffman,&nbsp;Soumendra Lahiri,&nbsp;Douglas W. Nychka,&nbsp;Stephan R. Sain,&nbsp;Soutir Bandyopadhyay","doi":"10.1002/env.2845","DOIUrl":"10.1002/env.2845","url":null,"abstract":"<p>The heavy-tailed behavior of the generalized extreme-value distribution makes it a popular choice for modeling extreme events such as floods, droughts, heatwaves, wildfires and so forth. However, estimating the distribution's parameters using conventional maximum likelihood methods can be computationally intensive, even for moderate-sized datasets. To overcome this limitation, we propose a computationally efficient, likelihood-free estimation method utilizing a neural network. Through an extensive simulation study, we demonstrate that the proposed neural network-based method provides generalized extreme value distribution parameter estimates with comparable accuracy to the conventional maximum likelihood method but with a significant computational speedup. To account for estimation uncertainty, we utilize parametric bootstrapping, which is inherent in the trained network. Finally, we apply this method to 1000-year annual maximum temperature data from the Community Climate System Model version 3 across North America for three atmospheric concentrations: 289 ppm <span></span><math>\u0000 <semantics>\u0000 <mrow>\u0000 <msub>\u0000 <mrow>\u0000 <mtext>CO</mtext>\u0000 </mrow>\u0000 <mrow>\u0000 <mn>2</mn>\u0000 </mrow>\u0000 </msub>\u0000 </mrow>\u0000 <annotation>$$ {mathrm{CO}}_2 $$</annotation>\u0000 </semantics></math> (pre-industrial), 700 ppm <span></span><math>\u0000 <semantics>\u0000 <mrow>\u0000 <msub>\u0000 <mrow>\u0000 <mtext>CO</mtext>\u0000 </mrow>\u0000 <mrow>\u0000 <mn>2</mn>\u0000 </mrow>\u0000 </msub>\u0000 </mrow>\u0000 <annotation>$$ {mathrm{CO}}_2 $$</annotation>\u0000 </semantics></math> (future conditions), and 1400 ppm <span></span><math>\u0000 <semantics>\u0000 <mrow>\u0000 <msub>\u0000 <mrow>\u0000 <mtext>CO</mtext>\u0000 </mrow>\u0000 <mrow>\u0000 <mn>2</mn>\u0000 </mrow>\u0000 </msub>\u0000 </mrow>\u0000 <annotation>$$ {mathrm{CO}}_2 $$</annotation>\u0000 </semantics></math>, and compare the results with those obtained using the maximum likelihood approach.</p>","PeriodicalId":50512,"journal":{"name":"Environmetrics","volume":"35 3","pages":""},"PeriodicalIF":1.7,"publicationDate":"2024-03-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140129341","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Recursive nearest neighbor co-kriging models for big multi-fidelity spatial data sets 用于大型多保真度空间数据集的递归近邻协同定位模型
IF 1.7 3区 环境科学与生态学 Q4 ENVIRONMENTAL SCIENCES Pub Date : 2024-02-25 DOI: 10.1002/env.2844
Si Cheng, Bledar A. Konomi, Georgios Karagiannis, Emily L. Kang

Big datasets are gathered daily from different remote sensing platforms. Recently, statistical co-kriging models, with the help of scalable techniques, have been able to combine such datasets by using spatially varying bias corrections. The associated Bayesian inference for these models is usually facilitated via Markov chain Monte Carlo (MCMC) methods which present (sometimes prohibitively) slow mixing and convergence because they require the simulation of high-dimensional random effect vectors from their posteriors given large datasets. To enable fast inference in big data spatial problems, we propose the recursive nearest neighbor co-kriging (RNNC) model. Based on this model, we develop two computationally efficient inferential procedures: (a) the collapsed RNNC which reduces the posterior sampling space by integrating out the latent processes, and (b) the conjugate RNNC, an MCMC free inference which significantly reduces the computational time without sacrificing prediction accuracy. An important highlight of conjugate RNNC is that it enables fast inference in massive multifidelity data sets by avoiding expensive integration algorithms. The efficient computational and good predictive performances of our proposed algorithms are demonstrated on benchmark examples and the analysis of the High-resolution Infrared Radiation Sounder data gathered from two NOAA polar orbiting satellites in which we managed to reduce the computational time from multiple hours to just a few minutes.

每天都有大量数据集从不同的遥感平台收集而来。近来,在可扩展技术的帮助下,统计共轭模型能够通过使用空间变化偏差校正来组合这些数据集。这些模型的相关贝叶斯推断通常通过马尔科夫链蒙特卡罗(MCMC)方法来实现,但由于这些方法需要从大型数据集的后验中模拟高维随机效应向量,因此混合和收敛速度较慢(有时慢得令人望而却步)。为了在大数据空间问题中实现快速推理,我们提出了递归近邻共触发(RNNC)模型。基于该模型,我们开发了两种计算高效的推理程序:(a) 折叠 RNNC,通过整合出潜在过程来减少后验采样空间;以及 (b) 共轭 RNNC,一种无 MCMC 的推理方法,在不牺牲预测精度的情况下显著减少了计算时间。共轭 RNNC 的一个重要亮点是,它通过避免昂贵的积分算法,实现了在海量多保真数据集中的快速推理。我们提出的算法具有高效的计算能力和良好的预测性能,这一点在基准实例中得到了证明,在对两颗 NOAA 极轨道卫星收集的高分辨率红外辐射探测仪数据的分析中,我们成功地将计算时间从数小时缩短到了几分钟。
{"title":"Recursive nearest neighbor co-kriging models for big multi-fidelity spatial data sets","authors":"Si Cheng,&nbsp;Bledar A. Konomi,&nbsp;Georgios Karagiannis,&nbsp;Emily L. Kang","doi":"10.1002/env.2844","DOIUrl":"10.1002/env.2844","url":null,"abstract":"<p>Big datasets are gathered daily from different remote sensing platforms. Recently, statistical co-kriging models, with the help of scalable techniques, have been able to combine such datasets by using spatially varying bias corrections. The associated Bayesian inference for these models is usually facilitated via Markov chain Monte Carlo (MCMC) methods which present (sometimes prohibitively) slow mixing and convergence because they require the simulation of high-dimensional random effect vectors from their posteriors given large datasets. To enable fast inference in big data spatial problems, we propose the recursive nearest neighbor co-kriging (RNNC) model. Based on this model, we develop two computationally efficient inferential procedures: (a) the collapsed RNNC which reduces the posterior sampling space by integrating out the latent processes, and (b) the conjugate RNNC, an MCMC free inference which significantly reduces the computational time without sacrificing prediction accuracy. An important highlight of conjugate RNNC is that it enables fast inference in massive multifidelity data sets by avoiding expensive integration algorithms. The efficient computational and good predictive performances of our proposed algorithms are demonstrated on benchmark examples and the analysis of the High-resolution Infrared Radiation Sounder data gathered from two NOAA polar orbiting satellites in which we managed to reduce the computational time from multiple hours to just a few minutes.</p>","PeriodicalId":50512,"journal":{"name":"Environmetrics","volume":"35 4","pages":""},"PeriodicalIF":1.7,"publicationDate":"2024-02-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/env.2844","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139969576","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Sampling design methods for making improved lake management decisions 改进湖泊管理决策的取样设计方法
IF 1.5 3区 环境科学与生态学 Q4 ENVIRONMENTAL SCIENCES Pub Date : 2024-02-08 DOI: 10.1002/env.2842
Vilja Koski, Jo Eidsvik

The ecological status of lakes is important for understanding an ecosystem's biodiversity as well as for service water quality and policies related to land use and agricultural run-off. If the status is weak, then decisions about management alternatives need to be made. We assess the value of information of lake monitoring in Finland, where lakes are abundant. With reasonable ecological values and restoration costs, the value of information analysis can be compared with the survey's costs. Data are worth gathering if the expected value from the data exceeds the costs. From existing data, we specify a hierarchical Bayesian spatial logistic regression model for the ecological status of lakes. We then rely on functional approximations and Laplace approximations to get closed-form expressions for the value of information of a sampling design. The case study contains thousands of lakes. The combinatorially difficult design problem is to wisely pick the right subset of lakes for data gathering. To solve this optimization problem, we study the performance of various heuristics: greedy forward algorithms, exchange algorithms and Bayesian optimization approaches. The value of information increases quickly when adding lakes to a small design but then flattens out. Good designs are usually composed of lakes that are difficult to manage, while also balancing a variety of covariates and geographic coverage. The designs achieved by forward selection are reasonably good, but we can outperform them with the more nuanced search algorithms. Statistical designs clearly outperform other designs selected according to simpler criteria.

湖泊的生态状况对于了解生态系统的生物多样性、水质服务以及与土地利用和农业径流相关的政策都非常重要。如果湖泊生态状况不佳,就需要做出管理决策。在湖泊众多的芬兰,我们对湖泊监测信息的价值进行了评估。在生态价值和恢复成本合理的情况下,信息分析的价值可与调查成本进行比较。如果数据的预期价值超过成本,那么数据就值得收集。根据现有数据,我们为湖泊生态状况指定了一个分层贝叶斯空间逻辑回归模型。然后,我们依靠函数近似和拉普拉斯近似,得到了抽样设计信息价值的闭式表达式。案例研究包含数千个湖泊。如何明智地选择合适的湖泊子集来收集数据,是一个复杂的设计问题。为了解决这个优化问题,我们研究了各种启发式方法的性能:贪婪前向算法、交换算法和贝叶斯优化方法。在小型设计中添加湖泊时,信息价值会迅速增加,但随后会趋于平稳。好的设计通常由难以管理的湖泊组成,同时还要兼顾各种协变量和地理覆盖范围。通过正向选择获得的设计相当不错,但我们可以通过更细致的搜索算法来超越它们。统计设计明显优于根据更简单标准选出的其他设计。
{"title":"Sampling design methods for making improved lake management decisions","authors":"Vilja Koski,&nbsp;Jo Eidsvik","doi":"10.1002/env.2842","DOIUrl":"10.1002/env.2842","url":null,"abstract":"<p>The ecological status of lakes is important for understanding an ecosystem's biodiversity as well as for service water quality and policies related to land use and agricultural run-off. If the status is weak, then decisions about management alternatives need to be made. We assess the value of information of lake monitoring in Finland, where lakes are abundant. With reasonable ecological values and restoration costs, the value of information analysis can be compared with the survey's costs. Data are worth gathering if the expected value from the data exceeds the costs. From existing data, we specify a hierarchical Bayesian spatial logistic regression model for the ecological status of lakes. We then rely on functional approximations and Laplace approximations to get closed-form expressions for the value of information of a sampling design. The case study contains thousands of lakes. The combinatorially difficult design problem is to wisely pick the right subset of lakes for data gathering. To solve this optimization problem, we study the performance of various heuristics: greedy forward algorithms, exchange algorithms and Bayesian optimization approaches. The value of information increases quickly when adding lakes to a small design but then flattens out. Good designs are usually composed of lakes that are difficult to manage, while also balancing a variety of covariates and geographic coverage. The designs achieved by forward selection are reasonably good, but we can outperform them with the more nuanced search algorithms. Statistical designs clearly outperform other designs selected according to simpler criteria.</p>","PeriodicalId":50512,"journal":{"name":"Environmetrics","volume":"36 1","pages":""},"PeriodicalIF":1.5,"publicationDate":"2024-02-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/env.2842","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139771804","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Environmetrics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1