首页 > 最新文献

Environmetrics最新文献

英文 中文
2023 Editorial Collaborators 2023 编辑合作者
IF 1.7 3区 环境科学与生态学 Q4 ENVIRONMENTAL SCIENCES Pub Date : 2024-01-14 DOI: 10.1002/env.2841
{"title":"2023 Editorial Collaborators","authors":"","doi":"10.1002/env.2841","DOIUrl":"https://doi.org/10.1002/env.2841","url":null,"abstract":"","PeriodicalId":50512,"journal":{"name":"Environmetrics","volume":"35 1","pages":""},"PeriodicalIF":1.7,"publicationDate":"2024-01-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139474034","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Structural equation models for simultaneous modeling of air pollutants 空气污染物同步建模的结构方程模型
IF 1.7 3区 环境科学与生态学 Q4 ENVIRONMENTAL SCIENCES Pub Date : 2024-01-14 DOI: 10.1002/env.2837
Mariaelena Bottazzi Schenone, Elena Grimaccia, Maurizio Vichi

This paper provides a new modeling for air pollution, simultaneously taking into account the six main pollutants (PM10 and PM2.5, Sulphate Dioxide, Nitrogen Dioxide, Carbon Monoxide, ground level Ozone concentrations) and their key determinants, employing Structural Equation Models (SEMs). The model is able to estimate the complex links among air pollutants, often neglected in literature, and identifies specific drivers of air pollution. In literature, indexes of air pollution achieved using a fully statistical methodology have not been proposed yet. Indeed, an added value of this proposal is the statistical procedure itself, which can be applied also to obtain indexes modeling different phenomena. In particular, in this study, the new Air Pollution Index (API) is based on a modeling approach that allows to assess, through statistical criteria, the goodness of fit of the SEM in modeling pollutants and the significance of their determinants. The performance of the new index is assessed using air quality data for municipal European areas, which are characterized by different socioeconomic, geographical, and meteorological features. SEMs are estimated and evaluated in terms of best fit and model complexity. The index resulting by the best SEM is compared with the well-established Air Quality Index (AQI). The new API is validated by means of a sensitivity analysis, performed with a simulation study. Finally, to visualize the meaningfulness of the obtained results, a model-based cluster analysis is estimated on the municipal areas. The proposed SEM contributes to a better understanding of the relationships between air pollutants and their determinants, and this knowledge can inform policy decisions aimed at reducing air pollution and improving public health.

本文采用结构方程模型(SEM),提供了一种新的空气污染模型,同时考虑了六种主要污染物(PM10 和 PM2.5、二氧化硫、二氧化氮、一氧化碳、地面臭氧浓度)及其主要决定因素。该模型能够估计文献中经常忽略的空气污染物之间的复杂联系,并确定空气污染的具体驱动因素。在文献中,尚未提出使用完全统计方法实现的空气污染指数。事实上,该建议的附加值在于统计程序本身,它也可用于获得模拟不同现象的指数。特别是,在本研究中,新的空气污染指数(API)是基于一种建模方法,可以通过统计标准来评估 SEM 在污染物建模中的拟合度及其决定因素的重要性。新指数的性能使用欧洲城市地区的空气质量数据进行评估,这些地区具有不同的社会经济、地理和气象特征。根据最佳拟合度和模型复杂度对 SEM 进行了估算和评估。最佳 SEM 得出的指数与成熟的空气质量指数(AQI)进行了比较。通过模拟研究进行敏感性分析,验证了新的空气质量指数。最后,为了使所获结果的意义可视化,对城市地区进行了基于模型的聚类分析。所提出的 SEM 有助于更好地理解空气污染物及其决定因素之间的关系,这些知识可以为旨在减少空气污染和改善公众健康的决策提供信息。
{"title":"Structural equation models for simultaneous modeling of air pollutants","authors":"Mariaelena Bottazzi Schenone,&nbsp;Elena Grimaccia,&nbsp;Maurizio Vichi","doi":"10.1002/env.2837","DOIUrl":"10.1002/env.2837","url":null,"abstract":"<p>This paper provides a new modeling for air pollution, simultaneously taking into account the six main pollutants (PM10 and PM2.5, Sulphate Dioxide, Nitrogen Dioxide, Carbon Monoxide, ground level Ozone concentrations) and their key determinants, employing Structural Equation Models (SEMs). The model is able to estimate the complex links among air pollutants, often neglected in literature, and identifies specific drivers of air pollution. In literature, indexes of air pollution achieved using a fully statistical methodology have not been proposed yet. Indeed, an added value of this proposal is the statistical procedure itself, which can be applied also to obtain indexes modeling different phenomena. In particular, in this study, the new Air Pollution Index (API) is based on a modeling approach that allows to assess, through statistical criteria, the goodness of fit of the SEM in modeling pollutants and the significance of their determinants. The performance of the new index is assessed using air quality data for municipal European areas, which are characterized by different socioeconomic, geographical, and meteorological features. SEMs are estimated and evaluated in terms of best fit and model complexity. The index resulting by the best SEM is compared with the well-established Air Quality Index (AQI). The new API is validated by means of a sensitivity analysis, performed with a simulation study. Finally, to visualize the meaningfulness of the obtained results, a model-based cluster analysis is estimated on the municipal areas. The proposed SEM contributes to a better understanding of the relationships between air pollutants and their determinants, and this knowledge can inform policy decisions aimed at reducing air pollution and improving public health.</p>","PeriodicalId":50512,"journal":{"name":"Environmetrics","volume":"35 3","pages":""},"PeriodicalIF":1.7,"publicationDate":"2024-01-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139482958","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Quantifying and correcting geolocation error in spaceborne LiDAR forest canopy observations using high spatial accuracy data: A Bayesian model approach 利用高空间精度数据量化和纠正空间LiDAR林冠观测中的地理定位误差:贝叶斯模型方法
IF 1.7 3区 环境科学与生态学 Q4 ENVIRONMENTAL SCIENCES Pub Date : 2024-01-08 DOI: 10.1002/env.2840
Elliot S. Shannon, Andrew O. Finley, Daniel J. Hayes, Sylvia N. Noralez, Aaron R. Weiskittel, Bruce D. Cook, Chad Babcock

Geolocation error in spaceborne sampling light detection and ranging (LiDAR) measurements of forest structure can compromise forest attribute estimates and degrade integration with georeferenced field measurements or other remotely sensed data. Data integration is especially problematic when geolocation error is not well quantified. We propose a general model that uses airborne laser scanning data to quantify and correct geolocation error in spaceborne sampling LiDAR. To illustrate the model, LiDAR data from NASA Goddard's LiDAR Hyperspectral and Thermal Imager (G-LiHT) was used with a subset of LiDAR data from NASA's Global Ecosystem Dynamics Investigation (GEDI). The model accommodates multiple canopy height metrics derived from a simulated GEDI footprint kernel using spatially coincident G-LiHT, and incorporates both additive and multiplicative mapping between the canopy height metrics generated from both datasets. A Bayesian implementation provides probabilistic uncertainty quantification in both parameter and geolocation error estimates. Results show a systematic geolocation error of 9.62 m in the southwest direction. In addition, estimated geolocation errors within GEDI footprints were highly variable, with results showing a $$ sim $$0.45 probability the true footprint center is within 20 m. Estimating and correcting geolocation error via the model outlined here can help inform subsequent efforts to integrate spaceborne LiDAR data, like GEDI, with other georeferenced data.

对森林结构进行空间采样光探测与测距(LiDAR)测量时产生的地理定位误差会影响森林属性估算,并降低与地理参照实地测量或其他遥感数据的整合效果。当地理定位误差不能很好量化时,数据整合尤其成问题。我们提出了一个通用模型,利用机载激光扫描数据来量化和纠正空间采样激光雷达的地理定位误差。为了说明该模型,我们使用了 NASA 戈达德激光雷达高光谱和热成像仪(G-LiHT)的激光雷达数据,以及 NASA 全球生态系统动力学调查(GEDI)的激光雷达数据子集。该模型采用空间重合的 G-LiHT 模拟 GEDI 迹线内核得出的多个树冠高度指标,并结合了两个数据集生成的树冠高度指标之间的加法和乘法映射。贝叶斯方法的实施为参数和地理定位误差估计提供了概率不确定性量化。结果显示,西南方向的系统地理定位误差为 9.62 米。此外,GEDI足迹内的估计地理定位误差变化很大,结果显示真实足迹中心在20米以内的概率为∼$$ sim $$0.45。
{"title":"Quantifying and correcting geolocation error in spaceborne LiDAR forest canopy observations using high spatial accuracy data: A Bayesian model approach","authors":"Elliot S. Shannon,&nbsp;Andrew O. Finley,&nbsp;Daniel J. Hayes,&nbsp;Sylvia N. Noralez,&nbsp;Aaron R. Weiskittel,&nbsp;Bruce D. Cook,&nbsp;Chad Babcock","doi":"10.1002/env.2840","DOIUrl":"10.1002/env.2840","url":null,"abstract":"<p>Geolocation error in spaceborne sampling light detection and ranging (LiDAR) measurements of forest structure can compromise forest attribute estimates and degrade integration with georeferenced field measurements or other remotely sensed data. Data integration is especially problematic when geolocation error is not well quantified. We propose a general model that uses airborne laser scanning data to quantify and correct geolocation error in spaceborne sampling LiDAR. To illustrate the model, LiDAR data from NASA Goddard's LiDAR Hyperspectral and Thermal Imager (G-LiHT) was used with a subset of LiDAR data from NASA's Global Ecosystem Dynamics Investigation (GEDI). The model accommodates multiple canopy height metrics derived from a simulated GEDI footprint kernel using spatially coincident G-LiHT, and incorporates both additive and multiplicative mapping between the canopy height metrics generated from both datasets. A Bayesian implementation provides probabilistic uncertainty quantification in both parameter and geolocation error estimates. Results show a systematic geolocation error of 9.62 m in the southwest direction. In addition, estimated geolocation errors within GEDI footprints were highly variable, with results showing a <span></span><math>\u0000 <semantics>\u0000 <mrow>\u0000 <mo>∼</mo>\u0000 </mrow>\u0000 <annotation>$$ sim $$</annotation>\u0000 </semantics></math>0.45 probability the true footprint center is within 20 m. Estimating and correcting geolocation error via the model outlined here can help inform subsequent efforts to integrate spaceborne LiDAR data, like GEDI, with other georeferenced data.</p>","PeriodicalId":50512,"journal":{"name":"Environmetrics","volume":"35 4","pages":""},"PeriodicalIF":1.7,"publicationDate":"2024-01-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/env.2840","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139409501","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multivariate nearest-neighbors Gaussian processes with random covariance matrices 具有随机协方差矩阵的多变量近邻高斯过程
IF 1.7 3区 环境科学与生态学 Q4 ENVIRONMENTAL SCIENCES Pub Date : 2024-01-02 DOI: 10.1002/env.2839
Isabelle Grenier, Bruno Sansó, Jessica L. Matthews

We propose a non-stationary spatial model based on a normal-inverse-Wishart framework, conditioning on a set of nearest-neighbors. The model, called nearest-neighbor Gaussian process with random covariance matrices is developed for both univariate and multivariate spatial settings and allows for fully flexible covariance structures that impose no stationarity or isotropic restrictions. In addition, the model can handle duplicate observations and missing data. We consider an approach based on integrating out the spatial random effects that allows fast inference for the model parameters. We also consider a full hierarchical approach that leverages the sparse structures induced by the model to perform fast Monte Carlo computations. Strong computational efficiency is achieved by leveraging the adaptive localized structure of the model that allows for a high level of parallelization. We illustrate the performance of the model with univariate and bivariate simulations, as well as with observations from two stationary satellites consisting of albedo measurements.

我们提出了一种基于正态逆 Wishart 框架的非稳态空间模型,以一组最近邻为条件。该模型被称为具有随机协方差矩阵的近邻高斯过程,适用于单变量和多变量空间环境,允许完全灵活的协方差结构,不施加任何静态或各向同性限制。此外,该模型还能处理重复观测数据和缺失数据。我们考虑了一种基于空间随机效应积分的方法,这种方法可以快速推断模型参数。我们还考虑了一种完全分层的方法,利用模型引起的稀疏结构来执行快速蒙特卡罗计算。利用模型的自适应局部结构可以实现高水平的并行化,从而达到很高的计算效率。我们通过单变量和双变量模拟,以及由反照率测量组成的两颗静止卫星的观测数据,说明了该模型的性能。
{"title":"Multivariate nearest-neighbors Gaussian processes with random covariance matrices","authors":"Isabelle Grenier,&nbsp;Bruno Sansó,&nbsp;Jessica L. Matthews","doi":"10.1002/env.2839","DOIUrl":"10.1002/env.2839","url":null,"abstract":"<p>We propose a non-stationary spatial model based on a normal-inverse-Wishart framework, conditioning on a set of nearest-neighbors. The model, called nearest-neighbor Gaussian process with random covariance matrices is developed for both univariate and multivariate spatial settings and allows for fully flexible covariance structures that impose no stationarity or isotropic restrictions. In addition, the model can handle duplicate observations and missing data. We consider an approach based on integrating out the spatial random effects that allows fast inference for the model parameters. We also consider a full hierarchical approach that leverages the sparse structures induced by the model to perform fast Monte Carlo computations. Strong computational efficiency is achieved by leveraging the adaptive localized structure of the model that allows for a high level of parallelization. We illustrate the performance of the model with univariate and bivariate simulations, as well as with observations from two stationary satellites consisting of albedo measurements.</p>","PeriodicalId":50512,"journal":{"name":"Environmetrics","volume":"35 3","pages":""},"PeriodicalIF":1.7,"publicationDate":"2024-01-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139374292","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Statistical evaluation of a long-memory process using the generalized entropic value-at-risk 利用广义熵风险值对长记忆过程进行统计评估
IF 1.7 3区 环境科学与生态学 Q4 ENVIRONMENTAL SCIENCES Pub Date : 2023-12-25 DOI: 10.1002/env.2838
Hidekazu Yoshioka, Yumi Yoshioka

The modeling and identification of time series data with a long memory are important in various fields. The streamflow discharge is one such example that can be reasonably described as an aggregated stochastic process of randomized affine processes where the probability measure, we call it reversion measure, for the randomization is not directly observable. Accurate identification of the reversion measure is critical because of its omnipresence in the aggregated stochastic process. However, the modeling accuracy is commonly limited by the available real-world data. We resolve this issue by proposing the novel upper and lower bounds of a statistic of interest subject to ambiguity of the reversion measure. Here, we use the Tsallis value-at-risk (TsVaR) as a convex risk functional to generalize the widely used entropic value-at-risk (EVaR) as a sharp statistical indicator. We demonstrate that the EVaR cannot be used for evaluating key statistics, such as mean and variance, of the streamflow discharge due to the blowup of some exponential integrand. We theoretically show that the TsVaR can avoid this issue because it requires only the existence of some polynomial moment, not exponential moment. As a demonstration, we apply the semi-implicit gradient descent method to calculate the TsVaR and corresponding Radon–Nikodym derivative for time series data of actual streamflow discharges in mountainous river environments.

具有长记忆的时间序列数据的建模和识别在各个领域都很重要。溪流排放就是这样一个例子,它可以被合理地描述为随机仿射过程的聚合随机过程,其中随机化的概率度量(我们称之为回归度量)是不可直接观测的。由于回归度量在聚合随机过程中无处不在,因此准确识别回归度量至关重要。然而,建模的准确性通常会受到可用现实数据的限制。为了解决这个问题,我们提出了新颖的上界和下界,即在回归度量不明确的情况下,相关统计量的上界和下界。在此,我们使用 Tsallis 风险值(TsVaR)作为凸风险函数,将广泛使用的熵风险值(EVaR)概括为尖锐的统计指标。我们证明,由于某些指数积分的炸毁,EVaR 不能用于评估溪流排放的平均值和方差等关键统计数据。我们从理论上证明,TsVaR 可以避免这一问题,因为它只要求存在某些多项式矩,而不是指数矩。作为演示,我们应用半隐式梯度下降法计算了山区河流环境中实际溪流排放的时间序列数据的 TsVaR 和相应的 Radon-Nikodym 导数。
{"title":"Statistical evaluation of a long-memory process using the generalized entropic value-at-risk","authors":"Hidekazu Yoshioka,&nbsp;Yumi Yoshioka","doi":"10.1002/env.2838","DOIUrl":"10.1002/env.2838","url":null,"abstract":"<p>The modeling and identification of time series data with a long memory are important in various fields. The streamflow discharge is one such example that can be reasonably described as an aggregated stochastic process of randomized affine processes where the probability measure, we call it reversion measure, for the randomization is not directly observable. Accurate identification of the reversion measure is critical because of its omnipresence in the aggregated stochastic process. However, the modeling accuracy is commonly limited by the available real-world data. We resolve this issue by proposing the novel upper and lower bounds of a statistic of interest subject to ambiguity of the reversion measure. Here, we use the Tsallis value-at-risk (TsVaR) as a convex risk functional to generalize the widely used entropic value-at-risk (EVaR) as a sharp statistical indicator. We demonstrate that the EVaR cannot be used for evaluating key statistics, such as mean and variance, of the streamflow discharge due to the blowup of some exponential integrand. We theoretically show that the TsVaR can avoid this issue because it requires only the existence of some polynomial moment, not exponential moment. As a demonstration, we apply the semi-implicit gradient descent method to calculate the TsVaR and corresponding Radon–Nikodym derivative for time series data of actual streamflow discharges in mountainous river environments.</p>","PeriodicalId":50512,"journal":{"name":"Environmetrics","volume":"35 4","pages":""},"PeriodicalIF":1.7,"publicationDate":"2023-12-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139056396","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
New generalized extreme value distribution with applications to extreme temperature data 应用于极端温度数据的新广义极值分布
IF 1.7 3区 环境科学与生态学 Q4 ENVIRONMENTAL SCIENCES Pub Date : 2023-12-14 DOI: 10.1002/env.2836
Wilson Gyasi, Kahadawala Cooray

A new generalization of the extreme value distribution is presented with its density function, having a wide variety of density and tail shapes for modeling extreme value data. This generalized extreme value distribution will be referred to as the odd generalized extreme value distribution. It is derived by considering the distributions of the odds of the generalized extreme value distribution. Consequently, the new distribution is enlightened by not only having all six families of extreme value distributions; Gumbel, Fréchet, Weibull, reverse-Gumbel, reverse-Fréchet, and reverse-Weibull as submodels but also convenient for modeling bimodal extreme value data that are frequently found in environmental sciences. Basic properties of the distribution, including tail behavior and tail heaviness, are studied. Also, quantile-based aliases of the new distribution are illustrated using Galton's skewness and Moor's kurtosis plane. The adequacy of the new distribution is illustrated using well-known goodness-of-fit measures. A simulation is performed to validate the estimated risk measures due to repeated data points frequently found in temperature data. The Grand Rapids and well-known Wooster temperature data sets are analyzed and compared to nine different extreme value distributions to illustrate the new distribution's bimodality, flexibility, and overall fitness.

提出了极值分布的一种新的推广方法,它的密度函数具有多种密度和尾形,可用于极值数据的建模。这种广义极值分布称为奇广义极值分布。它是通过考虑广义极值分布的概率分布而得到的。因此,新分布不仅具有所有六类极值分布;Gumbel, fracimchet, Weibull, reverse-Gumbel, reverse- fracimchet和reverse-Weibull作为子模型,但也方便建模在环境科学中经常发现的双峰极值数据。研究了尾翼分布的基本性质,包括尾翼行为和尾翼质量。此外,用高尔顿偏度和摩尔峰度平面说明了新分布的基于分位数的别名。用众所周知的拟合优度来说明新分布的充分性。通过模拟来验证由于温度数据中经常发现的重复数据点而估计的风险措施。分析了大急流城和著名的Wooster温度数据集,并将其与9种不同的极值分布进行了比较,以说明新分布的双峰性、灵活性和整体适应性。
{"title":"New generalized extreme value distribution with applications to extreme temperature data","authors":"Wilson Gyasi,&nbsp;Kahadawala Cooray","doi":"10.1002/env.2836","DOIUrl":"10.1002/env.2836","url":null,"abstract":"<p>A new generalization of the extreme value distribution is presented with its density function, having a wide variety of density and tail shapes for modeling extreme value data. This generalized extreme value distribution will be referred to as the odd generalized extreme value distribution. It is derived by considering the distributions of the odds of the generalized extreme value distribution. Consequently, the new distribution is enlightened by not only having all six families of extreme value distributions; Gumbel, Fréchet, Weibull, reverse-Gumbel, reverse-Fréchet, and reverse-Weibull as submodels but also convenient for modeling bimodal extreme value data that are frequently found in environmental sciences. Basic properties of the distribution, including tail behavior and tail heaviness, are studied. Also, quantile-based aliases of the new distribution are illustrated using Galton's skewness and Moor's kurtosis plane. The adequacy of the new distribution is illustrated using well-known goodness-of-fit measures. A simulation is performed to validate the estimated risk measures due to repeated data points frequently found in temperature data. The Grand Rapids and well-known Wooster temperature data sets are analyzed and compared to nine different extreme value distributions to illustrate the new distribution's bimodality, flexibility, and overall fitness.</p>","PeriodicalId":50512,"journal":{"name":"Environmetrics","volume":"35 3","pages":""},"PeriodicalIF":1.7,"publicationDate":"2023-12-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138628052","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Total least squares bias in climate fingerprinting regressions with heterogeneous noise variances and correlated explanatory variables 具有异质噪声方差和相关解释变量的气候指纹回归中的总最小二乘法偏差
IF 1.7 3区 环境科学与生态学 Q4 ENVIRONMENTAL SCIENCES Pub Date : 2023-12-12 DOI: 10.1002/env.2835
Ross McKitrick

Regression-based “fingerprinting” methods in climate science employ total least squares (TLS) or orthogonal regression to remedy attenuation bias arising from measurement error due to reliance on climate model-generated explanatory variables. Proving the consistency of multivariate TLS requires assuming noise variances are equal across all variables in the model. This assumption has been challenged empirically in the climate context but little is known about TLS biases when the assumption is violated. Monte Carlo analysis is used herein to examine coefficient biases when the noise variances are not equal. The analysis allows the explanatory variables to be negatively correlated which is typical in climate applications. Ordinary least squares (OLS) exhibits the expected attenuation bias which vanishes as the noise variances on the explanatory variables disappear. In some cases, TLS corrects attenuation bias but more typically imparts large and generally positive biases. OLS performs well when the true value of β=0$$ beta =0 $$ whereas TLS performs quite poorly. This implies that TLS is not well suited for tests of the null. When β=1$$ beta =1 $$ TLS tends to exhibit opposite biases to OLS. Diagnostic information specific to each data sample should be consulted before using TLS to avoid spurious inferences and replacing OLS attenuation bias with other, potentially larger biases.

气候科学中基于回归的“指纹”方法采用总最小二乘(TLS)或正交回归来补救由于依赖气候模式生成的解释变量而导致的测量误差所引起的衰减偏差。证明多变量TLS的一致性需要假设模型中所有变量的噪声方差相等。这一假设在气候背景下受到了经验上的挑战,但当这一假设被违反时,人们对TLS偏差知之甚少。本文采用蒙特卡罗分析来检验噪声方差不相等时的系数偏差。分析允许解释变量呈负相关,这在气候应用中是典型的。普通最小二乘(OLS)表现出预期的衰减偏差,随着解释变量上的噪声方差的消失而消失。在某些情况下,TLS可以纠正衰减偏差,但更典型的是输入较大且通常为正的偏差。当β的真值=0 $$ beta =0 $$时,OLS表现良好,而TLS表现相当差。这意味着TLS不适用于null的测试。当β=1 $$ beta =1 $$时,TLS倾向于表现出与OLS相反的偏差。在使用TLS之前,应该参考特定于每个数据样本的诊断信息,以避免错误的推断,并用其他可能更大的偏差替换OLS衰减偏差。
{"title":"Total least squares bias in climate fingerprinting regressions with heterogeneous noise variances and correlated explanatory variables","authors":"Ross McKitrick","doi":"10.1002/env.2835","DOIUrl":"10.1002/env.2835","url":null,"abstract":"<p>Regression-based “fingerprinting” methods in climate science employ total least squares (TLS) or orthogonal regression to remedy attenuation bias arising from measurement error due to reliance on climate model-generated explanatory variables. Proving the consistency of multivariate TLS requires assuming noise variances are equal across all variables in the model. This assumption has been challenged empirically in the climate context but little is known about TLS biases when the assumption is violated. Monte Carlo analysis is used herein to examine coefficient biases when the noise variances are not equal. The analysis allows the explanatory variables to be negatively correlated which is typical in climate applications. Ordinary least squares (OLS) exhibits the expected attenuation bias which vanishes as the noise variances on the explanatory variables disappear. In some cases, TLS corrects attenuation bias but more typically imparts large and generally positive biases. OLS performs well when the true value of <math>\u0000 <semantics>\u0000 <mrow>\u0000 <mi>β</mi>\u0000 <mo>=</mo>\u0000 <mn>0</mn>\u0000 </mrow>\u0000 <annotation>$$ beta =0 $$</annotation>\u0000 </semantics></math> whereas TLS performs quite poorly. This implies that TLS is not well suited for tests of the null. When <math>\u0000 <semantics>\u0000 <mrow>\u0000 <mi>β</mi>\u0000 <mo>=</mo>\u0000 <mn>1</mn>\u0000 </mrow>\u0000 <annotation>$$ beta =1 $$</annotation>\u0000 </semantics></math> TLS tends to exhibit opposite biases to OLS. Diagnostic information specific to each data sample should be consulted before using TLS to avoid spurious inferences and replacing OLS attenuation bias with other, potentially larger biases.</p>","PeriodicalId":50512,"journal":{"name":"Environmetrics","volume":"35 2","pages":""},"PeriodicalIF":1.7,"publicationDate":"2023-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/env.2835","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138631025","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Temporal evolution of the extreme excursions of multivariate k $$ k $$ th order Markov processes with application to oceanographic data 多元k $$ k $$阶马尔可夫过程极值漂移的时间演化及其在海洋资料中的应用
IF 1.7 3区 环境科学与生态学 Q4 ENVIRONMENTAL SCIENCES Pub Date : 2023-12-03 DOI: 10.1002/env.2834
Stan Tendijck, Philip Jonathan, David Randell, Jonathan Tawn

We develop two models for the temporal evolution of extreme events of multivariate k$$ k $$th order Markov processes. The foundation of our methodology lies in the conditional extremes model of Heffernan and Tawn (Journal of the Royal Statistical Society: Series B (Methodology), 2014, 66, 497–546), and it naturally extends the work of Winter and Tawn (Journal of the Royal Statistical Society: Series C (Applied Statistics), 2016, 65, 345–365; Extremes, 2017, 20, 393–415) and Tendijck et al. (Environmetrics 2019, 30, e2541) to include multivariate random variables. We use cross-validation-type techniques to develop a model order selection procedure, and we test our models on two-dimensional meteorological-oceanographic data with directional covariates for a location in the northern North Sea. We conclude that the newly-developed models perform better than the widely used historical matching methodology for these data.

我们建立了两个多元k $$ k $$阶马尔可夫过程的极端事件的时间演化模型。我们的方法基础在于Heffernan和Tawn的条件极值模型(《皇家统计学会杂志》:B辑(方法论),2014年,66,497 - 546),它自然地扩展了Winter和Tawn的工作(《皇家统计学会杂志》:C辑(应用统计),2016年,65,345 - 365;极端,2017,20,393 - 415)和Tendijck等人(Environmetrics 2019, 30, e2541),包括多变量随机变量。我们使用交叉验证型技术来开发模型顺序选择程序,并在北海北部的一个位置使用定向协变量的二维气象-海洋学数据测试我们的模型。我们得出结论,新开发的模型比广泛使用的历史匹配方法对这些数据表现得更好。
{"title":"Temporal evolution of the extreme excursions of multivariate \u0000 \u0000 \u0000 k\u0000 \u0000 $$ k $$\u0000 th order Markov processes with application to oceanographic data","authors":"Stan Tendijck,&nbsp;Philip Jonathan,&nbsp;David Randell,&nbsp;Jonathan Tawn","doi":"10.1002/env.2834","DOIUrl":"10.1002/env.2834","url":null,"abstract":"<p>We develop two models for the temporal evolution of extreme events of multivariate <span></span><math>\u0000 <semantics>\u0000 <mrow>\u0000 <mi>k</mi>\u0000 </mrow>\u0000 <annotation>$$ k $$</annotation>\u0000 </semantics></math>th order Markov processes. The foundation of our methodology lies in the conditional extremes model of Heffernan and Tawn (<i>Journal of the Royal Statistical Society: Series B (Methodology)</i>, 2014, 66, 497–546), and it naturally extends the work of Winter and Tawn (<i>Journal of the Royal Statistical Society: Series C (Applied Statistics)</i>, 2016, 65, 345–365; <i>Extremes</i>, 2017, 20, 393–415) and Tendijck et al. (<i>Environmetrics</i> 2019, 30, e2541) to include multivariate random variables. We use cross-validation-type techniques to develop a model order selection procedure, and we test our models on two-dimensional meteorological-oceanographic data with directional covariates for a location in the northern North Sea. We conclude that the newly-developed models perform better than the widely used historical matching methodology for these data.</p>","PeriodicalId":50512,"journal":{"name":"Environmetrics","volume":"35 3","pages":""},"PeriodicalIF":1.7,"publicationDate":"2023-12-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/env.2834","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138539060","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Calibrated forecasts of quasi-periodic climate processes with deep echo state networks and penalized quantile regression 基于深度回波状态网络和惩罚分位数回归的准周期气候过程校准预报
IF 1.7 3区 环境科学与生态学 Q4 ENVIRONMENTAL SCIENCES Pub Date : 2023-11-20 DOI: 10.1002/env.2833
Matthew Bonas, Christopher K. Wikle, Stefano Castruccio

Among the most relevant processes in the Earth system for human habitability are quasi-periodic, ocean-driven multi-year events whose dynamics are currently incompletely characterized by physical models, and hence poorly predictable. This work aims at showing how (1) data-driven, stochastic machine learning approaches provide an affordable yet flexible means to forecast these processes; (2) the associated uncertainty can be properly calibrated with fast ensemble-based approaches. While the methodology introduced and discussed in this work pertains to synoptic scale events, the principle of augmenting incomplete or highly sensitive physical systems with data-driven models to improve predictability is far more general and can be extended to environmental problems of any scale in time or space.

地球系统中与人类可居住性最相关的过程之一是准周期性的、由海洋驱动的多年期事件,其动力学目前尚不能完全用物理模型来描述,因此很难预测。这项工作旨在展示(1)数据驱动的随机机器学习方法如何提供一种经济而灵活的方法来预测这些过程;(2)相关的不确定度可以通过基于快速集合的方法进行适当的校准。虽然在这项工作中介绍和讨论的方法与天气尺度事件有关,但用数据驱动模型增加不完整或高度敏感的物理系统以提高可预测性的原理更为普遍,并且可以扩展到任何时间或空间尺度的环境问题。
{"title":"Calibrated forecasts of quasi-periodic climate processes with deep echo state networks and penalized quantile regression","authors":"Matthew Bonas,&nbsp;Christopher K. Wikle,&nbsp;Stefano Castruccio","doi":"10.1002/env.2833","DOIUrl":"10.1002/env.2833","url":null,"abstract":"<p>Among the most relevant processes in the Earth system for human habitability are quasi-periodic, ocean-driven multi-year events whose dynamics are currently incompletely characterized by physical models, and hence poorly predictable. This work aims at showing how (1) data-driven, stochastic machine learning approaches provide an affordable yet flexible means to forecast these processes; (2) the associated uncertainty can be properly calibrated with fast ensemble-based approaches. While the methodology introduced and discussed in this work pertains to synoptic scale events, the principle of augmenting incomplete or highly sensitive physical systems with data-driven models to improve predictability is far more general and can be extended to environmental problems of any scale in time or space.</p>","PeriodicalId":50512,"journal":{"name":"Environmetrics","volume":"35 1","pages":""},"PeriodicalIF":1.7,"publicationDate":"2023-11-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138539075","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Locally correlated Poisson sampling 局部相关泊松采样
IF 1.7 3区 环境科学与生态学 Q4 ENVIRONMENTAL SCIENCES Pub Date : 2023-11-11 DOI: 10.1002/env.2832
Wilmer Prentius

Designs that produces spatially balanced, or well-spread, samples are desirable as they increase the probability of obtaining a sample highly representative of the population. Spatially correlated Poisson sampling (SCPS) is a method for selecting well-spread samples. In the SCPS method, the sampling outcomes (inclusion or exclusion of units) are decided sequentially. After each decision, the inclusion probabilities of surrounding units are updated. A specific order for deciding the sampling outcomes is not enforced for SCPS, that is, the order can be chosen randomly or be fixed. A new modified method called locally correlated Poisson sampling (LCPS) is suggested. In this new method, the order of the decisions makes sure the inclusion probabilities are updated (more) locally. As a result, a stronger negative correlation between inclusion indicators of nearby units is achieved. Simulations on various data sets show that the resulting samples from LCPS, in general, are more spatially balanced and produce lower variance than samples from SCPS and the local pivotal method.

产生空间均衡或分布均匀样本的设计是可取的,因为它们能提高获得高度代表人口的样本的概率。空间相关泊松抽样 (SCPS) 是一种选择分布均匀样本的方法。在 SCPS 方法中,抽样结果(纳入或排除单位)是按顺序决定的。每次决定后,都会更新周围单位的纳入概率。SCPS 并不强制规定决定抽样结果的具体顺序,也就是说,顺序可以随机选择,也可以固定不变。我们提出了一种新的改进方法,称为局部相关泊松抽样(LCPS)。在这种新方法中,决策顺序确保了包含概率(更多地)在本地更新。因此,邻近单位的纳入指标之间会产生更强的负相关。对各种数据集的模拟表明,与 SCPS 和局部枢轴法相比,LCPS 得出的样本一般在空间上更均衡,产生的方差也更小。
{"title":"Locally correlated Poisson sampling","authors":"Wilmer Prentius","doi":"10.1002/env.2832","DOIUrl":"10.1002/env.2832","url":null,"abstract":"<p>Designs that produces spatially balanced, or well-spread, samples are desirable as they increase the probability of obtaining a sample highly representative of the population. Spatially correlated Poisson sampling (SCPS) is a method for selecting well-spread samples. In the SCPS method, the sampling outcomes (inclusion or exclusion of units) are decided sequentially. After each decision, the inclusion probabilities of surrounding units are updated. A specific order for deciding the sampling outcomes is not enforced for SCPS, that is, the order can be chosen randomly or be fixed. A new modified method called locally correlated Poisson sampling (LCPS) is suggested. In this new method, the order of the decisions makes sure the inclusion probabilities are updated (more) locally. As a result, a stronger negative correlation between inclusion indicators of nearby units is achieved. Simulations on various data sets show that the resulting samples from LCPS, in general, are more spatially balanced and produce lower variance than samples from SCPS and the local pivotal method.</p>","PeriodicalId":50512,"journal":{"name":"Environmetrics","volume":"35 2","pages":""},"PeriodicalIF":1.7,"publicationDate":"2023-11-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/env.2832","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135042013","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Environmetrics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1