首页 > 最新文献

Environmetrics最新文献

英文 中文
Correction to “Assessing Predictability of Environmental Time Series With Statistical and Machine Learning Models”
IF 1.5 3区 环境科学与生态学 Q4 ENVIRONMENTAL SCIENCES Pub Date : 2025-02-27 DOI: 10.1002/env.70008

Newlands, N.K. and Lyubchich, V. 2025. “ Assessing Predictability of Environmental Time Series With Statistical and Machine Learning Models.” Environmetrics 36(2), e70000. https://doi.org/10.1002/env.70000.

In the initial published version of this article, the title was incorrect. Below is the corrected article title:

Discussion on “Assessing Predictability of Environmental Time Series With Statistical and Machine Learning Models”

We apologize for this error.

{"title":"Correction to “Assessing Predictability of Environmental Time Series With Statistical and Machine Learning Models”","authors":"","doi":"10.1002/env.70008","DOIUrl":"https://doi.org/10.1002/env.70008","url":null,"abstract":"<p>\u0000 <span>Newlands, N.K.</span> and <span>Lyubchich, V.</span> <span>2025</span>. “ <span>Assessing Predictability of Environmental Time Series With Statistical and Machine Learning Models</span>.” <i>Environmetrics</i> <span>36</span>(<span>2</span>), e70000. https://doi.org/10.1002/env.70000.</p><p>In the initial published version of this article, the title was incorrect. Below is the corrected article title:</p><p><b>Discussion on “Assessing Predictability of Environmental Time Series With Statistical and Machine Learning Models”</b></p><p>We apologize for this error.</p>","PeriodicalId":50512,"journal":{"name":"Environmetrics","volume":"36 2","pages":""},"PeriodicalIF":1.5,"publicationDate":"2025-02-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/env.70008","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143497196","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
New Parametric Approach for Modeling Hydrological Data: An Alternative to the Beta, Kumaraswamy, and Simplex Models 水文数据建模的新参数方法:贝塔、库马拉斯瓦米和简约模型的替代方法
IF 1.5 3区 环境科学与生态学 Q4 ENVIRONMENTAL SCIENCES Pub Date : 2025-02-26 DOI: 10.1002/env.70006
Thiago A. N. De Andrade, Frank Gomes-Silva, Indranil Ghosh

We propose a new approach of continuous distributions in the unit interval, focusing on hydrological applications. This study presents the innovative two-parameter model called modified exponentiated generalized (MEG) distribution. The efficiency of the MEG distribution is evidenced through its application to 29 real datasets representing the percentage of useful water volume in hydroelectric power plant reservoirs in Brazil. The model outperforms the beta, simplex, and Kumaraswamy (KW) distributions, which are widely used for this type of analysis. The connection of our proposal with classical distributions, such as the Fréchet and KW distribution, broadens its applicability. While the Fréchet distribution is recognized for its usefulness in modeling extreme values, the proximity to KW allows a comprehensive analysis of hydrological data. The simple and tractable analytical expressions of the MEG's density and cumulative and quantile functions make it computationally feasible and particularly attractive for practical applications. Furthermore, this work highlights the relevance of the related reflected model: the reflected modified exponentiated generalized distribution. This contribution is expected to improve the statistical modeling of hydrological phenomena and provide new perspectives for future scientific investigations.

{"title":"New Parametric Approach for Modeling Hydrological Data: An Alternative to the Beta, Kumaraswamy, and Simplex Models","authors":"Thiago A. N. De Andrade,&nbsp;Frank Gomes-Silva,&nbsp;Indranil Ghosh","doi":"10.1002/env.70006","DOIUrl":"https://doi.org/10.1002/env.70006","url":null,"abstract":"<div>\u0000 \u0000 <p>We propose a new approach of continuous distributions in the unit interval, focusing on hydrological applications. This study presents the innovative two-parameter model called <i>modified exponentiated generalized</i> (MEG) distribution. The efficiency of the MEG distribution is evidenced through its application to 29 real datasets representing the percentage of useful water volume in hydroelectric power plant reservoirs in Brazil. The model outperforms the beta, simplex, and Kumaraswamy (KW) distributions, which are widely used for this type of analysis. The connection of our proposal with classical distributions, such as the Fréchet and KW distribution, broadens its applicability. While the Fréchet distribution is recognized for its usefulness in modeling extreme values, the proximity to KW allows a comprehensive analysis of hydrological data. The simple and tractable analytical expressions of the MEG's density and cumulative and quantile functions make it computationally feasible and particularly attractive for practical applications. Furthermore, this work highlights the relevance of the related reflected model: the <i>reflected modified exponentiated generalized distribution</i>. This contribution is expected to improve the statistical modeling of hydrological phenomena and provide new perspectives for future scientific investigations.</p>\u0000 </div>","PeriodicalId":50512,"journal":{"name":"Environmetrics","volume":"36 2","pages":""},"PeriodicalIF":1.5,"publicationDate":"2025-02-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143489971","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The Effect of the North Atlantic Oscillation on Monthly Precipitation in Selected European Locations: A Non-Linear Time Series Approach
IF 1.5 3区 环境科学与生态学 Q4 ENVIRONMENTAL SCIENCES Pub Date : 2025-02-24 DOI: 10.1002/env.2896
Changli He, Jian Kang, Annastiina Silvennoinen, Timo Teräsvirta

In this article, the relationship between the monthly precipitation in 30 European cities and towns, and two Algerian ones, and the North Atlantic Oscillation (NAO) index is characterized using the Vector Seasonal Shifting Mean and Covariance Autoregressive model, extended to contain exogenous variables. The results, based on monthly time series from 1851 up until 2020, include shifting monthly means for the rainfall series and the estimated coefficients of the exogenous NAO variable. They suggest that in the north and the west, the amount of rain in the boreal winter months has increased or stayed the same during the observation period, whereas in the Mediterranean area, there have been systematic decreases. Results on the North Atlantic Oscillation indicate that the NAO has its strongest effect on precipitation during the winter months. The (negative) effect is particularly strong in Western Europe, Lisbon, and the Mediterranean rim. In contrast, the effect in northern locations is positive for the winter months. The constancy of error variances and correlations is tested and, if rejected, the time-varying alternative is estimated. A spatial relationship between the error correlations and the distance between the corresponding pairs of cities is estimated.

{"title":"The Effect of the North Atlantic Oscillation on Monthly Precipitation in Selected European Locations: A Non-Linear Time Series Approach","authors":"Changli He,&nbsp;Jian Kang,&nbsp;Annastiina Silvennoinen,&nbsp;Timo Teräsvirta","doi":"10.1002/env.2896","DOIUrl":"https://doi.org/10.1002/env.2896","url":null,"abstract":"<p>In this article, the relationship between the monthly precipitation in 30 European cities and towns, and two Algerian ones, and the North Atlantic Oscillation (NAO) index is characterized using the Vector Seasonal Shifting Mean and Covariance Autoregressive model, extended to contain exogenous variables. The results, based on monthly time series from 1851 up until 2020, include shifting monthly means for the rainfall series and the estimated coefficients of the exogenous NAO variable. They suggest that in the north and the west, the amount of rain in the boreal winter months has increased or stayed the same during the observation period, whereas in the Mediterranean area, there have been systematic decreases. Results on the North Atlantic Oscillation indicate that the NAO has its strongest effect on precipitation during the winter months. The (negative) effect is particularly strong in Western Europe, Lisbon, and the Mediterranean rim. In contrast, the effect in northern locations is positive for the winter months. The constancy of error variances and correlations is tested and, if rejected, the time-varying alternative is estimated. A spatial relationship between the error correlations and the distance between the corresponding pairs of cities is estimated.</p>","PeriodicalId":50512,"journal":{"name":"Environmetrics","volume":"36 2","pages":""},"PeriodicalIF":1.5,"publicationDate":"2025-02-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/env.2896","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143481500","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Discussion on Assessing Predictability of Environmental Time Series With Statistical and Machine Learning Models
IF 1.5 3区 环境科学与生态学 Q4 ENVIRONMENTAL SCIENCES Pub Date : 2025-02-24 DOI: 10.1002/env.70003
Ansgar Steland
<p>I congratulate the authors for their interesting and insightful paper. Their findings contribute to the ongoing discussion of the pros and cons of machine learning methods and statistical approaches in environmetrics. In my discussion, I want to address some issues in the design of the comparison study and the interpretation of the results, and add some general thoughts. Moreover, I report about my own analyses of the Indiana citizen science data set used by the authors. Specifically, I developed a non-linear ARIMA regression model with improved heteroscedasticity-consistent uncertainty estimation, which turns out to substantially outperform the best method of (Bonas et al. <span>2025</span>). I also applied a couple of machine learning approaches not examined there, which I regard as quite accessible general purpose machine learning methods. One of those methods is recommended by the early 2025 state-of-the-art AI large language models. I report about the interesting results in the last section.</p><p>The authors argue that for non-linear and non-stationary processes, machine learning methods are typically superior due to their non-parametric structure. But here one should recall that the classical but still popular class of feedforward artificial neural networks trained by backpropagation is non-linear least squares regression with a certain non-linear regression function known up to a parameter vector. In its entirety, this was mainly recognized and elaborated by the econometrician Halbert White, see White (<span>1992</span>). And, of course, there are many methods classified as traditional statistical approaches which are non-parametric as well. The differences are more with respect to the dimensionality and sample size, and how the methods deal with it. Many machine learning methods such as artificial neural networks fit high-dimensional overparametrized parametric models, whereas statistical methods usually work with specifically chosen stochastic models aiming at parsimony. Such ML methods process high-dimensional inputs as given, and the purpose of the first stage of a model is to learn features from the input data, sometimes being agnostic with respect to the type and meaning of the input variables. The guiding principle is that one only fixes the basic topology (e.g., fully-connected layers or convolutional layers), and the model can extract optimal features provided the learning sample is large enough. This is in contrast to statistical approaches, which tend to design a model for specific types of inputs and deal with high dimensionality by suitable preprocessing steps, human-controlled feature generation and/or specific assumptions and related estimation methods, especially sparse models and sparse estimation, which combine estimation and variable selection. A further distinction is that statistical models and modeling using stochastic processes often impose a certain structure, which is derived from domain knowledge (e.g., physic
当然,任何统计学家,无论是应用统计学家还是数理统计学家,都会同意,量化基于数据的决策的不确定性至关重要,这在历史上一直是由统计学作为一门科学来解决的。在机器学习方面,情况已经有了很大的改善,作者为一类回声状态神经网络的不确定性量化开发了一种适当的方法。但在实际应用中,并不存在测试样本,根据迄今为止的测量结果,我们可以查看预测结果和相关的预测区间,从而量化不确定性。现在的问题是,如何解释预测区间。然而,有各种不确定性来源会导致相关的变化,而且根据所选预测方法的不同,预测区间的解释也可能不同。重要的变化来源包括模型类别选择的不确定性(如参数回归或非参数回归模型)、模型内具体规范的不确定性(如选择合适的回归因子)以及估计误差的不确定性。在环境计量学中,位置误差也会出现,即测量位置与我们想要预测的位置不同时(Cressie 和 Kornack,2003 年)。在频数统计中,给定协变量,预测因子是未知的,但却是固定的,并受到产生响应的随机误差项的干扰,预测因子的预测区间是一个随机区间,可能涵盖也可能不涵盖真正的预测因子。在贝叶斯框架中,它也可以是一个具有自身分布的随机量,预测区间可以由其后期分布的量级组成,从而可以解释为预测因子位于预测区间内的概率等于所需的水平。很明显,当预测响应时,这两种方法都会考虑误差的影响。在集合方法中,预测区间是从构建或模拟的集合预测中获得的。预测模型的集合可以而且通常被解释为一组专家提供不同的预测,然后利用集合的群集智能将这些预测组合起来,以产生更好的预测。现在,人们通过生成集合预测来构建预测区间。最后,机器学习中生成模型的兴起导致了不确定性量化的方法,即在给定数据的情况下,利用模型固有的生成采样机制产生一组代表不确定性的输出。总之,我们需要仔细研究预测区间是如何计算出来的,以便理解它的解释。问题在于,不同预测模型在均方误差方面的差异在某种意义上是否具有统计学意义。特别是,由于某些方法的性能指标非常接近,我想就预测中的均方误差讨论这个问题,因为它在研究人员和从业人员中非常常用。对于一大类模型和统计估计方法,Rivers 和 Vuong 2002)基于样本内和样本外的拟合不足标准,提出了动态非线性模型成对比较的渐近检验方法。后一种方法使用训练样本的模型拟合度,并评估其在测试样本中的性能。以印第安纳州公民科学数据集的温度测量为例,(Bonas 等人,2025 年)ARIMA 预测模型的样本内 MSE 为 48 . 01 $$ 48.
{"title":"Discussion on Assessing Predictability of Environmental Time Series With Statistical and Machine Learning Models","authors":"Ansgar Steland","doi":"10.1002/env.70003","DOIUrl":"https://doi.org/10.1002/env.70003","url":null,"abstract":"&lt;p&gt;I congratulate the authors for their interesting and insightful paper. Their findings contribute to the ongoing discussion of the pros and cons of machine learning methods and statistical approaches in environmetrics. In my discussion, I want to address some issues in the design of the comparison study and the interpretation of the results, and add some general thoughts. Moreover, I report about my own analyses of the Indiana citizen science data set used by the authors. Specifically, I developed a non-linear ARIMA regression model with improved heteroscedasticity-consistent uncertainty estimation, which turns out to substantially outperform the best method of (Bonas et al. &lt;span&gt;2025&lt;/span&gt;). I also applied a couple of machine learning approaches not examined there, which I regard as quite accessible general purpose machine learning methods. One of those methods is recommended by the early 2025 state-of-the-art AI large language models. I report about the interesting results in the last section.&lt;/p&gt;&lt;p&gt;The authors argue that for non-linear and non-stationary processes, machine learning methods are typically superior due to their non-parametric structure. But here one should recall that the classical but still popular class of feedforward artificial neural networks trained by backpropagation is non-linear least squares regression with a certain non-linear regression function known up to a parameter vector. In its entirety, this was mainly recognized and elaborated by the econometrician Halbert White, see White (&lt;span&gt;1992&lt;/span&gt;). And, of course, there are many methods classified as traditional statistical approaches which are non-parametric as well. The differences are more with respect to the dimensionality and sample size, and how the methods deal with it. Many machine learning methods such as artificial neural networks fit high-dimensional overparametrized parametric models, whereas statistical methods usually work with specifically chosen stochastic models aiming at parsimony. Such ML methods process high-dimensional inputs as given, and the purpose of the first stage of a model is to learn features from the input data, sometimes being agnostic with respect to the type and meaning of the input variables. The guiding principle is that one only fixes the basic topology (e.g., fully-connected layers or convolutional layers), and the model can extract optimal features provided the learning sample is large enough. This is in contrast to statistical approaches, which tend to design a model for specific types of inputs and deal with high dimensionality by suitable preprocessing steps, human-controlled feature generation and/or specific assumptions and related estimation methods, especially sparse models and sparse estimation, which combine estimation and variable selection. A further distinction is that statistical models and modeling using stochastic processes often impose a certain structure, which is derived from domain knowledge (e.g., physic","PeriodicalId":50512,"journal":{"name":"Environmetrics","volume":"36 2","pages":""},"PeriodicalIF":1.5,"publicationDate":"2025-02-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/env.70003","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143475634","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Semiparametric Copula-Based Confidence Intervals on Level Curves for the Evaluation of the Risk Level Associated to Bivariate Events
IF 1.5 3区 环境科学与生态学 Q4 ENVIRONMENTAL SCIENCES Pub Date : 2025-02-20 DOI: 10.1002/env.70005
Albert Folcher, Jean-François Quessy
<p>If <span></span><math> <semantics> <mrow> <mo>(</mo> <mi>X</mi> <mo>,</mo> <mi>Y</mi> <mo>)</mo> </mrow> <annotation>$$ left(X,Yright) $$</annotation> </semantics></math> is a random pair with distribution function <span></span><math> <semantics> <mrow> <msub> <mrow> <mi>F</mi> </mrow> <mrow> <mi>X</mi> <mo>,</mo> <mi>Y</mi> </mrow> </msub> <mo>(</mo> <mi>x</mi> <mo>,</mo> <mi>y</mi> <mo>)</mo> <mo>=</mo> <mi>ℙ</mi> <mo>(</mo> <mi>X</mi> <mo>≤</mo> <mi>x</mi> <mo>,</mo> <mi>Y</mi> <mo>≤</mo> <mi>y</mi> <mo>)</mo> </mrow> <annotation>$$ {F}_{X,Y}left(x,yright)=mathbb{P}left(Xle x,Yle yright) $$</annotation> </semantics></math>, one can define the level curve of probability <span></span><math> <semantics> <mrow> <mi>p</mi> </mrow> <annotation>$$ p $$</annotation> </semantics></math> as the values of <span></span><math> <semantics> <mrow> <mo>(</mo> <mi>x</mi> <mo>,</mo> <mi>y</mi> <mo>)</mo> <mo>∈</mo> <msup> <mrow> <mi>ℝ</mi> </mrow> <mrow> <mn>2</mn> </mrow> </msup> </mrow> <annotation>$$ left(x,yright)in {mathbb{R}}^2 $$</annotation> </semantics></math> such that <span></span><math> <semantics> <mrow> <msub> <mrow> <mi>F</mi> </mrow> <mrow> <mi>X</mi> <mo>,</mo> <mi>Y</mi> </mrow> </msub> <mo>(</mo> <mi>x</mi> <mo>,</mo> <mi>y</mi>
{"title":"Semiparametric Copula-Based Confidence Intervals on Level Curves for the Evaluation of the Risk Level Associated to Bivariate Events","authors":"Albert Folcher,&nbsp;Jean-François Quessy","doi":"10.1002/env.70005","DOIUrl":"https://doi.org/10.1002/env.70005","url":null,"abstract":"&lt;p&gt;If &lt;span&gt;&lt;/span&gt;&lt;math&gt;\u0000 &lt;semantics&gt;\u0000 &lt;mrow&gt;\u0000 &lt;mo&gt;(&lt;/mo&gt;\u0000 &lt;mi&gt;X&lt;/mi&gt;\u0000 &lt;mo&gt;,&lt;/mo&gt;\u0000 &lt;mi&gt;Y&lt;/mi&gt;\u0000 &lt;mo&gt;)&lt;/mo&gt;\u0000 &lt;/mrow&gt;\u0000 &lt;annotation&gt;$$ left(X,Yright) $$&lt;/annotation&gt;\u0000 &lt;/semantics&gt;&lt;/math&gt; is a random pair with distribution function &lt;span&gt;&lt;/span&gt;&lt;math&gt;\u0000 &lt;semantics&gt;\u0000 &lt;mrow&gt;\u0000 &lt;msub&gt;\u0000 &lt;mrow&gt;\u0000 &lt;mi&gt;F&lt;/mi&gt;\u0000 &lt;/mrow&gt;\u0000 &lt;mrow&gt;\u0000 &lt;mi&gt;X&lt;/mi&gt;\u0000 &lt;mo&gt;,&lt;/mo&gt;\u0000 &lt;mi&gt;Y&lt;/mi&gt;\u0000 &lt;/mrow&gt;\u0000 &lt;/msub&gt;\u0000 &lt;mo&gt;(&lt;/mo&gt;\u0000 &lt;mi&gt;x&lt;/mi&gt;\u0000 &lt;mo&gt;,&lt;/mo&gt;\u0000 &lt;mi&gt;y&lt;/mi&gt;\u0000 &lt;mo&gt;)&lt;/mo&gt;\u0000 &lt;mo&gt;=&lt;/mo&gt;\u0000 &lt;mi&gt;ℙ&lt;/mi&gt;\u0000 &lt;mo&gt;(&lt;/mo&gt;\u0000 &lt;mi&gt;X&lt;/mi&gt;\u0000 &lt;mo&gt;≤&lt;/mo&gt;\u0000 &lt;mi&gt;x&lt;/mi&gt;\u0000 &lt;mo&gt;,&lt;/mo&gt;\u0000 &lt;mi&gt;Y&lt;/mi&gt;\u0000 &lt;mo&gt;≤&lt;/mo&gt;\u0000 &lt;mi&gt;y&lt;/mi&gt;\u0000 &lt;mo&gt;)&lt;/mo&gt;\u0000 &lt;/mrow&gt;\u0000 &lt;annotation&gt;$$ {F}_{X,Y}left(x,yright)=mathbb{P}left(Xle x,Yle yright) $$&lt;/annotation&gt;\u0000 &lt;/semantics&gt;&lt;/math&gt;, one can define the level curve of probability &lt;span&gt;&lt;/span&gt;&lt;math&gt;\u0000 &lt;semantics&gt;\u0000 &lt;mrow&gt;\u0000 &lt;mi&gt;p&lt;/mi&gt;\u0000 &lt;/mrow&gt;\u0000 &lt;annotation&gt;$$ p $$&lt;/annotation&gt;\u0000 &lt;/semantics&gt;&lt;/math&gt; as the values of &lt;span&gt;&lt;/span&gt;&lt;math&gt;\u0000 &lt;semantics&gt;\u0000 &lt;mrow&gt;\u0000 &lt;mo&gt;(&lt;/mo&gt;\u0000 &lt;mi&gt;x&lt;/mi&gt;\u0000 &lt;mo&gt;,&lt;/mo&gt;\u0000 &lt;mi&gt;y&lt;/mi&gt;\u0000 &lt;mo&gt;)&lt;/mo&gt;\u0000 &lt;mo&gt;∈&lt;/mo&gt;\u0000 &lt;msup&gt;\u0000 &lt;mrow&gt;\u0000 &lt;mi&gt;ℝ&lt;/mi&gt;\u0000 &lt;/mrow&gt;\u0000 &lt;mrow&gt;\u0000 &lt;mn&gt;2&lt;/mn&gt;\u0000 &lt;/mrow&gt;\u0000 &lt;/msup&gt;\u0000 &lt;/mrow&gt;\u0000 &lt;annotation&gt;$$ left(x,yright)in {mathbb{R}}^2 $$&lt;/annotation&gt;\u0000 &lt;/semantics&gt;&lt;/math&gt; such that &lt;span&gt;&lt;/span&gt;&lt;math&gt;\u0000 &lt;semantics&gt;\u0000 &lt;mrow&gt;\u0000 &lt;msub&gt;\u0000 &lt;mrow&gt;\u0000 &lt;mi&gt;F&lt;/mi&gt;\u0000 &lt;/mrow&gt;\u0000 &lt;mrow&gt;\u0000 &lt;mi&gt;X&lt;/mi&gt;\u0000 &lt;mo&gt;,&lt;/mo&gt;\u0000 &lt;mi&gt;Y&lt;/mi&gt;\u0000 &lt;/mrow&gt;\u0000 &lt;/msub&gt;\u0000 &lt;mo&gt;(&lt;/mo&gt;\u0000 &lt;mi&gt;x&lt;/mi&gt;\u0000 &lt;mo&gt;,&lt;/mo&gt;\u0000 &lt;mi&gt;y&lt;/mi&gt;\u0000 ","PeriodicalId":50512,"journal":{"name":"Environmetrics","volume":"36 2","pages":""},"PeriodicalIF":1.5,"publicationDate":"2025-02-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/env.70005","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143456021","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Discussion on “Assessing Predictability of Environmental Time Series With Statistical and Machine Learning Models”
IF 1.5 3区 环境科学与生态学 Q4 ENVIRONMENTAL SCIENCES Pub Date : 2025-02-18 DOI: 10.1002/env.70001
Paolo Maranzano, Paul A. Parker

We contribute to the discussion of the insightful article “Assessing predictability of environmental time series with statistical and machine learning models” by Bonas et al. (2024), in which the authors commend their effort in comparing a wide range of methodologies for the challenging task of predicting environmental time series data. We focus our discussion on two topics of interest to us. First, we consider extensions of the explored methodologies that allow for heteroscedastic error terms. Second, we consider non-Gaussianity and fitting models on transformed data. For both of these points, we will make use of the authors' supplied code and data in order to extend their examples. Ultimately, we find that modeling of heteroscedasticity error terms has the potential to improve both point and interval estimates for these environmental time series. We also find that the use of transformations to handle non-Gaussianity can improve interval estimates.

{"title":"Discussion on “Assessing Predictability of Environmental Time Series With Statistical and Machine Learning Models”","authors":"Paolo Maranzano,&nbsp;Paul A. Parker","doi":"10.1002/env.70001","DOIUrl":"https://doi.org/10.1002/env.70001","url":null,"abstract":"<p>We contribute to the discussion of the insightful article “Assessing predictability of environmental time series with statistical and machine learning models” by Bonas et al. (2024), in which the authors commend their effort in comparing a wide range of methodologies for the challenging task of predicting environmental time series data. We focus our discussion on two topics of interest to us. First, we consider extensions of the explored methodologies that allow for heteroscedastic error terms. Second, we consider non-Gaussianity and fitting models on transformed data. For both of these points, we will make use of the authors' supplied code and data in order to extend their examples. Ultimately, we find that modeling of heteroscedasticity error terms has the potential to improve both point and interval estimates for these environmental time series. We also find that the use of transformations to handle non-Gaussianity can improve interval estimates.</p>","PeriodicalId":50512,"journal":{"name":"Environmetrics","volume":"36 2","pages":""},"PeriodicalIF":1.5,"publicationDate":"2025-02-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/env.70001","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143431605","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Stacking Weights and Model Space Selection in Frequentist Model Averaging for Benchmark Dose Estimation
IF 1.5 3区 环境科学与生态学 Q4 ENVIRONMENTAL SCIENCES Pub Date : 2025-02-17 DOI: 10.1002/env.70002
Jens Riis Baalkilde, Niels Richard Hansen, Signe Marie Jensen

In dose-response modeling, several models can often yield satisfactory fits to the observed data. The current practice in risk assessment is to use model averaging, which is a way to combine multiple models in a weighted average. A key parameter in risk assessment is the benchmark dose, the dose resulting in a predefined abnormal change in response. Current practice when applying frequentist model averaging is to use weights based on the Akaike Information Criterion (AIC). This paper introduces stacking weights as an alternative for dose-response modeling and generalizes a Diversity Index from dichotomous to continuous responses for model space selection. Three simulation studies were conducted to evaluate the new methods. They showed that, in three realistic scenarios, recommended strategies generally performed well, with stacking weights outperforming AIC weights in several cases. Strategies involving model selection were less effective. However, in a challenging scenario, none of the methods performed well. Due to the promising results of stacking weights, they have been added to the R package “bmd.”

{"title":"Stacking Weights and Model Space Selection in Frequentist Model Averaging for Benchmark Dose Estimation","authors":"Jens Riis Baalkilde,&nbsp;Niels Richard Hansen,&nbsp;Signe Marie Jensen","doi":"10.1002/env.70002","DOIUrl":"https://doi.org/10.1002/env.70002","url":null,"abstract":"<p>In dose-response modeling, several models can often yield satisfactory fits to the observed data. The current practice in risk assessment is to use model averaging, which is a way to combine multiple models in a weighted average. A key parameter in risk assessment is the benchmark dose, the dose resulting in a predefined abnormal change in response. Current practice when applying frequentist model averaging is to use weights based on the Akaike Information Criterion (AIC). This paper introduces stacking weights as an alternative for dose-response modeling and generalizes a Diversity Index from dichotomous to continuous responses for model space selection. Three simulation studies were conducted to evaluate the new methods. They showed that, in three realistic scenarios, recommended strategies generally performed well, with stacking weights outperforming AIC weights in several cases. Strategies involving model selection were less effective. However, in a challenging scenario, none of the methods performed well. Due to the promising results of stacking weights, they have been added to the R package “bmd.”</p>","PeriodicalId":50512,"journal":{"name":"Environmetrics","volume":"36 2","pages":""},"PeriodicalIF":1.5,"publicationDate":"2025-02-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/env.70002","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143431614","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
“Assessing Predictability of Environmental Time Series With Statistical and Machine Learning Models”
IF 1.5 3区 环境科学与生态学 Q4 ENVIRONMENTAL SCIENCES Pub Date : 2025-02-17 DOI: 10.1002/env.70000
Nathaniel K. Newlands, Vyacheslav Lyubchich

The relative merits of machine learning and statistical methods are discussed recently by Bonas et al. 2004, who raise important open questions for the statistical community regarding the value-added benefits of data science and the future role of environmental statistics. Specifically, they identify three major knowledge gaps where statistics is seen as crucial to strengthening inference in machine learning (ML): to provide an ML model-based framework amenable to explainability, to determine the best approach for quantifying uncertainty in relation to complex environmental dynamics, and to comprehensively identify ML's value-added benefits. We continue this discussion by exploring these general questions and sharing our perspective and insights from our modeling of marine and terrestrial ecosystem dynamics. We propose several lines of inquiry where environmental statisticians and data scientists could collaboratively advance predictive analytics.

{"title":"“Assessing Predictability of Environmental Time Series With Statistical and Machine Learning Models”","authors":"Nathaniel K. Newlands,&nbsp;Vyacheslav Lyubchich","doi":"10.1002/env.70000","DOIUrl":"https://doi.org/10.1002/env.70000","url":null,"abstract":"<p>The relative merits of machine learning and statistical methods are discussed recently by Bonas et al. 2004, who raise important open questions for the statistical community regarding the value-added benefits of data science and the future role of environmental statistics. Specifically, they identify three major knowledge gaps where statistics is seen as crucial to strengthening inference in machine learning (ML): to provide an ML model-based framework amenable to explainability, to determine the best approach for quantifying uncertainty in relation to complex environmental dynamics, and to comprehensively identify ML's value-added benefits. We continue this discussion by exploring these general questions and sharing our perspective and insights from our modeling of marine and terrestrial ecosystem dynamics. We propose several lines of inquiry where environmental statisticians and data scientists could collaboratively advance predictive analytics.</p>","PeriodicalId":50512,"journal":{"name":"Environmetrics","volume":"36 2","pages":""},"PeriodicalIF":1.5,"publicationDate":"2025-02-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/env.70000","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143431613","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Does the Quality of Political Institutions Matter for the Effectiveness of Environmental Taxes? An Empirical Analysis on CO2 Emissions
IF 1.5 3区 环境科学与生态学 Q4 ENVIRONMENTAL SCIENCES Pub Date : 2025-02-12 DOI: 10.1002/env.2895
Donatella Baiardi, Simona Scabrosetti

Focusing on a sample of 39 countries in the period 1996–2017, we analyze whether the relationship between environmental taxes and CO2 emissions depends on the quality of political institutions. Our results show that an increase in the environmental tax revenue is related to a reduction in CO2 emissions only in countries with more consolidated democratic institutions, higher civil society participation, and less corrupt governments. Moreover, the relationship between CO2 emissions and revenue neutral shifts to different tax sources depends not only on the quality of political institutions, but also on the kind of externality the policymaker aims at correcting.

{"title":"Does the Quality of Political Institutions Matter for the Effectiveness of Environmental Taxes? An Empirical Analysis on CO2 Emissions","authors":"Donatella Baiardi,&nbsp;Simona Scabrosetti","doi":"10.1002/env.2895","DOIUrl":"https://doi.org/10.1002/env.2895","url":null,"abstract":"<p>Focusing on a sample of 39 countries in the period 1996–2017, we analyze whether the relationship between environmental taxes and CO<sub>2</sub> emissions depends on the quality of political institutions. Our results show that an increase in the environmental tax revenue is related to a reduction in CO<sub>2</sub> emissions only in countries with more consolidated democratic institutions, higher civil society participation, and less corrupt governments. Moreover, the relationship between CO<sub>2</sub> emissions and revenue neutral shifts to different tax sources depends not only on the quality of political institutions, but also on the kind of externality the policymaker aims at correcting.</p>","PeriodicalId":50512,"journal":{"name":"Environmetrics","volume":"36 2","pages":""},"PeriodicalIF":1.5,"publicationDate":"2025-02-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/env.2895","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143389315","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Fuzzy Clustering of Circular Time Series With Applications to Wind Data
IF 1.5 3区 环境科学与生态学 Q4 ENVIRONMENTAL SCIENCES Pub Date : 2025-02-10 DOI: 10.1002/env.2902
Ángel López-Oriona, Ying Sun, Rosa María Crujeiras

In environmental science, practitioners often deal with data recorded sequentially along time, such as time series of wind direction or wind speed. In this context, clustering of time series is a useful tool to carry out exploratory analyses. While most of the proposals are focused on real-valued time series, very few works consider circular time series, despite the frequent appearance of these objects in many disciplines. In this manuscript, a dissimilarity for circular time series is introduced and used in combination with a soft clustering method. The metric relies on a measure of serial dependence considering circular arcs, thus taking advantage of the directional character inherent to the series range. The clustering approach is able to group together time series generated from similar stochastic processes. Some simulations show that the method exhibits a reasonable clustering effectiveness, outperforming alternative techniques in many contexts. Two interesting applications involving time series of wind direction in Saudi Arabia show the potential of the proposed approach.

{"title":"Fuzzy Clustering of Circular Time Series With Applications to Wind Data","authors":"Ángel López-Oriona,&nbsp;Ying Sun,&nbsp;Rosa María Crujeiras","doi":"10.1002/env.2902","DOIUrl":"https://doi.org/10.1002/env.2902","url":null,"abstract":"<p>In environmental science, practitioners often deal with data recorded sequentially along time, such as time series of wind direction or wind speed. In this context, clustering of time series is a useful tool to carry out exploratory analyses. While most of the proposals are focused on real-valued time series, very few works consider circular time series, despite the frequent appearance of these objects in many disciplines. In this manuscript, a dissimilarity for circular time series is introduced and used in combination with a soft clustering method. The metric relies on a measure of serial dependence considering circular arcs, thus taking advantage of the directional character inherent to the series range. The clustering approach is able to group together time series generated from similar stochastic processes. Some simulations show that the method exhibits a reasonable clustering effectiveness, outperforming alternative techniques in many contexts. Two interesting applications involving time series of wind direction in Saudi Arabia show the potential of the proposed approach.</p>","PeriodicalId":50512,"journal":{"name":"Environmetrics","volume":"36 2","pages":""},"PeriodicalIF":1.5,"publicationDate":"2025-02-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/env.2902","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143380030","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Environmetrics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1