首页 > 最新文献

Environmental and Ecological Statistics最新文献

英文 中文
Spatiotemporal modelling of $$hbox {PM}_{2.5}$$ concentrations in Lombardy (Italy): a comparative study 意大利伦巴第地区 $$hbox {PM}_{2.5}$$ 浓度的时空模型:一项比较研究
IF 3.8 4区 环境科学与生态学 Q2 ENVIRONMENTAL SCIENCES Pub Date : 2024-02-01 DOI: 10.1007/s10651-023-00589-0
Philipp Otto, Alessandro Fusta Moro, Jacopo Rodeschini, Qendrim Shaboviq, Rosaria Ignaccolo, Natalia Golini, Michela Cameletti, Paolo Maranzano, Francesco Finazzi, Alessandro Fassò

This study presents a comparative analysis of three predictive models with an increasing degree of flexibility: hidden dynamic geostatistical models (HDGM), generalised additive mixed models (GAMM), and the random forest spatiotemporal kriging models (RFSTK). These models are evaluated for their effectiveness in predicting (text {PM}_{2.5}) concentrations in Lombardy (North Italy) from 2016 to 2020. Despite differing methodologies, all models demonstrate proficient capture of spatiotemporal patterns within air pollution data with similar out-of-sample performance. Furthermore, the study delves into station-specific analyses, revealing variable model performance contingent on localised conditions. Model interpretation, facilitated by parametric coefficient analysis and partial dependence plots, unveils consistent associations between predictor variables and (text {PM}_{2.5}) concentrations. Despite nuanced variations in modelling spatiotemporal correlations, all models effectively accounted for the underlying dependence. In summary, this study underscores the efficacy of conventional techniques in modelling correlated spatiotemporal data, concurrently highlighting the complementary potential of Machine Learning and classical statistical approaches.

本研究对三种预测模型进行了比较分析,这三种模型的灵活性越来越高:隐藏动态地理统计模型(HDGM)、广义加法混合模型(GAMM)和随机森林时空克里金模型(RFSTK)。评估了这些模型在预测 2016 年至 2020 年伦巴第大区(意大利北部)(text {PM}_{2.5})浓度方面的有效性。尽管方法不同,但所有模型都能熟练捕捉空气污染数据中的时空模式,并具有相似的样本外性能。此外,该研究还深入分析了特定站点的情况,揭示了因当地条件而异的模型性能。参数系数分析和偏倚图有助于模型解释,揭示了预测变量与 (text {PM}_{2.5}) 浓度之间的一致联系。尽管在模拟时空相关性方面存在细微差别,但所有模型都有效地解释了潜在的依赖关系。总之,这项研究强调了传统技术在建立相关时空数据模型方面的功效,同时也凸显了机器学习和经典统计方法的互补潜力。
{"title":"Spatiotemporal modelling of $$hbox {PM}_{2.5}$$ concentrations in Lombardy (Italy): a comparative study","authors":"Philipp Otto, Alessandro Fusta Moro, Jacopo Rodeschini, Qendrim Shaboviq, Rosaria Ignaccolo, Natalia Golini, Michela Cameletti, Paolo Maranzano, Francesco Finazzi, Alessandro Fassò","doi":"10.1007/s10651-023-00589-0","DOIUrl":"https://doi.org/10.1007/s10651-023-00589-0","url":null,"abstract":"<p>This study presents a comparative analysis of three predictive models with an increasing degree of flexibility: hidden dynamic geostatistical models (HDGM), generalised additive mixed models (GAMM), and the random forest spatiotemporal kriging models (RFSTK). These models are evaluated for their effectiveness in predicting <span>(text {PM}_{2.5})</span> concentrations in Lombardy (North Italy) from 2016 to 2020. Despite differing methodologies, all models demonstrate proficient capture of spatiotemporal patterns within air pollution data with similar out-of-sample performance. Furthermore, the study delves into station-specific analyses, revealing variable model performance contingent on localised conditions. Model interpretation, facilitated by parametric coefficient analysis and partial dependence plots, unveils consistent associations between predictor variables and <span>(text {PM}_{2.5})</span> concentrations. Despite nuanced variations in modelling spatiotemporal correlations, all models effectively accounted for the underlying dependence. In summary, this study underscores the efficacy of conventional techniques in modelling correlated spatiotemporal data, concurrently highlighting the complementary potential of Machine Learning and classical statistical approaches.</p>","PeriodicalId":50519,"journal":{"name":"Environmental and Ecological Statistics","volume":"50 1","pages":""},"PeriodicalIF":3.8,"publicationDate":"2024-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139667425","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Discrete Beta and Shifted Beta-Binomial models for rating and ranking data 用于评级和排名数据的离散贝塔模型和偏移贝塔-二叉模型
IF 3.8 4区 环境科学与生态学 Q2 ENVIRONMENTAL SCIENCES Pub Date : 2024-01-27 DOI: 10.1007/s10651-023-00592-5
Mariangela Sciandra, Salvatore Fasola, Alessandro Albano, Chiara Di Maria, Antonella Plaia

Ranking and rating methods for preference data result in a different underlying organization of data that can lead to manifold probabilistic approaches to data modelling. As an alternative to existing approaches, two new flexible probability distributions are discussed as a modelling framework: the Discrete Beta and the Shifted Beta-Binomial. Through the presentation of three real-world examples, we demonstrate the practical utility of these distributions. These illustrative cases show how these novel distributions can effectively address real-world challenges, with a particular focus on data derived from surveys concerning environmental issues. Our analysis highlights the new distributions’ capability to capture the inherent structures within preference data, offering valuable insights into the field.

偏好数据的排序和评级方法会产生不同的基本数据组织,从而导致数据建模的多方面概率方法。作为现有方法的替代方案,我们讨论了两种新的灵活概率分布作为建模框架:离散贝塔和偏移贝塔-二项式。通过介绍三个真实世界的例子,我们展示了这些分布的实用性。这些说明性案例展示了这些新型分布如何有效地应对现实世界的挑战,其中特别关注来自环境问题调查的数据。我们的分析强调了新分布捕捉偏好数据内在结构的能力,为该领域提供了宝贵的见解。
{"title":"Discrete Beta and Shifted Beta-Binomial models for rating and ranking data","authors":"Mariangela Sciandra, Salvatore Fasola, Alessandro Albano, Chiara Di Maria, Antonella Plaia","doi":"10.1007/s10651-023-00592-5","DOIUrl":"https://doi.org/10.1007/s10651-023-00592-5","url":null,"abstract":"<p>Ranking and rating methods for preference data result in a different underlying organization of data that can lead to manifold probabilistic approaches to data modelling. As an alternative to existing approaches, two new flexible probability distributions are discussed as a modelling framework: the <i>Discrete Beta</i> and the <i>Shifted Beta-Binomial</i>. Through the presentation of three real-world examples, we demonstrate the practical utility of these distributions. These illustrative cases show how these novel distributions can effectively address real-world challenges, with a particular focus on data derived from surveys concerning environmental issues. Our analysis highlights the new distributions’ capability to capture the inherent structures within preference data, offering valuable insights into the field.</p>","PeriodicalId":50519,"journal":{"name":"Environmental and Ecological Statistics","volume":"19 1","pages":""},"PeriodicalIF":3.8,"publicationDate":"2024-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139585893","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Advances in Kth nearest-neighbour clutter removal Kth 近邻杂波去除技术的进展
IF 3.8 4区 环境科学与生态学 Q2 ENVIRONMENTAL SCIENCES Pub Date : 2024-01-24 DOI: 10.1007/s10651-023-00588-1
Nicoletta D’Angelo

We consider the problem of feature detection in the presence of clutter in spatial point processes. Classification methods have been developed in previous studies. Among these, Byers and Raftery (J Am Stat Assoc 93(442):577–584, 1998) models the observed Kth nearest neighbour distances as a mixture distribution and classifies the clutter and feature points consequently. In this paper, we enhance such approach in two manners. First, we propose an automatic procedure for selecting the number of nearest neighbours to consider in the classification method by means of segmented regression models. Secondly, with the aim of applying the procedure multiple times to get a “better" end result, we propose a stopping criterion that minimizes the overall entropy measure of cluster separation between clutter and feature points. The proposed procedures are suitable for a feature with clutter as two superimposed Poisson processes on any space, including linear networks. We present simulations and two case studies of environmental data to illustrate the method.

我们考虑的是空间点过程中存在杂波时的特征检测问题。之前的研究已经开发出了分类方法。其中,Byers 和 Raftery (J Am Stat Assoc 93(442):577-584, 1998) 将观测到的 Kth 近邻距离建模为混合分布,并据此对杂波和特征点进行分类。在本文中,我们从两个方面对这种方法进行了改进。首先,我们提出了一种自动程序,通过分段回归模型来选择分类方法中要考虑的近邻数量。其次,为了多次应用该程序以获得 "更好 "的最终结果,我们提出了一个停止标准,该标准可使杂波和特征点之间聚类分离的整体熵值最小化。所提出的程序适用于任何空间(包括线性网络)上的两个叠加泊松过程的特征与杂波。我们通过模拟和两个环境数据案例研究来说明该方法。
{"title":"Advances in Kth nearest-neighbour clutter removal","authors":"Nicoletta D’Angelo","doi":"10.1007/s10651-023-00588-1","DOIUrl":"https://doi.org/10.1007/s10651-023-00588-1","url":null,"abstract":"<p>We consider the problem of feature detection in the presence of clutter in spatial point processes. Classification methods have been developed in previous studies. Among these, Byers and Raftery (J Am Stat Assoc 93(442):577–584, 1998) models the observed <i>K</i>th nearest neighbour distances as a mixture distribution and classifies the <i>clutter</i> and <i>feature</i> points consequently. In this paper, we enhance such approach in two manners. First, we propose an automatic procedure for selecting the number of nearest neighbours to consider in the classification method by means of segmented regression models. Secondly, with the aim of applying the procedure multiple times to get a “better\" end result, we propose a stopping criterion that minimizes the overall entropy measure of cluster separation between clutter and feature points. The proposed procedures are suitable for a feature with clutter as two superimposed Poisson processes on any space, including linear networks. We present simulations and two case studies of environmental data to illustrate the method.</p>","PeriodicalId":50519,"journal":{"name":"Environmental and Ecological Statistics","volume":"45 1","pages":""},"PeriodicalIF":3.8,"publicationDate":"2024-01-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139557249","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Logistic regression versus XGBoost for detecting burned areas using satellite images 利用卫星图像探测烧毁区域的逻辑回归与 XGBoost
IF 3.8 4区 环境科学与生态学 Q2 ENVIRONMENTAL SCIENCES Pub Date : 2024-01-20 DOI: 10.1007/s10651-023-00590-7

Abstract

Classical statistical methods prove advantageous for small datasets, whereas machine learning algorithms can excel with larger datasets. Our paper challenges this conventional wisdom by addressing a highly significant problem: the identification of burned areas through satellite imagery, that is a clear example of imbalanced data. The methods are illustrated in the North-Central Portugal and the North-West of Spain in October 2017 within a multi-temporal setting of satellite imagery. Daily satellite images are taken from Moderate Resolution Imaging Spectroradiometer (MODIS) products. Our analysis shows that a classical Logistic regression (LR) model competes on par, if not surpasses, a widely employed machine learning algorithm called the extreme gradient boosting algorithm (XGBoost) within this particular domain.

摘要 经典的统计方法被证明对小型数据集具有优势,而机器学习算法则能在大型数据集上大显身手。我们的论文挑战了这一传统观点,解决了一个非常重要的问题:通过卫星图像识别烧毁区域,这是一个不平衡数据的明显例子。2017 年 10 月,我们在葡萄牙中北部和西班牙西北部的多时卫星图像环境中对这些方法进行了说明。每日卫星图像取自中分辨率成像分光仪(MODIS)产品。我们的分析表明,在这一特定领域,经典的逻辑回归(LR)模型与广泛使用的机器学习算法--极端梯度提升算法(XGBoost)--不相上下,甚至有过之而无不及。
{"title":"Logistic regression versus XGBoost for detecting burned areas using satellite images","authors":"","doi":"10.1007/s10651-023-00590-7","DOIUrl":"https://doi.org/10.1007/s10651-023-00590-7","url":null,"abstract":"<h3>Abstract</h3> <p>Classical statistical methods prove advantageous for small datasets, whereas machine learning algorithms can excel with larger datasets. Our paper challenges this conventional wisdom by addressing a highly significant problem: the identification of burned areas through satellite imagery, that is a clear example of imbalanced data. The methods are illustrated in the North-Central Portugal and the North-West of Spain in October 2017 within a multi-temporal setting of satellite imagery. Daily satellite images are taken from Moderate Resolution Imaging Spectroradiometer (MODIS) products. Our analysis shows that a classical Logistic regression (LR) model competes on par, if not surpasses, a widely employed machine learning algorithm called the extreme gradient boosting algorithm (XGBoost) within this particular domain.</p>","PeriodicalId":50519,"journal":{"name":"Environmental and Ecological Statistics","volume":"19 1","pages":""},"PeriodicalIF":3.8,"publicationDate":"2024-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139509837","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Flash floods in Mediterranean catchments: a meta-model decision support system based on Bayesian networks 地中海流域的山洪暴发:基于贝叶斯网络的元模型决策支持系统
IF 3.8 4区 环境科学与生态学 Q2 ENVIRONMENTAL SCIENCES Pub Date : 2024-01-09 DOI: 10.1007/s10651-023-00587-2
Rosa F. Ropero, M. Julia Flores, Rafael Rumí

Natural disasters, especially those related to water—like storms and floods—have increased over the last decades both in number and intensity. Under the current Climate Change framework, several reports predict an increase in the intensity and duration of these extreme climatic events, where the Mediterranean area would be one of the most affected. This paper develops a decision support system based on Bayesian inference able to predict a flood alert in Andalusian Mediterranean catchments. The key point is that, using simple weather forecasts and live measurements of river level, we can get a flood-alert several hours before it happens. A set of models based on Bayesian networks was learnt for each of the catchments included in the study area, and joined together into a more complex model based on a rule system. This final meta-model was validated using data from both non-extreme and extreme storm events. Results show that the methodology proposed provides an accurate forecast of the flood situation of the greatest catchment areas of Andalusia.

过去几十年来,自然灾害,特别是与水有关的灾害,如风暴和洪水,在数量和强度上都有所增加。在当前的气候变化框架下,一些报告预测这些极端气候事件的强度和持续时间将会增加,而地中海地区将是受影响最严重的地区之一。本文开发了一个基于贝叶斯推理的决策支持系统,能够预测安达卢西亚地中海流域的洪水警报。关键在于,利用简单的天气预报和河流水位的实时测量数据,我们可以在洪水发生前几个小时获得洪水警报。针对研究区域内的每个集水区,我们学习了一套基于贝叶斯网络的模型,并将其连接成一个基于规则系统的更复杂模型。利用非极端和极端暴雨事件的数据对最终的元模型进行了验证。结果表明,所提出的方法能够准确预测安达卢西亚最大集水区的洪水情况。
{"title":"Flash floods in Mediterranean catchments: a meta-model decision support system based on Bayesian networks","authors":"Rosa F. Ropero, M. Julia Flores, Rafael Rumí","doi":"10.1007/s10651-023-00587-2","DOIUrl":"https://doi.org/10.1007/s10651-023-00587-2","url":null,"abstract":"<p>Natural disasters, especially those related to water—like storms and floods—have increased over the last decades both in number and intensity. Under the current Climate Change framework, several reports predict an increase in the intensity and duration of these extreme climatic events, where the Mediterranean area would be one of the most affected. This paper develops a decision support system based on Bayesian inference able to predict a flood alert in Andalusian Mediterranean catchments. The key point is that, using simple weather forecasts and live measurements of river level, we can get a flood-alert several hours before it happens. A set of models based on Bayesian networks was learnt for each of the catchments included in the study area, and joined together into a more complex model based on a rule system. This final meta-model was validated using data from both non-extreme and extreme storm events. Results show that the methodology proposed provides an accurate forecast of the flood situation of the greatest catchment areas of Andalusia.</p>","PeriodicalId":50519,"journal":{"name":"Environmental and Ecological Statistics","volume":"82 1","pages":""},"PeriodicalIF":3.8,"publicationDate":"2024-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139422290","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A zero-inflated model for spatiotemporal count data with extra zeros: application to 1950–2015 tornado data in Kansas 具有额外零的时空计数数据的零膨胀模型:应用于 1950-2015 年堪萨斯州龙卷风数据
IF 3.8 4区 环境科学与生态学 Q2 ENVIRONMENTAL SCIENCES Pub Date : 2023-12-23 DOI: 10.1007/s10651-023-00586-3
Hong-Ding Yang, Audrey Chang, Wei-Wen Hsu, Chun-Shu Chen

In many tornado climate studies, the number of tornado touchdowns is often the primary outcome of interest. These outcome measures are usually generated under a spatiotemporal correlation structure and contains many zeros due to the rarity of tornado occurrence at a specific location and time interval. To model the spatiotemporal count data with excess zeros, we propose a spatiotemporal zero-inflated Poisson (ZIP) model, which lends itself to ease of interpretation and computational simplicity. Technically, we embed a modified conditional autoregressive model in the ZIP model to describe the spatial and temporal correlations, where the probability of a pure zero in the ZIP is purposely designed to depend on locations but independent of time. Illustrated with the longitudinal tornado touchdown data in the state of Kansas from 1950 to 2015, our model suggests that the spatial correlation among the counties and the corresponding temperature are significant factors attributed to the tornado touchdowns. Through the model, we can also estimate the probabilities of no tornado touchdowns for each county over time. These estimated probabilities substantially help us understand the pattern of touchdowns and further identify the risk areas across Kansas. Moreover, these estimates can be iteratively updated when more current touchdown data are available. The final model for Kansas tornado touchdown data is evaluated using more recent data.

在许多龙卷风气候研究中,龙卷风触地次数往往是主要的研究结果。这些结果度量通常是在时空相关结构下产生的,由于龙卷风在特定地点和时间间隔内发生的罕见性,这些结果度量包含许多零。为了对零点过多的时空计数数据建模,我们提出了时空零膨胀泊松(ZIP)模型,该模型易于解释,计算简单。从技术上讲,我们在 ZIP 模型中嵌入了一个修正的条件自回归模型,以描述空间和时间相关性,其中 ZIP 中出现纯零的概率特意设计为与地点相关,但与时间无关。以堪萨斯州 1950 年至 2015 年的龙卷风触地纵向数据为例,我们的模型表明,各县之间的空间相关性和相应的温度是导致龙卷风触地的重要因素。通过该模型,我们还可以估算出每个县在不同时期没有龙卷风触地的概率。这些估算出的概率极大地帮助我们了解龙卷风触地的模式,并进一步确定堪萨斯州的风险区域。此外,当获得更多最新触地数据时,这些估计值还可以不断更新。堪萨斯州龙卷风触地数据的最终模型将使用更新的数据进行评估。
{"title":"A zero-inflated model for spatiotemporal count data with extra zeros: application to 1950–2015 tornado data in Kansas","authors":"Hong-Ding Yang, Audrey Chang, Wei-Wen Hsu, Chun-Shu Chen","doi":"10.1007/s10651-023-00586-3","DOIUrl":"https://doi.org/10.1007/s10651-023-00586-3","url":null,"abstract":"<p>In many tornado climate studies, the number of tornado touchdowns is often the primary outcome of interest. These outcome measures are usually generated under a spatiotemporal correlation structure and contains many zeros due to the rarity of tornado occurrence at a specific location and time interval. To model the spatiotemporal count data with excess zeros, we propose a spatiotemporal zero-inflated Poisson (ZIP) model, which lends itself to ease of interpretation and computational simplicity. Technically, we embed a modified conditional autoregressive model in the ZIP model to describe the spatial and temporal correlations, where the probability of a pure zero in the ZIP is purposely designed to depend on locations but independent of time. Illustrated with the longitudinal tornado touchdown data in the state of Kansas from 1950 to 2015, our model suggests that the spatial correlation among the counties and the corresponding temperature are significant factors attributed to the tornado touchdowns. Through the model, we can also estimate the probabilities of no tornado touchdowns for each county over time. These estimated probabilities substantially help us understand the pattern of touchdowns and further identify the risk areas across Kansas. Moreover, these estimates can be iteratively updated when more current touchdown data are available. The final model for Kansas tornado touchdown data is evaluated using more recent data.</p>","PeriodicalId":50519,"journal":{"name":"Environmental and Ecological Statistics","volume":"78 1","pages":""},"PeriodicalIF":3.8,"publicationDate":"2023-12-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139030333","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Are the income and price elasticities of economy-wide electricity demand in middle-income countries time-varying? Evidence from panels and individual countries 中等收入国家全经济电力需求的收入和价格弹性是否随时间变化?来自小组和个别国家的证据
IF 3.8 4区 环境科学与生态学 Q2 ENVIRONMENTAL SCIENCES Pub Date : 2023-11-30 DOI: 10.1007/s10651-023-00585-4
Brantley Liddle, Fakhri Hasanov
{"title":"Are the income and price elasticities of economy-wide electricity demand in middle-income countries time-varying? Evidence from panels and individual countries","authors":"Brantley Liddle, Fakhri Hasanov","doi":"10.1007/s10651-023-00585-4","DOIUrl":"https://doi.org/10.1007/s10651-023-00585-4","url":null,"abstract":"","PeriodicalId":50519,"journal":{"name":"Environmental and Ecological Statistics","volume":"25 1","pages":"827 - 849"},"PeriodicalIF":3.8,"publicationDate":"2023-11-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138533560","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Some statistical problems involved in forecasting and estimating the spread of SARS-CoV-2 using Hawkes point processes and SEIR models 利用Hawkes点过程和SEIR模型预测和估计SARS-CoV-2传播的一些统计问题
IF 3.8 4区 环境科学与生态学 Q2 ENVIRONMENTAL SCIENCES Pub Date : 2023-11-29 DOI: 10.1007/s10651-023-00591-6
Frederic Schoenberg

This article reviews some of the statistical issues involved with modeling SARS-CoV02 (Covid-19) in Los Angeles County, California, using Hawkes point process models and SEIR models. The two types of models are compared, and their pros and cons are discussed. We also discuss particular statistical decisions, such as where to place the upper limits on y-axes, and whether to use a Bayesian or frequentist version of the model, how to estimate seroprevalence, and fitting the density of transmission times in the Hawkes model.

本文回顾了利用Hawkes点过程模型和SEIR模型在加利福尼亚州洛杉矶县建立SARS-CoV02 (Covid-19)模型所涉及的一些统计问题。对这两种模型进行了比较,并讨论了它们的优缺点。我们还讨论了特定的统计决策,例如在y轴上放置上限的位置,以及是否使用贝叶斯模型或频率版本的模型,如何估计血清阳性率,以及拟合Hawkes模型中的传播时间密度。
{"title":"Some statistical problems involved in forecasting and estimating the spread of SARS-CoV-2 using Hawkes point processes and SEIR models","authors":"Frederic Schoenberg","doi":"10.1007/s10651-023-00591-6","DOIUrl":"https://doi.org/10.1007/s10651-023-00591-6","url":null,"abstract":"<p>This article reviews some of the statistical issues involved with modeling SARS-CoV02 (Covid-19) in Los Angeles County, California, using Hawkes point process models and SEIR models. The two types of models are compared, and their pros and cons are discussed. We also discuss particular statistical decisions, such as where to place the upper limits on y-axes, and whether to use a Bayesian or frequentist version of the model, how to estimate seroprevalence, and fitting the density of transmission times in the Hawkes model.</p>","PeriodicalId":50519,"journal":{"name":"Environmental and Ecological Statistics","volume":"13 1","pages":""},"PeriodicalIF":3.8,"publicationDate":"2023-11-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138533608","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Trade-off between efficiency and variance estimation of spatially balanced augmented samples 空间平衡增广样本的效率和方差估计之间的权衡
IF 3.8 4区 环境科学与生态学 Q2 ENVIRONMENTAL SCIENCES Pub Date : 2023-11-29 DOI: 10.1007/s10651-023-00582-7
Omer Ozturk, Blair L. Robertson, Olena Kravchuk, Jennifer Brown

In this paper, we construct three types of augmented samples, which are samples generated from two separate randomization events. The first type combines a simple random sample (SRS) with a spatially balanced sample (SBS) selected from the same finite population. The second type combines an SBS with an SRS. The third type combines two spatially balanced samples. The simple random sample is constructed without replacement and does not contain any ties. The spatially balanced samples are constructed using the properties of the Halton sequence. We provide the first and second order inclusion probabilities for the augmented samples. Next, using the inclusion probabilities of the augmented samples, we construct estimators for the mean and total of a finite population. The efficiency of the augmented samples varies between the efficiency of SRS and SBS samples. If the number of SRS observations in the augmented sample is large, the efficiency is closer to the efficiency of SRS. Otherwise, it is closer to the efficiency of SBS. We also provide estimators for the variances of the estimators of population total of augmented samples. The stability of these variance estimators depends on the proportion of SRS observations in the augmented samples. The larger number of SRS observations lead to stable variance estimators.

在本文中,我们构造了三种类型的增强样本,它们是由两个独立的随机化事件产生的样本。第一种类型结合了简单随机样本(SRS)和从相同有限总体中选择的空间平衡样本(SBS)。第二种类型将SBS与SRS结合在一起。第三种类型结合了两个空间平衡的样本。简单随机样本不进行替换,不包含任何关系。利用霍尔顿序列的性质构造空间平衡样本。我们提供了增广样本的一阶和二阶包含概率。接下来,使用扩增样本的包含概率,我们构造有限总体的均值和总数的估计量。增强型样品的效率介于SRS和SBS样品的效率之间。增广样本中SRS观测数较多,则效率更接近于SRS效率。否则,更接近SBS的效率。我们还提供了增广样本总体估计量方差的估计量。这些方差估计量的稳定性取决于增广样本中SRS观测值的比例。较大数量的SRS观测值导致稳定的方差估计。
{"title":"Trade-off between efficiency and variance estimation of spatially balanced augmented samples","authors":"Omer Ozturk, Blair L. Robertson, Olena Kravchuk, Jennifer Brown","doi":"10.1007/s10651-023-00582-7","DOIUrl":"https://doi.org/10.1007/s10651-023-00582-7","url":null,"abstract":"<p>In this paper, we construct three types of augmented samples, which are samples generated from two separate randomization events. The first type combines a simple random sample (<i>SRS</i>) with a spatially balanced sample (<i>SBS</i>) selected from the same finite population. The second type combines an <i>SBS</i> with an <i>SRS</i>. The third type combines two spatially balanced samples. The simple random sample is constructed without replacement and does not contain any ties. The spatially balanced samples are constructed using the properties of the Halton sequence. We provide the first and second order inclusion probabilities for the augmented samples. Next, using the inclusion probabilities of the augmented samples, we construct estimators for the mean and total of a finite population. The efficiency of the augmented samples varies between the efficiency of <i>SRS</i> and <i>SBS</i> samples. If the number of <i>SRS</i> observations in the augmented sample is large, the efficiency is closer to the efficiency of <i>SRS</i>. Otherwise, it is closer to the efficiency of <i>SBS</i>. We also provide estimators for the variances of the estimators of population total of augmented samples. The stability of these variance estimators depends on the proportion of <i>SRS</i> observations in the augmented samples. The larger number of <i>SRS</i> observations lead to stable variance estimators.</p>","PeriodicalId":50519,"journal":{"name":"Environmental and Ecological Statistics","volume":"34 1","pages":""},"PeriodicalIF":3.8,"publicationDate":"2023-11-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138533575","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Finding the number of latent states in hidden Markov models using information criteria 利用信息准则寻找隐马尔可夫模型中潜在状态的数量
IF 3.8 4区 环境科学与生态学 Q2 ENVIRONMENTAL SCIENCES Pub Date : 2023-11-22 DOI: 10.1007/s10651-023-00584-5
Jodie Buckby, Ting Wang, David Fletcher, Jiancang Zhuang, Akiko Takeo, Kazushige Obara

Hidden Markov models (HMMs) are often used to model time series data and are applied in many fields of research. However, estimating the unknown number of hidden states in the Markov chain is a non-trivial component of HMM model selection and an area of active research. Currently, AIC and BIC are commonly used for this purpose, despite theoretical issues and some evidence of poor performance in the literature. Here, motivated by the HMMs developed to model seismic tremor data, we use simulation studies to compare the performance of a number of model selection information criteria when used to select the number of hidden states in HMMs, including an adjusted BIC not previously used with HMMs. We find that AIC and BIC are not always reliable tools for selecting the number of hidden states in HMMs and that other information criteria such as adjusted BIC can actually perform better, depending on factors such as sample size and sojourn times in each state. We apply the information criteria to a set of HMMs fitted to seismic tremor data and compare the models selected by the different criteria.

隐马尔可夫模型(hmm)通常用于时间序列数据的建模,并在许多研究领域得到应用。然而,估计马尔可夫链中未知隐藏状态的数量是HMM模型选择的一个重要组成部分,也是一个活跃的研究领域。目前,AIC和BIC通常用于此目的,尽管存在理论问题和一些文献中表现不佳的证据。在此,受用于模拟地震数据的hmm的激励,我们使用仿真研究来比较用于选择hmm中隐藏状态数量的许多模型选择信息标准的性能,包括先前未用于hmm的调整BIC。我们发现AIC和BIC并不总是选择hmm中隐藏状态数量的可靠工具,而其他信息标准(如调整后的BIC)实际上可以表现得更好,这取决于每个状态的样本量和逗留时间等因素。我们将信息准则应用于一组地震地震资料拟合的hmm模型,并比较了不同准则所选择的模型。
{"title":"Finding the number of latent states in hidden Markov models using information criteria","authors":"Jodie Buckby, Ting Wang, David Fletcher, Jiancang Zhuang, Akiko Takeo, Kazushige Obara","doi":"10.1007/s10651-023-00584-5","DOIUrl":"https://doi.org/10.1007/s10651-023-00584-5","url":null,"abstract":"<p>Hidden Markov models (HMMs) are often used to model time series data and are applied in many fields of research. However, estimating the unknown number of hidden states in the Markov chain is a non-trivial component of HMM model selection and an area of active research. Currently, AIC and BIC are commonly used for this purpose, despite theoretical issues and some evidence of poor performance in the literature. Here, motivated by the HMMs developed to model seismic tremor data, we use simulation studies to compare the performance of a number of model selection information criteria when used to select the number of hidden states in HMMs, including an adjusted BIC not previously used with HMMs. We find that AIC and BIC are not always reliable tools for selecting the number of hidden states in HMMs and that other information criteria such as adjusted BIC can actually perform better, depending on factors such as sample size and sojourn times in each state. We apply the information criteria to a set of HMMs fitted to seismic tremor data and compare the models selected by the different criteria.</p>","PeriodicalId":50519,"journal":{"name":"Environmental and Ecological Statistics","volume":"101 1 1","pages":""},"PeriodicalIF":3.8,"publicationDate":"2023-11-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138533632","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Environmental and Ecological Statistics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1