首页 > 最新文献

Spatial Statistics最新文献

英文 中文
Spatiotemporal factor models for functional data with application to population map forecast 功能数据时空因素模型在人口分布图预测中的应用
IF 2.1 2区 数学 Q3 GEOSCIENCES, MULTIDISCIPLINARY Pub Date : 2024-08-01 DOI: 10.1016/j.spasta.2024.100849
Tomoya Wakayama , Shonosuke Sugasawa

The proliferation of mobile devices has led to the collection of large amounts of population data. This situation has prompted the need to utilize this rich, multidimensional data in practical applications. In response to this trend, we have integrated functional data analysis (FDA) and factor analysis to address the challenge of predicting hourly population changes across various districts in Tokyo. Specifically, by assuming a Gaussian process, we avoided the large covariance matrix parameters of the multivariate normal distribution. In addition, the data were both time and spatially dependent between districts. To capture various characteristics, a Bayesian factor model was introduced, which modeled the time series of a small number of common factors and expressed the spatial structure through factor loading matrices. Furthermore, the factor loading matrices were made identifiable and sparse to ensure the interpretability of the model. We also proposed a Bayesian shrinkage method as a systematic approach for factor selection. Through numerical experiments and data analysis, we investigated the predictive accuracy and interpretability of our proposed method. We concluded that the flexibility of the method allows for the incorporation of additional time series features, thereby improving its accuracy.

移动设备的普及导致了大量人口数据的收集。这种情况促使人们需要在实际应用中利用这些丰富的多维数据。针对这一趋势,我们整合了功能数据分析(FDA)和因子分析,以应对预测东京各区每小时人口变化的挑战。具体来说,通过假设高斯过程,我们避免了多元正态分布的大协方差矩阵参数。此外,各区之间的数据既与时间有关,也与空间有关。为了捕捉各种特征,我们引入了贝叶斯因子模型,该模型将时间序列建模为少数几个共同因子,并通过因子载荷矩阵表达空间结构。此外,为确保模型的可解释性,我们还使因子载荷矩阵具有可识别性和稀疏性。我们还提出了一种贝叶斯收缩法,作为因子选择的系统方法。通过数值实验和数据分析,我们研究了所提方法的预测准确性和可解释性。我们得出的结论是,该方法的灵活性允许纳入更多的时间序列特征,从而提高了其准确性。
{"title":"Spatiotemporal factor models for functional data with application to population map forecast","authors":"Tomoya Wakayama ,&nbsp;Shonosuke Sugasawa","doi":"10.1016/j.spasta.2024.100849","DOIUrl":"10.1016/j.spasta.2024.100849","url":null,"abstract":"<div><p>The proliferation of mobile devices has led to the collection of large amounts of population data. This situation has prompted the need to utilize this rich, multidimensional data in practical applications. In response to this trend, we have integrated functional data analysis (FDA) and factor analysis to address the challenge of predicting hourly population changes across various districts in Tokyo. Specifically, by assuming a Gaussian process, we avoided the large covariance matrix parameters of the multivariate normal distribution. In addition, the data were both time and spatially dependent between districts. To capture various characteristics, a Bayesian factor model was introduced, which modeled the time series of a small number of common factors and expressed the spatial structure through factor loading matrices. Furthermore, the factor loading matrices were made identifiable and sparse to ensure the interpretability of the model. We also proposed a Bayesian shrinkage method as a systematic approach for factor selection. Through numerical experiments and data analysis, we investigated the predictive accuracy and interpretability of our proposed method. We concluded that the flexibility of the method allows for the incorporation of additional time series features, thereby improving its accuracy.</p></div>","PeriodicalId":48771,"journal":{"name":"Spatial Statistics","volume":"62 ","pages":"Article 100849"},"PeriodicalIF":2.1,"publicationDate":"2024-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141936198","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Enhancing bivariate spatial association analysis of network-constrained geographical flows: An incremental scale-based method 加强受网络限制的地理流动的双变量空间关联分析:基于规模的增量方法
IF 2.1 2区 数学 Q3 GEOSCIENCES, MULTIDISCIPLINARY Pub Date : 2024-07-31 DOI: 10.1016/j.spasta.2024.100852
Wenkai Liu , Haonan Cai , Weijie Zhang , Sheng Hu , Zhangzhi Tan , Jiannan Cai , Hanfa Xing

Measuring bivariate spatial association plays a key role in understanding the spatial relationships between two types of geographical flow (hereafter referred to as “flow”). However, existing studies usually use multiple scales to analyze bivariate associations of flows, leading to the results at larger scales can be strongly affected by the results at smaller scales. Moreover, the planar space assumption of most existing studies is unsuitable for network-constrained flows. To solve these problems, a network incremental flow cross K-function (NIFK) is developed in this study by extending the cross K-function for points into a flow context. Specifically, two versions of NIFK were developed in this study: the global version to check whether bivariate associations exist in the whole study area and the local version to identify specific locations where associations occur. Experiments on three simulated datasets demonstrate the advantages of the proposed method over an available alternative method. A case study conducted using Xiamen taxi and ride-hailing service datasets demonstrates the usefulness of the proposed method. The detected bivariate spatial association provides deep insights for understanding the competition between taxi services and ride-hailing services.

测量双变量空间关联对于理解两类地理流量(以下简称 "流量")之间的空间关系起着关键作用。然而,现有研究通常使用多个尺度来分析流量的双变量关联,导致较大尺度的结果会受到较小尺度结果的强烈影响。此外,大多数现有研究的平面空间假设并不适合网络约束流。为了解决这些问题,本研究通过将点的交叉 K 函数扩展到流的背景下,开发了网络增量流交叉 K 函数(NIFK)。具体来说,本研究开发了两个版本的 NIFK:全局版本用于检查整个研究区域是否存在二元关联,局部版本用于识别发生关联的特定位置。在三个模拟数据集上进行的实验表明,与现有的替代方法相比,本研究提出的方法更具优势。利用厦门出租车和打车服务数据集进行的案例研究证明了所提方法的实用性。检测到的二元空间关联为理解出租车服务和打车服务之间的竞争提供了深刻的见解。
{"title":"Enhancing bivariate spatial association analysis of network-constrained geographical flows: An incremental scale-based method","authors":"Wenkai Liu ,&nbsp;Haonan Cai ,&nbsp;Weijie Zhang ,&nbsp;Sheng Hu ,&nbsp;Zhangzhi Tan ,&nbsp;Jiannan Cai ,&nbsp;Hanfa Xing","doi":"10.1016/j.spasta.2024.100852","DOIUrl":"10.1016/j.spasta.2024.100852","url":null,"abstract":"<div><p>Measuring bivariate spatial association plays a key role in understanding the spatial relationships between two types of geographical flow (hereafter referred to as “flow”). However, existing studies usually use multiple scales to analyze bivariate associations of flows, leading to the results at larger scales can be strongly affected by the results at smaller scales. Moreover, the planar space assumption of most existing studies is unsuitable for network-constrained flows. To solve these problems, a network incremental flow cross K-function (<em>NIFK</em>) is developed in this study by extending the cross K-function for points into a flow context. Specifically, two versions of <em>NIFK</em> were developed in this study: the global version to check whether bivariate associations exist in the whole study area and the local version to identify specific locations where associations occur. Experiments on three simulated datasets demonstrate the advantages of the proposed method over an available alternative method. A case study conducted using Xiamen taxi and ride-hailing service datasets demonstrates the usefulness of the proposed method. The detected bivariate spatial association provides deep insights for understanding the competition between taxi services and ride-hailing services.</p></div>","PeriodicalId":48771,"journal":{"name":"Spatial Statistics","volume":"63 ","pages":"Article 100852"},"PeriodicalIF":2.1,"publicationDate":"2024-07-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141964309","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Analysis of the spatial distribution and future trends of coal mine accidents: A case study of coal mine accidents in China from 2005–2022 煤矿事故的空间分布和未来趋势分析:2005-2022 年中国煤矿事故案例研究
IF 2.1 2区 数学 Q3 GEOSCIENCES, MULTIDISCIPLINARY Pub Date : 2024-07-30 DOI: 10.1016/j.spasta.2024.100851
He Yinnan , Qin Ruxiang

A scientific grasp of the macro law of coal mining accidents can contribute to strengthening their prevention and control and guaranteeing a stable energy supply. In this study, 2,269 investigation reports of China's coal mining accidents from 2005 to 2022 were adopted as the basic data source, and GIS spatial analysis and rescaled range analysis methods were utilized to comprehensively reveal the spatial-temporal distribution features, and evolutionary patterns of coal mining accidents in China. The findings indicate that the numbers of gas explosion, permeability, outburst, suffocation and roof fall accidents has rapidly declined. The coverage area of coal mining accidents has gradually moved toward western of China. However, the center of the area covered by coal mining accidents during the study period was mainly concentrated in Shanxi and Henan Provinces. Besides, the number of deaths resulting from coal mining accidents across the country has gradually decreased, while the time series exhibited high continuity, with future changes consistent with past changes. The average cycle period of the coal mining accident sequence was 5 years. Through the systematic analysis of coal mine accidents conducted in this research, the law of accident occurrence was more comprehensively revealed, providing a reference and basis for the government and enterprises to implement precise preventive measures.

科学把握煤矿事故发生的宏观规律,有助于加强煤矿事故防控,保障能源稳定供应。本研究以2005-2022年中国煤矿事故调查报告2269份为基础数据,利用GIS空间分析和重标度范围分析方法,全面揭示了中国煤矿事故的时空分布特征和演变规律。研究结果表明,瓦斯爆炸、透水、突水、窒息和顶板冒落事故数量迅速下降,煤矿事故覆盖区域不断扩大,事故发生率逐年上升。煤矿事故的覆盖区域逐渐向西部转移。然而,研究期间煤矿事故覆盖区域的中心主要集中在山西省和河南省。此外,全国煤矿事故死亡人数逐渐减少,时间序列表现出较强的连续性,未来的变化与过去的变化相一致。煤矿事故序列的平均周期为 5 年。通过对煤矿事故的系统分析,较为全面地揭示了事故发生的规律,为政府和企业实施精准预防措施提供了参考和依据。
{"title":"Analysis of the spatial distribution and future trends of coal mine accidents: A case study of coal mine accidents in China from 2005–2022","authors":"He Yinnan ,&nbsp;Qin Ruxiang","doi":"10.1016/j.spasta.2024.100851","DOIUrl":"10.1016/j.spasta.2024.100851","url":null,"abstract":"<div><p>A scientific grasp of the macro law of coal mining accidents can contribute to strengthening their prevention and control and guaranteeing a stable energy supply. In this study, 2,269 investigation reports of China's coal mining accidents from 2005 to 2022 were adopted as the basic data source, and GIS spatial analysis and rescaled range analysis methods were utilized to comprehensively reveal the spatial-temporal distribution features, and evolutionary patterns of coal mining accidents in China. The findings indicate that the numbers of gas explosion, permeability, outburst, suffocation and roof fall accidents has rapidly declined. The coverage area of coal mining accidents has gradually moved toward western of China. However, the center of the area covered by coal mining accidents during the study period was mainly concentrated in Shanxi and Henan Provinces. Besides, the number of deaths resulting from coal mining accidents across the country has gradually decreased, while the time series exhibited high continuity, with future changes consistent with past changes. The average cycle period of the coal mining accident sequence was 5 years. Through the systematic analysis of coal mine accidents conducted in this research, the law of accident occurrence was more comprehensively revealed, providing a reference and basis for the government and enterprises to implement precise preventive measures.</p></div>","PeriodicalId":48771,"journal":{"name":"Spatial Statistics","volume":"63 ","pages":"Article 100851"},"PeriodicalIF":2.1,"publicationDate":"2024-07-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141936197","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The SPDE approach for spatio-temporal datasets with advection and diffusion 针对具有平流和扩散的时空数据集的 SPDE 方法
IF 2.1 2区 数学 Q3 GEOSCIENCES, MULTIDISCIPLINARY Pub Date : 2024-07-02 DOI: 10.1016/j.spasta.2024.100847
Lucia Clarotto , Denis Allard , Thomas Romary , Nicolas Desassis

In the task of predicting spatio-temporal fields in environmental science using statistical methods, introducing statistical models inspired by the physics of the underlying phenomena that are numerically efficient is of growing interest. Large space–time datasets call for new numerical methods to efficiently process them. The Stochastic Partial Differential Equation (SPDE) approach has proven to be effective for the estimation and the prediction in a spatial context. We present here the advection–diffusion SPDE with first–order derivative in time which defines a large class of nonseparable spatio-temporal models. A Gaussian Markov random field approximation of the solution to the SPDE is built by discretizing the temporal derivative with a finite difference method (implicit Euler) and by solving the spatial SPDE with a finite element method (continuous Galerkin) at each time step. The “Streamline Diffusion” stabilization technique is introduced when the advection term dominates the diffusion. Computationally efficient methods are proposed to estimate the parameters of the SPDE and to predict the spatio-temporal field by kriging, as well as to perform conditional simulations. The approach is applied to a solar radiation dataset. Its advantages and limitations are discussed.

在利用统计方法预测环境科学中的时空场时,引入受基本现象物理学启发的高效数值统计模型越来越受到关注。大型时空数据集需要新的数值方法来高效处理。事实证明,随机偏微分方程(SPDE)方法对空间范围内的估计和预测非常有效。我们在此介绍具有一阶时间导数的平流-扩散 SPDE,它定义了一大类不可分割的时空模型。通过使用有限差分法(隐式欧拉)对时间导数进行离散化,并在每个时间步使用有限元法(连续 Galerkin)求解空间 SPDE,建立了 SPDE 解的高斯马尔可夫随机场近似。当平流项主导扩散时,引入 "流线扩散 "稳定技术。提出了计算效率高的方法来估计 SPDE 的参数,通过克里格法预测时空场,以及进行条件模拟。该方法应用于太阳辐射数据集。讨论了该方法的优势和局限性。
{"title":"The SPDE approach for spatio-temporal datasets with advection and diffusion","authors":"Lucia Clarotto ,&nbsp;Denis Allard ,&nbsp;Thomas Romary ,&nbsp;Nicolas Desassis","doi":"10.1016/j.spasta.2024.100847","DOIUrl":"https://doi.org/10.1016/j.spasta.2024.100847","url":null,"abstract":"<div><p>In the task of predicting spatio-temporal fields in environmental science using statistical methods, introducing statistical models inspired by the physics of the underlying phenomena that are numerically efficient is of growing interest. Large space–time datasets call for new numerical methods to efficiently process them. The Stochastic Partial Differential Equation (SPDE) approach has proven to be effective for the estimation and the prediction in a spatial context. We present here the advection–diffusion SPDE with first–order derivative in time which defines a large class of nonseparable spatio-temporal models. A Gaussian Markov random field approximation of the solution to the SPDE is built by discretizing the temporal derivative with a finite difference method (implicit Euler) and by solving the spatial SPDE with a finite element method (continuous Galerkin) at each time step. The “Streamline Diffusion” stabilization technique is introduced when the advection term dominates the diffusion. Computationally efficient methods are proposed to estimate the parameters of the SPDE and to predict the spatio-temporal field by kriging, as well as to perform conditional simulations. The approach is applied to a solar radiation dataset. Its advantages and limitations are discussed.</p></div>","PeriodicalId":48771,"journal":{"name":"Spatial Statistics","volume":"62 ","pages":"Article 100847"},"PeriodicalIF":2.1,"publicationDate":"2024-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2211675324000381/pdfft?md5=b6c10160a911d1d2dde2fa58d62da6e3&pid=1-s2.0-S2211675324000381-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141607530","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Spatial non-stationarity test of regression relationships in the multiscale geographically weighted regression model 多尺度地理加权回归模型中回归关系的空间非平稳性检验
IF 2.1 2区 数学 Q3 GEOSCIENCES, MULTIDISCIPLINARY Pub Date : 2024-06-13 DOI: 10.1016/j.spasta.2024.100846
Feng Chen , Yee Leung , Qiang Wang , Yu Zhou

By allowing covariate-specific bandwidths for estimating spatially varying coefficients, the multiscale geographically weighted regression (MGWR) model can simultaneously explore spatial non-stationarity and multiple operational scales of the corresponding geographical processes. Treating the constant coefficients as an extreme situation which corresponds to the global scale and infinite covariate bandwidth, the traditional linear regression, GWR and mixed GWR models are special cases of the MGWR model. An appropriately-specified GWR-based model would be beneficial to the understanding of the general underlying processes, especially for their operational scales. To specify an appropriate model, the key issue is to determine how many MGWR coefficient(s) should be constant. Along the traditional statistical line of thought, we propose a residual-based bootstrap method to test spatial non-stationarity of the MGWR coefficients, which can underpin our understanding of the characteristics of regression relationships in statistics. The simulation experiment validates the proposed test, and demonstrates that it is of valid Type I error and satisfactory power, and is robust to different types of model error distributions. The applicability of the proposed test is demonstrated in a real-world case study on the Shanghai housing prices.

多尺度地理加权回归(MGWR)模型通过允许特定协变量带宽来估计空间变化系数,可以同时探索相应地理过程的空间非平稳性和多种操作尺度。将常数系数视为对应于全球尺度和无限协变量带宽的极端情况,传统的线性回归、GWR 和混合 GWR 模型都是 MGWR 模型的特例。一个基于 GWR 的适当指定模型将有助于理解一般的基本过程,特别是其运行尺度。要指定一个合适的模型,关键问题是确定有多少 MGWR 系数应该是常数。按照传统的统计思路,我们提出了一种基于残差的引导方法来检验 MGWR 系数的空间非平稳性,这可以巩固我们对统计学中回归关系特征的理解。模拟实验验证了所提出的检验方法,证明其具有有效的 I 类误差和令人满意的功率,并对不同类型的模型误差分布具有稳健性。通过对上海房价的实际案例研究,证明了所提检验的适用性。
{"title":"Spatial non-stationarity test of regression relationships in the multiscale geographically weighted regression model","authors":"Feng Chen ,&nbsp;Yee Leung ,&nbsp;Qiang Wang ,&nbsp;Yu Zhou","doi":"10.1016/j.spasta.2024.100846","DOIUrl":"10.1016/j.spasta.2024.100846","url":null,"abstract":"<div><p>By allowing covariate-specific bandwidths for estimating spatially varying coefficients, the multiscale geographically weighted regression (MGWR) model can simultaneously explore spatial non-stationarity and multiple operational scales of the corresponding geographical processes. Treating the constant coefficients as an extreme situation which corresponds to the global scale and infinite covariate bandwidth, the traditional linear regression, GWR and mixed GWR models are special cases of the MGWR model. An appropriately-specified GWR-based model would be beneficial to the understanding of the general underlying processes, especially for their operational scales. To specify an appropriate model, the key issue is to determine how many MGWR coefficient(s) should be constant. Along the traditional statistical line of thought, we propose a residual-based bootstrap method to test spatial non-stationarity of the MGWR coefficients, which can underpin our understanding of the characteristics of regression relationships in statistics. The simulation experiment validates the proposed test, and demonstrates that it is of valid Type I error and satisfactory power, and is robust to different types of model error distributions. The applicability of the proposed test is demonstrated in a real-world case study on the Shanghai housing prices.</p></div>","PeriodicalId":48771,"journal":{"name":"Spatial Statistics","volume":"62 ","pages":"Article 100846"},"PeriodicalIF":2.1,"publicationDate":"2024-06-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141396270","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Analytical simulation methodology for nonlinear spatiotemporal models: Spatial salience in Covid-19 contagion 非线性时空模型的分析模拟方法:Covid-19 传染的空间显著性
IF 2.1 2区 数学 Q3 GEOSCIENCES, MULTIDISCIPLINARY Pub Date : 2024-06-13 DOI: 10.1016/j.spasta.2024.100844
Michael Beenstock , Yoel Cohen , Daniel Felsenstein

‘Outdegree’ from directed graph theory is used to measure the salience of individual locations in the transmission of Covid-19 morbidity through the spatiotemporal network of contagion and their salience in the spatiotemporal diffusion of vaccination rollout. A spatial econometric model in which morbidity varies inversely with vaccination rollout, and vaccination rollout varies directly with morbidity is used to calculate dynamic auto-outdegrees for morbidity and dynamic cross-outdegrees for the effect of vaccination on morbidity. The former identifies hot spots of contagion, and the latter identifies locations in which vaccination rollout is particularly effective in reducing national morbidity. These outdegrees are calculated analytically rather than simulated numerically.

有向图理论中的 "出度 "用于衡量个别地点在通过传染病时空网络传播 Covid-19 发病率时的显著性,以及它们在疫苗接种推广的时空扩散中的显著性。在一个空间计量经济学模型中,发病率与疫苗接种推广情况成反比变化,而疫苗接种推广情况与发病率直接变化,该模型用于计算发病率的动态自动淘汰度和疫苗接种对发病率影响的动态交叉淘汰度。前者确定传染热点,后者确定疫苗接种推广对降低全国发病率特别有效的地点。这些跨度是通过分析而不是数字模拟计算出来的。
{"title":"Analytical simulation methodology for nonlinear spatiotemporal models: Spatial salience in Covid-19 contagion","authors":"Michael Beenstock ,&nbsp;Yoel Cohen ,&nbsp;Daniel Felsenstein","doi":"10.1016/j.spasta.2024.100844","DOIUrl":"10.1016/j.spasta.2024.100844","url":null,"abstract":"<div><p>‘Outdegree’ from directed graph theory is used to measure the salience of individual locations in the transmission of Covid-19 morbidity through the spatiotemporal network of contagion and their salience in the spatiotemporal diffusion of vaccination rollout. A spatial econometric model in which morbidity varies inversely with vaccination rollout, and vaccination rollout varies directly with morbidity is used to calculate dynamic auto-outdegrees for morbidity and dynamic cross-outdegrees for the effect of vaccination on morbidity. The former identifies hot spots of contagion, and the latter identifies locations in which vaccination rollout is particularly effective in reducing national morbidity. These outdegrees are calculated analytically rather than simulated numerically.</p></div>","PeriodicalId":48771,"journal":{"name":"Spatial Statistics","volume":"62 ","pages":"Article 100844"},"PeriodicalIF":2.1,"publicationDate":"2024-06-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141403265","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Automatic cross-validation in structured models: Is it time to leave out leave-one-out? 结构化模型中的自动交叉验证:是时候摒弃 "leave-one-out "了吗?
IF 2.3 2区 数学 Q3 GEOSCIENCES, MULTIDISCIPLINARY Pub Date : 2024-06-12 DOI: 10.1016/j.spasta.2024.100843
Aritz Adin , Elias Teixeira Krainski , Amanda Lenzi , Zhedong Liu , Joaquín Martínez-Minaya , Håvard Rue

Standard techniques such as leave-one-out cross-validation (LOOCV) might not be suitable for evaluating the predictive performance of models incorporating structured random effects. In such cases, the correlation between the training and test sets could have a notable impact on the model’s prediction error. To overcome this issue, an automatic group construction procedure for leave-group-out cross validation (LGOCV) has recently emerged as a valuable tool for enhancing predictive performance measurement in structured models. The purpose of this paper is (i) to compare LOOCV and LGOCV within structured models, emphasizing model selection and predictive performance, and (ii) to provide real data applications in spatial statistics using complex structured models fitted with INLA, showcasing the utility of the automatic LGOCV method. First, we briefly review the key aspects of the recently proposed LGOCV method for automatic group construction in latent Gaussian models. We also demonstrate the effectiveness of this method for selecting the model with the highest predictive performance by simulating extrapolation tasks in both temporal and spatial data analyses. Finally, we provide insights into the effectiveness of the LGOCV method in modeling complex structured data, encompassing spatio-temporal multivariate count data, spatial compositional data, and spatio-temporal geospatial data.

留一交叉验证(LOOCV)等标准技术可能不适合评估包含结构随机效应的模型的预测性能。在这种情况下,训练集和测试集之间的相关性可能会对模型的预测误差产生显著影响。为了克服这一问题,最近出现了一种用于留空交叉验证(LGOCV)的自动建组程序,它是提高结构化模型预测性能测量的重要工具。本文的目的是:(i) 比较结构化模型中的 LOOCV 和 LGOCV,强调模型选择和预测性能;(ii) 提供空间统计学中使用 INLA 拟合的复杂结构化模型的实际数据应用,展示自动 LGOCV 方法的实用性。首先,我们简要回顾了最近提出的在潜在高斯模型中自动构建分组的 LGOCV 方法的主要方面。我们还通过模拟时间和空间数据分析中的外推任务,展示了该方法在选择预测性能最高的模型方面的有效性。最后,我们深入探讨了 LGOCV 方法在复杂结构数据建模中的有效性,包括时空多变量计数数据、空间组合数据和时空地理空间数据。
{"title":"Automatic cross-validation in structured models: Is it time to leave out leave-one-out?","authors":"Aritz Adin ,&nbsp;Elias Teixeira Krainski ,&nbsp;Amanda Lenzi ,&nbsp;Zhedong Liu ,&nbsp;Joaquín Martínez-Minaya ,&nbsp;Håvard Rue","doi":"10.1016/j.spasta.2024.100843","DOIUrl":"https://doi.org/10.1016/j.spasta.2024.100843","url":null,"abstract":"<div><p>Standard techniques such as leave-one-out cross-validation (LOOCV) might not be suitable for evaluating the predictive performance of models incorporating structured random effects. In such cases, the correlation between the training and test sets could have a notable impact on the model’s prediction error. To overcome this issue, an automatic group construction procedure for leave-group-out cross validation (LGOCV) has recently emerged as a valuable tool for enhancing predictive performance measurement in structured models. The purpose of this paper is (i) to compare LOOCV and LGOCV within structured models, emphasizing model selection and predictive performance, and (ii) to provide real data applications in spatial statistics using complex structured models fitted with INLA, showcasing the utility of the automatic LGOCV method. First, we briefly review the key aspects of the recently proposed LGOCV method for automatic group construction in latent Gaussian models. We also demonstrate the effectiveness of this method for selecting the model with the highest predictive performance by simulating extrapolation tasks in both temporal and spatial data analyses. Finally, we provide insights into the effectiveness of the LGOCV method in modeling complex structured data, encompassing spatio-temporal multivariate count data, spatial compositional data, and spatio-temporal geospatial data.</p></div>","PeriodicalId":48771,"journal":{"name":"Spatial Statistics","volume":"62 ","pages":"Article 100843"},"PeriodicalIF":2.3,"publicationDate":"2024-06-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2211675324000344/pdfft?md5=58ade5e28808d907246b86bb20b2c270&pid=1-s2.0-S2211675324000344-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141429838","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The circular Matérn covariance function and its link to Markov random fields on the circle 圆马特恩协方差函数及其与圆上马尔可夫随机场的联系
IF 2.3 2区 数学 Q3 GEOSCIENCES, MULTIDISCIPLINARY Pub Date : 2024-06-04 DOI: 10.1016/j.spasta.2024.100837
Chunfeng Huang , Ao Li , Nicholas W. Bussberg , Haimeng Zhang

The connection between Gaussian random fields and Markov random fields has been well-established in Euclidean spaces, with Matérn covariance functions playing a pivotal role. In this paper, we explore the extension of this link to circular spaces and uncover different results. It is known that Matérn covariance functions are not always positive definite on the circle; however, the circular Matérn covariance functions are shown to be valid on the circle and are the focus of this paper. For these circular Matérn random fields on the circle, we show that the corresponding Markov random fields can be obtained explicitly on equidistance grids. Consequently, the equivalence between the circular Matérn random fields and Markov random fields is then exact and this marks a departure from the Euclidean space counterpart, where only approximations are achieved. Moreover, the key motivation in Euclidean spaces for establishing such link relies on the assumption that the corresponding Markov random field is sparse. We show that such sparsity does not hold in general on the circle. In addition, for the sparse Markov random field on the circle, we derive its corresponding Gaussian random field.

高斯随机场和马尔可夫随机场之间的联系在欧几里得空间中已经得到了很好的证实,其中马特恩协方差函数发挥了关键作用。在本文中,我们将探索这一联系在圆空间中的延伸,并揭示出不同的结果。众所周知,圆上的 Matérn 协方差函数并不总是正定的;然而,圆上的 Matérn 协方差函数被证明在圆上是有效的,这也是本文的重点。对于圆上的这些圆 Matérn 随机场,我们证明相应的马尔可夫随机场可以在等距网格上明确得到。因此,圆 Matérn 随机场和马尔可夫随机场之间的等价性是精确的,这标志着与欧几里得空间对应场的不同,后者只能得到近似值。此外,欧几里得空间中建立这种联系的关键动机依赖于假设相应的马尔可夫随机场是稀疏的。我们证明,这种稀疏性在圆上一般不成立。此外,对于圆上的稀疏马尔科夫随机场,我们推导出了其相应的高斯随机场。
{"title":"The circular Matérn covariance function and its link to Markov random fields on the circle","authors":"Chunfeng Huang ,&nbsp;Ao Li ,&nbsp;Nicholas W. Bussberg ,&nbsp;Haimeng Zhang","doi":"10.1016/j.spasta.2024.100837","DOIUrl":"https://doi.org/10.1016/j.spasta.2024.100837","url":null,"abstract":"<div><p>The connection between Gaussian random fields and Markov random fields has been well-established in Euclidean spaces, with Matérn covariance functions playing a pivotal role. In this paper, we explore the extension of this link to circular spaces and uncover different results. It is known that Matérn covariance functions are not always positive definite on the circle; however, the circular Matérn covariance functions are shown to be valid on the circle and are the focus of this paper. For these circular Matérn random fields on the circle, we show that the corresponding Markov random fields can be obtained explicitly on equidistance grids. Consequently, the equivalence between the circular Matérn random fields and Markov random fields is then exact and this marks a departure from the Euclidean space counterpart, where only approximations are achieved. Moreover, the key motivation in Euclidean spaces for establishing such link relies on the assumption that the corresponding Markov random field is sparse. We show that such sparsity does not hold in general on the circle. In addition, for the sparse Markov random field on the circle, we derive its corresponding Gaussian random field.</p></div>","PeriodicalId":48771,"journal":{"name":"Spatial Statistics","volume":"62 ","pages":"Article 100837"},"PeriodicalIF":2.3,"publicationDate":"2024-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141323908","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Dimension reduction for spatial regression: Spatial predictor envelope 空间回归的降维:空间预测包络
IF 2.3 2区 数学 Q3 GEOSCIENCES, MULTIDISCIPLINARY Pub Date : 2024-06-01 DOI: 10.1016/j.spasta.2024.100838
Paul May , Hossein Moradi Rekabdarkolaee

Natural sciences such as geology and forestry often utilize regression models for spatial data with many predictors and small to moderate sample sizes. In these settings, efficient estimation of the regression parameters is crucial for both model interpretation and prediction. We propose a dimension reduction approach for spatial regression that assumes certain linear combinations of the predictors are immaterial to the regression. The model and corresponding inference provide efficient estimation of regression parameters while accounting for spatial correlation in the data. We employed the maximum likelihood estimation approach to estimate the parameters of the model. The effectiveness of the proposed model is illustrated through simulation studies and the analysis of a geochemical data set, predicting rare earth element concentrations within an oil and gas reserve in Wyoming. Simulation results indicate that our proposed model offers a significant reduction in the mean square errors and variation of the regression coefficients. Furthermore, the method provided a 50% reduction in prediction variance for rare earth element concentrations within our data analysis.

地质学和林业等自然科学领域经常利用回归模型来处理预测因子多、样本量小到中等的空间数据。在这些情况下,有效估计回归参数对模型解释和预测都至关重要。我们提出了一种空间回归的降维方法,该方法假定预测因子的某些线性组合对回归无关紧要。该模型和相应的推论在考虑数据空间相关性的同时,提供了回归参数的有效估计。我们采用最大似然估计法来估计模型参数。通过模拟研究和对地球化学数据集的分析,预测了怀俄明州油气储量中稀土元素的浓度,从而说明了所提模型的有效性。模拟结果表明,我们提出的模型显著减少了均方误差和回归系数的变化。此外,在我们的数据分析中,该方法还将稀土元素浓度的预测方差减少了 50%。
{"title":"Dimension reduction for spatial regression: Spatial predictor envelope","authors":"Paul May ,&nbsp;Hossein Moradi Rekabdarkolaee","doi":"10.1016/j.spasta.2024.100838","DOIUrl":"10.1016/j.spasta.2024.100838","url":null,"abstract":"<div><p>Natural sciences such as geology and forestry often utilize regression models for spatial data with many predictors and small to moderate sample sizes. In these settings, efficient estimation of the regression parameters is crucial for both model interpretation and prediction. We propose a dimension reduction approach for spatial regression that assumes certain linear combinations of the predictors are immaterial to the regression. The model and corresponding inference provide efficient estimation of regression parameters while accounting for spatial correlation in the data. We employed the maximum likelihood estimation approach to estimate the parameters of the model. The effectiveness of the proposed model is illustrated through simulation studies and the analysis of a geochemical data set, predicting rare earth element concentrations within an oil and gas reserve in Wyoming. Simulation results indicate that our proposed model offers a significant reduction in the mean square errors and variation of the regression coefficients. Furthermore, the method provided a 50% reduction in prediction variance for rare earth element concentrations within our data analysis.</p></div>","PeriodicalId":48771,"journal":{"name":"Spatial Statistics","volume":"61 ","pages":"Article 100838"},"PeriodicalIF":2.3,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141132058","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Integrated deviance information criterion for spatial autoregressive models with heteroskedasticity 具有异方差性的空间自回归模型的综合偏差信息准则
IF 2.3 2区 数学 Q3 GEOSCIENCES, MULTIDISCIPLINARY Pub Date : 2024-05-18 DOI: 10.1016/j.spasta.2024.100842
Osman Doğan

In this study, we introduce the integrated deviance information criterion (DIC) for nested and non-nested model selection problems in heteroskedastic spatial autoregressive models. In a Bayesian estimation setting, we assume that the idiosyncratic error terms of our spatial autoregressive model have a scale mixture of normal distributions, where the scale mixture variables are latent variables that induce heteroskedasticity. We first derive the integrated likelihood function by analytically integrating out the scale mixture variables from the complete-data likelihood function. We then use the integrated likelihood function to formulate the integrated DIC measure. We investigate the finite sample performance of the integrated DIC in selecting the true model in a simulation study. The simulation results show that the integrated DIC performs satisfactorily and can be useful for selecting the correct model in specification search exercises. Finally, in a spatially augmented economic growth model, we use the integrated DIC to choose the spatial weights matrix that leads to better predictive accuracy.

在本研究中,我们针对异方差空间自回归模型中的嵌套和非嵌套模型选择问题引入了综合偏差信息准则(DIC)。在贝叶斯估计环境下,我们假设空间自回归模型的特异性误差项具有正态分布的尺度混合物,其中尺度混合物变量是引起异方差的潜变量。我们首先从完整数据似然函数中分析积分出尺度混合变量,从而得出积分似然函数。然后,我们使用积分似然函数来制定积分 DIC 度量。我们在模拟研究中考察了综合 DIC 在选择真实模型时的有限样本性能。模拟结果表明,综合 DIC 的性能令人满意,可用于在规范搜索练习中选择正确的模型。最后,在空间增强经济增长模型中,我们利用综合 DIC 选择空间权重矩阵,从而获得更好的预测精度。
{"title":"Integrated deviance information criterion for spatial autoregressive models with heteroskedasticity","authors":"Osman Doğan","doi":"10.1016/j.spasta.2024.100842","DOIUrl":"https://doi.org/10.1016/j.spasta.2024.100842","url":null,"abstract":"<div><p>In this study, we introduce the integrated deviance information criterion (DIC) for nested and non-nested model selection problems in heteroskedastic spatial autoregressive models. In a Bayesian estimation setting, we assume that the idiosyncratic error terms of our spatial autoregressive model have a scale mixture of normal distributions, where the scale mixture variables are latent variables that induce heteroskedasticity. We first derive the integrated likelihood function by analytically integrating out the scale mixture variables from the complete-data likelihood function. We then use the integrated likelihood function to formulate the integrated DIC measure. We investigate the finite sample performance of the integrated DIC in selecting the true model in a simulation study. The simulation results show that the integrated DIC performs satisfactorily and can be useful for selecting the correct model in specification search exercises. Finally, in a spatially augmented economic growth model, we use the integrated DIC to choose the spatial weights matrix that leads to better predictive accuracy.</p></div>","PeriodicalId":48771,"journal":{"name":"Spatial Statistics","volume":"61 ","pages":"Article 100842"},"PeriodicalIF":2.3,"publicationDate":"2024-05-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141095822","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Spatial Statistics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1