Pub Date : 2024-04-16DOI: 10.1016/j.spasta.2024.100835
Alfred Stein , Christopher K. Wikle
{"title":"Space, uncertainty, and the environment: honoring the distinguished career of noel Cressie","authors":"Alfred Stein , Christopher K. Wikle","doi":"10.1016/j.spasta.2024.100835","DOIUrl":"10.1016/j.spasta.2024.100835","url":null,"abstract":"","PeriodicalId":48771,"journal":{"name":"Spatial Statistics","volume":null,"pages":null},"PeriodicalIF":2.3,"publicationDate":"2024-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2211675324000265/pdfft?md5=46b044e34139ffbdeaccb37cb6b0fbff&pid=1-s2.0-S2211675324000265-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140758247","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-04-14DOI: 10.1016/j.spasta.2024.100831
Cécile Spychala, Clément Dombry, Camelia Goga
In order to prevent and/or forecast road accidents, the statistical modeling of spatial dependence and potential risk factors is a major asset. The main goal of this article is to predict the number of accidents on a certain area by considering georeferenced accident locations crossed with variables characterizing the studied geographical area such as road characteristics as well as sociodemographic and global infrastructure variables. We model the accident point pattern by a spatial log-Gaussian Cox process (LGCP). To reduce the computation burden of LGCP models in this high-dimensional setting, we suggest a two-step procedure: to perform first automatic variable selection methods based on Poisson regression, Poisson aggregation and random forest and in a second step, to use the selected variables and perform LGCP model analysis. The dataset consists in road accidents occurred between 2017 and 2019 in the CAGB (urban community of Besançon), France. Based on LGCP analysis, we are able to identify the principal risk factors of road accidents and risky areas from CAGB region.
{"title":"Variable selection methods for Log-Gaussian Cox processes: A case-study on accident data","authors":"Cécile Spychala, Clément Dombry, Camelia Goga","doi":"10.1016/j.spasta.2024.100831","DOIUrl":"10.1016/j.spasta.2024.100831","url":null,"abstract":"<div><p>In order to prevent and/or forecast road accidents, the statistical modeling of spatial dependence and potential risk factors is a major asset. The main goal of this article is to predict the number of accidents on a certain area by considering georeferenced accident locations crossed with variables characterizing the studied geographical area such as road characteristics as well as sociodemographic and global infrastructure variables. We model the accident point pattern by a spatial log-Gaussian Cox process (LGCP). To reduce the computation burden of LGCP models in this high-dimensional setting, we suggest a two-step procedure: to perform first automatic variable selection methods based on Poisson regression, Poisson aggregation and random forest and in a second step, to use the selected variables and perform LGCP model analysis. The dataset consists in road accidents occurred between 2017 and 2019 in the CAGB (urban community of Besançon), France. Based on LGCP analysis, we are able to identify the principal risk factors of road accidents and risky areas from CAGB region.</p></div>","PeriodicalId":48771,"journal":{"name":"Spatial Statistics","volume":null,"pages":null},"PeriodicalIF":2.3,"publicationDate":"2024-04-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140756592","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-04-13DOI: 10.1016/j.spasta.2024.100830
Denise Duarte , Débora F. Magalhães , Aline M. Piroutek , Caio Alves
We present the Probabilistic Context Neighborhood model designed for two-dimensional lattices as a variation of a Markov random field assuming discrete values. In this model, the neighborhood structure has a fixed geometry but a variable order, depending on the neighbors’ values. Our model extends the Probabilistic Context Tree model, originally applicable to one-dimensional space. It retains advantageous properties, such as representing the dependence neighborhood structure as a graph in a tree format, facilitating an understanding of model complexity. Furthermore, we adapt the algorithm used to estimate the Probabilistic Context Tree to estimate the parameters of the proposed model. We illustrate the accuracy of our estimation methodology through simulation studies. Additionally, we apply the Probabilistic Context Neighborhood model to spatial real-world data, showcasing its practical utility.
{"title":"Probabilistic Context Neighborhood model for lattices","authors":"Denise Duarte , Débora F. Magalhães , Aline M. Piroutek , Caio Alves","doi":"10.1016/j.spasta.2024.100830","DOIUrl":"https://doi.org/10.1016/j.spasta.2024.100830","url":null,"abstract":"<div><p>We present the Probabilistic Context Neighborhood model designed for two-dimensional lattices as a variation of a Markov random field assuming discrete values. In this model, the neighborhood structure has a fixed geometry but a variable order, depending on the neighbors’ values. Our model extends the Probabilistic Context Tree model, originally applicable to one-dimensional space. It retains advantageous properties, such as representing the dependence neighborhood structure as a graph in a tree format, facilitating an understanding of model complexity. Furthermore, we adapt the algorithm used to estimate the Probabilistic Context Tree to estimate the parameters of the proposed model. We illustrate the accuracy of our estimation methodology through simulation studies. Additionally, we apply the Probabilistic Context Neighborhood model to spatial real-world data, showcasing its practical utility.</p></div>","PeriodicalId":48771,"journal":{"name":"Spatial Statistics","volume":null,"pages":null},"PeriodicalIF":2.3,"publicationDate":"2024-04-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140558031","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-04-06DOI: 10.1016/j.spasta.2024.100815
Miguel De la Llave , Fernando A. López
Selecting correct specification in spatial model frameworks is a relevant research topic in spatial econometrics. The purpose of this paper is to examine and contrast two well-known model selection strategies, Specific-to-General, Stge, and General-to-Specific, Gets, in the context of spatial probit models. The results obtained from these classical methods are juxtaposed with those generated through the utilization of a powerful machine learning algorithm: Gradient Boosting. The paper includes an extensive Monte Carlo experiment to compare the performance of these three strategies with small and medium sample sizes. The results show that under ideal conditions, both classical strategies obtain similar results for medium-sized samples, but for small samples, Stge performs slightly better than Gets. The Gradient Boosting algorithm obtains slightly higher success rates than the classical strategies, especially with small samples sizes. Finally, the flow of both strategies is illustrated using a well-known dataset on the probability of businesses reopening in New Orleans in the aftermath of Hurricane Katrina.
{"title":"Searching for correct specification in spatial probit models. Classical approaches versus Gradient Boosting algorithm","authors":"Miguel De la Llave , Fernando A. López","doi":"10.1016/j.spasta.2024.100815","DOIUrl":"https://doi.org/10.1016/j.spasta.2024.100815","url":null,"abstract":"<div><p>Selecting correct specification in spatial model frameworks is a relevant research topic in spatial econometrics. The purpose of this paper is to examine and contrast two well-known model selection strategies, Specific-to-General, Stge, and General-to-Specific, Gets, in the context of spatial probit models. The results obtained from these classical methods are juxtaposed with those generated through the utilization of a powerful machine learning algorithm: Gradient Boosting. The paper includes an extensive Monte Carlo experiment to compare the performance of these three strategies with small and medium sample sizes. The results show that under ideal conditions, both classical strategies obtain similar results for medium-sized samples, but for small samples, Stge performs slightly better than Gets. The Gradient Boosting algorithm obtains slightly higher success rates than the classical strategies, especially with small samples sizes. Finally, the flow of both strategies is illustrated using a well-known dataset on the probability of businesses reopening in New Orleans in the aftermath of Hurricane Katrina.</p></div>","PeriodicalId":48771,"journal":{"name":"Spatial Statistics","volume":null,"pages":null},"PeriodicalIF":2.3,"publicationDate":"2024-04-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140644629","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-04-01DOI: 10.1016/j.spasta.2024.100827
Nicholas Grieshop, Christopher K. Wikle
Binary spatio-temporal data are common in many application areas. Such data can be considered from many perspectives, including via deterministic or stochastic cellular automata (CA), where local rules govern the transition probabilities that describe the evolution of the 0 and 1 states across space and time. One implementation of a stochastic CA for such data is via a spatio-temporal generalized linear model (or mixed model), with the local rule covariates being included in the transformed mean response. However, in many applications we do have a complete understanding of the local rules and must instead explore the rules space, which can be accomplished through symbolic regression. Even with a learned rule space, the data-driven rules may be insufficient to describe the process behavior and it is helpful to augment the transformed linear predictor with a latent spatio-temporal dynamic process. Here, we demonstrate for the first time that an echo state network (ESN) latent process can be used to enhance symbolic regression-learned local rule covariates. We implement this in a hierarchical Bayesian framework with regularized horseshoe priors on the ESN output weight matrices, which extends the ESN literature as well. Finally, we gain added expressiveness from the ESNs by considering an ensemble of ESN reservoirs, which we accommodate through weighted model averaging, which is also new to the ESN literature. We demonstrate our methodology on a simulated process in which we assume we do not know all of the local CA rules, as well as on multiple environmental data sets.
{"title":"Echo state network-enhanced symbolic regression for spatio-temporal binary stochastic cellular automata","authors":"Nicholas Grieshop, Christopher K. Wikle","doi":"10.1016/j.spasta.2024.100827","DOIUrl":"https://doi.org/10.1016/j.spasta.2024.100827","url":null,"abstract":"<div><p>Binary spatio-temporal data are common in many application areas. Such data can be considered from many perspectives, including via deterministic or stochastic cellular automata (CA), where local rules govern the transition probabilities that describe the evolution of the 0 and 1 states across space and time. One implementation of a stochastic CA for such data is via a spatio-temporal generalized linear model (or mixed model), with the local rule covariates being included in the transformed mean response. However, in many applications we do have a complete understanding of the local rules and must instead explore the rules space, which can be accomplished through symbolic regression. Even with a learned rule space, the data-driven rules may be insufficient to describe the process behavior and it is helpful to augment the transformed linear predictor with a latent spatio-temporal dynamic process. Here, we demonstrate for the first time that an echo state network (ESN) latent process can be used to enhance symbolic regression-learned local rule covariates. We implement this in a hierarchical Bayesian framework with regularized horseshoe priors on the ESN output weight matrices, which extends the ESN literature as well. Finally, we gain added expressiveness from the ESNs by considering an ensemble of ESN reservoirs, which we accommodate through weighted model averaging, which is also new to the ESN literature. We demonstrate our methodology on a simulated process in which we assume we do not know all of the local CA rules, as well as on multiple environmental data sets.</p></div>","PeriodicalId":48771,"journal":{"name":"Spatial Statistics","volume":null,"pages":null},"PeriodicalIF":2.3,"publicationDate":"2024-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140343882","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-04-01DOI: 10.1016/j.spasta.2024.100829
Alan R. Pearse, Noel Cressie, David Gunawan
This article studies the use of asymmetric loss functions for the optimal prediction of positive-valued spatial processes. We focus on the family of power-divergence loss functions with properties such as continuity, convexity, connections to well known divergence measures, and the ability to control the asymmetry and behaviour of the loss function via a power parameter. The properties of power-divergence loss functions, optimal power-divergence (OPD) spatial predictors, and related measures of uncertainty quantification are studied. In addition, we examine in general the notion of asymmetry in loss functions defined for positive-valued spatial processes and define an asymmetry measure, which we apply to the family of power-divergence loss functions and other common loss functions. The paper concludes with a simulation study comparing the optimal power-divergence predictor to predictors derived from other common loss functions. Finally, we illustrate OPD spatial prediction on a dataset of zinc measurements in the soil of a floodplain of the Meuse River, Netherlands.
{"title":"Optimal prediction of positive-valued spatial processes: Asymmetric power-divergence loss","authors":"Alan R. Pearse, Noel Cressie, David Gunawan","doi":"10.1016/j.spasta.2024.100829","DOIUrl":"https://doi.org/10.1016/j.spasta.2024.100829","url":null,"abstract":"<div><p>This article studies the use of asymmetric loss functions for the optimal prediction of positive-valued spatial processes. We focus on the family of power-divergence loss functions with properties such as continuity, convexity, connections to well known divergence measures, and the ability to control the asymmetry and behaviour of the loss function via a power parameter. The properties of power-divergence loss functions, optimal power-divergence (OPD) spatial predictors, and related measures of uncertainty quantification are studied. In addition, we examine in general the notion of asymmetry in loss functions defined for positive-valued spatial processes and define an asymmetry measure, which we apply to the family of power-divergence loss functions and other common loss functions. The paper concludes with a simulation study comparing the optimal power-divergence predictor to predictors derived from other common loss functions. Finally, we illustrate OPD spatial prediction on a dataset of zinc measurements in the soil of a floodplain of the Meuse River, Netherlands.</p></div>","PeriodicalId":48771,"journal":{"name":"Spatial Statistics","volume":null,"pages":null},"PeriodicalIF":2.3,"publicationDate":"2024-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2211675324000204/pdfft?md5=871aab873ba22ac88102220b2fa98a6a&pid=1-s2.0-S2211675324000204-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140540607","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-04-01DOI: 10.1016/j.spasta.2024.100822
Marianne Abémgnigni Njifon , Dominic Schuhmacher
Several deep learning methods for spatial data have been developed that report good performance in a big data setting. These methods typically require the choice of an appropriate kernel and some tuning of hyperparameters, which are contributing reasons for poor performance on smaller data sets.
In this paper, we propose a mathematical construction of a graph-based neural network for spatial prediction that substantially generalizes the KCN model in [Appleby, Liu and Liu (2020). Kriging convolutional networks. In Proc. AAAI Conf. AI 34, pp. 3187–3194]. In particular, our model, referred to as SPONGE, allows for integrated learning of the convolutional kernel, admits higher order neighborhood structures and can make use of the distance between locations in the neighborhood and between labels of neighboring nodes. All of this yields higher flexibility in capturing spatial correlations.
We investigate in simulation studies including small, medium and (reasonably) large data sets in what situations and to what extent SPONGE comes close to or (if the conditions for optimality are violated) even beats universal Kriging, whose predictions incur a high computational cost if is large. Furthermore we study the improvement for general SPONGE in comparison with the usual KCN.
Finally, we compare various graph-based neural network models on larger real world data sets and apply our method to the prediction of soil organic carbon in the southern part of Malawi.
{"title":"Graph convolutional networks for spatial interpolation of correlated data","authors":"Marianne Abémgnigni Njifon , Dominic Schuhmacher","doi":"10.1016/j.spasta.2024.100822","DOIUrl":"10.1016/j.spasta.2024.100822","url":null,"abstract":"<div><p>Several deep learning methods for spatial data have been developed that report good performance in a big data setting. These methods typically require the choice of an appropriate kernel and some tuning of hyperparameters, which are contributing reasons for poor performance on smaller data sets.</p><p>In this paper, we propose a mathematical construction of a graph-based neural network for spatial prediction that substantially generalizes the KCN model in [Appleby, Liu and Liu (2020). Kriging convolutional networks. In <em>Proc. AAAI Conf. AI</em> 34, pp. 3187–3194]. In particular, our model, referred to as SPONGE, allows for integrated learning of the convolutional kernel, admits higher order neighborhood structures and can make use of the distance between locations in the neighborhood and between labels of neighboring nodes. All of this yields higher flexibility in capturing spatial correlations.</p><p>We investigate in simulation studies including small, medium and (reasonably) large data sets in what situations and to what extent SPONGE comes close to or (if the conditions for optimality are violated) even beats universal Kriging, whose predictions incur a high computational cost if <span><math><mi>n</mi></math></span> is large. Furthermore we study the improvement for general SPONGE in comparison with the usual KCN.</p><p>Finally, we compare various graph-based neural network models on larger real world data sets and apply our method to the prediction of soil organic carbon in the southern part of Malawi.</p></div>","PeriodicalId":48771,"journal":{"name":"Spatial Statistics","volume":null,"pages":null},"PeriodicalIF":2.3,"publicationDate":"2024-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2211675324000137/pdfft?md5=69156f8ee056977ff579b35133b1c08a&pid=1-s2.0-S2211675324000137-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140203769","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-04-01DOI: 10.1016/j.spasta.2024.100821
Ruoyong Xu, Patrick Brown
Profile likelihoods are rarely used in geostatistical models due to the computational burden imposed by repeated decompositions of large variance matrices. Accounting for uncertainty in covariance parameters can be highly consequential in geostatistical models as some covariance parameters are poorly identified, the problem is severe enough that the differentiability parameter of the Matern correlation function is typically treated as fixed. The problem is compounded with anisotropic spatial models as there are two additional parameters to consider. In this paper, we make the following contributions: Firstly, a methodology is created for profile likelihoods for Gaussian spatial models with Matérn family of correlation functions, including anisotropic models. This methodology adopts a novel reparameterization for generation of representative points, and uses GPUs for parallel profile likelihoods computation in software implementation. Then, we show the profile likelihood of the Matérn shape parameter is often quite flat but still identifiable, it can usually rule out very small values. Finally, simulation studies and applications on real data examples show that profile-based confidence intervals of covariance parameters and regression parameters have superior coverage to the traditional standard Wald type confidence intervals.
{"title":"Profile likelihoods for parameters in trans-Gaussian geostatistical models","authors":"Ruoyong Xu, Patrick Brown","doi":"10.1016/j.spasta.2024.100821","DOIUrl":"https://doi.org/10.1016/j.spasta.2024.100821","url":null,"abstract":"<div><p>Profile likelihoods are rarely used in geostatistical models due to the computational burden imposed by repeated decompositions of large variance matrices. Accounting for uncertainty in covariance parameters can be highly consequential in geostatistical models as some covariance parameters are poorly identified, the problem is severe enough that the differentiability parameter of the Matern correlation function is typically treated as fixed. The problem is compounded with anisotropic spatial models as there are two additional parameters to consider. In this paper, we make the following contributions: Firstly, a methodology is created for profile likelihoods for Gaussian spatial models with Matérn family of correlation functions, including anisotropic models. This methodology adopts a novel reparameterization for generation of representative points, and uses GPUs for parallel profile likelihoods computation in software implementation. Then, we show the profile likelihood of the Matérn shape parameter is often quite flat but still identifiable, it can usually rule out very small values. Finally, simulation studies and applications on real data examples show that profile-based confidence intervals of covariance parameters and regression parameters have superior coverage to the traditional standard Wald type confidence intervals.</p></div>","PeriodicalId":48771,"journal":{"name":"Spatial Statistics","volume":null,"pages":null},"PeriodicalIF":2.3,"publicationDate":"2024-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2211675324000125/pdfft?md5=45a72d58be1d0dec5797130ec2cd5b10&pid=1-s2.0-S2211675324000125-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140349720","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-04-01DOI: 10.1016/j.spasta.2024.100825
Andrew Zammit-Mangion , Michael D. Kaminski , Ba-Hien Tran , Maurizio Filippone , Noel Cressie
Statistical models for spatial processes play a central role in analyses of spatial data. Yet, it is the simple, interpretable, and well understood models that are routinely employed even though, as is revealed through prior and posterior predictive checks, these can poorly characterise the spatial heterogeneity in the underlying process of interest. Here, we propose a new, flexible class of spatial-process models, which we refer to as spatial Bayesian neural networks (SBNNs). An SBNN leverages the representational capacity of a Bayesian neural network; it is tailored to a spatial setting by incorporating a spatial “embedding layer” into the network and, possibly, spatially-varying network parameters. An SBNN is calibrated by matching its finite-dimensional distribution at locations on a fine gridding of space to that of a target process of interest. That process could be easy to simulate from or we may have many realisations from it. We propose several variants of SBNNs, most of which are able to match the finite-dimensional distribution of the target process at the selected grid better than conventional BNNs of similar complexity. We also show that an SBNN can be used to represent a variety of spatial processes often used in practice, such as Gaussian processes, lognormal processes, and max-stable processes. We briefly discuss the tools that could be used to make inference with SBNNs, and we conclude with a discussion of their advantages and limitations.
{"title":"Spatial Bayesian neural networks","authors":"Andrew Zammit-Mangion , Michael D. Kaminski , Ba-Hien Tran , Maurizio Filippone , Noel Cressie","doi":"10.1016/j.spasta.2024.100825","DOIUrl":"https://doi.org/10.1016/j.spasta.2024.100825","url":null,"abstract":"<div><p>Statistical models for spatial processes play a central role in analyses of spatial data. Yet, it is the simple, interpretable, and well understood models that are routinely employed even though, as is revealed through prior and posterior predictive checks, these can poorly characterise the spatial heterogeneity in the underlying process of interest. Here, we propose a new, flexible class of spatial-process models, which we refer to as spatial Bayesian neural networks (SBNNs). An SBNN leverages the representational capacity of a Bayesian neural network; it is tailored to a spatial setting by incorporating a spatial “embedding layer” into the network and, possibly, spatially-varying network parameters. An SBNN is calibrated by matching its finite-dimensional distribution at locations on a fine gridding of space to that of a target process of interest. That process could be easy to simulate from or we may have many realisations from it. We propose several variants of SBNNs, most of which are able to match the finite-dimensional distribution of the target process at the selected grid better than conventional BNNs of similar complexity. We also show that an SBNN can be used to represent a variety of spatial processes often used in practice, such as Gaussian processes, lognormal processes, and max-stable processes. We briefly discuss the tools that could be used to make inference with SBNNs, and we conclude with a discussion of their advantages and limitations.</p></div>","PeriodicalId":48771,"journal":{"name":"Spatial Statistics","volume":null,"pages":null},"PeriodicalIF":2.3,"publicationDate":"2024-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2211675324000162/pdfft?md5=9797c18031f7060a28da14735b49c179&pid=1-s2.0-S2211675324000162-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140543633","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-04-01DOI: 10.1016/j.spasta.2024.100832
Alvaro Alexander Burbano-Moreno, Vinícius Diniz Mayrink
Spatial Functional Data (SFD) analysis is an emerging statistical framework that combines Functional Data Analysis (FDA) and spatial dependency modeling. Unlike traditional statistical methods, which treat data as scalar values or vectors, SFD considers data as continuous functions, allowing for a more comprehensive understanding of their behavior and variability. This approach is well-suited for analyzing data collected over time, space, or any other continuous domain. SFD has found applications in various fields, including economics, finance, medicine, environmental science, and engineering. This study proposes new functional Gaussian models incorporating spatial dependence structures, focusing on irregularly spaced data and reflecting spatially correlated curves. The model is based on Bernstein polynomial (BP) basis functions and utilizes a Bayesian approach for estimating unknown quantities and parameters. The paper explores the advantages and limitations of the BP model in capturing complex shapes and patterns while ensuring numerical stability. The main contributions of this work include the development of an innovative model designed for SFD using BP, the presence of a random effect to address associations between irregularly spaced observations, and a comprehensive simulation study to evaluate models’ performance under various scenarios. The work also presents one real application of Temperature in Mexico City, showcasing practical illustrations of the proposed model.
空间函数数据(SFD)分析是一种新兴的统计框架,它结合了函数数据分析(FDA)和空间依赖性建模。与将数据视为标量值或向量的传统统计方法不同,SFD 将数据视为连续函数,从而可以更全面地了解数据的行为和可变性。这种方法非常适合分析在时间、空间或任何其他连续领域收集的数据。SFD 已在经济、金融、医学、环境科学和工程学等多个领域得到应用。本研究提出了包含空间依赖结构的新函数高斯模型,重点关注不规则间距数据和反映空间相关曲线。该模型基于伯恩斯坦多项式(BP)基函数,利用贝叶斯方法估计未知量和参数。论文探讨了 BP 模型在捕捉复杂形状和模式的同时确保数值稳定性方面的优势和局限性。这项工作的主要贡献包括:利用贝叶斯方法开发了一种专为 SFD 设计的创新模型;随机效应的存在解决了不规则间距观测值之间的关联问题;综合模拟研究评估了模型在各种情况下的性能。这项工作还介绍了墨西哥城温度的一个实际应用,展示了拟议模型的实际说明。
{"title":"Spatial Functional Data analysis: Irregular spacing and Bernstein polynomials","authors":"Alvaro Alexander Burbano-Moreno, Vinícius Diniz Mayrink","doi":"10.1016/j.spasta.2024.100832","DOIUrl":"https://doi.org/10.1016/j.spasta.2024.100832","url":null,"abstract":"<div><p>Spatial Functional Data (SFD) analysis is an emerging statistical framework that combines Functional Data Analysis (FDA) and spatial dependency modeling. Unlike traditional statistical methods, which treat data as scalar values or vectors, SFD considers data as continuous functions, allowing for a more comprehensive understanding of their behavior and variability. This approach is well-suited for analyzing data collected over time, space, or any other continuous domain. SFD has found applications in various fields, including economics, finance, medicine, environmental science, and engineering. This study proposes new functional Gaussian models incorporating spatial dependence structures, focusing on irregularly spaced data and reflecting spatially correlated curves. The model is based on Bernstein polynomial (BP) basis functions and utilizes a Bayesian approach for estimating unknown quantities and parameters. The paper explores the advantages and limitations of the BP model in capturing complex shapes and patterns while ensuring numerical stability. The main contributions of this work include the development of an innovative model designed for SFD using BP, the presence of a random effect to address associations between irregularly spaced observations, and a comprehensive simulation study to evaluate models’ performance under various scenarios. The work also presents one real application of Temperature in Mexico City, showcasing practical illustrations of the proposed model.</p></div>","PeriodicalId":48771,"journal":{"name":"Spatial Statistics","volume":null,"pages":null},"PeriodicalIF":2.3,"publicationDate":"2024-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140554300","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}