首页 > 最新文献

Spatial Statistics最新文献

英文 中文
A hypothesis test for detecting spatial patterns in categorical areal data 检测分类面积数据空间模式的假设检验
IF 2.3 2区 数学 Q3 GEOSCIENCES, MULTIDISCIPLINARY Pub Date : 2024-05-04 DOI: 10.1016/j.spasta.2024.100839
Stella Self , Xingpei Zhao , Anja Zgodic , Anna Overby , David White , Alexander C. McLain , Caitlin Dyckman

The vast growth of spatial datasets in recent decades has fueled the development of many statistical methods for detecting spatial patterns. Two of the most commonly studied spatial patterns are clustering, loosely defined as datapoints with similar attributes existing close together, and dispersion, loosely defined as the semi-regular placement of datapoints with similar attributes. In this work, we develop a hypothesis test to detect spatial clustering or dispersion at specific distances in categorical areal data. Such data consists of a set of spatial regions whose boundaries are fixed and known (e.g., counties) associated with a categorical random variable (e.g. whether the county is rural, micropolitan, or metropolitan). We propose a method to extend the positive area proportion function (developed for detecting spatial clustering in binary areal data) to the categorical case. This proposal, referred to as the categorical positive areal proportion function test, can detect various spatial patterns, including homogeneous clusters, heterogeneous clusters, and dispersion. Our approach is the first method capable of distinguishing between different types of clustering in categorical areal data. After validating our method using an extensive simulation study, we use the categorical positive area proportion function test to detect spatial patterns in Boulder County, Colorado USA biological, agricultural, built and open conservation easements.

近几十年来,空间数据集的大量增加推动了许多用于检测空间模式的统计方法的发展。其中最常研究的两种空间模式是聚类和离散,前者宽泛地定义为具有相似属性的数据点紧靠在一起,后者宽泛地定义为具有相似属性的数据点的半规则分布。在这项工作中,我们开发了一种假设检验方法,用于检测分类区域数据中特定距离的空间聚类或分散。此类数据由一组边界固定且已知的空间区域(如县)组成,这些区域与一个分类随机变量(如县是农村、微型城市还是大都市)相关联。我们提出了一种将正面积比例函数(为检测二元面积数据中的空间聚类而开发)扩展到分类情况的方法。该方法被称为分类正面积比例函数检验法,可以检测出各种空间模式,包括同质聚类、异质聚类和离散模式。我们的方法是第一种能够区分分类方差数据中不同类型聚类的方法。在通过大量模拟研究验证了我们的方法后,我们使用分类正面积比例函数检验法检测了美国科罗拉多州博尔德县的生物、农业、建筑和开放式保护地役权的空间模式。
{"title":"A hypothesis test for detecting spatial patterns in categorical areal data","authors":"Stella Self ,&nbsp;Xingpei Zhao ,&nbsp;Anja Zgodic ,&nbsp;Anna Overby ,&nbsp;David White ,&nbsp;Alexander C. McLain ,&nbsp;Caitlin Dyckman","doi":"10.1016/j.spasta.2024.100839","DOIUrl":"https://doi.org/10.1016/j.spasta.2024.100839","url":null,"abstract":"<div><p>The vast growth of spatial datasets in recent decades has fueled the development of many statistical methods for detecting spatial patterns. Two of the most commonly studied spatial patterns are clustering, loosely defined as datapoints with similar attributes existing close together, and dispersion, loosely defined as the semi-regular placement of datapoints with similar attributes. In this work, we develop a hypothesis test to detect spatial clustering or dispersion at specific distances in categorical areal data. Such data consists of a set of spatial regions whose boundaries are fixed and known (e.g., counties) associated with a categorical random variable (e.g. whether the county is rural, micropolitan, or metropolitan). We propose a method to extend the positive area proportion function (developed for detecting spatial clustering in binary areal data) to the categorical case. This proposal, referred to as the categorical positive areal proportion function test, can detect various spatial patterns, including homogeneous clusters, heterogeneous clusters, and dispersion. Our approach is the first method capable of distinguishing between different types of clustering in categorical areal data. After validating our method using an extensive simulation study, we use the categorical positive area proportion function test to detect spatial patterns in Boulder County, Colorado USA biological, agricultural, built and open conservation easements.</p></div>","PeriodicalId":48771,"journal":{"name":"Spatial Statistics","volume":"61 ","pages":"Article 100839"},"PeriodicalIF":2.3,"publicationDate":"2024-05-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140906483","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Summary statistics for spatio-temporal point processes on linear networks 线性网络时空点过程的汇总统计
IF 2.3 2区 数学 Q3 GEOSCIENCES, MULTIDISCIPLINARY Pub Date : 2024-05-03 DOI: 10.1016/j.spasta.2024.100840
Mehdi Moradi, Ali Sharifi

We propose novel second/higher-order summary statistics for inhomogeneous spatio-temporal point processes when the spatial locations are limited to a linear network. More specifically, letting the spatial distance between events be measured by a regular distance metric, appropriate forms of K- and J-functions are introduced, and their theoretical relationships are studied. The theoretical forms of our proposed summary statistics are investigated under homogeneity, Poissonness, and independent thinning. Moreover, non-parametric estimators are derived, facilitating the use of our proposed summary statistics to study the spatio-temporal dependence between events. Through simulation studies, we demonstrate that our proposed J-function effectively identifies spatio-temporal clustering, inhibition, and randomness. Finally, we examine spatio-temporal dependencies for street crimes in Valencia, Spain, and traffic accidents in New York, USA.

当空间位置局限于线性网络时,我们为非均质时空点过程提出了新的二阶/高阶汇总统计量。更具体地说,让事件之间的空间距离用常规距离度量来测量,引入适当形式的 K 函数和 J 函数,并研究它们之间的理论关系。在同质性、泊松性和独立稀疏性条件下,研究了我们提出的汇总统计的理论形式。此外,我们还推导出了非参数估计器,便于使用我们提出的汇总统计量来研究事件之间的时空依赖性。通过模拟研究,我们证明了我们提出的 J 函数能有效识别时空聚类、抑制和随机性。最后,我们研究了西班牙巴伦西亚街头犯罪和美国纽约交通事故的时空依赖性。
{"title":"Summary statistics for spatio-temporal point processes on linear networks","authors":"Mehdi Moradi,&nbsp;Ali Sharifi","doi":"10.1016/j.spasta.2024.100840","DOIUrl":"https://doi.org/10.1016/j.spasta.2024.100840","url":null,"abstract":"<div><p>We propose novel second/higher-order summary statistics for inhomogeneous spatio-temporal point processes when the spatial locations are limited to a linear network. More specifically, letting the spatial distance between events be measured by a regular distance metric, appropriate forms of <span><math><mi>K</mi></math></span>- and <span><math><mi>J</mi></math></span>-functions are introduced, and their theoretical relationships are studied. The theoretical forms of our proposed summary statistics are investigated under homogeneity, Poissonness, and independent thinning. Moreover, non-parametric estimators are derived, facilitating the use of our proposed summary statistics to study the spatio-temporal dependence between events. Through simulation studies, we demonstrate that our proposed <span><math><mi>J</mi></math></span>-function effectively identifies spatio-temporal clustering, inhibition, and randomness. Finally, we examine spatio-temporal dependencies for street crimes in Valencia, Spain, and traffic accidents in New York, USA.</p></div>","PeriodicalId":48771,"journal":{"name":"Spatial Statistics","volume":"61 ","pages":"Article 100840"},"PeriodicalIF":2.3,"publicationDate":"2024-05-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2211675324000319/pdfft?md5=345d0dcd771c5b3d1a2f681cc0522723&pid=1-s2.0-S2211675324000319-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140893753","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Rapid outlier detection, model selection and variable selection using penalized likelihood estimation for general spatial models 利用一般空间模型的惩罚似然估计快速检测离群值、选择模型和变量
IF 2.3 2区 数学 Q3 GEOSCIENCES, MULTIDISCIPLINARY Pub Date : 2024-04-27 DOI: 10.1016/j.spasta.2024.100834
Yunquan Song, Minglu Fang, Yuanfeng Wang, Yiming Hou

The outliers in the data set have a potential influence on the statistical inference and can provide some useful information behind the data set, the methodology for outlier detection and accommodation is always an important topic in data analysis. For spatial data, its influence not only affects coefficient estimation but model selection. The traditional method usually carries out outlier detection, model selection and variable selection step by step, so the data processing efficiency is not high. In order to further improve the efficiency and accuracy of data processing, based on the general spatial model, we consider a technique to achieve outlier detection, along with model and variable estimation in one step. In the general spatial model, we add a mean shift parameter for each data point to identify outliers. Penalized likelihood estimation (PLE) is proposed to simultaneously detect outliers, and to select spatial models and explanatory variables for spatial data. This method correctly identifies multiple outliers, provides a proper spatial model, and corrects coefficient estimation without removing outliers in numerical simulation and case analysis. Compared to current methods, PLE detects outliers more quickly, and solves the optimization problem to select spatial models and explanatory variables. Calculation is easy using the optimized solnp function in R software.

数据集中的离群值对统计推断有潜在影响,并能提供数据集背后的一些有用信息,因此离群值的检测和容纳方法始终是数据分析中的一个重要课题。对于空间数据而言,其影响不仅会影响系数估计,还会影响模型选择。传统的方法通常是逐步进行离群点检测、模型选择和变量选择,因此数据处理效率不高。为了进一步提高数据处理的效率和准确性,我们在一般空间模型的基础上,考虑采用一种技术来实现离群点检测、模型和变量估计的一步到位。在一般空间模型中,我们为每个数据点添加一个均值偏移参数,以识别离群值。我们提出了惩罚似然估计法(PLE)来同时检测异常值,并为空间数据选择空间模型和解释变量。在数值模拟和案例分析中,该方法能正确识别多个离群值,提供合适的空间模型,并在不去除离群值的情况下修正系数估计。与现有方法相比,PLE 能更快地发现异常值,并解决选择空间模型和解释变量的优化问题。使用 R 软件中的优化 solnp 函数,计算非常简单。
{"title":"Rapid outlier detection, model selection and variable selection using penalized likelihood estimation for general spatial models","authors":"Yunquan Song,&nbsp;Minglu Fang,&nbsp;Yuanfeng Wang,&nbsp;Yiming Hou","doi":"10.1016/j.spasta.2024.100834","DOIUrl":"https://doi.org/10.1016/j.spasta.2024.100834","url":null,"abstract":"<div><p>The outliers in the data set have a potential influence on the statistical inference and can provide some useful information behind the data set, the methodology for outlier detection and accommodation is always an important topic in data analysis. For spatial data, its influence not only affects coefficient estimation but model selection. The traditional method usually carries out outlier detection, model selection and variable selection step by step, so the data processing efficiency is not high. In order to further improve the efficiency and accuracy of data processing, based on the general spatial model, we consider a technique to achieve outlier detection, along with model and variable estimation in one step. In the general spatial model, we add a mean shift parameter for each data point to identify outliers. Penalized likelihood estimation (PLE) is proposed to simultaneously detect outliers, and to select spatial models and explanatory variables for spatial data. This method correctly identifies multiple outliers, provides a proper spatial model, and corrects coefficient estimation without removing outliers in numerical simulation and case analysis. Compared to current methods, PLE detects outliers more quickly, and solves the optimization problem to select spatial models and explanatory variables. Calculation is easy using the optimized solnp function in R software.</p></div>","PeriodicalId":48771,"journal":{"name":"Spatial Statistics","volume":"61 ","pages":"Article 100834"},"PeriodicalIF":2.3,"publicationDate":"2024-04-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140815588","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Incremental transfer learning for spatial autoregressive model with linear constraints 具有线性约束条件的空间自回归模型的增量转移学习
IF 2.3 2区 数学 Q3 GEOSCIENCES, MULTIDISCIPLINARY Pub Date : 2024-04-27 DOI: 10.1016/j.spasta.2024.100833
Jie Li, Yunquan Song

Transfer learning is generally regarded as a beneficial technique for utilizing external information to enhance learning performance on target tasks. However, current research on transfer learning in high-dimensional regression models does not take into account both the location information of the data and the explicit utilization of prior knowledge. In the framework of transfer learning, this study seeks to resolve the spatial autoregressive problem and investigate the impact of introducing linear constraints. In this paper, a two-step transfer learning approach and a transferable source detection algorithm based on cross-validation are proposed when the input dimensions of the source and target datasets are the same. When the input dimensions are different, this paper suggests a straightforward and workable incremental transfer learning method. Additionally, for the estimating model developed under this method, Karush–Kuhn–Tucker (KKT) conditions and degrees of freedom are determined, and a Bayesian Information Criterion (BIC) is created for choosing hyperparameters. The effectiveness of the proposed methods is proven by numerical calculations, and the performance of the model in transfer learning estimation is improved by the addition of linear constraints.

迁移学习通常被认为是一种利用外部信息提高目标任务学习成绩的有益技术。然而,目前关于高维回归模型中迁移学习的研究并没有考虑数据的位置信息和先验知识的明确利用。在迁移学习的框架下,本研究试图解决空间自回归问题,并研究引入线性约束的影响。当源数据集和目标数据集的输入维度相同时,本文提出了基于交叉验证的两步迁移学习方法和可迁移源检测算法。当输入维度不同时,本文提出了一种简单可行的增量迁移学习方法。此外,本文还确定了根据该方法建立的估计模型的卡鲁什-库恩-塔克(KKT)条件和自由度,并创建了贝叶斯信息准则(BIC)来选择超参数。通过数值计算证明了所提方法的有效性,并通过添加线性约束提高了模型在迁移学习估计中的性能。
{"title":"Incremental transfer learning for spatial autoregressive model with linear constraints","authors":"Jie Li,&nbsp;Yunquan Song","doi":"10.1016/j.spasta.2024.100833","DOIUrl":"https://doi.org/10.1016/j.spasta.2024.100833","url":null,"abstract":"<div><p>Transfer learning is generally regarded as a beneficial technique for utilizing external information to enhance learning performance on target tasks. However, current research on transfer learning in high-dimensional regression models does not take into account both the location information of the data and the explicit utilization of prior knowledge. In the framework of transfer learning, this study seeks to resolve the spatial autoregressive problem and investigate the impact of introducing linear constraints. In this paper, a two-step transfer learning approach and a transferable source detection algorithm based on cross-validation are proposed when the input dimensions of the source and target datasets are the same. When the input dimensions are different, this paper suggests a straightforward and workable incremental transfer learning method. Additionally, for the estimating model developed under this method, Karush–Kuhn–Tucker (KKT) conditions and degrees of freedom are determined, and a Bayesian Information Criterion (BIC) is created for choosing hyperparameters. The effectiveness of the proposed methods is proven by numerical calculations, and the performance of the model in transfer learning estimation is improved by the addition of linear constraints.</p></div>","PeriodicalId":48771,"journal":{"name":"Spatial Statistics","volume":"61 ","pages":"Article 100833"},"PeriodicalIF":2.3,"publicationDate":"2024-04-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140824975","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Exploring heterogeneity and dynamics of meteorological influences on US PM2.5: A distributed learning approach with spatiotemporal varying coefficient models 探索气象对美国 PM2.5 影响的异质性和动态性:采用时空变化系数模型的分布式学习方法
IF 2.3 2区 数学 Q3 GEOSCIENCES, MULTIDISCIPLINARY Pub Date : 2024-04-25 DOI: 10.1016/j.spasta.2024.100826
Lily Wang , Guannan Wang , Annie S. Gao

Particulate matter (PM) has emerged as a primary air quality concern due to its substantial impact on human health. Many recent research works suggest that PM2.5 concentrations depend on meteorological conditions. Enhancing current pollution control strategies necessitates a more holistic comprehension of PM2.5 dynamics and the precise quantification of spatiotemporal heterogeneity in the relationship between meteorological factors and PM2.5 levels. The spatiotemporal varying coefficient model stands as a prominent spatial regression technique adept at addressing this heterogeneity. Amidst the challenges posed by the substantial scale of modern spatiotemporal datasets, we propose a pioneering distributed estimation method (DEM) founded on multivariate spline smoothing across a domain’s triangulation. This DEM algorithm ensures an easily implementable, highly scalable, and communication-efficient strategy, demonstrating almost linear speedup potential. We validate the effectiveness of our proposed DEM through extensive simulation studies, demonstrating that it achieves coefficient estimations akin to those of global estimators derived from complete datasets. Applying the proposed model and method to the US daily PM2.5 and meteorological data, we investigate the influence of meteorological variables on PM2.5 concentrations, revealing both spatial and seasonal variations in this relationship.

颗粒物(PM)由于对人类健康有重大影响,已成为空气质量的首要问题。最近的许多研究表明,PM2.5 的浓度取决于气象条件。要加强当前的污染控制策略,就必须更全面地了解 PM2.5 的动态变化,并精确量化气象因素与 PM2.5 浓度之间的时空异质性关系。时空变化系数模型是善于处理这种异质性的一种突出的空间回归技术。面对现代时空数据集的巨大规模所带来的挑战,我们提出了一种开创性的分布式估算方法(DEM),该方法建立在对域的三角剖分进行多元样条平滑的基础上。这种 DEM 算法确保了策略的易实施性、高度可扩展性和通信效率,展示了几乎线性的加速潜力。我们通过大量的模拟研究验证了所提出的 DEM 算法的有效性,证明其系数估算结果与从完整数据集得出的全局估算结果相近。我们将提出的模型和方法应用于美国每日 PM2.5 和气象数据,研究了气象变量对 PM2.5 浓度的影响,揭示了这种关系的空间和季节变化。
{"title":"Exploring heterogeneity and dynamics of meteorological influences on US PM2.5: A distributed learning approach with spatiotemporal varying coefficient models","authors":"Lily Wang ,&nbsp;Guannan Wang ,&nbsp;Annie S. Gao","doi":"10.1016/j.spasta.2024.100826","DOIUrl":"10.1016/j.spasta.2024.100826","url":null,"abstract":"<div><p>Particulate matter (PM) has emerged as a primary air quality concern due to its substantial impact on human health. Many recent research works suggest that PM<sub>2.5</sub> concentrations depend on meteorological conditions. Enhancing current pollution control strategies necessitates a more holistic comprehension of PM<sub>2.5</sub> dynamics and the precise quantification of spatiotemporal heterogeneity in the relationship between meteorological factors and PM<sub>2.5</sub> levels. The spatiotemporal varying coefficient model stands as a prominent spatial regression technique adept at addressing this heterogeneity. Amidst the challenges posed by the substantial scale of modern spatiotemporal datasets, we propose a pioneering distributed estimation method (DEM) founded on multivariate spline smoothing across a domain’s triangulation. This DEM algorithm ensures an easily implementable, highly scalable, and communication-efficient strategy, demonstrating almost linear speedup potential. We validate the effectiveness of our proposed DEM through extensive simulation studies, demonstrating that it achieves coefficient estimations akin to those of global estimators derived from complete datasets. Applying the proposed model and method to the US daily PM<sub>2.5</sub> and meteorological data, we investigate the influence of meteorological variables on PM<sub>2.5</sub> concentrations, revealing both spatial and seasonal variations in this relationship.</p></div>","PeriodicalId":48771,"journal":{"name":"Spatial Statistics","volume":"61 ","pages":"Article 100826"},"PeriodicalIF":2.3,"publicationDate":"2024-04-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140766668","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A nonparametric penalized likelihood approach to density estimation of space–time point patterns 时空点模式密度估计的非参数惩罚似然法
IF 2.3 2区 数学 Q3 GEOSCIENCES, MULTIDISCIPLINARY Pub Date : 2024-04-18 DOI: 10.1016/j.spasta.2024.100824
Blerta Begu , Simone Panzeri , Eleonora Arnone , Michelle Carey , Laura M. Sangalli

In this work, we consider space–time point processes and study their continuous space–time evolution. We propose an innovative nonparametric methodology to estimate the unknown space–time density of the point pattern, or, equivalently, to estimate the intensity of an inhomogeneous space–time Poisson point process. The presented approach combines maximum likelihood estimation with roughness penalties, based on differential operators, defined over the spatial and temporal domains of interest. We first establish some important theoretical properties of the considered estimator, including its consistency. We then develop an efficient and flexible estimation procedure that leverages advanced numerical and computation techniques. Thanks to a discretization based on finite elements in space and B-splines in time, the proposed method can effectively capture complex multi-modal and strongly anisotropic spatio-temporal point patterns; moreover, these point patterns may be observed over planar or curved domains with non-trivial geometries, due to geographic constraints, such as coastal regions with complicated shorelines, or curved regions with complex orography. In addition to providing estimates, the method’s functionalities also include the introduction of appropriate uncertainty quantification tools. We thoroughly validate the proposed method, by means of simulation studies and applications to real-world data. The obtained results highlight significant advantages over state-of-the-art competing approaches.

在这项工作中,我们考虑了时空点过程,并研究了它们的连续时空演变。我们提出了一种创新的非参数方法来估算点模式的未知时空密度,或者等同于估算不均匀时空泊松点过程的强度。所提出的方法将最大似然估计与基于微分算子的粗糙度惩罚相结合,微分算子定义在感兴趣的空间和时间域上。我们首先确定了所考虑的估计器的一些重要理论特性,包括其一致性。然后,我们利用先进的数值和计算技术,开发出一种高效灵活的估算程序。由于采用了基于空间有限元和时间 B-样条的离散化方法,所提出的方法可以有效捕捉复杂的多模式和强各向异性的时空点模式;此外,由于地理条件的限制,这些点模式可能会在具有非三维几何形状的平面或曲面域上观测到,例如具有复杂海岸线的沿海地区或具有复杂地形的曲面区域。除了提供估计值,该方法的功能还包括引入适当的不确定性量化工具。我们通过模拟研究和实际数据应用,对所提出的方法进行了全面验证。所获得的结果凸显了与最先进的竞争方法相比的显著优势。
{"title":"A nonparametric penalized likelihood approach to density estimation of space–time point patterns","authors":"Blerta Begu ,&nbsp;Simone Panzeri ,&nbsp;Eleonora Arnone ,&nbsp;Michelle Carey ,&nbsp;Laura M. Sangalli","doi":"10.1016/j.spasta.2024.100824","DOIUrl":"10.1016/j.spasta.2024.100824","url":null,"abstract":"<div><p>In this work, we consider space–time point processes and study their continuous space–time evolution. We propose an innovative nonparametric methodology to estimate the unknown space–time density of the point pattern, or, equivalently, to estimate the intensity of an inhomogeneous space–time Poisson point process. The presented approach combines maximum likelihood estimation with roughness penalties, based on differential operators, defined over the spatial and temporal domains of interest. We first establish some important theoretical properties of the considered estimator, including its consistency. We then develop an efficient and flexible estimation procedure that leverages advanced numerical and computation techniques. Thanks to a discretization based on finite elements in space and B-splines in time, the proposed method can effectively capture complex multi-modal and strongly anisotropic spatio-temporal point patterns; moreover, these point patterns may be observed over planar or curved domains with non-trivial geometries, due to geographic constraints, such as coastal regions with complicated shorelines, or curved regions with complex orography. In addition to providing estimates, the method’s functionalities also include the introduction of appropriate uncertainty quantification tools. We thoroughly validate the proposed method, by means of simulation studies and applications to real-world data. The obtained results highlight significant advantages over state-of-the-art competing approaches.</p></div>","PeriodicalId":48771,"journal":{"name":"Spatial Statistics","volume":"61 ","pages":"Article 100824"},"PeriodicalIF":2.3,"publicationDate":"2024-04-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2211675324000150/pdfft?md5=fcab55472ed3f4b5aa4f0e9b44fe624a&pid=1-s2.0-S2211675324000150-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140757441","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Space, uncertainty, and the environment: honoring the distinguished career of noel Cressie 空间、不确定性与环境:纪念诺埃尔-克雷西的杰出职业生涯
IF 2.3 2区 数学 Q3 GEOSCIENCES, MULTIDISCIPLINARY Pub Date : 2024-04-16 DOI: 10.1016/j.spasta.2024.100835
Alfred Stein , Christopher K. Wikle
{"title":"Space, uncertainty, and the environment: honoring the distinguished career of noel Cressie","authors":"Alfred Stein ,&nbsp;Christopher K. Wikle","doi":"10.1016/j.spasta.2024.100835","DOIUrl":"10.1016/j.spasta.2024.100835","url":null,"abstract":"","PeriodicalId":48771,"journal":{"name":"Spatial Statistics","volume":"61 ","pages":"Article 100835"},"PeriodicalIF":2.3,"publicationDate":"2024-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2211675324000265/pdfft?md5=46b044e34139ffbdeaccb37cb6b0fbff&pid=1-s2.0-S2211675324000265-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140758247","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Variable selection methods for Log-Gaussian Cox processes: A case-study on accident data 对数高斯 Cox 过程的变量选择方法:事故数据案例研究
IF 2.3 2区 数学 Q3 GEOSCIENCES, MULTIDISCIPLINARY Pub Date : 2024-04-14 DOI: 10.1016/j.spasta.2024.100831
Cécile Spychala, Clément Dombry, Camelia Goga

In order to prevent and/or forecast road accidents, the statistical modeling of spatial dependence and potential risk factors is a major asset. The main goal of this article is to predict the number of accidents on a certain area by considering georeferenced accident locations crossed with variables characterizing the studied geographical area such as road characteristics as well as sociodemographic and global infrastructure variables. We model the accident point pattern by a spatial log-Gaussian Cox process (LGCP). To reduce the computation burden of LGCP models in this high-dimensional setting, we suggest a two-step procedure: to perform first automatic variable selection methods based on Poisson regression, Poisson aggregation and random forest and in a second step, to use the selected variables and perform LGCP model analysis. The dataset consists in road accidents occurred between 2017 and 2019 in the CAGB (urban community of Besançon), France. Based on LGCP analysis, we are able to identify the principal risk factors of road accidents and risky areas from CAGB region.

为了预防和/或预测道路交通事故,对空间依赖性和潜在风险因素进行统计建模是一项重要资产。本文的主要目标是通过考虑地理参照的事故地点与所研究地理区域的特征变量(如道路特征以及社会人口和全球基础设施变量)交叉,预测某一区域的事故数量。我们通过空间对数-高斯考克斯过程(LGCP)对事故点模式进行建模。为了减轻 LGCP 模型在这种高维环境下的计算负担,我们建议分两步进行:第一步是基于泊松回归、泊松聚合和随机森林的自动变量选择方法,第二步是使用所选变量并进行 LGCP 模型分析。数据集包括 2017 年至 2019 年期间在法国 CAGB(贝桑松城市社区)发生的交通事故。基于 LGCP 分析,我们能够确定 CAGB 地区道路事故的主要风险因素和风险区域。
{"title":"Variable selection methods for Log-Gaussian Cox processes: A case-study on accident data","authors":"Cécile Spychala,&nbsp;Clément Dombry,&nbsp;Camelia Goga","doi":"10.1016/j.spasta.2024.100831","DOIUrl":"10.1016/j.spasta.2024.100831","url":null,"abstract":"<div><p>In order to prevent and/or forecast road accidents, the statistical modeling of spatial dependence and potential risk factors is a major asset. The main goal of this article is to predict the number of accidents on a certain area by considering georeferenced accident locations crossed with variables characterizing the studied geographical area such as road characteristics as well as sociodemographic and global infrastructure variables. We model the accident point pattern by a spatial log-Gaussian Cox process (LGCP). To reduce the computation burden of LGCP models in this high-dimensional setting, we suggest a two-step procedure: to perform first automatic variable selection methods based on Poisson regression, Poisson aggregation and random forest and in a second step, to use the selected variables and perform LGCP model analysis. The dataset consists in road accidents occurred between 2017 and 2019 in the CAGB (urban community of Besançon), France. Based on LGCP analysis, we are able to identify the principal risk factors of road accidents and risky areas from CAGB region.</p></div>","PeriodicalId":48771,"journal":{"name":"Spatial Statistics","volume":"61 ","pages":"Article 100831"},"PeriodicalIF":2.3,"publicationDate":"2024-04-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140756592","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Probabilistic Context Neighborhood model for lattices 网格的概率上下文邻域模型
IF 2.3 2区 数学 Q3 GEOSCIENCES, MULTIDISCIPLINARY Pub Date : 2024-04-13 DOI: 10.1016/j.spasta.2024.100830
Denise Duarte , Débora F. Magalhães , Aline M. Piroutek , Caio Alves

We present the Probabilistic Context Neighborhood model designed for two-dimensional lattices as a variation of a Markov random field assuming discrete values. In this model, the neighborhood structure has a fixed geometry but a variable order, depending on the neighbors’ values. Our model extends the Probabilistic Context Tree model, originally applicable to one-dimensional space. It retains advantageous properties, such as representing the dependence neighborhood structure as a graph in a tree format, facilitating an understanding of model complexity. Furthermore, we adapt the algorithm used to estimate the Probabilistic Context Tree to estimate the parameters of the proposed model. We illustrate the accuracy of our estimation methodology through simulation studies. Additionally, we apply the Probabilistic Context Neighborhood model to spatial real-world data, showcasing its practical utility.

我们介绍了为二维网格设计的概率上下文邻域模型,它是马尔可夫随机场假设离散值的一种变体。在这一模型中,邻域结构具有固定的几何形状,但顺序可变,这取决于邻域的值。我们的模型扩展了最初适用于一维空间的概率上下文树模型。它保留了一些有利的特性,如以树形格式将依赖邻域结构表示为图形,从而便于理解模型的复杂性。此外,我们调整了用于估计概率上下文树的算法,以估计所提模型的参数。我们通过模拟研究说明了估算方法的准确性。此外,我们还将概率内涵邻接模型应用于现实世界的空间数据,展示了该模型的实用性。
{"title":"Probabilistic Context Neighborhood model for lattices","authors":"Denise Duarte ,&nbsp;Débora F. Magalhães ,&nbsp;Aline M. Piroutek ,&nbsp;Caio Alves","doi":"10.1016/j.spasta.2024.100830","DOIUrl":"https://doi.org/10.1016/j.spasta.2024.100830","url":null,"abstract":"<div><p>We present the Probabilistic Context Neighborhood model designed for two-dimensional lattices as a variation of a Markov random field assuming discrete values. In this model, the neighborhood structure has a fixed geometry but a variable order, depending on the neighbors’ values. Our model extends the Probabilistic Context Tree model, originally applicable to one-dimensional space. It retains advantageous properties, such as representing the dependence neighborhood structure as a graph in a tree format, facilitating an understanding of model complexity. Furthermore, we adapt the algorithm used to estimate the Probabilistic Context Tree to estimate the parameters of the proposed model. We illustrate the accuracy of our estimation methodology through simulation studies. Additionally, we apply the Probabilistic Context Neighborhood model to spatial real-world data, showcasing its practical utility.</p></div>","PeriodicalId":48771,"journal":{"name":"Spatial Statistics","volume":"61 ","pages":"Article 100830"},"PeriodicalIF":2.3,"publicationDate":"2024-04-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140558031","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Searching for correct specification in spatial probit models. Classical approaches versus Gradient Boosting algorithm 在空间概率模型中寻找正确的规范。经典方法与梯度提升算法
IF 2.3 2区 数学 Q3 GEOSCIENCES, MULTIDISCIPLINARY Pub Date : 2024-04-06 DOI: 10.1016/j.spasta.2024.100815
Miguel De la Llave , Fernando A. López

Selecting correct specification in spatial model frameworks is a relevant research topic in spatial econometrics. The purpose of this paper is to examine and contrast two well-known model selection strategies, Specific-to-General, Stge, and General-to-Specific, Gets, in the context of spatial probit models. The results obtained from these classical methods are juxtaposed with those generated through the utilization of a powerful machine learning algorithm: Gradient Boosting. The paper includes an extensive Monte Carlo experiment to compare the performance of these three strategies with small and medium sample sizes. The results show that under ideal conditions, both classical strategies obtain similar results for medium-sized samples, but for small samples, Stge performs slightly better than Gets. The Gradient Boosting algorithm obtains slightly higher success rates than the classical strategies, especially with small samples sizes. Finally, the flow of both strategies is illustrated using a well-known dataset on the probability of businesses reopening in New Orleans in the aftermath of Hurricane Katrina.

在空间模型框架中选择正确的规范是空间计量经济学的一个相关研究课题。本文旨在以空间 probit 模型为背景,研究和对比两种著名的模型选择策略:Specific-to-General(Stge)和 General-to-Specific(Gets)。本文将这些经典方法得出的结果与利用强大的机器学习算法得出的结果进行对比:梯度提升。论文包括一个广泛的蒙特卡罗实验,以比较这三种策略在中小样本量下的性能。结果表明,在理想条件下,两种经典策略在中等样本量时都能获得相似的结果,但在小样本量时,Stge 的表现略好于 Gets。梯度提升算法的成功率略高于经典策略,尤其是在样本量较小的情况下。最后,我们使用一个著名的数据集来说明这两种策略的流程,该数据集涉及卡特里娜飓风过后新奥尔良企业重新开业的概率。
{"title":"Searching for correct specification in spatial probit models. Classical approaches versus Gradient Boosting algorithm","authors":"Miguel De la Llave ,&nbsp;Fernando A. López","doi":"10.1016/j.spasta.2024.100815","DOIUrl":"https://doi.org/10.1016/j.spasta.2024.100815","url":null,"abstract":"<div><p>Selecting correct specification in spatial model frameworks is a relevant research topic in spatial econometrics. The purpose of this paper is to examine and contrast two well-known model selection strategies, Specific-to-General, Stge, and General-to-Specific, Gets, in the context of spatial probit models. The results obtained from these classical methods are juxtaposed with those generated through the utilization of a powerful machine learning algorithm: Gradient Boosting. The paper includes an extensive Monte Carlo experiment to compare the performance of these three strategies with small and medium sample sizes. The results show that under ideal conditions, both classical strategies obtain similar results for medium-sized samples, but for small samples, Stge performs slightly better than Gets. The Gradient Boosting algorithm obtains slightly higher success rates than the classical strategies, especially with small samples sizes. Finally, the flow of both strategies is illustrated using a well-known dataset on the probability of businesses reopening in New Orleans in the aftermath of Hurricane Katrina.</p></div>","PeriodicalId":48771,"journal":{"name":"Spatial Statistics","volume":"61 ","pages":"Article 100815"},"PeriodicalIF":2.3,"publicationDate":"2024-04-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140644629","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Spatial Statistics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1