首页 > 最新文献

Spatial Statistics最新文献

英文 中文
A framework for analysing point patterns on nonconvex domains using visibility graphs and multidimensional scaling 一个使用可见性图和多维尺度分析非凸域上点模式的框架
IF 2.5 2区 数学 Q3 GEOSCIENCES, MULTIDISCIPLINARY Pub Date : 2025-12-01 Epub Date: 2025-09-25 DOI: 10.1016/j.spasta.2025.100935
Kabelo Mahloromela, Inger Fabris-Rotelli
A point pattern is typically analysed to understand the first- and second-order properties of the underlying point process. These properties are usually inferred using estimation procedures that depend on interpoint distance and are thus sensitive to the choice of distance metric. Euclidean distance is conventionally used to quantify proximity between points, but it does not accurately reflect spatial relationships when points are constrained within irregular, nonconvex spatial domains. Herein, we propose a strategy to embed visibility graph distances into Euclidean metric space using multidimensional scaling. The aim is to simplify analyses, leverage well-developed methods based on Euclidean distance, and retain, as far as possible, the true proximity relationships on a nonconvex spatial domain. The kernel smoothed intensity estimate and the K-function are computed in this new spatial context and used to validate the effectiveness of the embedding strategy.
分析点模式通常是为了了解底层点过程的一阶和二阶性质。这些属性通常使用依赖于点间距离的估计过程来推断,因此对距离度量的选择很敏感。欧几里得距离通常用于量化点之间的接近程度,但是当点被约束在不规则的非凸空间域中时,它不能准确地反映空间关系。本文提出了一种利用多维尺度将可见性图距离嵌入欧氏度量空间的策略。其目的是简化分析,利用基于欧几里得距离的成熟方法,并尽可能地保留非凸空间域上的真正接近关系。在新的空间环境下计算核平滑强度估计和k函数,并用于验证嵌入策略的有效性。
{"title":"A framework for analysing point patterns on nonconvex domains using visibility graphs and multidimensional scaling","authors":"Kabelo Mahloromela,&nbsp;Inger Fabris-Rotelli","doi":"10.1016/j.spasta.2025.100935","DOIUrl":"10.1016/j.spasta.2025.100935","url":null,"abstract":"<div><div>A point pattern is typically analysed to understand the first- and second-order properties of the underlying point process. These properties are usually inferred using estimation procedures that depend on interpoint distance and are thus sensitive to the choice of distance metric. Euclidean distance is conventionally used to quantify proximity between points, but it does not accurately reflect spatial relationships when points are constrained within irregular, nonconvex spatial domains. Herein, we propose a strategy to embed visibility graph distances into Euclidean metric space using multidimensional scaling. The aim is to simplify analyses, leverage well-developed methods based on Euclidean distance, and retain, as far as possible, the true proximity relationships on a nonconvex spatial domain. The kernel smoothed intensity estimate and the <span><math><mi>K</mi></math></span>-function are computed in this new spatial context and used to validate the effectiveness of the embedding strategy.</div></div>","PeriodicalId":48771,"journal":{"name":"Spatial Statistics","volume":"70 ","pages":"Article 100935"},"PeriodicalIF":2.5,"publicationDate":"2025-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145222040","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Bandwidth selection for the intensity in spatial point processes 空间点过程中强度的带宽选择
IF 2.5 2区 数学 Q3 GEOSCIENCES, MULTIDISCIPLINARY Pub Date : 2025-12-01 Epub Date: 2025-09-25 DOI: 10.1016/j.spasta.2025.100928
Yangha Chung , Ji Meng Loh , Woncheol Jang
We introduce a doubly-smoothed bandwidth selection method to obtain bandwidth matrices H for estimating the intensity function of a spatial point process. The doubly-smoothed bootstrap involves taking bootstrap samples by adding random noise and using Dirichlet rather than multinomial weights. The mean integrated squared error (MISE) and asymptotic mean integrated squared error (AMISE) as a function of H can then be computed numerically using the bootstrap samples, with optimal H obtained by minimizing the MISE or AMISE with respect to H. We present simulation results comparing the doubly-smoothed bandwidth selection method with other methods for a number of intensity functions. We also apply our methods to a data set of police pedestrian stops in New York City.
提出了一种双平滑带宽选择方法,用于估计空间点过程的强度函数,从而得到带宽矩阵H。双平滑自举包括通过添加随机噪声和使用狄利克雷而不是多项权重来获取自举样本。然后可以使用自举样本数值计算平均积分平方误差(MISE)和渐近平均积分平方误差(AMISE)作为H的函数,通过最小化MISE或AMISE相对于H获得最优H。我们给出了将双平滑带宽选择方法与许多强度函数的其他方法进行比较的仿真结果。我们还将我们的方法应用于纽约市警察行人站的数据集。
{"title":"Bandwidth selection for the intensity in spatial point processes","authors":"Yangha Chung ,&nbsp;Ji Meng Loh ,&nbsp;Woncheol Jang","doi":"10.1016/j.spasta.2025.100928","DOIUrl":"10.1016/j.spasta.2025.100928","url":null,"abstract":"<div><div>We introduce a doubly-smoothed bandwidth selection method to obtain bandwidth matrices <span><math><mi>H</mi></math></span> for estimating the intensity function of a spatial point process. The doubly-smoothed bootstrap involves taking bootstrap samples by adding random noise and using Dirichlet rather than multinomial weights. The mean integrated squared error (MISE) and asymptotic mean integrated squared error (AMISE) as a function of <span><math><mi>H</mi></math></span> can then be computed numerically using the bootstrap samples, with optimal <span><math><mi>H</mi></math></span> obtained by minimizing the MISE or AMISE with respect to <span><math><mrow><mi>H</mi><mo>.</mo></mrow></math></span> We present simulation results comparing the doubly-smoothed bandwidth selection method with other methods for a number of intensity functions. We also apply our methods to a data set of police pedestrian stops in New York City.</div></div>","PeriodicalId":48771,"journal":{"name":"Spatial Statistics","volume":"70 ","pages":"Article 100928"},"PeriodicalIF":2.5,"publicationDate":"2025-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145222042","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Uncertain spatial autoregressive model with applications to regional economic analysis and regional air quality analysis 不确定空间自回归模型及其在区域经济分析和区域空气质量分析中的应用
IF 2.5 2区 数学 Q3 GEOSCIENCES, MULTIDISCIPLINARY Pub Date : 2025-12-01 Epub Date: 2025-09-22 DOI: 10.1016/j.spasta.2025.100932
Jinsheng Xie
This study aims to establish uncertain spatial statistics by exploring the uncertain spatial autoregressive model firstly. Modeling the observations of the response variable via uncertain variables and assuming they are affected by neighboring observations, this paper explores an approach of the uncertain spatial autoregressive model to estimate relationships among the uncertain variables with spatial locations. By employing the principle of least squares, a minimization problem is provided to estimate unknown parameters in the uncertain spatial autoregressive model. Finally, two real-world examples of regional economic analysis and regional air quality analysis are given to clearly demonstrate the uncertain spatial autoregressive model.
本研究旨在通过探索不确定空间自回归模型,建立不确定空间统计量。本文通过不确定变量对响应变量的观测值进行建模,并假设它们受到相邻观测值的影响,探索了一种不确定空间自回归模型来估计不确定变量与空间位置之间关系的方法。利用最小二乘原理,给出了不确定空间自回归模型中未知参数估计的最小化问题。最后,以区域经济分析和区域空气质量分析为例,对不确定空间自回归模型进行了论证。
{"title":"Uncertain spatial autoregressive model with applications to regional economic analysis and regional air quality analysis","authors":"Jinsheng Xie","doi":"10.1016/j.spasta.2025.100932","DOIUrl":"10.1016/j.spasta.2025.100932","url":null,"abstract":"<div><div>This study aims to establish uncertain spatial statistics by exploring the uncertain spatial autoregressive model firstly. Modeling the observations of the response variable via uncertain variables and assuming they are affected by neighboring observations, this paper explores an approach of the uncertain spatial autoregressive model to estimate relationships among the uncertain variables with spatial locations. By employing the principle of least squares, a minimization problem is provided to estimate unknown parameters in the uncertain spatial autoregressive model. Finally, two real-world examples of regional economic analysis and regional air quality analysis are given to clearly demonstrate the uncertain spatial autoregressive model.</div></div>","PeriodicalId":48771,"journal":{"name":"Spatial Statistics","volume":"70 ","pages":"Article 100932"},"PeriodicalIF":2.5,"publicationDate":"2025-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145222043","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Spatial empirical best predictor of small area linear parameter for positively skewed outcomes 空间经验是小面积线性参数对正偏倚结果的最佳预测因子
IF 2.5 2区 数学 Q3 GEOSCIENCES, MULTIDISCIPLINARY Pub Date : 2025-12-01 Epub Date: 2025-11-04 DOI: 10.1016/j.spasta.2025.100941
Dian Handayani , Khairil Anwar Notodiputro , Asep Saefuddin , I Wayan Mangku , Anang Kurnia
Small area estimation (SAE) addresses the estimation of parameters for population subsets when the sample itself is too small to produce reliable direct estimates. The standard method, empirical best linear unbiased prediction, uses a predictor under a linear mixed model that assumes normality of the variable of interest and independence among small areas. However, in practical studies, the distribution of the variable of interest tends to be positively skewed and there exists spatial dependence among the small areas. To address both of these, a previous study had proposed a spatial synthetic (SYNT) predictor that predicts non-sampled values of the variable of interest using its unconditional mean. The SYNT predictor is derived based on a unit-level spatial lognormal mixed model. Herein, we propose spatial empirical best predictor (SEBP) to improve the SYNT predictor by using its conditional mean to predict the non-sampled values of the variable of interest. We perform simulation studies to evaluate the performance of SEBP and compare them with those of the SYNT predictor and other existing methods. Our results reveal that the SEBP performs better in terms of the average relative bias and average relative root mean square error when the spatial correlation among small areas is small, medium or large. In an SAE application on the average monthly household per-capita expenditure for sub-districts in Bogor, Indonesia, the proposed SEBP provides better estimates than other established methods.
当样本本身太小而无法产生可靠的直接估计时,小面积估计(SAE)解决了总体子集参数的估计。标准方法,经验最佳线性无偏预测,使用线性混合模型下的预测器,该模型假设感兴趣变量的正态性和小区域之间的独立性。但在实际研究中,兴趣变量的分布趋于正偏,小区域间存在空间依赖性。为了解决这两个问题,之前的一项研究提出了一个空间合成(SYNT)预测器,该预测器使用目标变量的无条件平均值来预测其非采样值。SYNT预测器是基于单位级空间对数正态混合模型导出的。在此,我们提出空间经验最佳预测器(SEBP)来改进SYNT预测器,通过使用其条件均值来预测感兴趣变量的非采样值。我们进行了模拟研究来评估SEBP的性能,并将其与SYNT预测器和其他现有方法进行了比较。结果表明,当小区域间的空间相关性为小、中、大时,SEBP在平均相对偏差和平均相对均方根误差方面表现较好。在一项SAE关于印度尼西亚茂物街道平均每月家庭人均支出的应用中,拟议的SEBP提供了比其他现有方法更好的估计。
{"title":"Spatial empirical best predictor of small area linear parameter for positively skewed outcomes","authors":"Dian Handayani ,&nbsp;Khairil Anwar Notodiputro ,&nbsp;Asep Saefuddin ,&nbsp;I Wayan Mangku ,&nbsp;Anang Kurnia","doi":"10.1016/j.spasta.2025.100941","DOIUrl":"10.1016/j.spasta.2025.100941","url":null,"abstract":"<div><div>Small area estimation (SAE) addresses the estimation of parameters for population subsets when the sample itself is too small to produce reliable direct estimates. The standard method, empirical best linear unbiased prediction, uses a predictor under a linear mixed model that assumes normality of the variable of interest and independence among small areas. However, in practical studies, the distribution of the variable of interest tends to be positively skewed and there exists spatial dependence among the small areas. To address both of these, a previous study had proposed a spatial synthetic (SYNT) predictor that predicts non-sampled values of the variable of interest using its unconditional mean. The SYNT predictor is derived based on a unit-level spatial lognormal mixed model. Herein, we propose spatial empirical best predictor (SEBP) to improve the SYNT predictor by using its conditional mean to predict the non-sampled values of the variable of interest. We perform simulation studies to evaluate the performance of SEBP and compare them with those of the SYNT predictor and other existing methods. Our results reveal that the SEBP performs better in terms of the average relative bias and average relative root mean square error when the spatial correlation among small areas is small, medium or large. In an SAE application on the average monthly household per-capita expenditure for sub-districts in Bogor, Indonesia, the proposed SEBP provides better estimates than other established methods.</div></div>","PeriodicalId":48771,"journal":{"name":"Spatial Statistics","volume":"70 ","pages":"Article 100941"},"PeriodicalIF":2.5,"publicationDate":"2025-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145528797","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Specifying spatial effects in panel data: Locally robust vs. conditional tests 指定面板数据中的空间效果:局部鲁棒测试与条件测试
IF 2.5 2区 数学 Q3 GEOSCIENCES, MULTIDISCIPLINARY Pub Date : 2025-12-01 Epub Date: 2025-09-23 DOI: 10.1016/j.spasta.2025.100934
Giovanni Millo
We address the issue of specifying a spatial lag vs. spatial error process in spatial panel models. The popular locally robust Lagrange multiplier (RLM) tests for spatial lag vs. error are compared to optimal alternatives based on maximum likelihood estimation: Wald and likelihood ratio (LR) tests requiring estimation of the full encompassing model, and conditional Lagrange multiplier (CLM) tests drawing on the reduced specification. Monte Carlo simulations are performed in a typical spatial panel context. Individual effects are successfully eliminated through the forward orthogonal deviations transformation, making the RLM suitable for panel data. Nevertheless, the statistical properties of Wald and LR are superior to those of the RLM. The CLM also dominates the RLM, as long as the sample is at least of moderate size. The RLM are computationally very convenient, but ML-based tests are feasible in most usage cases on mainstream hardware.
我们解决了在空间面板模型中指定空间滞后与空间误差过程的问题。将流行的局部鲁棒拉格朗日乘数(RLM)空间滞后与误差测试与基于最大似然估计的最佳替代方案进行比较:Wald和似然比(LR)测试需要估计完整的包含模型,以及基于简化规范的条件拉格朗日乘数(CLM)测试。蒙特卡罗模拟是在典型的空间面板环境中进行的。通过前向正交偏差变换,成功地消除了个体影响,使RLM适用于面板数据。然而,Wald和LR的统计性质优于RLM。只要样本至少是中等大小,CLM也支配着RLM。RLM在计算上非常方便,但是基于ml的测试在主流硬件上的大多数使用情况下是可行的。
{"title":"Specifying spatial effects in panel data: Locally robust vs. conditional tests","authors":"Giovanni Millo","doi":"10.1016/j.spasta.2025.100934","DOIUrl":"10.1016/j.spasta.2025.100934","url":null,"abstract":"<div><div>We address the issue of specifying a spatial lag vs. spatial error process in spatial panel models. The popular locally robust Lagrange multiplier (RLM) tests for spatial lag vs. error are compared to optimal alternatives based on maximum likelihood estimation: Wald and likelihood ratio (LR) tests requiring estimation of the full encompassing model, and conditional Lagrange multiplier (CLM) tests drawing on the reduced specification. Monte Carlo simulations are performed in a typical spatial panel context. Individual effects are successfully eliminated through the forward orthogonal deviations transformation, making the RLM suitable for panel data. Nevertheless, the statistical properties of Wald and LR are superior to those of the RLM. The CLM also dominates the RLM, as long as the sample is at least of moderate size. The RLM are computationally very convenient, but ML-based tests are feasible in most usage cases on mainstream hardware.</div></div>","PeriodicalId":48771,"journal":{"name":"Spatial Statistics","volume":"70 ","pages":"Article 100934"},"PeriodicalIF":2.5,"publicationDate":"2025-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145159568","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Growth of spatial statistics for agriculture and environment: The example of BioSP at INRAE 农业与环境空间统计的增长:以印度农业与环境研究所的BioSP为例
IF 2.5 2区 数学 Q3 GEOSCIENCES, MULTIDISCIPLINARY Pub Date : 2025-12-01 Epub Date: 2025-10-01 DOI: 10.1016/j.spasta.2025.100938
Denis Allard
This paper illustrates how progress in spatial statistics is fueled by scientific questions arising from applications in agriculture and environment. The unifying theme is the work that has been carried out at BioSP, a statistics and mathematics research unit mainly affiliated to the “Mathematics and Digital Technologies” division at INRAE, the French National Research Institute for Agriculture, Food and Environment. Starting from the 20 contributions that BioSP members have published in Spatial Statistics since its creation in 2012, almost fifteen years of advances are reviewed, spanning point processes, (multivariate) spatio-temporal Gaussian processes, compositional data, stochastic weather generators and extreme value theory. Most of the content is focused on theoretical and methodological developments, with examples being limited due to length constraints for the article. Attention is given to how these advances have been inspired by problems arising in other research domains. In return, it will be shown how they have opened new research questions in spatial statistics and how they had impact in the scientific fields they originated from. In conclusion, some perspectives and outlooks are discussed, in particular in relation to the AI revolution.
本文阐述了空间统计的进步是如何由农业和环境应用中产生的科学问题推动的。统一的主题是在BioSP进行的工作,BioSP是一个统计和数学研究单位,主要隶属于法国国家农业、食品和环境研究所(INRAE)的“数学和数字技术”部门。自2012年创立以来,BioSP成员在《空间统计学》上发表了20篇文章,回顾了近15年来的进展,包括点过程、(多元)时空高斯过程、成分数据、随机天气发生器和极值理论。大部分内容集中在理论和方法的发展,由于文章的长度限制,示例有限。关注这些进步是如何受到其他研究领域中出现的问题的启发的。作为回报,它将展示他们如何在空间统计中开辟了新的研究问题,以及他们如何在他们所起源于的科学领域产生影响。最后,讨论了一些观点和展望,特别是与人工智能革命有关的观点和展望。
{"title":"Growth of spatial statistics for agriculture and environment: The example of BioSP at INRAE","authors":"Denis Allard","doi":"10.1016/j.spasta.2025.100938","DOIUrl":"10.1016/j.spasta.2025.100938","url":null,"abstract":"<div><div>This paper illustrates how progress in spatial statistics is fueled by scientific questions arising from applications in agriculture and environment. The unifying theme is the work that has been carried out at BioSP, a statistics and mathematics research unit mainly affiliated to the “Mathematics and Digital Technologies” division at INRAE, the French National Research Institute for Agriculture, Food and Environment. Starting from the 20 contributions that BioSP members have published in <em>Spatial Statistics</em> since its creation in 2012, almost fifteen years of advances are reviewed, spanning point processes, (multivariate) spatio-temporal Gaussian processes, compositional data, stochastic weather generators and extreme value theory. Most of the content is focused on theoretical and methodological developments, with examples being limited due to length constraints for the article. Attention is given to how these advances have been inspired by problems arising in other research domains. In return, it will be shown how they have opened new research questions in spatial statistics and how they had impact in the scientific fields they originated from. In conclusion, some perspectives and outlooks are discussed, in particular in relation to the AI revolution.</div></div>","PeriodicalId":48771,"journal":{"name":"Spatial Statistics","volume":"70 ","pages":"Article 100938"},"PeriodicalIF":2.5,"publicationDate":"2025-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145269553","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Joint model for zero-inflated data combining fishery-dependent and fishery-independent sources 结合渔业依赖和渔业独立来源的零膨胀数据联合模型
IF 2.5 2区 数学 Q3 GEOSCIENCES, MULTIDISCIPLINARY Pub Date : 2025-12-01 Epub Date: 2025-09-11 DOI: 10.1016/j.spasta.2025.100930
Daniela Silva , Raquel Menezes , Gonçalo Araújo , Renato Rosa , Ana Moreno , Alexandra Silva , Susana Garrido
Accurately identifying spatial patterns of species distribution is crucial for scientific insight and societal benefit, aiding our understanding of species fluctuations. The increasing quantity and quality of ecological datasets present heightened statistical challenges, complicating spatial species dynamics comprehension. Addressing the complex task of integrating multiple data sources to enhance spatial fish distribution understanding in marine ecology, this study introduces a pioneering five-layer Joint model. The model adeptly integrates fishery-independent and fishery-dependent data, accommodating zero-inflated data and distinct sampling processes. A comprehensive simulation study evaluates the model performance across various preferential sampling scenarios and sample sizes, elucidating its advantages and challenges. Our findings highlight the model’s robustness in estimating preferential parameters, emphasizing differentiation between presence–absence and biomass observations. Evaluation of estimation of spatial covariance and prediction performance underscores the model’s reliability. Augmenting sample sizes reduces parameter estimation variability, aligning with the principle that increased information enhances certainty. Assessing the contribution of each data source reveals successful integration, providing a comprehensive representation of biomass patterns. Empirical application within a real-world context further solidifies the model’s efficacy in capturing species’ spatial distribution. This research advances methodologies for integrating diverse datasets with different sampling natures further contributing to a more informed understanding of spatial dynamics of marine species.
准确识别物种分布的空间模式对于科学洞察力和社会效益至关重要,有助于我们理解物种波动。生态数据集的数量和质量的不断提高对统计提出了更高的挑战,使空间物种动态的理解复杂化。为了解决整合多个数据源以增强海洋生态学中鱼类空间分布认识的复杂任务,本研究引入了一个开创性的五层联合模型。该模型巧妙地整合了渔业独立和渔业依赖的数据,适应零膨胀数据和不同的采样过程。综合仿真研究评估了模型在各种优先采样场景和样本量下的性能,阐明了其优势和挑战。我们的发现突出了模型在估计优先参数方面的稳健性,强调了存在-缺失和生物量观测之间的区别。对空间协方差估计和预测性能的评价强调了模型的可靠性。增加样本量减少了参数估计的可变性,与增加的信息增强确定性的原则保持一致。评估每个数据源的贡献揭示了成功的整合,提供了生物量模式的全面表示。在现实环境中的经验应用进一步巩固了模型在捕捉物种空间分布方面的有效性。这项研究推进了整合不同采样性质的不同数据集的方法,进一步有助于对海洋物种的空间动态有更深入的了解。
{"title":"Joint model for zero-inflated data combining fishery-dependent and fishery-independent sources","authors":"Daniela Silva ,&nbsp;Raquel Menezes ,&nbsp;Gonçalo Araújo ,&nbsp;Renato Rosa ,&nbsp;Ana Moreno ,&nbsp;Alexandra Silva ,&nbsp;Susana Garrido","doi":"10.1016/j.spasta.2025.100930","DOIUrl":"10.1016/j.spasta.2025.100930","url":null,"abstract":"<div><div>Accurately identifying spatial patterns of species distribution is crucial for scientific insight and societal benefit, aiding our understanding of species fluctuations. The increasing quantity and quality of ecological datasets present heightened statistical challenges, complicating spatial species dynamics comprehension. Addressing the complex task of integrating multiple data sources to enhance spatial fish distribution understanding in marine ecology, this study introduces a pioneering five-layer Joint model. The model adeptly integrates fishery-independent and fishery-dependent data, accommodating zero-inflated data and distinct sampling processes. A comprehensive simulation study evaluates the model performance across various preferential sampling scenarios and sample sizes, elucidating its advantages and challenges. Our findings highlight the model’s robustness in estimating preferential parameters, emphasizing differentiation between presence–absence and biomass observations. Evaluation of estimation of spatial covariance and prediction performance underscores the model’s reliability. Augmenting sample sizes reduces parameter estimation variability, aligning with the principle that increased information enhances certainty. Assessing the contribution of each data source reveals successful integration, providing a comprehensive representation of biomass patterns. Empirical application within a real-world context further solidifies the model’s efficacy in capturing species’ spatial distribution. This research advances methodologies for integrating diverse datasets with different sampling natures further contributing to a more informed understanding of spatial dynamics of marine species.</div></div>","PeriodicalId":48771,"journal":{"name":"Spatial Statistics","volume":"70 ","pages":"Article 100930"},"PeriodicalIF":2.5,"publicationDate":"2025-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145107607","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Spatial and spatio-temporal cluster detection using stacking 基于叠加的时空聚类检测
IF 2.5 2区 数学 Q3 GEOSCIENCES, MULTIDISCIPLINARY Pub Date : 2025-12-01 Epub Date: 2025-09-24 DOI: 10.1016/j.spasta.2025.100933
Maria E. Kamenetsky , Jun Zhu , Ronald E. Gangnon
Patterns in disease across space and time are important to epidemiologists and health professionals because they may indicate underlying elevated disease risk. In some cases, elevated risk may be driven by environmental exposures, infectious diseases or other factors where timely public health interventions are important. The spatial and spatio-temporal scan statistics identify a single most likely cluster or equivalently select a single correct model. We instead consider an ensemble of single cluster models. We use stacking, a model-averaging technique, to combine relative risk estimates from all of the single cluster models into a sequence of meta-models indexed by the effective number of parameters/clusters. The number of parameters/spatio-temporal clusters is chosen using information criteria. A simulation study is conducted to demonstrate the statistical properties of the stacking method. The method is illustrated using a dataset of female breast cancer incidence data at the municipality level in Japan.
跨空间和时间的疾病模式对流行病学家和卫生专业人员很重要,因为它们可能表明潜在的疾病风险升高。在某些情况下,风险升高可能是由环境暴露、传染病或及时的公共卫生干预很重要的其他因素造成的。空间和时空扫描统计信息确定一个最可能的集群或选择一个正确的模型。我们转而考虑单个集群模型的集合。我们使用叠加,一种模型平均技术,将所有单个聚类模型的相对风险估计合并到由参数/聚类的有效数量索引的元模型序列中。使用信息标准选择参数/时空聚类的数量。通过仿真研究验证了该方法的统计特性。该方法使用日本市级女性乳腺癌发病率数据集进行说明。
{"title":"Spatial and spatio-temporal cluster detection using stacking","authors":"Maria E. Kamenetsky ,&nbsp;Jun Zhu ,&nbsp;Ronald E. Gangnon","doi":"10.1016/j.spasta.2025.100933","DOIUrl":"10.1016/j.spasta.2025.100933","url":null,"abstract":"<div><div>Patterns in disease across space and time are important to epidemiologists and health professionals because they may indicate underlying elevated disease risk. In some cases, elevated risk may be driven by environmental exposures, infectious diseases or other factors where timely public health interventions are important. The spatial and spatio-temporal scan statistics identify a single most likely cluster or equivalently select a single correct model. We instead consider an ensemble of single cluster models. We use stacking, a model-averaging technique, to combine relative risk estimates from all of the single cluster models into a sequence of meta-models indexed by the effective number of parameters/clusters. The number of parameters/spatio-temporal clusters is chosen using information criteria. A simulation study is conducted to demonstrate the statistical properties of the stacking method. The method is illustrated using a dataset of female breast cancer incidence data at the municipality level in Japan.</div></div>","PeriodicalId":48771,"journal":{"name":"Spatial Statistics","volume":"70 ","pages":"Article 100933"},"PeriodicalIF":2.5,"publicationDate":"2025-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145159567","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Variational autoencoded multivariate spatial Fay–Herriot models 变分自编码多元空间费-赫里奥特模型
IF 2.5 2区 数学 Q3 GEOSCIENCES, MULTIDISCIPLINARY Pub Date : 2025-12-01 Epub Date: 2025-09-10 DOI: 10.1016/j.spasta.2025.100929
Zhenhua Wang , Paul A. Parker , Scott H. Holan
Small area estimation models are essential for estimating population characteristics in regions with limited sample sizes, thereby supporting policy decisions, demographic studies, and resource allocation, among other use cases. The spatial Fay–Herriot model is one such approach that incorporates spatial dependence to improve estimation by borrowing strength from neighboring regions. However, this approach often requires substantial computational resources, limiting its scalability for high-dimensional datasets, especially when considering multiple (multivariate) responses. This paper proposes two methods that integrate the multivariate spatial Fay–Herriot model with spatial random effects, learned through variational autoencoders, to efficiently leverage spatial structure. Importantly, after training the variational autoencoder to represent spatial dependence for a given set of geographies, it may be used again in future modeling efforts, without the need for retraining. Additionally, the use of the variational autoencoder to represent spatial dependence results in extreme improvements in computational efficiency, even for massive datasets. We demonstrate the effectiveness of our approach using 5-year period estimates from the American Community Survey over all census tracts in California.
小区域估计模型对于在样本数量有限的地区估计人口特征是必不可少的,因此支持政策决策、人口统计研究和资源分配,以及其他用例。空间Fay-Herriot模型就是这样一种方法,它结合了空间依赖性,通过借鉴邻近区域的强度来改进估计。然而,这种方法通常需要大量的计算资源,限制了其对高维数据集的可伸缩性,特别是在考虑多个(多变量)响应时。本文提出了两种将变分自编码器学习到的多元空间Fay-Herriot模型与空间随机效应相结合的方法,以有效利用空间结构。重要的是,在训练变分自编码器以表示给定地理集合的空间依赖性之后,它可以在未来的建模工作中再次使用,而无需再训练。此外,使用变分自编码器来表示空间依赖性可以极大地提高计算效率,即使对于大量数据集也是如此。我们使用美国社区调查对加州所有人口普查区的5年估计来证明我们方法的有效性。
{"title":"Variational autoencoded multivariate spatial Fay–Herriot models","authors":"Zhenhua Wang ,&nbsp;Paul A. Parker ,&nbsp;Scott H. Holan","doi":"10.1016/j.spasta.2025.100929","DOIUrl":"10.1016/j.spasta.2025.100929","url":null,"abstract":"<div><div>Small area estimation models are essential for estimating population characteristics in regions with limited sample sizes, thereby supporting policy decisions, demographic studies, and resource allocation, among other use cases. The spatial Fay–Herriot model is one such approach that incorporates spatial dependence to improve estimation by borrowing strength from neighboring regions. However, this approach often requires substantial computational resources, limiting its scalability for high-dimensional datasets, especially when considering multiple (multivariate) responses. This paper proposes two methods that integrate the multivariate spatial Fay–Herriot model with spatial random effects, learned through variational autoencoders, to efficiently leverage spatial structure. Importantly, after training the variational autoencoder to represent spatial dependence for a given set of geographies, it may be used again in future modeling efforts, without the need for retraining. Additionally, the use of the variational autoencoder to represent spatial dependence results in extreme improvements in computational efficiency, even for massive datasets. We demonstrate the effectiveness of our approach using 5-year period estimates from the American Community Survey over all census tracts in California.</div></div>","PeriodicalId":48771,"journal":{"name":"Spatial Statistics","volume":"70 ","pages":"Article 100929"},"PeriodicalIF":2.5,"publicationDate":"2025-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145050405","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Model averaging for spatial autoregressive panel data models 空间自回归面板数据模型的模型平均
IF 2.5 2区 数学 Q3 GEOSCIENCES, MULTIDISCIPLINARY Pub Date : 2025-12-01 Epub Date: 2025-09-23 DOI: 10.1016/j.spasta.2025.100931
Aibing Ji, Jingxuan Li, Qingqing Li
The spatial autoregressive panel data models are widely employed in regional economics to capture spatial dependencies, but conventional specifications rely on a single spatial weight matrix, heightening the risk of model misspecification. Current research lacks systematic model averaging methods for integrating multiple weight matrices and addressing spatial effect uncertainty. This study proposes a novel model averaging framework for spatial autoregressive panel data models with fixed effects, extending model averaging methodology to the spatial panel context and enabling flexible combinations of multiple weight matrices for both dependent variables and error terms. An adaptive Mallows-type criterion is developed, dynamically adjusting to the presence or absence of spatial effects, with its asymptotic optimality established. Monte Carlo simulations confirm robustness across scenarios with no, single, or mixed spatial dependencies. An empirical application to Chinese provincial housing prices identifies economic adjacency as the key spatial dependence driver, validating the method’s predictive accuracy and policy utility for spatiotemporal data analysis.
空间自回归面板数据模型在区域经济学中被广泛应用于捕获空间依赖关系,但传统的规范依赖于单一的空间权重矩阵,增加了模型错误规范的风险。目前的研究缺乏系统的模型平均方法来积分多个权重矩阵和处理空间效应的不确定性。本研究为具有固定效应的空间自回归面板数据模型提出了一种新的模型平均框架,将模型平均方法扩展到空间面板环境,并实现了因变量和误差项的多个权重矩阵的灵活组合。提出了一种自适应mallows型准则,根据空间效应的存在或不存在进行动态调整,并建立了其渐近最优性。蒙特卡罗模拟证实了在没有、单一或混合空间依赖性的情况下的鲁棒性。通过对中国省级房价的实证分析,发现经济邻接性是主要的空间依赖驱动因素,验证了该方法在时空数据分析中的预测准确性和政策实用性。
{"title":"Model averaging for spatial autoregressive panel data models","authors":"Aibing Ji,&nbsp;Jingxuan Li,&nbsp;Qingqing Li","doi":"10.1016/j.spasta.2025.100931","DOIUrl":"10.1016/j.spasta.2025.100931","url":null,"abstract":"<div><div>The spatial autoregressive panel data models are widely employed in regional economics to capture spatial dependencies, but conventional specifications rely on a single spatial weight matrix, heightening the risk of model misspecification. Current research lacks systematic model averaging methods for integrating multiple weight matrices and addressing spatial effect uncertainty. This study proposes a novel model averaging framework for spatial autoregressive panel data models with fixed effects, extending model averaging methodology to the spatial panel context and enabling flexible combinations of multiple weight matrices for both dependent variables and error terms. An adaptive Mallows-type criterion is developed, dynamically adjusting to the presence or absence of spatial effects, with its asymptotic optimality established. Monte Carlo simulations confirm robustness across scenarios with no, single, or mixed spatial dependencies. An empirical application to Chinese provincial housing prices identifies economic adjacency as the key spatial dependence driver, validating the method’s predictive accuracy and policy utility for spatiotemporal data analysis.</div></div>","PeriodicalId":48771,"journal":{"name":"Spatial Statistics","volume":"70 ","pages":"Article 100931"},"PeriodicalIF":2.5,"publicationDate":"2025-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145159566","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Spatial Statistics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1