首页 > 最新文献

Spatial Statistics最新文献

英文 中文
Bandwidth selection for the intensity in spatial point processes 空间点过程中强度的带宽选择
IF 2.5 2区 数学 Q3 GEOSCIENCES, MULTIDISCIPLINARY Pub Date : 2025-09-25 DOI: 10.1016/j.spasta.2025.100928
Yangha Chung , Ji Meng Loh , Woncheol Jang
We introduce a doubly-smoothed bandwidth selection method to obtain bandwidth matrices H for estimating the intensity function of a spatial point process. The doubly-smoothed bootstrap involves taking bootstrap samples by adding random noise and using Dirichlet rather than multinomial weights. The mean integrated squared error (MISE) and asymptotic mean integrated squared error (AMISE) as a function of H can then be computed numerically using the bootstrap samples, with optimal H obtained by minimizing the MISE or AMISE with respect to H. We present simulation results comparing the doubly-smoothed bandwidth selection method with other methods for a number of intensity functions. We also apply our methods to a data set of police pedestrian stops in New York City.
提出了一种双平滑带宽选择方法,用于估计空间点过程的强度函数,从而得到带宽矩阵H。双平滑自举包括通过添加随机噪声和使用狄利克雷而不是多项权重来获取自举样本。然后可以使用自举样本数值计算平均积分平方误差(MISE)和渐近平均积分平方误差(AMISE)作为H的函数,通过最小化MISE或AMISE相对于H获得最优H。我们给出了将双平滑带宽选择方法与许多强度函数的其他方法进行比较的仿真结果。我们还将我们的方法应用于纽约市警察行人站的数据集。
{"title":"Bandwidth selection for the intensity in spatial point processes","authors":"Yangha Chung ,&nbsp;Ji Meng Loh ,&nbsp;Woncheol Jang","doi":"10.1016/j.spasta.2025.100928","DOIUrl":"10.1016/j.spasta.2025.100928","url":null,"abstract":"<div><div>We introduce a doubly-smoothed bandwidth selection method to obtain bandwidth matrices <span><math><mi>H</mi></math></span> for estimating the intensity function of a spatial point process. The doubly-smoothed bootstrap involves taking bootstrap samples by adding random noise and using Dirichlet rather than multinomial weights. The mean integrated squared error (MISE) and asymptotic mean integrated squared error (AMISE) as a function of <span><math><mi>H</mi></math></span> can then be computed numerically using the bootstrap samples, with optimal <span><math><mi>H</mi></math></span> obtained by minimizing the MISE or AMISE with respect to <span><math><mrow><mi>H</mi><mo>.</mo></mrow></math></span> We present simulation results comparing the doubly-smoothed bandwidth selection method with other methods for a number of intensity functions. We also apply our methods to a data set of police pedestrian stops in New York City.</div></div>","PeriodicalId":48771,"journal":{"name":"Spatial Statistics","volume":"70 ","pages":"Article 100928"},"PeriodicalIF":2.5,"publicationDate":"2025-09-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145222042","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Spatial and spatio-temporal cluster detection using stacking 基于叠加的时空聚类检测
IF 2.5 2区 数学 Q3 GEOSCIENCES, MULTIDISCIPLINARY Pub Date : 2025-09-24 DOI: 10.1016/j.spasta.2025.100933
Maria E. Kamenetsky , Jun Zhu , Ronald E. Gangnon
Patterns in disease across space and time are important to epidemiologists and health professionals because they may indicate underlying elevated disease risk. In some cases, elevated risk may be driven by environmental exposures, infectious diseases or other factors where timely public health interventions are important. The spatial and spatio-temporal scan statistics identify a single most likely cluster or equivalently select a single correct model. We instead consider an ensemble of single cluster models. We use stacking, a model-averaging technique, to combine relative risk estimates from all of the single cluster models into a sequence of meta-models indexed by the effective number of parameters/clusters. The number of parameters/spatio-temporal clusters is chosen using information criteria. A simulation study is conducted to demonstrate the statistical properties of the stacking method. The method is illustrated using a dataset of female breast cancer incidence data at the municipality level in Japan.
跨空间和时间的疾病模式对流行病学家和卫生专业人员很重要,因为它们可能表明潜在的疾病风险升高。在某些情况下,风险升高可能是由环境暴露、传染病或及时的公共卫生干预很重要的其他因素造成的。空间和时空扫描统计信息确定一个最可能的集群或选择一个正确的模型。我们转而考虑单个集群模型的集合。我们使用叠加,一种模型平均技术,将所有单个聚类模型的相对风险估计合并到由参数/聚类的有效数量索引的元模型序列中。使用信息标准选择参数/时空聚类的数量。通过仿真研究验证了该方法的统计特性。该方法使用日本市级女性乳腺癌发病率数据集进行说明。
{"title":"Spatial and spatio-temporal cluster detection using stacking","authors":"Maria E. Kamenetsky ,&nbsp;Jun Zhu ,&nbsp;Ronald E. Gangnon","doi":"10.1016/j.spasta.2025.100933","DOIUrl":"10.1016/j.spasta.2025.100933","url":null,"abstract":"<div><div>Patterns in disease across space and time are important to epidemiologists and health professionals because they may indicate underlying elevated disease risk. In some cases, elevated risk may be driven by environmental exposures, infectious diseases or other factors where timely public health interventions are important. The spatial and spatio-temporal scan statistics identify a single most likely cluster or equivalently select a single correct model. We instead consider an ensemble of single cluster models. We use stacking, a model-averaging technique, to combine relative risk estimates from all of the single cluster models into a sequence of meta-models indexed by the effective number of parameters/clusters. The number of parameters/spatio-temporal clusters is chosen using information criteria. A simulation study is conducted to demonstrate the statistical properties of the stacking method. The method is illustrated using a dataset of female breast cancer incidence data at the municipality level in Japan.</div></div>","PeriodicalId":48771,"journal":{"name":"Spatial Statistics","volume":"70 ","pages":"Article 100933"},"PeriodicalIF":2.5,"publicationDate":"2025-09-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145159567","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Specifying spatial effects in panel data: Locally robust vs. conditional tests 指定面板数据中的空间效果:局部鲁棒测试与条件测试
IF 2.5 2区 数学 Q3 GEOSCIENCES, MULTIDISCIPLINARY Pub Date : 2025-09-23 DOI: 10.1016/j.spasta.2025.100934
Giovanni Millo
We address the issue of specifying a spatial lag vs. spatial error process in spatial panel models. The popular locally robust Lagrange multiplier (RLM) tests for spatial lag vs. error are compared to optimal alternatives based on maximum likelihood estimation: Wald and likelihood ratio (LR) tests requiring estimation of the full encompassing model, and conditional Lagrange multiplier (CLM) tests drawing on the reduced specification. Monte Carlo simulations are performed in a typical spatial panel context. Individual effects are successfully eliminated through the forward orthogonal deviations transformation, making the RLM suitable for panel data. Nevertheless, the statistical properties of Wald and LR are superior to those of the RLM. The CLM also dominates the RLM, as long as the sample is at least of moderate size. The RLM are computationally very convenient, but ML-based tests are feasible in most usage cases on mainstream hardware.
我们解决了在空间面板模型中指定空间滞后与空间误差过程的问题。将流行的局部鲁棒拉格朗日乘数(RLM)空间滞后与误差测试与基于最大似然估计的最佳替代方案进行比较:Wald和似然比(LR)测试需要估计完整的包含模型,以及基于简化规范的条件拉格朗日乘数(CLM)测试。蒙特卡罗模拟是在典型的空间面板环境中进行的。通过前向正交偏差变换,成功地消除了个体影响,使RLM适用于面板数据。然而,Wald和LR的统计性质优于RLM。只要样本至少是中等大小,CLM也支配着RLM。RLM在计算上非常方便,但是基于ml的测试在主流硬件上的大多数使用情况下是可行的。
{"title":"Specifying spatial effects in panel data: Locally robust vs. conditional tests","authors":"Giovanni Millo","doi":"10.1016/j.spasta.2025.100934","DOIUrl":"10.1016/j.spasta.2025.100934","url":null,"abstract":"<div><div>We address the issue of specifying a spatial lag vs. spatial error process in spatial panel models. The popular locally robust Lagrange multiplier (RLM) tests for spatial lag vs. error are compared to optimal alternatives based on maximum likelihood estimation: Wald and likelihood ratio (LR) tests requiring estimation of the full encompassing model, and conditional Lagrange multiplier (CLM) tests drawing on the reduced specification. Monte Carlo simulations are performed in a typical spatial panel context. Individual effects are successfully eliminated through the forward orthogonal deviations transformation, making the RLM suitable for panel data. Nevertheless, the statistical properties of Wald and LR are superior to those of the RLM. The CLM also dominates the RLM, as long as the sample is at least of moderate size. The RLM are computationally very convenient, but ML-based tests are feasible in most usage cases on mainstream hardware.</div></div>","PeriodicalId":48771,"journal":{"name":"Spatial Statistics","volume":"70 ","pages":"Article 100934"},"PeriodicalIF":2.5,"publicationDate":"2025-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145159568","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Model averaging for spatial autoregressive panel data models 空间自回归面板数据模型的模型平均
IF 2.5 2区 数学 Q3 GEOSCIENCES, MULTIDISCIPLINARY Pub Date : 2025-09-23 DOI: 10.1016/j.spasta.2025.100931
Aibing Ji, Jingxuan Li, Qingqing Li
The spatial autoregressive panel data models are widely employed in regional economics to capture spatial dependencies, but conventional specifications rely on a single spatial weight matrix, heightening the risk of model misspecification. Current research lacks systematic model averaging methods for integrating multiple weight matrices and addressing spatial effect uncertainty. This study proposes a novel model averaging framework for spatial autoregressive panel data models with fixed effects, extending model averaging methodology to the spatial panel context and enabling flexible combinations of multiple weight matrices for both dependent variables and error terms. An adaptive Mallows-type criterion is developed, dynamically adjusting to the presence or absence of spatial effects, with its asymptotic optimality established. Monte Carlo simulations confirm robustness across scenarios with no, single, or mixed spatial dependencies. An empirical application to Chinese provincial housing prices identifies economic adjacency as the key spatial dependence driver, validating the method’s predictive accuracy and policy utility for spatiotemporal data analysis.
空间自回归面板数据模型在区域经济学中被广泛应用于捕获空间依赖关系,但传统的规范依赖于单一的空间权重矩阵,增加了模型错误规范的风险。目前的研究缺乏系统的模型平均方法来积分多个权重矩阵和处理空间效应的不确定性。本研究为具有固定效应的空间自回归面板数据模型提出了一种新的模型平均框架,将模型平均方法扩展到空间面板环境,并实现了因变量和误差项的多个权重矩阵的灵活组合。提出了一种自适应mallows型准则,根据空间效应的存在或不存在进行动态调整,并建立了其渐近最优性。蒙特卡罗模拟证实了在没有、单一或混合空间依赖性的情况下的鲁棒性。通过对中国省级房价的实证分析,发现经济邻接性是主要的空间依赖驱动因素,验证了该方法在时空数据分析中的预测准确性和政策实用性。
{"title":"Model averaging for spatial autoregressive panel data models","authors":"Aibing Ji,&nbsp;Jingxuan Li,&nbsp;Qingqing Li","doi":"10.1016/j.spasta.2025.100931","DOIUrl":"10.1016/j.spasta.2025.100931","url":null,"abstract":"<div><div>The spatial autoregressive panel data models are widely employed in regional economics to capture spatial dependencies, but conventional specifications rely on a single spatial weight matrix, heightening the risk of model misspecification. Current research lacks systematic model averaging methods for integrating multiple weight matrices and addressing spatial effect uncertainty. This study proposes a novel model averaging framework for spatial autoregressive panel data models with fixed effects, extending model averaging methodology to the spatial panel context and enabling flexible combinations of multiple weight matrices for both dependent variables and error terms. An adaptive Mallows-type criterion is developed, dynamically adjusting to the presence or absence of spatial effects, with its asymptotic optimality established. Monte Carlo simulations confirm robustness across scenarios with no, single, or mixed spatial dependencies. An empirical application to Chinese provincial housing prices identifies economic adjacency as the key spatial dependence driver, validating the method’s predictive accuracy and policy utility for spatiotemporal data analysis.</div></div>","PeriodicalId":48771,"journal":{"name":"Spatial Statistics","volume":"70 ","pages":"Article 100931"},"PeriodicalIF":2.5,"publicationDate":"2025-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145159566","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Uncertain spatial autoregressive model with applications to regional economic analysis and regional air quality analysis 不确定空间自回归模型及其在区域经济分析和区域空气质量分析中的应用
IF 2.5 2区 数学 Q3 GEOSCIENCES, MULTIDISCIPLINARY Pub Date : 2025-09-22 DOI: 10.1016/j.spasta.2025.100932
Jinsheng Xie
This study aims to establish uncertain spatial statistics by exploring the uncertain spatial autoregressive model firstly. Modeling the observations of the response variable via uncertain variables and assuming they are affected by neighboring observations, this paper explores an approach of the uncertain spatial autoregressive model to estimate relationships among the uncertain variables with spatial locations. By employing the principle of least squares, a minimization problem is provided to estimate unknown parameters in the uncertain spatial autoregressive model. Finally, two real-world examples of regional economic analysis and regional air quality analysis are given to clearly demonstrate the uncertain spatial autoregressive model.
本研究旨在通过探索不确定空间自回归模型,建立不确定空间统计量。本文通过不确定变量对响应变量的观测值进行建模,并假设它们受到相邻观测值的影响,探索了一种不确定空间自回归模型来估计不确定变量与空间位置之间关系的方法。利用最小二乘原理,给出了不确定空间自回归模型中未知参数估计的最小化问题。最后,以区域经济分析和区域空气质量分析为例,对不确定空间自回归模型进行了论证。
{"title":"Uncertain spatial autoregressive model with applications to regional economic analysis and regional air quality analysis","authors":"Jinsheng Xie","doi":"10.1016/j.spasta.2025.100932","DOIUrl":"10.1016/j.spasta.2025.100932","url":null,"abstract":"<div><div>This study aims to establish uncertain spatial statistics by exploring the uncertain spatial autoregressive model firstly. Modeling the observations of the response variable via uncertain variables and assuming they are affected by neighboring observations, this paper explores an approach of the uncertain spatial autoregressive model to estimate relationships among the uncertain variables with spatial locations. By employing the principle of least squares, a minimization problem is provided to estimate unknown parameters in the uncertain spatial autoregressive model. Finally, two real-world examples of regional economic analysis and regional air quality analysis are given to clearly demonstrate the uncertain spatial autoregressive model.</div></div>","PeriodicalId":48771,"journal":{"name":"Spatial Statistics","volume":"70 ","pages":"Article 100932"},"PeriodicalIF":2.5,"publicationDate":"2025-09-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145222043","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Joint model for zero-inflated data combining fishery-dependent and fishery-independent sources 结合渔业依赖和渔业独立来源的零膨胀数据联合模型
IF 2.5 2区 数学 Q3 GEOSCIENCES, MULTIDISCIPLINARY Pub Date : 2025-09-11 DOI: 10.1016/j.spasta.2025.100930
Daniela Silva , Raquel Menezes , Gonçalo Araújo , Renato Rosa , Ana Moreno , Alexandra Silva , Susana Garrido
Accurately identifying spatial patterns of species distribution is crucial for scientific insight and societal benefit, aiding our understanding of species fluctuations. The increasing quantity and quality of ecological datasets present heightened statistical challenges, complicating spatial species dynamics comprehension. Addressing the complex task of integrating multiple data sources to enhance spatial fish distribution understanding in marine ecology, this study introduces a pioneering five-layer Joint model. The model adeptly integrates fishery-independent and fishery-dependent data, accommodating zero-inflated data and distinct sampling processes. A comprehensive simulation study evaluates the model performance across various preferential sampling scenarios and sample sizes, elucidating its advantages and challenges. Our findings highlight the model’s robustness in estimating preferential parameters, emphasizing differentiation between presence–absence and biomass observations. Evaluation of estimation of spatial covariance and prediction performance underscores the model’s reliability. Augmenting sample sizes reduces parameter estimation variability, aligning with the principle that increased information enhances certainty. Assessing the contribution of each data source reveals successful integration, providing a comprehensive representation of biomass patterns. Empirical application within a real-world context further solidifies the model’s efficacy in capturing species’ spatial distribution. This research advances methodologies for integrating diverse datasets with different sampling natures further contributing to a more informed understanding of spatial dynamics of marine species.
准确识别物种分布的空间模式对于科学洞察力和社会效益至关重要,有助于我们理解物种波动。生态数据集的数量和质量的不断提高对统计提出了更高的挑战,使空间物种动态的理解复杂化。为了解决整合多个数据源以增强海洋生态学中鱼类空间分布认识的复杂任务,本研究引入了一个开创性的五层联合模型。该模型巧妙地整合了渔业独立和渔业依赖的数据,适应零膨胀数据和不同的采样过程。综合仿真研究评估了模型在各种优先采样场景和样本量下的性能,阐明了其优势和挑战。我们的发现突出了模型在估计优先参数方面的稳健性,强调了存在-缺失和生物量观测之间的区别。对空间协方差估计和预测性能的评价强调了模型的可靠性。增加样本量减少了参数估计的可变性,与增加的信息增强确定性的原则保持一致。评估每个数据源的贡献揭示了成功的整合,提供了生物量模式的全面表示。在现实环境中的经验应用进一步巩固了模型在捕捉物种空间分布方面的有效性。这项研究推进了整合不同采样性质的不同数据集的方法,进一步有助于对海洋物种的空间动态有更深入的了解。
{"title":"Joint model for zero-inflated data combining fishery-dependent and fishery-independent sources","authors":"Daniela Silva ,&nbsp;Raquel Menezes ,&nbsp;Gonçalo Araújo ,&nbsp;Renato Rosa ,&nbsp;Ana Moreno ,&nbsp;Alexandra Silva ,&nbsp;Susana Garrido","doi":"10.1016/j.spasta.2025.100930","DOIUrl":"10.1016/j.spasta.2025.100930","url":null,"abstract":"<div><div>Accurately identifying spatial patterns of species distribution is crucial for scientific insight and societal benefit, aiding our understanding of species fluctuations. The increasing quantity and quality of ecological datasets present heightened statistical challenges, complicating spatial species dynamics comprehension. Addressing the complex task of integrating multiple data sources to enhance spatial fish distribution understanding in marine ecology, this study introduces a pioneering five-layer Joint model. The model adeptly integrates fishery-independent and fishery-dependent data, accommodating zero-inflated data and distinct sampling processes. A comprehensive simulation study evaluates the model performance across various preferential sampling scenarios and sample sizes, elucidating its advantages and challenges. Our findings highlight the model’s robustness in estimating preferential parameters, emphasizing differentiation between presence–absence and biomass observations. Evaluation of estimation of spatial covariance and prediction performance underscores the model’s reliability. Augmenting sample sizes reduces parameter estimation variability, aligning with the principle that increased information enhances certainty. Assessing the contribution of each data source reveals successful integration, providing a comprehensive representation of biomass patterns. Empirical application within a real-world context further solidifies the model’s efficacy in capturing species’ spatial distribution. This research advances methodologies for integrating diverse datasets with different sampling natures further contributing to a more informed understanding of spatial dynamics of marine species.</div></div>","PeriodicalId":48771,"journal":{"name":"Spatial Statistics","volume":"70 ","pages":"Article 100930"},"PeriodicalIF":2.5,"publicationDate":"2025-09-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145107607","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Variational autoencoded multivariate spatial Fay–Herriot models 变分自编码多元空间费-赫里奥特模型
IF 2.5 2区 数学 Q3 GEOSCIENCES, MULTIDISCIPLINARY Pub Date : 2025-09-10 DOI: 10.1016/j.spasta.2025.100929
Zhenhua Wang , Paul A. Parker , Scott H. Holan
Small area estimation models are essential for estimating population characteristics in regions with limited sample sizes, thereby supporting policy decisions, demographic studies, and resource allocation, among other use cases. The spatial Fay–Herriot model is one such approach that incorporates spatial dependence to improve estimation by borrowing strength from neighboring regions. However, this approach often requires substantial computational resources, limiting its scalability for high-dimensional datasets, especially when considering multiple (multivariate) responses. This paper proposes two methods that integrate the multivariate spatial Fay–Herriot model with spatial random effects, learned through variational autoencoders, to efficiently leverage spatial structure. Importantly, after training the variational autoencoder to represent spatial dependence for a given set of geographies, it may be used again in future modeling efforts, without the need for retraining. Additionally, the use of the variational autoencoder to represent spatial dependence results in extreme improvements in computational efficiency, even for massive datasets. We demonstrate the effectiveness of our approach using 5-year period estimates from the American Community Survey over all census tracts in California.
小区域估计模型对于在样本数量有限的地区估计人口特征是必不可少的,因此支持政策决策、人口统计研究和资源分配,以及其他用例。空间Fay-Herriot模型就是这样一种方法,它结合了空间依赖性,通过借鉴邻近区域的强度来改进估计。然而,这种方法通常需要大量的计算资源,限制了其对高维数据集的可伸缩性,特别是在考虑多个(多变量)响应时。本文提出了两种将变分自编码器学习到的多元空间Fay-Herriot模型与空间随机效应相结合的方法,以有效利用空间结构。重要的是,在训练变分自编码器以表示给定地理集合的空间依赖性之后,它可以在未来的建模工作中再次使用,而无需再训练。此外,使用变分自编码器来表示空间依赖性可以极大地提高计算效率,即使对于大量数据集也是如此。我们使用美国社区调查对加州所有人口普查区的5年估计来证明我们方法的有效性。
{"title":"Variational autoencoded multivariate spatial Fay–Herriot models","authors":"Zhenhua Wang ,&nbsp;Paul A. Parker ,&nbsp;Scott H. Holan","doi":"10.1016/j.spasta.2025.100929","DOIUrl":"10.1016/j.spasta.2025.100929","url":null,"abstract":"<div><div>Small area estimation models are essential for estimating population characteristics in regions with limited sample sizes, thereby supporting policy decisions, demographic studies, and resource allocation, among other use cases. The spatial Fay–Herriot model is one such approach that incorporates spatial dependence to improve estimation by borrowing strength from neighboring regions. However, this approach often requires substantial computational resources, limiting its scalability for high-dimensional datasets, especially when considering multiple (multivariate) responses. This paper proposes two methods that integrate the multivariate spatial Fay–Herriot model with spatial random effects, learned through variational autoencoders, to efficiently leverage spatial structure. Importantly, after training the variational autoencoder to represent spatial dependence for a given set of geographies, it may be used again in future modeling efforts, without the need for retraining. Additionally, the use of the variational autoencoder to represent spatial dependence results in extreme improvements in computational efficiency, even for massive datasets. We demonstrate the effectiveness of our approach using 5-year period estimates from the American Community Survey over all census tracts in California.</div></div>","PeriodicalId":48771,"journal":{"name":"Spatial Statistics","volume":"70 ","pages":"Article 100929"},"PeriodicalIF":2.5,"publicationDate":"2025-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145050405","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Regime changes and spatial dependence in the 2020 US presidential election polls 2020年美国总统大选民意调查中的政权更迭和空间依赖
IF 2.5 2区 数学 Q3 GEOSCIENCES, MULTIDISCIPLINARY Pub Date : 2025-08-26 DOI: 10.1016/j.spasta.2025.100927
Giampiero M. Gallo , Demetrio Lacava , Edoardo Otranto
This paper introduces a novel two-stage modeling framework that combines Markov Switching (MS) models with an autoregressive model augmented by spatial effects to analyze the dynamics and spatial interdependence of Biden’s polling percentages during the 2020 electoral campaign. In the first stage, we employ MS models to segment each state’s daily polling time series into distinct regimes — interpreted as phases of decline, stability, and growth. This segmentation captures abrupt changes and local trends in public opinion, enabling us to link regime shifts with key political events such as debates, party conventions, and milestone campaign achievements. The inherent nonlinearity of polling data would otherwise be lost by first differencing. By removing the regime-specific components, we generate stationary residuals modeled using an Autoregressive model with exogenous variables (ARX) that incorporates political spatial interactions through two complementary effects. The spillover effect captures lagged influences arising from politically influential states, while the contagion effect reflects the contemporaneous impact of neighboring states. A recursive algorithm based on partial correlations is implemented to select the most relevant spillover sources for each state. Empirical results, based on daily data from 13 swing states, reveal robust evidence of persistent regime structures and marked spatial dependencies. While contagion effects are uniformly significant across states, spillover dynamics exhibit considerable heterogeneity in both magnitude and direction. This integrated modeling approach enhances our understanding of the complex, nonlinear temporal evolution of polling trends and the spatial diffusion of political opinions that underpinned the 2020 electoral outcome.
本文提出了一种新的两阶段建模框架,将马尔可夫切换(MS)模型与空间效应增强的自回归模型相结合,分析了拜登2020年大选期间民调百分比的动态和空间相互依赖性。在第一阶段,我们使用MS模型将每个州的日常投票时间序列划分为不同的制度-解释为下降,稳定和增长的阶段。这种划分捕捉到了公众舆论的突然变化和当地趋势,使我们能够将政权更迭与关键政治事件(如辩论、政党大会和里程碑式的竞选成就)联系起来。否则,轮询数据固有的非线性就会因第一次差分而丧失。通过去除特定于政权的成分,我们使用带有外生变量(ARX)的自回归模型生成平稳残差,该模型通过两种互补效应整合了政治空间相互作用。溢出效应反映的是政治上有影响力的国家产生的滞后影响,而传染效应反映的是邻国同时产生的影响。采用一种基于部分相关性的递归算法,为每个状态选择最相关的溢出源。基于13个摇摆州的日常数据,实证结果揭示了持久的政权结构和显著的空间依赖性的有力证据。虽然传染效应在各州都同样显著,但溢出动态在大小和方向上都表现出相当大的异质性。这种综合建模方法增强了我们对民意调查趋势的复杂、非线性时间演变的理解,以及支撑2020年选举结果的政治观点的空间扩散。
{"title":"Regime changes and spatial dependence in the 2020 US presidential election polls","authors":"Giampiero M. Gallo ,&nbsp;Demetrio Lacava ,&nbsp;Edoardo Otranto","doi":"10.1016/j.spasta.2025.100927","DOIUrl":"10.1016/j.spasta.2025.100927","url":null,"abstract":"<div><div>This paper introduces a novel two-stage modeling framework that combines Markov Switching (MS) models with an autoregressive model augmented by spatial effects to analyze the dynamics and spatial interdependence of Biden’s polling percentages during the 2020 electoral campaign. In the first stage, we employ MS models to segment each state’s daily polling time series into distinct regimes — interpreted as phases of decline, stability, and growth. This segmentation captures abrupt changes and local trends in public opinion, enabling us to link regime shifts with key political events such as debates, party conventions, and milestone campaign achievements. The inherent nonlinearity of polling data would otherwise be lost by first differencing. By removing the regime-specific components, we generate stationary residuals modeled using an Autoregressive model with exogenous variables (ARX) that incorporates political spatial interactions through two complementary effects. The spillover effect captures lagged influences arising from politically influential states, while the contagion effect reflects the contemporaneous impact of neighboring states. A recursive algorithm based on partial correlations is implemented to select the most relevant spillover sources for each state. Empirical results, based on daily data from 13 swing states, reveal robust evidence of persistent regime structures and marked spatial dependencies. While contagion effects are uniformly significant across states, spillover dynamics exhibit considerable heterogeneity in both magnitude and direction. This integrated modeling approach enhances our understanding of the complex, nonlinear temporal evolution of polling trends and the spatial diffusion of political opinions that underpinned the 2020 electoral outcome.</div></div>","PeriodicalId":48771,"journal":{"name":"Spatial Statistics","volume":"69 ","pages":"Article 100927"},"PeriodicalIF":2.5,"publicationDate":"2025-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144909070","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
GMM inference for the spatial autoregressive kink model with an unknown threshold 未知阈值空间自回归扭结模型的GMM推理
IF 2.5 2区 数学 Q3 GEOSCIENCES, MULTIDISCIPLINARY Pub Date : 2025-08-22 DOI: 10.1016/j.spasta.2025.100926
Wentao Wang , Dengkui Li
This paper considers spatial autoregressive kink models with an unknown threshold, where the impact of a specific explanatory variable on the response variable is piecewise linear but differs below and above this threshold. To address the endogeneity issue, the paper presents the modified generalized method of moments (GMM) that consistently estimates the threshold location and slope changes. Asymptotic properties, including the consistency and asymptotic normality of the GMM estimators, and the limiting distribution of the Sup-Wald statistic, are established under a set of regularity assumptions. In view of the nonstandard asymptotic null distribution, we use a multiplier bootstrap to approximate the p-value of the Sup-Wald statistic to detect the presence of the threshold. Simulation study illustrates that the estimators and inference are well-behaved in finite samples. An empirical application to the secondary industrial structure data of 280 Chinese prefecture-level cities further highlights the practical merits of our methods.
本文考虑具有未知阈值的空间自回归扭结模型,其中特定解释变量对响应变量的影响是分段线性的,但在该阈值以下和以上有所不同。为了解决内生性问题,本文提出了改进的广义矩量法(GMM),该方法可以一致地估计阈值位置和斜率变化。在一组正则性假设下,建立了GMM估计量的渐近性质,包括一致性和渐近正态性,以及Sup-Wald统计量的极限分布。考虑到非标准渐近零分布,我们使用乘法器自举来近似Sup-Wald统计量的p值来检测阈值的存在。仿真研究表明,该估计器和推理器在有限样本下表现良好。对中国280个地级市第二产业结构数据的实证应用进一步凸显了本文方法的实用性。
{"title":"GMM inference for the spatial autoregressive kink model with an unknown threshold","authors":"Wentao Wang ,&nbsp;Dengkui Li","doi":"10.1016/j.spasta.2025.100926","DOIUrl":"10.1016/j.spasta.2025.100926","url":null,"abstract":"<div><div>This paper considers spatial autoregressive kink models with an unknown threshold, where the impact of a specific explanatory variable on the response variable is piecewise linear but differs below and above this threshold. To address the endogeneity issue, the paper presents the modified generalized method of moments (GMM) that consistently estimates the threshold location and slope changes. Asymptotic properties, including the consistency and asymptotic normality of the GMM estimators, and the limiting distribution of the Sup-Wald statistic, are established under a set of regularity assumptions. In view of the nonstandard asymptotic null distribution, we use a multiplier bootstrap to approximate the <span><math><mi>p</mi></math></span>-value of the Sup-Wald statistic to detect the presence of the threshold. Simulation study illustrates that the estimators and inference are well-behaved in finite samples. An empirical application to the secondary industrial structure data of 280 Chinese prefecture-level cities further highlights the practical merits of our methods.</div></div>","PeriodicalId":48771,"journal":{"name":"Spatial Statistics","volume":"69 ","pages":"Article 100926"},"PeriodicalIF":2.5,"publicationDate":"2025-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144909069","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Conformal novelty detection for replicate point patterns with FDR or FWER control 用FDR或FWER控制复制点模式的保形新颖性检测
IF 2.5 2区 数学 Q3 GEOSCIENCES, MULTIDISCIPLINARY Pub Date : 2025-08-05 DOI: 10.1016/j.spasta.2025.100924
Christophe A.N. Biscio , Adrien Mazoyer , Martin V. Vejling
Monte Carlo tests are widely used for computing valid p-values without requiring known distributions of test statistics. When performing multiple Monte Carlo tests, it is essential to maintain control of the type I error. Some techniques for multiplicity control pose requirements on the joint distribution of the p-values, for instance independence, which can be computationally intensive to achieve, as it requires simulating disjoint null samples for each test. We refer to this as naïve multiple Monte Carlo testing. We highlight in this work that multiple Monte Carlo testing is an instance of conformal novelty detection. Leveraging this insight enables a more efficient multiple Monte Carlo testing procedure, avoiding excessive simulations by using a single null sample for all the tests, while still ensuring exact control over the false discovery rate or the family-wise error rate. We call this approach conformal multiple Monte Carlo testing. The performance is investigated in the context of global envelope tests for point pattern data through a simulation study and an application to a sweat gland data set. Results reveal that with a fixed simulation budget, our proposed method yields substantial improvements in power of the testing procedure as compared to the naïve multiple Monte Carlo testing procedure.
蒙特卡罗检验被广泛用于计算有效的p值,而不需要已知的检验统计量分布。在执行多个蒙特卡罗测试时,必须保持对第一类误差的控制。一些多重性控制技术对p值的联合分布提出了要求,例如独立性,这可能需要大量的计算才能实现,因为它需要为每个测试模拟不相交的零样本。我们将此称为naïve多重蒙特卡罗测试。在这项工作中,我们强调多重蒙特卡罗测试是保角新颖性检测的一个实例。利用这种洞察力可以实现更高效的多个蒙特卡罗测试过程,避免通过对所有测试使用单个零样本来进行过度模拟,同时仍然确保对错误发现率或家庭错误率进行精确控制。我们称这种方法为保形多重蒙特卡洛测试。通过模拟研究和对汗腺数据集的应用,在点模式数据的全局包络测试的背景下研究了性能。结果表明,在固定的模拟预算下,与naïve多重蒙特卡罗测试过程相比,我们提出的方法在测试过程的功率方面有了实质性的改进。
{"title":"Conformal novelty detection for replicate point patterns with FDR or FWER control","authors":"Christophe A.N. Biscio ,&nbsp;Adrien Mazoyer ,&nbsp;Martin V. Vejling","doi":"10.1016/j.spasta.2025.100924","DOIUrl":"10.1016/j.spasta.2025.100924","url":null,"abstract":"<div><div>Monte Carlo tests are widely used for computing valid <span><math><mi>p</mi></math></span>-values without requiring known distributions of test statistics. When performing multiple Monte Carlo tests, it is essential to maintain control of the type I error. Some techniques for multiplicity control pose requirements on the joint distribution of the <span><math><mi>p</mi></math></span>-values, for instance independence, which can be computationally intensive to achieve, as it requires simulating disjoint null samples for each test. We refer to this as naïve multiple Monte Carlo testing. We highlight in this work that multiple Monte Carlo testing is an instance of conformal novelty detection. Leveraging this insight enables a more efficient multiple Monte Carlo testing procedure, avoiding excessive simulations by using a single null sample for all the tests, while still ensuring exact control over the false discovery rate or the family-wise error rate. We call this approach conformal multiple Monte Carlo testing. The performance is investigated in the context of global envelope tests for point pattern data through a simulation study and an application to a sweat gland data set. Results reveal that with a fixed simulation budget, our proposed method yields substantial improvements in power of the testing procedure as compared to the naïve multiple Monte Carlo testing procedure.</div></div>","PeriodicalId":48771,"journal":{"name":"Spatial Statistics","volume":"69 ","pages":"Article 100924"},"PeriodicalIF":2.5,"publicationDate":"2025-08-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144860831","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Spatial Statistics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1