首页 > 最新文献

Spatial Statistics最新文献

英文 中文
Measuring unit relevance and stability in hierarchical spatio-temporal clustering 层次时空聚类中度量单元相关性和稳定性
IF 2.1 2区 数学 Q3 GEOSCIENCES, MULTIDISCIPLINARY Pub Date : 2025-01-13 DOI: 10.1016/j.spasta.2025.100880
Roy Cerqueti , Raffaele Mattera
Understanding the significance of individual data points within clustering structures is critical to effective data analysis. Traditional stability methods, while valuable, often overlook the nuanced impact of individual units, particularly in spatial contexts. In this paper, we explore the concept of unit relevance in clustering analysis, emphasizing its importance in capturing the spatio-temporal nature of the clustering problem. We propose a simple measure of unit relevance, the Unit Relevance Index (URI), and define an overall measure of clustering stability based on the aggregation of computed URIs. Considering two experiments on real datasets with geo-referenced time series, we find that the use of spatial constraints in the clustering task yields more stable results. Therefore, the inclusion of the spatial dimension can be seen as a way to stabilize the clustering.
理解聚类结构中单个数据点的重要性对于有效的数据分析至关重要。传统的稳定性方法虽然有价值,但往往忽略了单个单元的细微影响,特别是在空间环境中。在本文中,我们探讨了聚类分析中单位相关性的概念,强调了它在捕捉聚类问题的时空本质方面的重要性。我们提出了一种简单的单元相关性度量,即单元相关性索引(unit relevance Index, URI),并定义了基于计算的URI聚合的聚类稳定性的总体度量。通过对具有地理参考时间序列的真实数据集的两个实验,我们发现在聚类任务中使用空间约束可以获得更稳定的结果。因此,空间维度的包含可以看作是稳定聚类的一种方式。
{"title":"Measuring unit relevance and stability in hierarchical spatio-temporal clustering","authors":"Roy Cerqueti ,&nbsp;Raffaele Mattera","doi":"10.1016/j.spasta.2025.100880","DOIUrl":"10.1016/j.spasta.2025.100880","url":null,"abstract":"<div><div>Understanding the significance of individual data points within clustering structures is critical to effective data analysis. Traditional stability methods, while valuable, often overlook the nuanced impact of individual units, particularly in spatial contexts. In this paper, we explore the concept of unit relevance in clustering analysis, emphasizing its importance in capturing the spatio-temporal nature of the clustering problem. We propose a simple measure of unit relevance, the Unit Relevance Index (URI), and define an overall measure of clustering stability based on the aggregation of computed URIs. Considering two experiments on real datasets with geo-referenced time series, we find that the use of spatial constraints in the clustering task yields more stable results. Therefore, the inclusion of the spatial dimension can be seen as a way to stabilize the clustering.</div></div>","PeriodicalId":48771,"journal":{"name":"Spatial Statistics","volume":"66 ","pages":"Article 100880"},"PeriodicalIF":2.1,"publicationDate":"2025-01-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143151695","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The Multifractal Gaussian Mixture Model for unsupervised segmentation of complex data sets 复杂数据集无监督分割的多重分形高斯混合模型
IF 2.1 2区 数学 Q3 GEOSCIENCES, MULTIDISCIPLINARY Pub Date : 2025-01-10 DOI: 10.1016/j.spasta.2025.100879
Garry Jacyna, Damon Frezza, David M. Slater, James R. Thompson
We derive the Multifractal Gaussian Mixture Model algorithm for decomposing data sets into different multifractal regimes building on the empirical observation that simulated multifractals have log wavelet leaders that are well-approximated by a Gaussian distribution. We test the algorithm on composite images constructed from multifractal random walks with known multifractal spectra. The algorithm is able to correctly segment the pixels corresponding to different multifractals when the constituent multifractals are most distinct from each other. It also estimates the multifractal parameters with minimal error when compared to the theoretical spectra used to generate the original multifractal random walks. We also apply the algorithm to satellite images with varying degrees of cloud cover taken from the LandSat 8 Cloud Validation Data set. The algorithm is able to segment the pixels into their corresponding cloud mask category, and it detects different texture and features in the images that are unrelated to clouds. The results indicate that the Multifractal Gaussian Mixture Model algorithm is well-suited for semi-automated unsupervised data segmentation when the data being analyzed exhibit complex, scale-invariant characteristics.
我们推导了多重分形高斯混合模型算法,用于将数据集分解为不同的多重分形制度,建立在经验观察的基础上,模拟多重分形具有由高斯分布很好地近似的对数小波前导。我们在已知多重分形谱的多重分形随机漫步合成图像上对算法进行了测试。该算法能够在组成的多重分形之间差异最大时,正确分割出不同多重分形对应的像素。与用于生成原始多重分形随机漫步的理论谱相比,它还以最小的误差估计多重分形参数。我们还将该算法应用于从LandSat 8云验证数据集中获取的不同程度云覆盖的卫星图像。该算法能够将像素分割到相应的云掩模类别中,并检测图像中与云无关的不同纹理和特征。结果表明,多重分形高斯混合模型算法非常适合于被分析数据具有复杂、尺度不变特征的半自动无监督数据分割。
{"title":"The Multifractal Gaussian Mixture Model for unsupervised segmentation of complex data sets","authors":"Garry Jacyna,&nbsp;Damon Frezza,&nbsp;David M. Slater,&nbsp;James R. Thompson","doi":"10.1016/j.spasta.2025.100879","DOIUrl":"10.1016/j.spasta.2025.100879","url":null,"abstract":"<div><div>We derive the Multifractal Gaussian Mixture Model algorithm for decomposing data sets into different multifractal regimes building on the empirical observation that simulated multifractals have log wavelet leaders that are well-approximated by a Gaussian distribution. We test the algorithm on composite images constructed from multifractal random walks with known multifractal spectra. The algorithm is able to correctly segment the pixels corresponding to different multifractals when the constituent multifractals are most distinct from each other. It also estimates the multifractal parameters with minimal error when compared to the theoretical spectra used to generate the original multifractal random walks. We also apply the algorithm to satellite images with varying degrees of cloud cover taken from the LandSat 8 Cloud Validation Data set. The algorithm is able to segment the pixels into their corresponding cloud mask category, and it detects different texture and features in the images that are unrelated to clouds. The results indicate that the Multifractal Gaussian Mixture Model algorithm is well-suited for semi-automated unsupervised data segmentation when the data being analyzed exhibit complex, scale-invariant characteristics.</div></div>","PeriodicalId":48771,"journal":{"name":"Spatial Statistics","volume":"66 ","pages":"Article 100879"},"PeriodicalIF":2.1,"publicationDate":"2025-01-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143151694","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Bias-corrected instrumental variable estimation for spatial autoregressive models with measurement errors 带有测量误差的空间自回归模型的偏差校正工具变量估计
IF 2.1 2区 数学 Q3 GEOSCIENCES, MULTIDISCIPLINARY Pub Date : 2024-12-27 DOI: 10.1016/j.spasta.2024.100878
Guowang Luo , Mixia Wu
In this paper, bias-corrected instrumental variable estimation methods, specifically the bias-corrected two-stage least square (2SLS) estimation and the bias-corrected asymptotically best 2SLS estimation, are proposed for spatial autoregressive (SAR) models with covariate measurement errors, utilizing available information regarding the variance of the measurement error. Under mild assumptions, the consistency and asymptotic normality of the proposed estimators are derived. Simulation studies further reveal that the proposed methods exhibit robustness regardless of the presence of spatial dependence in the model. Additionally, a real data example is utilized to illustrate the developed methods.
本文利用有关测量误差方差的可用信息,针对具有协变量测量误差的空间自回归(SAR)模型,提出了偏差校正的工具变量估计方法,特别是偏差校正的两阶段最小二乘(2SLS)估计和偏差校正的渐近最佳2SLS估计。在温和的假设条件下,得到了所提估计量的相合性和渐近正态性。仿真研究进一步表明,无论模型中是否存在空间依赖性,所提出的方法都具有鲁棒性。此外,还利用一个实际的数据实例来说明所开发的方法。
{"title":"Bias-corrected instrumental variable estimation for spatial autoregressive models with measurement errors","authors":"Guowang Luo ,&nbsp;Mixia Wu","doi":"10.1016/j.spasta.2024.100878","DOIUrl":"10.1016/j.spasta.2024.100878","url":null,"abstract":"<div><div>In this paper, bias-corrected instrumental variable estimation methods, specifically the bias-corrected two-stage least square (2SLS) estimation and the bias-corrected asymptotically best 2SLS estimation, are proposed for spatial autoregressive (SAR) models with covariate measurement errors, utilizing available information regarding the variance of the measurement error. Under mild assumptions, the consistency and asymptotic normality of the proposed estimators are derived. Simulation studies further reveal that the proposed methods exhibit robustness regardless of the presence of spatial dependence in the model. Additionally, a real data example is utilized to illustrate the developed methods.</div></div>","PeriodicalId":48771,"journal":{"name":"Spatial Statistics","volume":"65 ","pages":"Article 100878"},"PeriodicalIF":2.1,"publicationDate":"2024-12-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143163870","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An optimised rabies vaccination schedule for rural settlements 优化农村居民点狂犬病疫苗接种计划
IF 2.1 2区 数学 Q3 GEOSCIENCES, MULTIDISCIPLINARY Pub Date : 2024-12-17 DOI: 10.1016/j.spasta.2024.100877
Rian Botes , Inger Fabris-Rotelli , Kabelo Mahloromela , Ding-Geng Chen
The timely and efficient administration of rabies vaccinations to animals in rural villages is necessary to attain a state of herd immunity. Efficient sampling of households in a rural village is of utmost importance in reaching the most animals for vaccination, with the least effort, and in the lowest time. This research seeks to both optimise the spatial sampling scheme used to sample households, as well as the route travelled by persons performing door-to-door vaccinations. The walking time in minutes is regarded as the cost of a vaccination scheme and is minimised in this paper. The distribution of houses in a rural village constitutes a spatial point pattern in R2, and as such, spatial point pattern analysis techniques as well as some spatial sampling schemes are applied throughout this research. The penultimate aim of this work is to provide policy makers with additional tools to combat rabies, a disease which remains endemic to some countries in West and Central Africa, and Asia.
及时和有效地对农村动物进行狂犬病疫苗接种是实现群体免疫状态所必需的。对农村家庭进行有效抽样对于以最少的努力和最短的时间为最多的动物接种疫苗至关重要。本研究旨在优化用于对家庭进行抽样的空间抽样方案,以及进行挨家挨户接种疫苗的人员所经过的路线。以分钟为单位的步行时间被视为疫苗接种方案的成本,并在本文中最小化。乡村房屋的分布构成了R2中的空间点格局,因此在本研究中采用了空间点格局分析技术和一些空间采样方案。这项工作的第二个目标是为决策者提供防治狂犬病的额外工具,这种疾病在西非和中非以及亚洲的一些国家仍然流行。
{"title":"An optimised rabies vaccination schedule for rural settlements","authors":"Rian Botes ,&nbsp;Inger Fabris-Rotelli ,&nbsp;Kabelo Mahloromela ,&nbsp;Ding-Geng Chen","doi":"10.1016/j.spasta.2024.100877","DOIUrl":"10.1016/j.spasta.2024.100877","url":null,"abstract":"<div><div>The timely and efficient administration of rabies vaccinations to animals in rural villages is necessary to attain a state of herd immunity. Efficient sampling of households in a rural village is of utmost importance in reaching the most animals for vaccination, with the least effort, and in the lowest time. This research seeks to both optimise the spatial sampling scheme used to sample households, as well as the route travelled by persons performing door-to-door vaccinations. The walking time in minutes is regarded as the cost of a vaccination scheme and is minimised in this paper. The distribution of houses in a rural village constitutes a spatial point pattern in <span><math><msup><mrow><mi>R</mi></mrow><mrow><mn>2</mn></mrow></msup></math></span>, and as such, spatial point pattern analysis techniques as well as some spatial sampling schemes are applied throughout this research. The penultimate aim of this work is to provide policy makers with additional tools to combat rabies, a disease which remains endemic to some countries in West and Central Africa, and Asia.</div></div>","PeriodicalId":48771,"journal":{"name":"Spatial Statistics","volume":"65 ","pages":"Article 100877"},"PeriodicalIF":2.1,"publicationDate":"2024-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143163871","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Softening the criteria for determining inner and outer predicted exceedance sets 软化确定内部和外部预测超出集的标准
IF 2.1 2区 数学 Q3 GEOSCIENCES, MULTIDISCIPLINARY Pub Date : 2024-12-17 DOI: 10.1016/j.spasta.2024.100876
Thomas Suesse , Alexander Brenning
Determining exceedance regions, such as regions where a specified threshold of a pollutant in the environment is exceeded, is of critical importance for decision-making in environmental management and public health. Inner and outer predicted exceedance sets express the uncertainties in predicted exceedance regions as they sandwich the unknown true exceedance region with high confidence, analogous to confidence regions for point estimates. It is therefore desirable to reduce the uncertainty about the locations of the true exceedance region, resulting in a narrow band between the inner and outer sets. However, in practice this is not often the case mainly due to the strict statistical subset criteria being set, which are equivalent to a multiple testing problem controlling the familywise error rate (FWER). It is well known that the FWER leads to fewer rejections compared to other criteria; in the context of exceedance regions, this would correspond to an extremely small, conservative inner predicted exceedance region. In this paper, we loosen the criteria slightly to obtain a narrower band between inner and outer sets, allowing for more nuanced uncertainty assessments. A new algorithm is proposed to construct these exceedance sets, and the methods are compared in a simulation study to assess whether they indeed control the new criteria. The methods are illustrated on two data sets: average rainfall in the state of Paraná, Brazil, and nitrogen dioxide air pollution in Germany in the year 2018.
确定超标区域,例如环境中污染物超过规定阈值的区域,对环境管理和公共卫生决策至关重要。内部和外部预测超越集表示预测超越区域的不确定性,因为它们以高置信度夹在未知的真实超越区域中,类似于点估计的置信度区域。因此,需要减少真实超出区域位置的不确定性,从而使内外集之间的频带变窄。然而,在实践中,这种情况并不常见,主要是由于设置了严格的统计子集标准,这相当于控制家庭错误率(FWER)的多个测试问题。众所周知,与其他标准相比,FWER会导致更少的拒绝;在超出区域的情况下,这将对应于一个极小的、保守的内部预测超出区域。在本文中,我们稍微放宽了标准,以在内外集之间获得更窄的频带,从而允许更细微的不确定性评估。提出了一种新的算法来构造这些超越集,并在仿真研究中比较了各种方法,以评估它们是否确实控制了新的准则。这些方法在两个数据集上得到了说明:巴西帕拉纳州的平均降雨量和德国2018年的二氧化氮空气污染。
{"title":"Softening the criteria for determining inner and outer predicted exceedance sets","authors":"Thomas Suesse ,&nbsp;Alexander Brenning","doi":"10.1016/j.spasta.2024.100876","DOIUrl":"10.1016/j.spasta.2024.100876","url":null,"abstract":"<div><div>Determining exceedance regions, such as regions where a specified threshold of a pollutant in the environment is exceeded, is of critical importance for decision-making in environmental management and public health. Inner and outer predicted exceedance sets express the uncertainties in predicted exceedance regions as they sandwich the unknown true exceedance region with high confidence, analogous to confidence regions for point estimates. It is therefore desirable to reduce the uncertainty about the locations of the true exceedance region, resulting in a narrow band between the inner and outer sets. However, in practice this is not often the case mainly due to the strict statistical subset criteria being set, which are equivalent to a multiple testing problem controlling the familywise error rate (FWER). It is well known that the FWER leads to fewer rejections compared to other criteria; in the context of exceedance regions, this would correspond to an extremely small, conservative inner predicted exceedance region. In this paper, we loosen the criteria slightly to obtain a narrower band between inner and outer sets, allowing for more nuanced uncertainty assessments. A new algorithm is proposed to construct these exceedance sets, and the methods are compared in a simulation study to assess whether they indeed control the new criteria. The methods are illustrated on two data sets: average rainfall in the state of Paraná, Brazil, and nitrogen dioxide air pollution in Germany in the year 2018.</div></div>","PeriodicalId":48771,"journal":{"name":"Spatial Statistics","volume":"65 ","pages":"Article 100876"},"PeriodicalIF":2.1,"publicationDate":"2024-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143163872","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Fixed effects spatial panel interval-valued autoregressive models and applications 固定效应空间面板区间值自回归模型及其应用
IF 2.1 2区 数学 Q3 GEOSCIENCES, MULTIDISCIPLINARY Pub Date : 2024-11-26 DOI: 10.1016/j.spasta.2024.100875
Qingqing Li, Ruizhuo Zheng, Aibing Ji, Hongyan Ma
Interval-valued data has garnered attention across various applications, leading to increased research into spatial interval-valued data models. The integration of uncertainty variables into spatial panel data models has become crucial. This paper presents a spatial panel interval-valued autoregressive model with fixed effects, utilizing the parametric method. The quasi-maximum likelihood method is employed for parameter estimation, and its consistency and asymptotic properties are discussed. Additionally, three special cases and two degenerated models derived from our framework are presented, elucidating their significance in spatial statistics. Monte Carlo simulations are used to validate the fitting and forecasting performance of our proposed models across diverse scenarios. Furthermore, the models are implemented in real-world air quality and house price datasets for forecasting purposes. Through rigorous experimentation, the superior performance of the models is demonstrated. These results highlight the practical utility of the spatial panel interval-valued autoregressive models in addressing spatial data challenges.
区间值数据在各种应用中引起了人们的关注,导致对空间区间值数据模型的研究增加。将不确定性变量整合到空间面板数据模型中变得至关重要。本文利用参数化方法建立了具有固定效应的空间面板区间值自回归模型。采用拟极大似然方法进行参数估计,讨论了拟极大似然方法的相合性和渐近性。此外,本文还提出了三个特例和两个退化模型,阐明了它们在空间统计中的意义。蒙特卡罗模拟用于验证我们提出的模型在不同情景下的拟合和预测性能。此外,这些模型在现实世界的空气质量和房价数据集中实施,用于预测目的。通过严格的实验,证明了模型的优越性能。这些结果突出了空间面板区间值自回归模型在解决空间数据挑战方面的实际效用。
{"title":"Fixed effects spatial panel interval-valued autoregressive models and applications","authors":"Qingqing Li,&nbsp;Ruizhuo Zheng,&nbsp;Aibing Ji,&nbsp;Hongyan Ma","doi":"10.1016/j.spasta.2024.100875","DOIUrl":"10.1016/j.spasta.2024.100875","url":null,"abstract":"<div><div>Interval-valued data has garnered attention across various applications, leading to increased research into spatial interval-valued data models. The integration of uncertainty variables into spatial panel data models has become crucial. This paper presents a spatial panel interval-valued autoregressive model with fixed effects, utilizing the parametric method. The quasi-maximum likelihood method is employed for parameter estimation, and its consistency and asymptotic properties are discussed. Additionally, three special cases and two degenerated models derived from our framework are presented, elucidating their significance in spatial statistics. Monte Carlo simulations are used to validate the fitting and forecasting performance of our proposed models across diverse scenarios. Furthermore, the models are implemented in real-world air quality and house price datasets for forecasting purposes. Through rigorous experimentation, the superior performance of the models is demonstrated. These results highlight the practical utility of the spatial panel interval-valued autoregressive models in addressing spatial data challenges.</div></div>","PeriodicalId":48771,"journal":{"name":"Spatial Statistics","volume":"65 ","pages":"Article 100875"},"PeriodicalIF":2.1,"publicationDate":"2024-11-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142747091","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Fuzzy clustering of mixed data with spatial regularization 利用空间正则化对混合数据进行模糊聚类
IF 2.1 2区 数学 Q3 GEOSCIENCES, MULTIDISCIPLINARY Pub Date : 2024-11-23 DOI: 10.1016/j.spasta.2024.100874
Pierpaolo D’Urso , Livia De Giovanni , Lorenzo Federico , Vincenzina Vitale
A fuzzy clustering model for data with mixed features and spatial constraints is proposed. The clustering model allows different types of variables, or attributes, to be taken into account. This result is achieved by combining the dissimilarity measures for each attribute employing a weighting scheme, to obtain a distance measure for multiple attributes. The weights are objectively computed during the optimization process. The weights reflect the relevance of each attribute type in the clustering results. A spatial term is taken into account, considering a wide definition of contiguity, either physical contiguity or the adjacency matrix in a network. Simulation studies and two empirical applications, including both physical and abstract definitions of contiguity are presented that show the effectiveness of the proposed clustering model.
本文提出了一种针对具有混合特征和空间限制的数据的模糊聚类模型。该聚类模型允许考虑不同类型的变量或属性。这一结果是通过采用加权方案将每个属性的不相似度量结合起来,从而获得多个属性的距离度量。权重是在优化过程中客观计算出来的。权重反映了每种属性类型在聚类结果中的相关性。考虑到连续性的广泛定义,即物理连续性或网络中的邻接矩阵,空间项也被考虑在内。模拟研究和两个经验应用(包括物理和抽象定义的毗连性)显示了建议聚类模型的有效性。
{"title":"Fuzzy clustering of mixed data with spatial regularization","authors":"Pierpaolo D’Urso ,&nbsp;Livia De Giovanni ,&nbsp;Lorenzo Federico ,&nbsp;Vincenzina Vitale","doi":"10.1016/j.spasta.2024.100874","DOIUrl":"10.1016/j.spasta.2024.100874","url":null,"abstract":"<div><div>A fuzzy clustering model for data with mixed features and spatial constraints is proposed. The clustering model allows different types of variables, or attributes, to be taken into account. This result is achieved by combining the dissimilarity measures for each attribute employing a weighting scheme, to obtain a distance measure for multiple attributes. The weights are objectively computed during the optimization process. The weights reflect the relevance of each attribute type in the clustering results. A spatial term is taken into account, considering a wide definition of contiguity, either physical contiguity or the adjacency matrix in a network. Simulation studies and two empirical applications, including both physical and abstract definitions of contiguity are presented that show the effectiveness of the proposed clustering model.</div></div>","PeriodicalId":48771,"journal":{"name":"Spatial Statistics","volume":"65 ","pages":"Article 100874"},"PeriodicalIF":2.1,"publicationDate":"2024-11-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142719822","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Fast mixture spatial regression: A mixture in the geographical and feature space applied to predict porosity in the post-salt 快速混合空间回归:将地理空间和特征空间的混合应用于预测盐湖开采后的孔隙度
IF 2.1 2区 数学 Q3 GEOSCIENCES, MULTIDISCIPLINARY Pub Date : 2024-11-22 DOI: 10.1016/j.spasta.2024.100873
Lucas Michelin , Lucas C. Godoy , Heitor S. Ramos , Marcos O. Prates
Extracting geological resources like hydrocarbon fluids requires significant investments and precise decision-making processes. To optimize the efficiency of the extraction process, researchers and industry experts have explored innovative methodologies, including the prediction of optimal drilling locations. Porosity, a key attribute of reservoir rocks, plays a crucial role in determining fluid storage capacity. Geostatistical techniques, such as kriging, have been widely used for estimating porosity by capturing spatial dependence in sampled point-referenced data. However, the reliance on geographical coordinates for determining spatial distances may present challenges in scenarios with small and widely separated samples. In this paper, we develop a mixture model that combines the covariance generated by geographical space and the covariance generated in an appropriate feature space to enhance estimation accuracy. Developed within the Bayesian framework, our approach utilizes flexible Markov Chain Monte Carlo (MCMC) methods and leverages the Nearest-Neighbor Gaussian Process (NNGP) strategy for scalability. We present a controlled empirical comparison, considering various data generation configurations, to assess the performance of the mixture model in comparison to the marginal models. Applying our models to a three-dimensional reservoir demonstrates its practical applicability and scalability. This research presents a novel approach for improved porosity estimation by integrating spatial and covariate information, offering the potential for optimizing reservoir exploration and extraction activities.
开采碳氢化合物流体等地质资源需要大量投资和精确的决策过程。为了优化开采过程的效率,研究人员和行业专家探索了创新方法,包括预测最佳钻井位置。孔隙度是储层岩石的一个关键属性,在确定流体存储能力方面起着至关重要的作用。地质统计技术,如克里格法,通过捕捉采样点参照数据中的空间依赖性,已被广泛用于估算孔隙度。然而,依赖地理坐标来确定空间距离可能会给样本量小且相距甚远的情况带来挑战。在本文中,我们开发了一种混合模型,将地理空间产生的协方差与适当特征空间产生的协方差结合起来,以提高估算精度。我们的方法是在贝叶斯框架内开发的,采用了灵活的马尔可夫链蒙特卡罗(MCMC)方法,并利用了近邻高斯过程(NNGP)策略来提高可扩展性。考虑到各种数据生成配置,我们进行了有控制的实证比较,以评估混合模型与边际模型的性能比较。将我们的模型应用于三维储层,证明了其实际适用性和可扩展性。这项研究提出了一种通过整合空间信息和协变量信息来改进孔隙度估算的新方法,为优化储层勘探和开采活动提供了可能。
{"title":"Fast mixture spatial regression: A mixture in the geographical and feature space applied to predict porosity in the post-salt","authors":"Lucas Michelin ,&nbsp;Lucas C. Godoy ,&nbsp;Heitor S. Ramos ,&nbsp;Marcos O. Prates","doi":"10.1016/j.spasta.2024.100873","DOIUrl":"10.1016/j.spasta.2024.100873","url":null,"abstract":"<div><div>Extracting geological resources like hydrocarbon fluids requires significant investments and precise decision-making processes. To optimize the efficiency of the extraction process, researchers and industry experts have explored innovative methodologies, including the prediction of optimal drilling locations. Porosity, a key attribute of reservoir rocks, plays a crucial role in determining fluid storage capacity. Geostatistical techniques, such as kriging, have been widely used for estimating porosity by capturing spatial dependence in sampled point-referenced data. However, the reliance on geographical coordinates for determining spatial distances may present challenges in scenarios with small and widely separated samples. In this paper, we develop a mixture model that combines the covariance generated by geographical space and the covariance generated in an appropriate feature space to enhance estimation accuracy. Developed within the Bayesian framework, our approach utilizes flexible Markov Chain Monte Carlo (MCMC) methods and leverages the Nearest-Neighbor Gaussian Process (NNGP) strategy for scalability. We present a controlled empirical comparison, considering various data generation configurations, to assess the performance of the mixture model in comparison to the marginal models. Applying our models to a three-dimensional reservoir demonstrates its practical applicability and scalability. This research presents a novel approach for improved porosity estimation by integrating spatial and covariate information, offering the potential for optimizing reservoir exploration and extraction activities.</div></div>","PeriodicalId":48771,"journal":{"name":"Spatial Statistics","volume":"65 ","pages":"Article 100873"},"PeriodicalIF":2.1,"publicationDate":"2024-11-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142719821","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Pixel isotropy test based on directional perimeters 基于方向周长的像素各向同性测试
IF 2.1 2区 数学 Q3 GEOSCIENCES, MULTIDISCIPLINARY Pub Date : 2024-11-20 DOI: 10.1016/j.spasta.2024.100869
Mariem Abaach , Hermine Biermé , Elena Di Bernardino , Anne Estrade
In this paper we consider the so-called directional perimeters of a thresholded gray-level image. These geometrical quantities are built by considering separately the horizontal and vertical contributions of the pixel. We explicitly compute the first two moments of the directional perimeter under the hypothesis of an underlying discrete Gaussian stationary random field. We establish a central limit theorem (CLT), as the number of pixels goes to infinity, for the joint directional perimeters at various levels under a weak summability condition of the covariance function. By using the CLT previously established, we construct a consistent pixel isotropy test, based on the ratio of the directional perimeters. Our theoretical study is completed by extensive numerical illustrations based on simulated data. Finally, we apply our method to detect pixel anisotropy in calcaneus X-ray images.
在本文中,我们考虑的是阈值灰度图像的所谓方向周长。这些几何量是通过分别考虑像素的水平和垂直贡献而建立的。在底层离散高斯静态随机场的假设下,我们明确计算了方向周长的前两个矩。在协方差函数的弱求和条件下,当像素数达到无穷大时,我们建立了各层次联合方向周长的中心极限定理(CLT)。利用之前建立的 CLT,我们构建了一个基于方向周长比的一致像素各向同性检验。基于模拟数据的大量数值说明完成了我们的理论研究。最后,我们将我们的方法应用于检测小腿X光图像中的像素各向异性。
{"title":"Pixel isotropy test based on directional perimeters","authors":"Mariem Abaach ,&nbsp;Hermine Biermé ,&nbsp;Elena Di Bernardino ,&nbsp;Anne Estrade","doi":"10.1016/j.spasta.2024.100869","DOIUrl":"10.1016/j.spasta.2024.100869","url":null,"abstract":"<div><div>In this paper we consider the so-called directional perimeters of a thresholded gray-level image. These geometrical quantities are built by considering separately the horizontal and vertical contributions of the pixel. We explicitly compute the first two moments of the directional perimeter under the hypothesis of an underlying discrete Gaussian stationary random field. We establish a central limit theorem (CLT), as the number of pixels goes to infinity, for the joint directional perimeters at various levels under a weak summability condition of the covariance function. By using the CLT previously established, we construct a consistent pixel isotropy test, based on the ratio of the directional perimeters. Our theoretical study is completed by extensive numerical illustrations based on simulated data. Finally, we apply our method to detect pixel anisotropy in calcaneus X-ray images.</div></div>","PeriodicalId":48771,"journal":{"name":"Spatial Statistics","volume":"65 ","pages":"Article 100869"},"PeriodicalIF":2.1,"publicationDate":"2024-11-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142707295","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Simulation of conditional non-Gaussian random fields with directional asymmetry 模拟具有方向不对称性的条件非高斯随机场
IF 2.1 2区 数学 Q3 GEOSCIENCES, MULTIDISCIPLINARY Pub Date : 2024-11-17 DOI: 10.1016/j.spasta.2024.100872
Sebastian Hörning , András Bárdossy
Observed environmental are usually the results of physical, chemical, or biological processes. These processes often introduce asymmetries which should be considered when analysing and modelling the observed variables. In a geostatistical context, there are two main types of asymmetry. The first is rank-asymmetry, i.e., low and high values exhibit different spatial dependence structures. The second is order-asymmetry, i.e., the spatial dependence structure is distinguishable in different directions. Both asymmetries, if significant, indicate that the corresponding random field has a non-Gaussian dependence structure. These asymmetries are not part of the classical geostatistical workflow. Taking asymmetry into account however is likely to improve the estimation and the uncertainty assessment at unobserved locations. In this contribution a stochastic model which can be used to simulate asymmetrical random fields with any of the asymmetries or with their combination is presented. Synthetically simulated flow fields and the well known Walker lake dataset are used to demonstrate the methodology.
观测到的环境通常是物理、化学或生物过程的结果。这些过程通常会带来不对称现象,在分析和模拟观测变量时应加以考虑。在地统计学中,不对称主要有两种类型。第一种是等级不对称,即低值和高值表现出不同的空间依赖结构。第二种是阶次不对称,即空间依赖结构在不同方向上有区别。如果这两种不对称现象显著,则表明相应的随机场具有非高斯依赖结构。这些非对称性不属于经典的地质统计工作流程。然而,将非对称性考虑在内很可能会改进未观测地点的估算和不确定性评估。本文介绍了一个随机模型,该模型可用于模拟任何一种不对称或其组合的不对称随机场。合成模拟的流场和众所周知的沃克湖数据集用于演示该方法。
{"title":"Simulation of conditional non-Gaussian random fields with directional asymmetry","authors":"Sebastian Hörning ,&nbsp;András Bárdossy","doi":"10.1016/j.spasta.2024.100872","DOIUrl":"10.1016/j.spasta.2024.100872","url":null,"abstract":"<div><div>Observed environmental are usually the results of physical, chemical, or biological processes. These processes often introduce asymmetries which should be considered when analysing and modelling the observed variables. In a geostatistical context, there are two main types of asymmetry. The first is rank-asymmetry, i.e., low and high values exhibit different spatial dependence structures. The second is order-asymmetry, i.e., the spatial dependence structure is distinguishable in different directions. Both asymmetries, if significant, indicate that the corresponding random field has a non-Gaussian dependence structure. These asymmetries are not part of the classical geostatistical workflow. Taking asymmetry into account however is likely to improve the estimation and the uncertainty assessment at unobserved locations. In this contribution a stochastic model which can be used to simulate asymmetrical random fields with any of the asymmetries or with their combination is presented. Synthetically simulated flow fields and the well known Walker lake dataset are used to demonstrate the methodology.</div></div>","PeriodicalId":48771,"journal":{"name":"Spatial Statistics","volume":"65 ","pages":"Article 100872"},"PeriodicalIF":2.1,"publicationDate":"2024-11-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142707342","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Spatial Statistics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1