A hypothesis test for detecting distance-specific clustering and dispersion in areal data

IF 2.5 2区 数学 Q3 GEOSCIENCES, MULTIDISCIPLINARY Spatial Statistics Pub Date : 2023-06-01 DOI:10.1016/j.spasta.2023.100757
Stella Self , Anna Overby , Anja Zgodic , David White , Alexander McLain , Caitlin Dyckman
{"title":"A hypothesis test for detecting distance-specific clustering and dispersion in areal data","authors":"Stella Self ,&nbsp;Anna Overby ,&nbsp;Anja Zgodic ,&nbsp;David White ,&nbsp;Alexander McLain ,&nbsp;Caitlin Dyckman","doi":"10.1016/j.spasta.2023.100757","DOIUrl":null,"url":null,"abstract":"<div><p><span>Spatial clustering detection has a variety of applications in diverse fields, including identifying infectious disease outbreaks, pinpointing crime hotspots, and identifying clusters of neurons in brain imaging applications. Ripley’s K-function is a popular method for detecting clustering (or dispersion) in point process data at specific distances. Ripley’s K-function measures the expected number of points within a given distance of any observed point. Clustering can be assessed by comparing the observed value of Ripley’s K-function to the expected value under complete spatial randomness. While performing spatial clustering analysis on point process data is common, applications to areal data commonly arise and need to be accurately assessed. Inspired by Ripley’s K-function, we develop the </span><em>positive area proportion function (PAPF)</em><span><span> and use it to develop a hypothesis testing procedure for the detection of spatial clustering and dispersion at specific distances in areal data. We compare the performance of the proposed PAPF hypothesis test to that of the global Moran’s I </span>statistic, the Getis–Ord general G statistic, and the spatial scan statistic with extensive simulation studies. We then evaluate the real-world performance of our method by using it to detect spatial clustering in land parcels containing conservation easements and US counties with high pediatric overweight/obesity rates.</span></p></div>","PeriodicalId":48771,"journal":{"name":"Spatial Statistics","volume":"55 ","pages":"Article 100757"},"PeriodicalIF":2.5000,"publicationDate":"2023-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10312012/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Spatial Statistics","FirstCategoryId":"100","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2211675323000325","RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"GEOSCIENCES, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

Abstract

Spatial clustering detection has a variety of applications in diverse fields, including identifying infectious disease outbreaks, pinpointing crime hotspots, and identifying clusters of neurons in brain imaging applications. Ripley’s K-function is a popular method for detecting clustering (or dispersion) in point process data at specific distances. Ripley’s K-function measures the expected number of points within a given distance of any observed point. Clustering can be assessed by comparing the observed value of Ripley’s K-function to the expected value under complete spatial randomness. While performing spatial clustering analysis on point process data is common, applications to areal data commonly arise and need to be accurately assessed. Inspired by Ripley’s K-function, we develop the positive area proportion function (PAPF) and use it to develop a hypothesis testing procedure for the detection of spatial clustering and dispersion at specific distances in areal data. We compare the performance of the proposed PAPF hypothesis test to that of the global Moran’s I statistic, the Getis–Ord general G statistic, and the spatial scan statistic with extensive simulation studies. We then evaluate the real-world performance of our method by using it to detect spatial clustering in land parcels containing conservation easements and US counties with high pediatric overweight/obesity rates.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
用于检测区域数据中特定距离的聚类和离散的假设检验
空间聚类检测在不同领域有多种应用,包括识别传染病爆发、精确定位犯罪热点以及在脑成像应用中识别神经元簇。Ripley的K函数是检测特定距离点过程数据中的聚类(或分散)的常用方法。Ripley的K函数测量任何观测点在给定距离内的预期点数。聚类可以通过将Ripley的K函数的观测值与完全空间随机性下的期望值进行比较来评估。虽然对点过程数据执行空间聚类分析是常见的,但对区域数据的应用通常会出现,并且需要准确评估。受Ripley的K函数的启发,我们开发了正面积比例函数(PAPF),并用它来开发一个假设检验程序,用于检测区域数据中特定距离的空间聚类和分散。我们通过大量的模拟研究,将所提出的PAPF假设检验的性能与全局Moran’s I统计量、Getis–Ord广义G统计量和空间扫描统计量的性能进行了比较。然后,我们通过使用我们的方法来检测包含保护地役权的地块和美国儿童超重/肥胖率高的县的空间聚类,来评估我们的方法在现实世界中的性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Spatial Statistics
Spatial Statistics GEOSCIENCES, MULTIDISCIPLINARY-MATHEMATICS, INTERDISCIPLINARY APPLICATIONS
CiteScore
4.00
自引率
21.70%
发文量
89
审稿时长
55 days
期刊介绍: Spatial Statistics publishes articles on the theory and application of spatial and spatio-temporal statistics. It favours manuscripts that present theory generated by new applications, or in which new theory is applied to an important practical case. A purely theoretical study will only rarely be accepted. Pure case studies without methodological development are not acceptable for publication. Spatial statistics concerns the quantitative analysis of spatial and spatio-temporal data, including their statistical dependencies, accuracy and uncertainties. Methodology for spatial statistics is typically found in probability theory, stochastic modelling and mathematical statistics as well as in information science. Spatial statistics is used in mapping, assessing spatial data quality, sampling design optimisation, modelling of dependence structures, and drawing of valid inference from a limited set of spatio-temporal data.
期刊最新文献
Estimating and plotting species-area relationship: Does aggregate distribution of species really matter? A delayed acceptance auxiliary variable MCMC for spatial models with intractable likelihood function Copula-based spatio-temporal modeling of air pollutant data incorporating covariate dependence Voronoi linkage between mismatching voting stations and census tracts in analyzing the 2018 Brazilian presidential election data Determinants of vote transitions by ecological inference within small areas
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1