Computationally efficient localised spatial smoothing of disease rates using anisotropic basis functions and penalised regression fitting

IF 2.1 2区 数学 Q3 GEOSCIENCES, MULTIDISCIPLINARY Spatial Statistics Pub Date : 2023-11-29 DOI:10.1016/j.spasta.2023.100796
Duncan Lee
{"title":"Computationally efficient localised spatial smoothing of disease rates using anisotropic basis functions and penalised regression fitting","authors":"Duncan Lee","doi":"10.1016/j.spasta.2023.100796","DOIUrl":null,"url":null,"abstract":"<div><p>The spatial variation in population-level disease rates can be estimated from aggregated disease data relating to <span><math><mi>N</mi></math></span> areal units using Bayesian hierarchical models. Spatial autocorrelation in these data is captured by random effects that are assigned a Conditional autoregressive (CAR) prior, which assumes that neighbouring areal units exhibit similar disease rates. This approach ignores boundaries in the disease rate surface, which are locations where neighbouring units exhibit a step-change in their rates. CAR type models have been extended to account for this localised spatial smoothness, but they are computationally prohibitive for big data sets. Therefore this paper proposes a novel computationally efficient approach for localised spatial smoothing, which is motivated by a new study of mental ill health across <span><math><mrow><mi>N</mi><mo>=</mo><mtext>32,754</mtext></mrow></math></span> Lower Super Output Areas in England. The approach is based on a computationally efficient ridge regression framework, where the spatial trend in disease rates is modelled by a set of anisotropic spatial basis functions that can exhibit either smooth or step change transitions in values between neighbouring areal units. The efficacy of this approach is evidenced by simulation, before using it to identify the highest rate areas and the magnitude of the health inequalities in four measures of mental ill health, namely antidepressant usage, benefit claims, depression diagnoses and hospitalisations.</p></div>","PeriodicalId":48771,"journal":{"name":"Spatial Statistics","volume":null,"pages":null},"PeriodicalIF":2.1000,"publicationDate":"2023-11-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2211675323000714/pdfft?md5=e39e73b4b9ffa4f14ba8a5e003868f43&pid=1-s2.0-S2211675323000714-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Spatial Statistics","FirstCategoryId":"100","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2211675323000714","RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"GEOSCIENCES, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

Abstract

The spatial variation in population-level disease rates can be estimated from aggregated disease data relating to N areal units using Bayesian hierarchical models. Spatial autocorrelation in these data is captured by random effects that are assigned a Conditional autoregressive (CAR) prior, which assumes that neighbouring areal units exhibit similar disease rates. This approach ignores boundaries in the disease rate surface, which are locations where neighbouring units exhibit a step-change in their rates. CAR type models have been extended to account for this localised spatial smoothness, but they are computationally prohibitive for big data sets. Therefore this paper proposes a novel computationally efficient approach for localised spatial smoothing, which is motivated by a new study of mental ill health across N=32,754 Lower Super Output Areas in England. The approach is based on a computationally efficient ridge regression framework, where the spatial trend in disease rates is modelled by a set of anisotropic spatial basis functions that can exhibit either smooth or step change transitions in values between neighbouring areal units. The efficacy of this approach is evidenced by simulation, before using it to identify the highest rate areas and the magnitude of the health inequalities in four measures of mental ill health, namely antidepressant usage, benefit claims, depression diagnoses and hospitalisations.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
利用各向异性基函数和惩罚回归拟合,计算效率高的疾病率局部空间平滑
使用贝叶斯层次模型,可以从与N个区域单位相关的汇总疾病数据中估计人口水平疾病发病率的空间变化。这些数据中的空间自相关性是通过分配条件自回归(CAR)先验的随机效应捕获的,它假设邻近的区域单位表现出相似的发病率。这种方法忽略了发病率表面的边界,即相邻单位在其发病率上表现出阶梯变化的位置。CAR类型的模型已经扩展到考虑这种局部空间平滑,但它们在计算上对大数据集是禁止的。因此,本文提出了一种新的计算高效的局部空间平滑方法,这是由一项关于英国N=32,754个低超级输出区域的精神疾病健康的新研究激发的。该方法以计算效率高的脊回归框架为基础,其中发病率的空间趋势由一组各向异性空间基函数模拟,这些函数可以在相邻面积单位之间表现出平滑或阶跃变化的值转换。这种方法的有效性通过模拟得到证明,然后用它来确定精神疾病的四种衡量标准(即抗抑郁药的使用、福利申请、抑郁症诊断和住院治疗)中发病率最高的地区和健康不平等的程度。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Spatial Statistics
Spatial Statistics GEOSCIENCES, MULTIDISCIPLINARY-MATHEMATICS, INTERDISCIPLINARY APPLICATIONS
CiteScore
4.00
自引率
21.70%
发文量
89
审稿时长
55 days
期刊介绍: Spatial Statistics publishes articles on the theory and application of spatial and spatio-temporal statistics. It favours manuscripts that present theory generated by new applications, or in which new theory is applied to an important practical case. A purely theoretical study will only rarely be accepted. Pure case studies without methodological development are not acceptable for publication. Spatial statistics concerns the quantitative analysis of spatial and spatio-temporal data, including their statistical dependencies, accuracy and uncertainties. Methodology for spatial statistics is typically found in probability theory, stochastic modelling and mathematical statistics as well as in information science. Spatial statistics is used in mapping, assessing spatial data quality, sampling design optimisation, modelling of dependence structures, and drawing of valid inference from a limited set of spatio-temporal data.
期刊最新文献
Uncovering hidden alignments in two-dimensional point fields Spatio-temporal data fusion for the analysis of in situ and remote sensing data using the INLA-SPDE approach Exploiting nearest-neighbour maps for estimating the variance of sample mean in equal-probability systematic sampling of spatial populations Variable selection of nonparametric spatial autoregressive models via deep learning Estimation and inference of multi-effect generalized geographically and temporally weighted regression models
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1