Computationally efficient localised spatial smoothing of disease rates using anisotropic basis functions and penalised regression fitting

IF 2.1 2区数学 Q3 GEOSCIENCES, MULTIDISCIPLINARY Spatial Statistics Pub Date : 2023-11-29 DOI:10.1016/j.spasta.2023.100796

Duncan Lee

{"title":"Computationally efficient localised spatial smoothing of disease rates using anisotropic basis functions and penalised regression fitting","authors":"Duncan Lee","doi":"10.1016/j.spasta.2023.100796","DOIUrl":null,"url":null,"abstract":"<div><p>The spatial variation in population-level disease rates can be estimated from aggregated disease data relating to <span><math><mi>N</mi></math></span> areal units using Bayesian hierarchical models. Spatial autocorrelation in these data is captured by random effects that are assigned a Conditional autoregressive (CAR) prior, which assumes that neighbouring areal units exhibit similar disease rates. This approach ignores boundaries in the disease rate surface, which are locations where neighbouring units exhibit a step-change in their rates. CAR type models have been extended to account for this localised spatial smoothness, but they are computationally prohibitive for big data sets. Therefore this paper proposes a novel computationally efficient approach for localised spatial smoothing, which is motivated by a new study of mental ill health across <span><math><mrow><mi>N</mi><mo>=</mo><mtext>32,754</mtext></mrow></math></span> Lower Super Output Areas in England. The approach is based on a computationally efficient ridge regression framework, where the spatial trend in disease rates is modelled by a set of anisotropic spatial basis functions that can exhibit either smooth or step change transitions in values between neighbouring areal units. The efficacy of this approach is evidenced by simulation, before using it to identify the highest rate areas and the magnitude of the health inequalities in four measures of mental ill health, namely antidepressant usage, benefit claims, depression diagnoses and hospitalisations.</p></div>","PeriodicalId":48771,"journal":{"name":"Spatial Statistics","volume":"59 ","pages":"Article 100796"},"PeriodicalIF":2.1000,"publicationDate":"2023-11-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2211675323000714/pdfft?md5=e39e73b4b9ffa4f14ba8a5e003868f43&pid=1-s2.0-S2211675323000714-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Spatial Statistics","FirstCategoryId":"100","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2211675323000714","RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"GEOSCIENCES, MULTIDISCIPLINARY","Score":null,"Total":0}

引用次数: 0

Abstract

The spatial variation in population-level disease rates can be estimated from aggregated disease data relating to $N$ areal units using Bayesian hierarchical models. Spatial autocorrelation in these data is captured by random effects that are assigned a Conditional autoregressive (CAR) prior, which assumes that neighbouring areal units exhibit similar disease rates. This approach ignores boundaries in the disease rate surface, which are locations where neighbouring units exhibit a step-change in their rates. CAR type models have been extended to account for this localised spatial smoothness, but they are computationally prohibitive for big data sets. Therefore this paper proposes a novel computationally efficient approach for localised spatial smoothing, which is motivated by a new study of mental ill health across $N = 32,754$ Lower Super Output Areas in England. The approach is based on a computationally efficient ridge regression framework, where the spatial trend in disease rates is modelled by a set of anisotropic spatial basis functions that can exhibit either smooth or step change transitions in values between neighbouring areal units. The efficacy of this approach is evidenced by simulation, before using it to identify the highest rate areas and the magnitude of the health inequalities in four measures of mental ill health, namely antidepressant usage, benefit claims, depression diagnoses and hospitalisations.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

利用各向异性基函数和惩罚回归拟合，计算效率高的疾病率局部空间平滑

使用贝叶斯层次模型，可以从与N个区域单位相关的汇总疾病数据中估计人口水平疾病发病率的空间变化。这些数据中的空间自相关性是通过分配条件自回归(CAR)先验的随机效应捕获的，它假设邻近的区域单位表现出相似的发病率。这种方法忽略了发病率表面的边界，即相邻单位在其发病率上表现出阶梯变化的位置。CAR类型的模型已经扩展到考虑这种局部空间平滑，但它们在计算上对大数据集是禁止的。因此，本文提出了一种新的计算高效的局部空间平滑方法，这是由一项关于英国N=32,754个低超级输出区域的精神疾病健康的新研究激发的。该方法以计算效率高的脊回归框架为基础，其中发病率的空间趋势由一组各向异性空间基函数模拟，这些函数可以在相邻面积单位之间表现出平滑或阶跃变化的值转换。这种方法的有效性通过模拟得到证明，然后用它来确定精神疾病的四种衡量标准(即抗抑郁药的使用、福利申请、抑郁症诊断和住院治疗)中发病率最高的地区和健康不平等的程度。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Spatial Statistics GEOSCIENCES, MULTIDISCIPLINARY-MATHEMATICS, INTERDISCIPLINARY APPLICATIONS

CiteScore

4.00

自引率

21.70%

发文量

审稿时长

55 days

期刊介绍： Spatial Statistics publishes articles on the theory and application of spatial and spatio-temporal statistics. It favours manuscripts that present theory generated by new applications, or in which new theory is applied to an important practical case. A purely theoretical study will only rarely be accepted. Pure case studies without methodological development are not acceptable for publication. Spatial statistics concerns the quantitative analysis of spatial and spatio-temporal data, including their statistical dependencies, accuracy and uncertainties. Methodology for spatial statistics is typically found in probability theory, stochastic modelling and mathematical statistics as well as in information science. Spatial statistics is used in mapping, assessing spatial data quality, sampling design optimisation, modelling of dependence structures, and drawing of valid inference from a limited set of spatio-temporal data.