{"title":"Computationally efficient localised spatial smoothing of disease rates using anisotropic basis functions and penalised regression fitting","authors":"Duncan Lee","doi":"10.1016/j.spasta.2023.100796","DOIUrl":null,"url":null,"abstract":"<div><p>The spatial variation in population-level disease rates can be estimated from aggregated disease data relating to <span><math><mi>N</mi></math></span> areal units using Bayesian hierarchical models. Spatial autocorrelation in these data is captured by random effects that are assigned a Conditional autoregressive (CAR) prior, which assumes that neighbouring areal units exhibit similar disease rates. This approach ignores boundaries in the disease rate surface, which are locations where neighbouring units exhibit a step-change in their rates. CAR type models have been extended to account for this localised spatial smoothness, but they are computationally prohibitive for big data sets. Therefore this paper proposes a novel computationally efficient approach for localised spatial smoothing, which is motivated by a new study of mental ill health across <span><math><mrow><mi>N</mi><mo>=</mo><mtext>32,754</mtext></mrow></math></span> Lower Super Output Areas in England. The approach is based on a computationally efficient ridge regression framework, where the spatial trend in disease rates is modelled by a set of anisotropic spatial basis functions that can exhibit either smooth or step change transitions in values between neighbouring areal units. The efficacy of this approach is evidenced by simulation, before using it to identify the highest rate areas and the magnitude of the health inequalities in four measures of mental ill health, namely antidepressant usage, benefit claims, depression diagnoses and hospitalisations.</p></div>","PeriodicalId":48771,"journal":{"name":"Spatial Statistics","volume":null,"pages":null},"PeriodicalIF":2.1000,"publicationDate":"2023-11-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2211675323000714/pdfft?md5=e39e73b4b9ffa4f14ba8a5e003868f43&pid=1-s2.0-S2211675323000714-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Spatial Statistics","FirstCategoryId":"100","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2211675323000714","RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"GEOSCIENCES, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
Abstract
The spatial variation in population-level disease rates can be estimated from aggregated disease data relating to areal units using Bayesian hierarchical models. Spatial autocorrelation in these data is captured by random effects that are assigned a Conditional autoregressive (CAR) prior, which assumes that neighbouring areal units exhibit similar disease rates. This approach ignores boundaries in the disease rate surface, which are locations where neighbouring units exhibit a step-change in their rates. CAR type models have been extended to account for this localised spatial smoothness, but they are computationally prohibitive for big data sets. Therefore this paper proposes a novel computationally efficient approach for localised spatial smoothing, which is motivated by a new study of mental ill health across Lower Super Output Areas in England. The approach is based on a computationally efficient ridge regression framework, where the spatial trend in disease rates is modelled by a set of anisotropic spatial basis functions that can exhibit either smooth or step change transitions in values between neighbouring areal units. The efficacy of this approach is evidenced by simulation, before using it to identify the highest rate areas and the magnitude of the health inequalities in four measures of mental ill health, namely antidepressant usage, benefit claims, depression diagnoses and hospitalisations.
期刊介绍:
Spatial Statistics publishes articles on the theory and application of spatial and spatio-temporal statistics. It favours manuscripts that present theory generated by new applications, or in which new theory is applied to an important practical case. A purely theoretical study will only rarely be accepted. Pure case studies without methodological development are not acceptable for publication.
Spatial statistics concerns the quantitative analysis of spatial and spatio-temporal data, including their statistical dependencies, accuracy and uncertainties. Methodology for spatial statistics is typically found in probability theory, stochastic modelling and mathematical statistics as well as in information science. Spatial statistics is used in mapping, assessing spatial data quality, sampling design optimisation, modelling of dependence structures, and drawing of valid inference from a limited set of spatio-temporal data.