Evaluating the impact of misspecified spatial neighboring structures in Bayesian CAR models

IF 3.3 Q2 MULTIDISCIPLINARY SCIENCES Scientific African Pub Date : 2025-03-01 Epub Date: 2024-12-19 DOI:10.1016/j.sciaf.2024.e02498
Ernest Somua-Wiafe , Richard Minkah , Kwabena Doku-Amponsah , Louis Asiedu , Edward Acheampong , Samuel Iddi
{"title":"Evaluating the impact of misspecified spatial neighboring structures in Bayesian CAR models","authors":"Ernest Somua-Wiafe ,&nbsp;Richard Minkah ,&nbsp;Kwabena Doku-Amponsah ,&nbsp;Louis Asiedu ,&nbsp;Edward Acheampong ,&nbsp;Samuel Iddi","doi":"10.1016/j.sciaf.2024.e02498","DOIUrl":null,"url":null,"abstract":"<div><div>Spatial neighboring graphs play a crucial role in accounting for global spatial dependency, particularly in spatial models that utilize the Conditional Autoregressive (CAR) covariance structure. The Bayesian modified Besag–York–Molliè (BYM2) model, which falls under the category of CAR models, introduces a precision parameter to quantify the variability not captured by the fixed risk components and a mixing parameter to decipher the proportion of random effects attributed to the spatial component and the aspatial random noise. Despite the advantages these extra features bring, misspecification of BYM2 model components is common, and its effects are not well understood. Previous studies often avoid simulations due to computational demands, relying instead on performance metrics for inferences and model comparisons using empirical data.</div><div>This study uses comprehensive simulations to examine the impact of erroneously specified spatial neighborhood structures on the BYM2 model. We considered three different neighborhood structures: a first-order adjacency-based structure and two minimum distance-based structures with threshold distances of 70 km and 140 km at various sparsity levels. For each structure, we simulate data under that structure and then analyze it using the remaining two structures as misspecified cases to evaluate their impact on model fit. Fixed PC prior settings were applied to control for prior specification effects in examining bias and MSE. The study was further validated through practical analyses of road crash incidents in Ghana and a lip cancer cases data in Scotland, UK.</div><div>Our findings reveal that incorrect specification of the neighboring structure does not significantly impact the fixed effects. However, it affects the estimates of the mixing parameter and precision term, thus impacting the spatial component. In cases of high spatial dependency and misspecified neighborhood structures, the BYM2 model tends to underestimate the mixing parameter. Under-specifying the neighborhood structure results in underestimated hyper-parameter values while over-specifying it leads to an overfitted spatial smooth. The empirical application results which were consistent with the simulation also emphasized the critical importance of accurately specifying spatial structures in BYM2 models. Relying solely on metrics like the Watanabe-Akaike Information Criterion (WAIC), Deviance Information Criterion (DIC), and Conditional Predictive Ordinate (CPO) estimates to determine an optimal spatial structure can be misleading. Instead, the Moran’s Index (MI) statistic is more reliable for identifying the most suitable neighborhood structure.</div></div>","PeriodicalId":21690,"journal":{"name":"Scientific African","volume":"27 ","pages":"Article e02498"},"PeriodicalIF":3.3000,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Scientific African","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S246822762400440X","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/12/19 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0

Abstract

Spatial neighboring graphs play a crucial role in accounting for global spatial dependency, particularly in spatial models that utilize the Conditional Autoregressive (CAR) covariance structure. The Bayesian modified Besag–York–Molliè (BYM2) model, which falls under the category of CAR models, introduces a precision parameter to quantify the variability not captured by the fixed risk components and a mixing parameter to decipher the proportion of random effects attributed to the spatial component and the aspatial random noise. Despite the advantages these extra features bring, misspecification of BYM2 model components is common, and its effects are not well understood. Previous studies often avoid simulations due to computational demands, relying instead on performance metrics for inferences and model comparisons using empirical data.
This study uses comprehensive simulations to examine the impact of erroneously specified spatial neighborhood structures on the BYM2 model. We considered three different neighborhood structures: a first-order adjacency-based structure and two minimum distance-based structures with threshold distances of 70 km and 140 km at various sparsity levels. For each structure, we simulate data under that structure and then analyze it using the remaining two structures as misspecified cases to evaluate their impact on model fit. Fixed PC prior settings were applied to control for prior specification effects in examining bias and MSE. The study was further validated through practical analyses of road crash incidents in Ghana and a lip cancer cases data in Scotland, UK.
Our findings reveal that incorrect specification of the neighboring structure does not significantly impact the fixed effects. However, it affects the estimates of the mixing parameter and precision term, thus impacting the spatial component. In cases of high spatial dependency and misspecified neighborhood structures, the BYM2 model tends to underestimate the mixing parameter. Under-specifying the neighborhood structure results in underestimated hyper-parameter values while over-specifying it leads to an overfitted spatial smooth. The empirical application results which were consistent with the simulation also emphasized the critical importance of accurately specifying spatial structures in BYM2 models. Relying solely on metrics like the Watanabe-Akaike Information Criterion (WAIC), Deviance Information Criterion (DIC), and Conditional Predictive Ordinate (CPO) estimates to determine an optimal spatial structure can be misleading. Instead, the Moran’s Index (MI) statistic is more reliable for identifying the most suitable neighborhood structure.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
贝叶斯CAR模型中错误指定空间邻近结构的影响评估
空间邻近图在考虑全局空间依赖性方面起着至关重要的作用,特别是在利用条件自回归(CAR)协方差结构的空间模型中。贝叶斯修正Besag-York-Molliè (BYM2)模型属于CAR模型的范畴,它引入了一个精度参数来量化固定风险分量未捕获的变异性,并引入了一个混合参数来破译归因于空间分量和非空间随机噪声的随机效应的比例。尽管这些额外的特性带来了好处,但对BYM2模型组件的错误规范是常见的,其影响也没有得到很好的理解。以前的研究往往由于计算需求而避免模拟,而是依赖于使用经验数据进行推断和模型比较的性能指标。本研究使用综合模拟来检验错误指定的空间邻域结构对BYM2模型的影响。我们考虑了三种不同的邻域结构:一阶邻接结构和两种最小距离结构,阈值距离分别为70 km和140 km。对于每个结构,我们模拟该结构下的数据,然后使用剩余的两个结构作为错误指定的情况来分析它,以评估它们对模型拟合的影响。在检验偏倚和MSE时,应用固定PC先验设置来控制先验规范效应。通过对加纳道路交通事故和英国苏格兰唇癌病例数据的实际分析,进一步验证了该研究。我们的研究结果表明,不正确的相邻结构规格不会显著影响固定效果。但是,它会影响混合参数和精度项的估计,从而影响空间分量。在高空间依赖性和邻域结构不明确的情况下,BYM2模型倾向于低估混合参数。邻域结构指定不充分会导致超参数值被低估,而邻域结构指定过度会导致空间平滑过拟合。与模拟结果一致的经验应用结果也强调了在BYM2模型中精确指定空间结构的重要性。仅仅依靠Watanabe-Akaike信息标准(WAIC)、偏差信息标准(DIC)和条件预测坐标(CPO)估计等指标来确定最佳空间结构可能会产生误导。相反,莫兰指数(MI)统计数据在确定最合适的社区结构方面更为可靠。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Scientific African
Scientific African Multidisciplinary-Multidisciplinary
CiteScore
5.60
自引率
3.40%
发文量
332
审稿时长
10 weeks
期刊最新文献
A nonparametric framework for linear–circular regression: Applications in environmental and biological sciences A unified H∞ dynamic observer for vehicle state and road profile estimation in half-vehicle models Powering growth or stunting futures? Energy poverty and long-term child nutrition in Malawi A new family of generalized distributions based on Modi-G transformation, with applications in Epidemiology, Engineering, and Environmental Science Efficiency assessment of two pyrolyzer models for converting waste plastics into fuels
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1