Evaluating the impact of misspecified spatial neighboring structures in Bayesian CAR models

IF 2.7 Q2 MULTIDISCIPLINARY SCIENCES Scientific African Pub Date : 2024-12-19 DOI:10.1016/j.sciaf.2024.e02498
Ernest Somua-Wiafe , Richard Minkah , Kwabena Doku-Amponsah , Louis Asiedu , Edward Acheampong , Samuel Iddi
{"title":"Evaluating the impact of misspecified spatial neighboring structures in Bayesian CAR models","authors":"Ernest Somua-Wiafe ,&nbsp;Richard Minkah ,&nbsp;Kwabena Doku-Amponsah ,&nbsp;Louis Asiedu ,&nbsp;Edward Acheampong ,&nbsp;Samuel Iddi","doi":"10.1016/j.sciaf.2024.e02498","DOIUrl":null,"url":null,"abstract":"<div><div>Spatial neighboring graphs play a crucial role in accounting for global spatial dependency, particularly in spatial models that utilize the Conditional Autoregressive (CAR) covariance structure. The Bayesian modified Besag–York–Molliè (BYM2) model, which falls under the category of CAR models, introduces a precision parameter to quantify the variability not captured by the fixed risk components and a mixing parameter to decipher the proportion of random effects attributed to the spatial component and the aspatial random noise. Despite the advantages these extra features bring, misspecification of BYM2 model components is common, and its effects are not well understood. Previous studies often avoid simulations due to computational demands, relying instead on performance metrics for inferences and model comparisons using empirical data.</div><div>This study uses comprehensive simulations to examine the impact of erroneously specified spatial neighborhood structures on the BYM2 model. We considered three different neighborhood structures: a first-order adjacency-based structure and two minimum distance-based structures with threshold distances of 70 km and 140 km at various sparsity levels. For each structure, we simulate data under that structure and then analyze it using the remaining two structures as misspecified cases to evaluate their impact on model fit. Fixed PC prior settings were applied to control for prior specification effects in examining bias and MSE. The study was further validated through practical analyses of road crash incidents in Ghana and a lip cancer cases data in Scotland, UK.</div><div>Our findings reveal that incorrect specification of the neighboring structure does not significantly impact the fixed effects. However, it affects the estimates of the mixing parameter and precision term, thus impacting the spatial component. In cases of high spatial dependency and misspecified neighborhood structures, the BYM2 model tends to underestimate the mixing parameter. Under-specifying the neighborhood structure results in underestimated hyper-parameter values while over-specifying it leads to an overfitted spatial smooth. The empirical application results which were consistent with the simulation also emphasized the critical importance of accurately specifying spatial structures in BYM2 models. Relying solely on metrics like the Watanabe-Akaike Information Criterion (WAIC), Deviance Information Criterion (DIC), and Conditional Predictive Ordinate (CPO) estimates to determine an optimal spatial structure can be misleading. Instead, the Moran’s Index (MI) statistic is more reliable for identifying the most suitable neighborhood structure.</div></div>","PeriodicalId":21690,"journal":{"name":"Scientific African","volume":"27 ","pages":"Article e02498"},"PeriodicalIF":2.7000,"publicationDate":"2024-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Scientific African","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S246822762400440X","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0

Abstract

Spatial neighboring graphs play a crucial role in accounting for global spatial dependency, particularly in spatial models that utilize the Conditional Autoregressive (CAR) covariance structure. The Bayesian modified Besag–York–Molliè (BYM2) model, which falls under the category of CAR models, introduces a precision parameter to quantify the variability not captured by the fixed risk components and a mixing parameter to decipher the proportion of random effects attributed to the spatial component and the aspatial random noise. Despite the advantages these extra features bring, misspecification of BYM2 model components is common, and its effects are not well understood. Previous studies often avoid simulations due to computational demands, relying instead on performance metrics for inferences and model comparisons using empirical data.
This study uses comprehensive simulations to examine the impact of erroneously specified spatial neighborhood structures on the BYM2 model. We considered three different neighborhood structures: a first-order adjacency-based structure and two minimum distance-based structures with threshold distances of 70 km and 140 km at various sparsity levels. For each structure, we simulate data under that structure and then analyze it using the remaining two structures as misspecified cases to evaluate their impact on model fit. Fixed PC prior settings were applied to control for prior specification effects in examining bias and MSE. The study was further validated through practical analyses of road crash incidents in Ghana and a lip cancer cases data in Scotland, UK.
Our findings reveal that incorrect specification of the neighboring structure does not significantly impact the fixed effects. However, it affects the estimates of the mixing parameter and precision term, thus impacting the spatial component. In cases of high spatial dependency and misspecified neighborhood structures, the BYM2 model tends to underestimate the mixing parameter. Under-specifying the neighborhood structure results in underestimated hyper-parameter values while over-specifying it leads to an overfitted spatial smooth. The empirical application results which were consistent with the simulation also emphasized the critical importance of accurately specifying spatial structures in BYM2 models. Relying solely on metrics like the Watanabe-Akaike Information Criterion (WAIC), Deviance Information Criterion (DIC), and Conditional Predictive Ordinate (CPO) estimates to determine an optimal spatial structure can be misleading. Instead, the Moran’s Index (MI) statistic is more reliable for identifying the most suitable neighborhood structure.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
求助全文
约1分钟内获得全文 去求助
来源期刊
Scientific African
Scientific African Multidisciplinary-Multidisciplinary
CiteScore
5.60
自引率
3.40%
发文量
332
审稿时长
10 weeks
期刊最新文献
DockCADD: A streamlined in silico pipeline for the identification of potent ribosomal S6 Kinase 2 (RSK2) inhibitors Allometric models for estimating aboveground biomass and carbon stocks of the semi-arid savanna woody species, Detarium microcarpum Guill. et Perr. Nanocomposite treatment of hospital wastewater; Prophylaxis toxicity in the freshwater crayfish muscles and hepatopancreas Spatial epidemiology based on the analysis of COVID-19 in Africa Exploring indigenous wisdom: Ethnobotanical documentation and conservation of medicinal plants in Goba District, Southwest Ethiopia
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1