Qi Zhang , Alexandra M. Schmidt , Yogendra P. Chaubey
{"title":"Modeling left-censored skewed spatial processes: The case of arsenic drinking water contamination","authors":"Qi Zhang , Alexandra M. Schmidt , Yogendra P. Chaubey","doi":"10.1016/j.spasta.2024.100816","DOIUrl":null,"url":null,"abstract":"<div><p>Commonly, observations from environmental processes are spatially structured and present skewed distributions. Recently, different models have been proposed to model spatial processes in their original scale. This work was motivated by modeling the levels of arsenic groundwater concentration in Comilla, a district of Bangladesh. Some of the observations are left censored. We propose spatial gamma models and explore different parametrizations of the gamma distribution. The gamma model naturally accounts for the skewness present in the data and the fact that arsenic levels are positive. We compare our proposed approaches with two skewed models proposed in the literature. Inference is performed under the Bayesian paradigm and interpolation to unobserved locations of interest naturally accounts for the estimation of the parameters in the proposed model. For the arsenic dataset, one of our proposed gamma models performs best in comparison to previous spatial models for skewed data, in terms of scoring rules criteria. Moreover, under the skewed models, some of the lower limits of the 95% posterior predictive distributions provide negative values violating the assumption that observations are strictly positive. The gamma distribution provides a reasonable, and simpler, alternative to account for the skewness present in the data and provide forecasts that are within the valid values of the observations.</p></div>","PeriodicalId":48771,"journal":{"name":"Spatial Statistics","volume":null,"pages":null},"PeriodicalIF":2.1000,"publicationDate":"2024-02-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2211675324000071/pdfft?md5=4e6657431bd93da01f8723d3f2fdc303&pid=1-s2.0-S2211675324000071-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Spatial Statistics","FirstCategoryId":"100","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2211675324000071","RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"GEOSCIENCES, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
Abstract
Commonly, observations from environmental processes are spatially structured and present skewed distributions. Recently, different models have been proposed to model spatial processes in their original scale. This work was motivated by modeling the levels of arsenic groundwater concentration in Comilla, a district of Bangladesh. Some of the observations are left censored. We propose spatial gamma models and explore different parametrizations of the gamma distribution. The gamma model naturally accounts for the skewness present in the data and the fact that arsenic levels are positive. We compare our proposed approaches with two skewed models proposed in the literature. Inference is performed under the Bayesian paradigm and interpolation to unobserved locations of interest naturally accounts for the estimation of the parameters in the proposed model. For the arsenic dataset, one of our proposed gamma models performs best in comparison to previous spatial models for skewed data, in terms of scoring rules criteria. Moreover, under the skewed models, some of the lower limits of the 95% posterior predictive distributions provide negative values violating the assumption that observations are strictly positive. The gamma distribution provides a reasonable, and simpler, alternative to account for the skewness present in the data and provide forecasts that are within the valid values of the observations.
期刊介绍:
Spatial Statistics publishes articles on the theory and application of spatial and spatio-temporal statistics. It favours manuscripts that present theory generated by new applications, or in which new theory is applied to an important practical case. A purely theoretical study will only rarely be accepted. Pure case studies without methodological development are not acceptable for publication.
Spatial statistics concerns the quantitative analysis of spatial and spatio-temporal data, including their statistical dependencies, accuracy and uncertainties. Methodology for spatial statistics is typically found in probability theory, stochastic modelling and mathematical statistics as well as in information science. Spatial statistics is used in mapping, assessing spatial data quality, sampling design optimisation, modelling of dependence structures, and drawing of valid inference from a limited set of spatio-temporal data.