The effects of station density in geostatistical prediction of air temperatures in Sweden: A comparison of two interpolation techniques

IF 12.4 Q1 ENVIRONMENTAL SCIENCES Resources Environment and Sustainability Pub Date : 2023-03-01 DOI:10.1016/j.resenv.2022.100092

Elijah Akwarandu Njoku , Patrick Etim Akpan , Augustine Edet Effiong , Isaac Oluwatosin Babatunde

{"title":"The effects of station density in geostatistical prediction of air temperatures in Sweden: A comparison of two interpolation techniques","authors":"Elijah Akwarandu Njoku , Patrick Etim Akpan , Augustine Edet Effiong , Isaac Oluwatosin Babatunde","doi":"10.1016/j.resenv.2022.100092","DOIUrl":null,"url":null,"abstract":"<div><p>High fidelity gridded temperature datasets are difficult to obtain for areas with sparse coverage of meteorological stations given that sparsity of stations is known to introduce uncertainty in the interpolation of climatic variables generally. Inspired by their potential for optimal results especially for small sample datasets, we assessed and compared the accuracy of interpolation results of Empirical Bayesian Kriging (EBK) and EBK-Regression Prediction (EBKRP) spatial prediction techniques under varying sampling density scenarios, using monthly maximum temperature normals (1991–2020) for the entire area of Sweden. The objectives of the study were to understand how EBK and EBKRP interpolation techniques perform in different sampling density scenarios and particularly in a sparse data setting, and the possible difference in the prediction accuracy between the two techniques. The 708 sampled stations obtained from the historical climatology database of the Swedish Meteorological and Hydrological Institute (SMHI) were split into seven sampling density subsets, ranging from 1 sample per 63,614 km<sup>2</sup> to 1 sample per 634,350 km<sup>2</sup> and representing both low and high sampling density scenarios. EBK interpolation technique was implemented using temperature data while land use land cover (LULC) and digital elevation model (DEM) were used as temperature covariates for the EBKRP interpolation models. The prediction accuracy assessment was based on five robust prediction performance indicators – mean error, mean absolute error, mean square error, root mean square error and Pearson correlation (R) – obtained from independent validation/cross-validation operations. Prediction accuracy was found to be generally positively related to sampling density, and sampling density accounted for about 85%–87% of interpolation accuracy for both EBK and EBKRP techniques. Although sampling density increased linearly, the rate of change in accuracy from one sampling density step to the next was not particularly proportional. For equivalent sampling density set-ups, EBKRP consistently outperformed EBK in all the accuracy metrics and EBKRP proved to be approximately 40% better than EBK. However, the two interpolation techniques produced generally low prediction biases for all the sampling density scenarios investigated. Our study suggests that potential effects of low sampling density and non-stationarity of temperature data can be significantly reduced by applying EBK but especially EBKRP when coupled with relevant covariates. This is especially true for continuous and slowly varying phenomena such as temperature and similar variables.</p></div>","PeriodicalId":34479,"journal":{"name":"Resources Environment and Sustainability","volume":"11 ","pages":"Article 100092"},"PeriodicalIF":12.4000,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Resources Environment and Sustainability","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666916122000366","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}

引用次数: 6

Abstract

High fidelity gridded temperature datasets are difficult to obtain for areas with sparse coverage of meteorological stations given that sparsity of stations is known to introduce uncertainty in the interpolation of climatic variables generally. Inspired by their potential for optimal results especially for small sample datasets, we assessed and compared the accuracy of interpolation results of Empirical Bayesian Kriging (EBK) and EBK-Regression Prediction (EBKRP) spatial prediction techniques under varying sampling density scenarios, using monthly maximum temperature normals (1991–2020) for the entire area of Sweden. The objectives of the study were to understand how EBK and EBKRP interpolation techniques perform in different sampling density scenarios and particularly in a sparse data setting, and the possible difference in the prediction accuracy between the two techniques. The 708 sampled stations obtained from the historical climatology database of the Swedish Meteorological and Hydrological Institute (SMHI) were split into seven sampling density subsets, ranging from 1 sample per 63,614 km² to 1 sample per 634,350 km² and representing both low and high sampling density scenarios. EBK interpolation technique was implemented using temperature data while land use land cover (LULC) and digital elevation model (DEM) were used as temperature covariates for the EBKRP interpolation models. The prediction accuracy assessment was based on five robust prediction performance indicators – mean error, mean absolute error, mean square error, root mean square error and Pearson correlation (R) – obtained from independent validation/cross-validation operations. Prediction accuracy was found to be generally positively related to sampling density, and sampling density accounted for about 85%–87% of interpolation accuracy for both EBK and EBKRP techniques. Although sampling density increased linearly, the rate of change in accuracy from one sampling density step to the next was not particularly proportional. For equivalent sampling density set-ups, EBKRP consistently outperformed EBK in all the accuracy metrics and EBKRP proved to be approximately 40% better than EBK. However, the two interpolation techniques produced generally low prediction biases for all the sampling density scenarios investigated. Our study suggests that potential effects of low sampling density and non-stationarity of temperature data can be significantly reduced by applying EBK but especially EBKRP when coupled with relevant covariates. This is especially true for continuous and slowly varying phenomena such as temperature and similar variables.

Abstract Image

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

站点密度在瑞典气温地质统计预测中的作用：两种插值技术的比较

对于气象站覆盖范围稀疏的地区，很难获得高保真网格温度数据集，因为众所周知，气象站的稀疏性会在气候变量的插值中引入不确定性。受其优化结果潜力的启发，特别是对于小样本数据集，我们评估并比较了经验贝叶斯克里格（EBK）和EBK回归预测（EBKRP）空间预测技术在不同采样密度场景下的插值结果的准确性，使用了瑞典整个地区的月最高温度正常值（1991-2020）。该研究的目的是了解EBK和EBKRP插值技术在不同采样密度场景中，特别是在稀疏数据设置中的表现，以及这两种技术之间预测精度的可能差异。从瑞典气象和水文研究所（SMHI）的历史气候学数据库中获得的708个采样站被分为七个采样密度子集，从每63614平方公里一个样本到每634350平方公里一一个样本，分别代表低采样密度和高采样密度情景。使用温度数据实现EBK插值技术，同时使用土地利用-土地覆盖（LULC）和数字高程模型（DEM）作为EBKRP插值模型的温度协变量。预测准确性评估基于从独立验证/交叉验证操作中获得的五个稳健预测性能指标——平均误差、平均绝对误差、均方误差、均方根误差和Pearson相关性（R）。预测精度通常与采样密度呈正相关，采样密度约占EBK和EBKRP技术插值精度的85%-87%。尽管采样密度线性增加，但从一个采样密度步骤到下一个采样浓度步骤的精度变化率并不是特别成比例。对于等效采样密度设置，EBKRP在所有精度指标上始终优于EBK，并且EBKRP被证明比EBK好大约40%。然而，对于所调查的所有采样密度场景，这两种插值技术通常产生较低的预测偏差。我们的研究表明，通过应用EBK，可以显著降低低采样密度和温度数据非平稳性的潜在影响，尤其是当与相关协变量相结合时，可以显著减少EBKRP。对于连续和缓慢变化的现象，如温度和类似的变量，尤其如此。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊