{"title":"Novel definition of local baseline values for potentially toxic elements in Czech farmland using adaptive spatial weighting","authors":"Jan Skála , Tomáš Matys Grygar , Alla Achasova","doi":"10.1016/j.apgeochem.2024.106082","DOIUrl":null,"url":null,"abstract":"<div><p>Exploratory data analysis is commonly used in geoscientific research to identify various data populations within datasets. Frequently, the distribution-wise metrics for the statistical centre and spread are combined to define normal (background) ranges of topsoil contents of potentially toxic elements. When decreasing the geographical scale, the survey areas turn too heterogeneous for the statistical definition of a single background range. The traditional solution is the domain approach wherein various data populations (and their statistical parameters) can be attributed to contextual (geological, ecological) information. Nevertheless, summarising the entire data set from large areas as a single statistical entity would provide too much data reduction which would decrease sensitivity of detecting localised anthropogenic contamination and work wrong in areas of geogenic anomalies spatially larger than contamination. In this paper, we tested a novel numerical solution for deriving local distribution-wise baseline values via spatially limited sliding window combined with geographical weighting. Considering environmental variables (soil and topographical properties) at every analysed soil sample point, we extended the geographical kernel weighting approach which considers only spatial dimension (given by geographical coordinates). The advanced version combines two similarity modes to assign highest weights to the nearest points expected to share similar environmental contexts within the user-defined moving kernel. The method was implemented for data-mining in the Czech high-density monitoring data for agricultural soils which had to be firstly regressed to achieve analytical harmony between two distinct extraction methods employed in that monitoring, in particular cold diluted nitric acid and hot aqua regia. After the reliable harmonisation, local baseline values were delivered as the localised outer limits of variation using the proposed double-weighted kernel approach. We compared the estimated localised background ranges for 10 potentially toxic elements (As, Be, Cd, Co, Cr, Cu, Ni, Pb, V, and Zn) with those based on the conventional substrate-wise domain approach and single nationwide legislation thresholds. The comparison was also efficient in identifying an inappropriate aggregation of some geological units. Finally, the kernel approach delivered regional outer limits of variability sensitive to subtle regional variations of topsoil geochemistry.</p></div>","PeriodicalId":8064,"journal":{"name":"Applied Geochemistry","volume":"170 ","pages":"Article 106082"},"PeriodicalIF":3.1000,"publicationDate":"2024-06-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Geochemistry","FirstCategoryId":"89","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0883292724001872","RegionNum":3,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GEOCHEMISTRY & GEOPHYSICS","Score":null,"Total":0}
引用次数: 0
Abstract
Exploratory data analysis is commonly used in geoscientific research to identify various data populations within datasets. Frequently, the distribution-wise metrics for the statistical centre and spread are combined to define normal (background) ranges of topsoil contents of potentially toxic elements. When decreasing the geographical scale, the survey areas turn too heterogeneous for the statistical definition of a single background range. The traditional solution is the domain approach wherein various data populations (and their statistical parameters) can be attributed to contextual (geological, ecological) information. Nevertheless, summarising the entire data set from large areas as a single statistical entity would provide too much data reduction which would decrease sensitivity of detecting localised anthropogenic contamination and work wrong in areas of geogenic anomalies spatially larger than contamination. In this paper, we tested a novel numerical solution for deriving local distribution-wise baseline values via spatially limited sliding window combined with geographical weighting. Considering environmental variables (soil and topographical properties) at every analysed soil sample point, we extended the geographical kernel weighting approach which considers only spatial dimension (given by geographical coordinates). The advanced version combines two similarity modes to assign highest weights to the nearest points expected to share similar environmental contexts within the user-defined moving kernel. The method was implemented for data-mining in the Czech high-density monitoring data for agricultural soils which had to be firstly regressed to achieve analytical harmony between two distinct extraction methods employed in that monitoring, in particular cold diluted nitric acid and hot aqua regia. After the reliable harmonisation, local baseline values were delivered as the localised outer limits of variation using the proposed double-weighted kernel approach. We compared the estimated localised background ranges for 10 potentially toxic elements (As, Be, Cd, Co, Cr, Cu, Ni, Pb, V, and Zn) with those based on the conventional substrate-wise domain approach and single nationwide legislation thresholds. The comparison was also efficient in identifying an inappropriate aggregation of some geological units. Finally, the kernel approach delivered regional outer limits of variability sensitive to subtle regional variations of topsoil geochemistry.
期刊介绍:
Applied Geochemistry is an international journal devoted to publication of original research papers, rapid research communications and selected review papers in geochemistry and urban geochemistry which have some practical application to an aspect of human endeavour, such as the preservation of the environment, health, waste disposal and the search for resources. Papers on applications of inorganic, organic and isotope geochemistry and geochemical processes are therefore welcome provided they meet the main criterion. Spatial and temporal monitoring case studies are only of interest to our international readership if they present new ideas of broad application.
Topics covered include: (1) Environmental geochemistry (including natural and anthropogenic aspects, and protection and remediation strategies); (2) Hydrogeochemistry (surface and groundwater); (3) Medical (urban) geochemistry; (4) The search for energy resources (in particular unconventional oil and gas or emerging metal resources); (5) Energy exploitation (in particular geothermal energy and CCS); (6) Upgrading of energy and mineral resources where there is a direct geochemical application; and (7) Waste disposal, including nuclear waste disposal.