{"title":"Delineation of geochemical anomalies through empirical cumulative distribution function for mineral exploration","authors":"Shahed Shahrestani, Ioan Sanislav","doi":"10.1016/j.gexplo.2024.107662","DOIUrl":null,"url":null,"abstract":"<div><div>In this paper, a statistical outlier detection technique based on empirical cumulative distribution functions (ECOD) is applied to a multivariate geochemical dataset from southeastern Iran, which is known for its porphyry and vein-type copper mineral occurrences. The ECOD method assumes that outlier samples are situated in both the left and right tails of the cumulative distribution functions, and it determines whether the outliers are located in the right or left tails using the concept of skewness. Anomaly maps produced by the ECOD method are compared with those generated by the local outlier factor (LOF) method. Both ECOD and LOF are applied to two subsets, including 4 and 12 trace elements. The anomaly maps are evaluated by comparing the number of delineated known mineral deposits and using ROC curves. The result revealed that LOF was outperformed by ECOD in the delineation of known Cu mineralization and in the identification of zones containing mineralized samples collected during the anomaly checking stage. The ECOD anomaly map is also compared with results from the k-means clustering method, and the superiority of ECOD over k-means clustering is demonstrated. The implementation of ECOD on clr-transformed multivariate geochemical data shows promise but assumes statistical independence among features, often unmet in geochemical exploration. To address this, we transformed clr data into new principal and independent feature spaces using principal component analysis (PCA) and independent component analysis (ICA), enhancing anomaly detection efficiency. ECOD_ICA outperformed ECOD_PCA, successfully classifying all mineralized samples and 15 of 18 Cu mineral occurrences in the highest score class (Q4), as confirmed by ROC analysis. However, the reliance of the ECOD method on univariate tail probabilities limits its ability to detect multivariate anomalies arising from complex inter-element relationships. Strong correlations in geochemical datasets can lead to false positives, necessitating dimension reduction techniques. While PCA and ICA help manage these correlations, they may obscure meaningful signals. The ECOD outlier detection method is also sensitive to the skewness of the dimensions, so a careful feature selection stage is recommended before applying it. The method is less sensitive to the number of dimensions, which enhances its robustness. Additionally, the absence of hyperparameter tuning makes ECOD a reliable and efficient outlier detection method.</div></div>","PeriodicalId":16336,"journal":{"name":"Journal of Geochemical Exploration","volume":"270 ","pages":"Article 107662"},"PeriodicalIF":3.4000,"publicationDate":"2024-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Geochemical Exploration","FirstCategoryId":"89","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0375674224002784","RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GEOCHEMISTRY & GEOPHYSICS","Score":null,"Total":0}
引用次数: 0
Abstract
In this paper, a statistical outlier detection technique based on empirical cumulative distribution functions (ECOD) is applied to a multivariate geochemical dataset from southeastern Iran, which is known for its porphyry and vein-type copper mineral occurrences. The ECOD method assumes that outlier samples are situated in both the left and right tails of the cumulative distribution functions, and it determines whether the outliers are located in the right or left tails using the concept of skewness. Anomaly maps produced by the ECOD method are compared with those generated by the local outlier factor (LOF) method. Both ECOD and LOF are applied to two subsets, including 4 and 12 trace elements. The anomaly maps are evaluated by comparing the number of delineated known mineral deposits and using ROC curves. The result revealed that LOF was outperformed by ECOD in the delineation of known Cu mineralization and in the identification of zones containing mineralized samples collected during the anomaly checking stage. The ECOD anomaly map is also compared with results from the k-means clustering method, and the superiority of ECOD over k-means clustering is demonstrated. The implementation of ECOD on clr-transformed multivariate geochemical data shows promise but assumes statistical independence among features, often unmet in geochemical exploration. To address this, we transformed clr data into new principal and independent feature spaces using principal component analysis (PCA) and independent component analysis (ICA), enhancing anomaly detection efficiency. ECOD_ICA outperformed ECOD_PCA, successfully classifying all mineralized samples and 15 of 18 Cu mineral occurrences in the highest score class (Q4), as confirmed by ROC analysis. However, the reliance of the ECOD method on univariate tail probabilities limits its ability to detect multivariate anomalies arising from complex inter-element relationships. Strong correlations in geochemical datasets can lead to false positives, necessitating dimension reduction techniques. While PCA and ICA help manage these correlations, they may obscure meaningful signals. The ECOD outlier detection method is also sensitive to the skewness of the dimensions, so a careful feature selection stage is recommended before applying it. The method is less sensitive to the number of dimensions, which enhances its robustness. Additionally, the absence of hyperparameter tuning makes ECOD a reliable and efficient outlier detection method.
期刊介绍:
Journal of Geochemical Exploration is mostly dedicated to publication of original studies in exploration and environmental geochemistry and related topics.
Contributions considered of prevalent interest for the journal include researches based on the application of innovative methods to:
define the genesis and the evolution of mineral deposits including transfer of elements in large-scale mineralized areas.
analyze complex systems at the boundaries between bio-geochemistry, metal transport and mineral accumulation.
evaluate effects of historical mining activities on the surface environment.
trace pollutant sources and define their fate and transport models in the near-surface and surface environments involving solid, fluid and aerial matrices.
assess and quantify natural and technogenic radioactivity in the environment.
determine geochemical anomalies and set baseline reference values using compositional data analysis, multivariate statistics and geo-spatial analysis.
assess the impacts of anthropogenic contamination on ecosystems and human health at local and regional scale to prioritize and classify risks through deterministic and stochastic approaches.
Papers dedicated to the presentation of newly developed methods in analytical geochemistry to be applied in the field or in laboratory are also within the topics of interest for the journal.