{"title":"Unsupervised detection of multivariate geochemical anomalies using a high-performance deep autoencoder Gaussian mixture model","authors":"Xuemei Wang, Yongliang Chen","doi":"10.1016/j.gexplo.2025.107671","DOIUrl":null,"url":null,"abstract":"<div><div>It is of great significance to construct an efficient geochemical anomaly detection model for the successful accomplishment of a mineral exploration process in a complex geological environment. However, the complex geological environment of the prospecting area often results in the high-dimensional unknown complex population distribution of geochemical exploration data. This complex distribution is difficult to fit with a theoretical probability distribution model. As a result, it becomes a challenge to carry out an effective detection of geochemical anomalies. Therefore, to develop an anomaly detection model that can effectively fit the complex population distribution of geochemical exploration data is the key for accurately detecting geochemical anomalies. For this reason, the deep autoencoder Gaussian mixture model (DAGMM) was adopted to model the geochemical exploration data obtained in the 1:200,000 geological survey conducted in the Baishan area (Jilin, China) to check its superiority in identifying multivariate geochemical anomalies. As an innovative deep learning framework for unsupervised anomaly detection, DAGMM ingeniously combines the data dimensionality reduction and compression capabilities of a deep autoencoder (DAE) with the probability density estimation advantage of the Gaussian mixture model (GMM). The DAGMM model can deeply explore the deep-level features of geochemical exploration data and effectively model the complex unknown data distribution through the synergistically work and joint optimization strategy in training the DAE and GMM model, so it can accurately identify geochemical anomalies. To show the superiority of the DAGMM model in detecting polymetallic geochemical anomalies, the DAGMM model was compared with the GMM and DAE models. The receiver operating characteristic (ROC) curves of the three models were plotted, and the areas under the ROC curves (AUCs) and lift indices were calculated. The ROC curve of the DAGMM model dominates that of the DAE model and GMM model. The DAGMM model has an AUC of 0.904 and a lift index of 10.44, respectively, which are much larger than those of the GMM model (AUC = 0.858, lift index = 3.63) and DAE model (AUC = 0.83, lift index = 5.31). Therefore, the DAGMM model significantly outperforms the other two models in detecting multivariate geochemical anomalies and the polymetallic geochemical anomalies detected by the DAGMM model contain all the known polymetallic deposits. Compared with DAE and GMM, DAGMM is more efficient and more powerful in detecting multivariate geochemical anomalies in complex geological environments.</div></div>","PeriodicalId":16336,"journal":{"name":"Journal of Geochemical Exploration","volume":"271 ","pages":"Article 107671"},"PeriodicalIF":3.4000,"publicationDate":"2025-01-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Geochemical Exploration","FirstCategoryId":"89","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0375674225000032","RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GEOCHEMISTRY & GEOPHYSICS","Score":null,"Total":0}
引用次数: 0
Abstract
It is of great significance to construct an efficient geochemical anomaly detection model for the successful accomplishment of a mineral exploration process in a complex geological environment. However, the complex geological environment of the prospecting area often results in the high-dimensional unknown complex population distribution of geochemical exploration data. This complex distribution is difficult to fit with a theoretical probability distribution model. As a result, it becomes a challenge to carry out an effective detection of geochemical anomalies. Therefore, to develop an anomaly detection model that can effectively fit the complex population distribution of geochemical exploration data is the key for accurately detecting geochemical anomalies. For this reason, the deep autoencoder Gaussian mixture model (DAGMM) was adopted to model the geochemical exploration data obtained in the 1:200,000 geological survey conducted in the Baishan area (Jilin, China) to check its superiority in identifying multivariate geochemical anomalies. As an innovative deep learning framework for unsupervised anomaly detection, DAGMM ingeniously combines the data dimensionality reduction and compression capabilities of a deep autoencoder (DAE) with the probability density estimation advantage of the Gaussian mixture model (GMM). The DAGMM model can deeply explore the deep-level features of geochemical exploration data and effectively model the complex unknown data distribution through the synergistically work and joint optimization strategy in training the DAE and GMM model, so it can accurately identify geochemical anomalies. To show the superiority of the DAGMM model in detecting polymetallic geochemical anomalies, the DAGMM model was compared with the GMM and DAE models. The receiver operating characteristic (ROC) curves of the three models were plotted, and the areas under the ROC curves (AUCs) and lift indices were calculated. The ROC curve of the DAGMM model dominates that of the DAE model and GMM model. The DAGMM model has an AUC of 0.904 and a lift index of 10.44, respectively, which are much larger than those of the GMM model (AUC = 0.858, lift index = 3.63) and DAE model (AUC = 0.83, lift index = 5.31). Therefore, the DAGMM model significantly outperforms the other two models in detecting multivariate geochemical anomalies and the polymetallic geochemical anomalies detected by the DAGMM model contain all the known polymetallic deposits. Compared with DAE and GMM, DAGMM is more efficient and more powerful in detecting multivariate geochemical anomalies in complex geological environments.
期刊介绍:
Journal of Geochemical Exploration is mostly dedicated to publication of original studies in exploration and environmental geochemistry and related topics.
Contributions considered of prevalent interest for the journal include researches based on the application of innovative methods to:
define the genesis and the evolution of mineral deposits including transfer of elements in large-scale mineralized areas.
analyze complex systems at the boundaries between bio-geochemistry, metal transport and mineral accumulation.
evaluate effects of historical mining activities on the surface environment.
trace pollutant sources and define their fate and transport models in the near-surface and surface environments involving solid, fluid and aerial matrices.
assess and quantify natural and technogenic radioactivity in the environment.
determine geochemical anomalies and set baseline reference values using compositional data analysis, multivariate statistics and geo-spatial analysis.
assess the impacts of anthropogenic contamination on ecosystems and human health at local and regional scale to prioritize and classify risks through deterministic and stochastic approaches.
Papers dedicated to the presentation of newly developed methods in analytical geochemistry to be applied in the field or in laboratory are also within the topics of interest for the journal.