Unsupervised detection of multivariate geochemical anomalies using a high-performance deep autoencoder Gaussian mixture model

IF 3.3 2区 地球科学 Q1 GEOCHEMISTRY & GEOPHYSICS Journal of Geochemical Exploration Pub Date : 2025-04-01 Epub Date: 2025-01-11 DOI:10.1016/j.gexplo.2025.107671
Xuemei Wang, Yongliang Chen
{"title":"Unsupervised detection of multivariate geochemical anomalies using a high-performance deep autoencoder Gaussian mixture model","authors":"Xuemei Wang,&nbsp;Yongliang Chen","doi":"10.1016/j.gexplo.2025.107671","DOIUrl":null,"url":null,"abstract":"<div><div>It is of great significance to construct an efficient geochemical anomaly detection model for the successful accomplishment of a mineral exploration process in a complex geological environment. However, the complex geological environment of the prospecting area often results in the high-dimensional unknown complex population distribution of geochemical exploration data. This complex distribution is difficult to fit with a theoretical probability distribution model. As a result, it becomes a challenge to carry out an effective detection of geochemical anomalies. Therefore, to develop an anomaly detection model that can effectively fit the complex population distribution of geochemical exploration data is the key for accurately detecting geochemical anomalies. For this reason, the deep autoencoder Gaussian mixture model (DAGMM) was adopted to model the geochemical exploration data obtained in the 1:200,000 geological survey conducted in the Baishan area (Jilin, China) to check its superiority in identifying multivariate geochemical anomalies. As an innovative deep learning framework for unsupervised anomaly detection, DAGMM ingeniously combines the data dimensionality reduction and compression capabilities of a deep autoencoder (DAE) with the probability density estimation advantage of the Gaussian mixture model (GMM). The DAGMM model can deeply explore the deep-level features of geochemical exploration data and effectively model the complex unknown data distribution through the synergistically work and joint optimization strategy in training the DAE and GMM model, so it can accurately identify geochemical anomalies. To show the superiority of the DAGMM model in detecting polymetallic geochemical anomalies, the DAGMM model was compared with the GMM and DAE models. The receiver operating characteristic (ROC) curves of the three models were plotted, and the areas under the ROC curves (AUCs) and lift indices were calculated. The ROC curve of the DAGMM model dominates that of the DAE model and GMM model. The DAGMM model has an AUC of 0.904 and a lift index of 10.44, respectively, which are much larger than those of the GMM model (AUC = 0.858, lift index = 3.63) and DAE model (AUC = 0.83, lift index = 5.31). Therefore, the DAGMM model significantly outperforms the other two models in detecting multivariate geochemical anomalies and the polymetallic geochemical anomalies detected by the DAGMM model contain all the known polymetallic deposits. Compared with DAE and GMM, DAGMM is more efficient and more powerful in detecting multivariate geochemical anomalies in complex geological environments.</div></div>","PeriodicalId":16336,"journal":{"name":"Journal of Geochemical Exploration","volume":"271 ","pages":"Article 107671"},"PeriodicalIF":3.3000,"publicationDate":"2025-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Geochemical Exploration","FirstCategoryId":"89","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0375674225000032","RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/11 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"GEOCHEMISTRY & GEOPHYSICS","Score":null,"Total":0}
引用次数: 0

Abstract

It is of great significance to construct an efficient geochemical anomaly detection model for the successful accomplishment of a mineral exploration process in a complex geological environment. However, the complex geological environment of the prospecting area often results in the high-dimensional unknown complex population distribution of geochemical exploration data. This complex distribution is difficult to fit with a theoretical probability distribution model. As a result, it becomes a challenge to carry out an effective detection of geochemical anomalies. Therefore, to develop an anomaly detection model that can effectively fit the complex population distribution of geochemical exploration data is the key for accurately detecting geochemical anomalies. For this reason, the deep autoencoder Gaussian mixture model (DAGMM) was adopted to model the geochemical exploration data obtained in the 1:200,000 geological survey conducted in the Baishan area (Jilin, China) to check its superiority in identifying multivariate geochemical anomalies. As an innovative deep learning framework for unsupervised anomaly detection, DAGMM ingeniously combines the data dimensionality reduction and compression capabilities of a deep autoencoder (DAE) with the probability density estimation advantage of the Gaussian mixture model (GMM). The DAGMM model can deeply explore the deep-level features of geochemical exploration data and effectively model the complex unknown data distribution through the synergistically work and joint optimization strategy in training the DAE and GMM model, so it can accurately identify geochemical anomalies. To show the superiority of the DAGMM model in detecting polymetallic geochemical anomalies, the DAGMM model was compared with the GMM and DAE models. The receiver operating characteristic (ROC) curves of the three models were plotted, and the areas under the ROC curves (AUCs) and lift indices were calculated. The ROC curve of the DAGMM model dominates that of the DAE model and GMM model. The DAGMM model has an AUC of 0.904 and a lift index of 10.44, respectively, which are much larger than those of the GMM model (AUC = 0.858, lift index = 3.63) and DAE model (AUC = 0.83, lift index = 5.31). Therefore, the DAGMM model significantly outperforms the other two models in detecting multivariate geochemical anomalies and the polymetallic geochemical anomalies detected by the DAGMM model contain all the known polymetallic deposits. Compared with DAE and GMM, DAGMM is more efficient and more powerful in detecting multivariate geochemical anomalies in complex geological environments.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于高性能深度自编码器高斯混合模型的多变量地球化学异常无监督检测
构建高效的地球化学异常检测模型,对于在复杂地质环境下顺利完成找矿过程具有重要意义。然而,由于找矿区复杂的地质环境,往往导致化探资料呈现高维未知的复杂种群分布。这种复杂的分布很难用理论概率分布模型拟合。因此,如何进行有效的地球化学异常检测成为一个挑战。因此,建立能有效拟合化探数据复杂种群分布的异常检测模型是准确检测化探异常的关键。为此,采用深度自编码器高斯混合模型(DAGMM)对吉林白山地区1:20万地质调查化探数据进行建模,验证其在多元地球化学异常识别方面的优势。作为一种创新的无监督异常检测深度学习框架,DAGMM巧妙地将深度自编码器(DAE)的数据降维和压缩能力与高斯混合模型(GMM)的概率密度估计优势结合起来。DAGMM模型通过DAE和GMM模型训练中的协同工作和联合优化策略,能够深入挖掘化探数据的深层次特征,有效建模复杂的未知数据分布,从而准确识别地球化学异常。为了证明DAGMM模型在多金属地球化学异常检测中的优越性,将DAGMM模型与GMM和DAE模型进行了比较。绘制3种模型的受试者工作特征(ROC)曲线,计算ROC曲线下面积(aus)和升力指数。DAGMM模型的ROC曲线优于DAE模型和GMM模型。DAGMM模型的AUC为0.904,升力指数为10.44,远远大于GMM模型(AUC = 0.858,升力指数= 3.63)和DAE模型(AUC = 0.83,升力指数= 5.31)。因此,DAGMM模型在多元地球化学异常检测方面明显优于其他两种模型,并且DAGMM模型检测到的多金属地球化学异常包含了已知的所有多金属矿床。与DAE和GMM相比,DAGMM在复杂地质环境中检测多变量地球化学异常的效率更高,功能更强。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Journal of Geochemical Exploration
Journal of Geochemical Exploration 地学-地球化学与地球物理
CiteScore
7.40
自引率
7.70%
发文量
148
审稿时长
8.1 months
期刊介绍: Journal of Geochemical Exploration is mostly dedicated to publication of original studies in exploration and environmental geochemistry and related topics. Contributions considered of prevalent interest for the journal include researches based on the application of innovative methods to: define the genesis and the evolution of mineral deposits including transfer of elements in large-scale mineralized areas. analyze complex systems at the boundaries between bio-geochemistry, metal transport and mineral accumulation. evaluate effects of historical mining activities on the surface environment. trace pollutant sources and define their fate and transport models in the near-surface and surface environments involving solid, fluid and aerial matrices. assess and quantify natural and technogenic radioactivity in the environment. determine geochemical anomalies and set baseline reference values using compositional data analysis, multivariate statistics and geo-spatial analysis. assess the impacts of anthropogenic contamination on ecosystems and human health at local and regional scale to prioritize and classify risks through deterministic and stochastic approaches. Papers dedicated to the presentation of newly developed methods in analytical geochemistry to be applied in the field or in laboratory are also within the topics of interest for the journal.
期刊最新文献
Application of short-wave infrared spectroscopy in sandstone-type uranium deposits: A case study of the Yuejin-II uranium deposit in the Qaidam Basin, China Two-dimensional data examination by exploratory functional data analysis to improve detection of scattered soil contamination by Cu-bearing pesticides K-feldspar oscillatory zoning constraining efficient crystal fractionation in high-silica magma: A case study of the Dacaowu pluton, eastern China From geology to environmental management: Defining geochemical baselines for stream sediments in a mining-impacted Brazilian watershed Vapor phase facilitated rhenium enrichment in porphyry Mo deposit: Insights from thermodynamic simulations and molybdenum isotopes at Jinduicheng deposit, central China
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1