A SMOTified-GAN-augmented bagging ensemble model of extreme learning machines for detecting geochemical anomalies associated with mineralization

IF 2.9 3区 地球科学 Q2 GEOCHEMISTRY & GEOPHYSICS Chemie Der Erde-Geochemistry Pub Date : 2024-11-01 Epub Date: 2024-06-14 DOI:10.1016/j.chemer.2024.126156
Min Guo, Yongliang Chen
{"title":"A SMOTified-GAN-augmented bagging ensemble model of extreme learning machines for detecting geochemical anomalies associated with mineralization","authors":"Min Guo,&nbsp;Yongliang Chen","doi":"10.1016/j.chemer.2024.126156","DOIUrl":null,"url":null,"abstract":"<div><div><span>The use of supervised machine learning and deep learning techniques to automatically detect geochemical anomalies associated with mineralization has become a current research hotspot. However, due to the scarcity of known mineral deposits in the study area, the establishment of supervised machine learning and deep learning models faces the challenge of highly imbalanced data classification. To address this challenge, the SMOTified-GAN oversampling technique and bagging strategy were combined with extreme learning machines (ELMs) to construct a robust high-performance ensemble classification model for detecting geochemical anomalies associated with mineralization. In this ensemble model, SMOTified-GAN is used to balance the ratio of positive (mineralized) to negative (background) samples in geochemical exploration data set, while keeping the sample distribution pattern of positive samples unchanged. The bagging strategy is used to construct a robust ensemble model from the simple ELM classifiers to improve the robustness of the supervised anomaly detection model. Taking the Helong area (Jilin Province, China) as the case study area, three bagging ensemble models of the simple ELM classifiers with SMOTified-GAN, GAN and SMOTE augmentations were established on the 1: 50,000 </span>stream sediment survey data, and used to automatically detect geochemical anomalies associated with polymetallic mineralization. The receiver operating characteristic (ROC) curve of the three ensemble models are very close to the upper left corner the of the ROC space, with the SMOTified-GAN augmented bagging ensemble model dominating the other two; and the area under the ROC curves (AUCs) of the three ensemble models are very close to 1.0 (0.99998, 0.99681, and 0.96803, respectively). Therefore, in terms of ROC curves and AUCs, the SMOTified-GAN augmented bagging ensemble model has the best performance in detecting geochemical anomalies associated with polymetallic mineralization. In addition, the geochemical anomalies associated with polymetallic mineralization detected by the SMOTified-GAN augmented bagging ensemble model have the close spatial correlation with the ore-forming control factors in the study area, and are mainly distributed around known polymetallic deposits. In other words, a bagging ensemble model with high performance can be constructed from the simple ELM classifiers with SMOTified-GAN augmentation in detecting geochemical anomalies associated with mineralization.</div></div>","PeriodicalId":55973,"journal":{"name":"Chemie Der Erde-Geochemistry","volume":"84 4","pages":"Article 126156"},"PeriodicalIF":2.9000,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Chemie Der Erde-Geochemistry","FirstCategoryId":"89","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0009281924000813","RegionNum":3,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/6/14 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"GEOCHEMISTRY & GEOPHYSICS","Score":null,"Total":0}
引用次数: 0

Abstract

The use of supervised machine learning and deep learning techniques to automatically detect geochemical anomalies associated with mineralization has become a current research hotspot. However, due to the scarcity of known mineral deposits in the study area, the establishment of supervised machine learning and deep learning models faces the challenge of highly imbalanced data classification. To address this challenge, the SMOTified-GAN oversampling technique and bagging strategy were combined with extreme learning machines (ELMs) to construct a robust high-performance ensemble classification model for detecting geochemical anomalies associated with mineralization. In this ensemble model, SMOTified-GAN is used to balance the ratio of positive (mineralized) to negative (background) samples in geochemical exploration data set, while keeping the sample distribution pattern of positive samples unchanged. The bagging strategy is used to construct a robust ensemble model from the simple ELM classifiers to improve the robustness of the supervised anomaly detection model. Taking the Helong area (Jilin Province, China) as the case study area, three bagging ensemble models of the simple ELM classifiers with SMOTified-GAN, GAN and SMOTE augmentations were established on the 1: 50,000 stream sediment survey data, and used to automatically detect geochemical anomalies associated with polymetallic mineralization. The receiver operating characteristic (ROC) curve of the three ensemble models are very close to the upper left corner the of the ROC space, with the SMOTified-GAN augmented bagging ensemble model dominating the other two; and the area under the ROC curves (AUCs) of the three ensemble models are very close to 1.0 (0.99998, 0.99681, and 0.96803, respectively). Therefore, in terms of ROC curves and AUCs, the SMOTified-GAN augmented bagging ensemble model has the best performance in detecting geochemical anomalies associated with polymetallic mineralization. In addition, the geochemical anomalies associated with polymetallic mineralization detected by the SMOTified-GAN augmented bagging ensemble model have the close spatial correlation with the ore-forming control factors in the study area, and are mainly distributed around known polymetallic deposits. In other words, a bagging ensemble model with high performance can be constructed from the simple ELM classifiers with SMOTified-GAN augmentation in detecting geochemical anomalies associated with mineralization.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
用于探测与矿化有关的地球化学异常现象的极限学习机 SMOTified-GAN 增强型袋式集合模型
利用监督式机器学习和深度学习技术自动检测与矿化相关的地球化学异常已成为当前的研究热点。然而,由于研究区域已知矿床的稀缺性,有监督机器学习和深度学习模型的建立面临着数据分类高度不平衡的挑战。为了解决这一挑战,将SMOTified-GAN过采样技术和套袋策略与极限学习机(elm)相结合,构建了一个鲁棒的高性能集合分类模型,用于检测与矿化相关的地球化学异常。在该集成模型中,利用SMOTified-GAN平衡化探数据集中正(矿化)与负(背景)样本的比例,同时保持正样本的样本分布模式不变。采用bagging策略从简单ELM分类器中构建鲁棒集成模型,提高监督异常检测模型的鲁棒性。以吉林和龙地区为例,在1:5万水系沉积物测量数据基础上,建立了smotifzed -GAN、GAN和SMOTE增强的简单ELM分类器3种bagging集合模型,并用于多金属成矿相关地球化学异常的自动检测。三种集成模型的接收者工作特征(ROC)曲线都非常接近于ROC空间的左上角,其中SMOTified-GAN增强套袋集成模型占主导地位;三种集合模型的ROC曲线下面积(auc)均非常接近1.0,分别为0.99998、0.99681和0.96803。因此,在ROC曲线和auc方面,smotifid - gan增强bagging系综模型在探测多金属成矿相关地球化学异常方面表现最好。此外,SMOTified-GAN增强bagging系综模型探测到的多金属成矿相关地球化学异常与研究区成矿控制因素具有密切的空间相关性,且主要分布在已知多金属矿床周围。换句话说,通过简单的ELM分类器和smotifed - gan增强,可以构建一个高性能的bagging集合模型,用于检测与矿化相关的地球化学异常。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Chemie Der Erde-Geochemistry
Chemie Der Erde-Geochemistry 地学-地球化学与地球物理
CiteScore
7.10
自引率
0.00%
发文量
40
审稿时长
3.0 months
期刊介绍: GEOCHEMISTRY was founded as Chemie der Erde 1914 in Jena, and, hence, is one of the oldest journals for geochemistry-related topics. GEOCHEMISTRY (formerly Chemie der Erde / Geochemistry) publishes original research papers, short communications, reviews of selected topics, and high-class invited review articles addressed at broad geosciences audience. Publications dealing with interdisciplinary questions are particularly welcome. Young scientists are especially encouraged to submit their work. Contributions will be published exclusively in English. The journal, through very personalized consultation and its worldwide distribution, offers entry into the world of international scientific communication, and promotes interdisciplinary discussion on chemical problems in a broad spectrum of geosciences. The following topics are covered by the expertise of the members of the editorial board (see below): -cosmochemistry, meteoritics- igneous, metamorphic, and sedimentary petrology- volcanology- low & high temperature geochemistry- experimental - theoretical - field related studies- mineralogy - crystallography- environmental geosciences- archaeometry
期刊最新文献
Trace element, Nd-Hf-Pb isotope geochemistry, and zircon U-Pb geochronology of the Baskil Dikes (Elazığ, Eastern Türkiye): Tracing Late Cretaceous continental arc magmatism in the Southern Neotethys Petrogenesis and tectonic setting of peridotites and associated dnite from the Tam Ky-Phuoc Son Ophiolite Complex, Central Vietnam Shear-deformation related Li-Rb-Cs mineralization: The case study from mineral chemistry and 40Ar39Ar ages of phlogopite in the Gaoligong Shear Zone, eastern Tibetan Mineralogy, geochemistry and genesis of newly discovered Sinandede kaolin deposit (Balıkesir, NW Türkiye): Potential applications Mineral composition, trace element geochemistry and metallogenic processes of the early Cambrian black rock series-hosted Xiajiadian gold‑vanadium deposit, southern Qinling, China
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1