全面评估 OPTICS、GMM 和 K-means 聚类方法在与样本集水盆地相关的地球化学异常检测中的应用

IF 2.6 3区 地球科学 Q2 GEOCHEMISTRY & GEOPHYSICS Chemie Der Erde-Geochemistry Pub Date : 2024-05-01 DOI:10.1016/j.chemer.2024.126094
Mahsa Hajihosseinlou , Abbas Maghsoudi , Reza Ghezelbash
{"title":"全面评估 OPTICS、GMM 和 K-means 聚类方法在与样本集水盆地相关的地球化学异常检测中的应用","authors":"Mahsa Hajihosseinlou ,&nbsp;Abbas Maghsoudi ,&nbsp;Reza Ghezelbash","doi":"10.1016/j.chemer.2024.126094","DOIUrl":null,"url":null,"abstract":"<div><p>The process of data-driven clustering to uncover geochemical anomalies linked to sample catchment basins (SCBs) includes a comprehensive framework to discern areas exhibiting unique geochemical attributes within a specified study area. The Ordering Points to Identify the Clustering Structure (OPTICS) method can serve as a robust methodology for detecting geochemical anomalies in SCBs. This is attributed to its capacity to effectively manage varying cluster densities, adaptively identify cluster numbers, exhibit resilience to noise, and display minimum sensitivity to parameters. A comparison was conducted in this research between the outcomes of the OPTICS clustering algorithm and two traditional clustering techniques, namely the Gaussian Mixture Model (GMM) and K-means clustering. In the following, the Expectation-Maximization (EM) technique is employed to train the GMM for clustering. Moreover, the Akaike Information Criterion (AIC) and Bayesian Information Criterion (BIC) as two validate statistical metrics implemented to ascertain the optimal number of components (clusters) belong to the GMM. It should be noted that the effectiveness of the clustering algorithms was further assessed using the Calinski-Harabasz (CH) index and the success-rate curves. OPTICS, a density-based clustering approach, was confirmed to be more effective than K-means and GMM for identifying MVT Pb<img>Zn anomalies in Varcheh district, western Iran. Furthermore, the specified anomalies show a geo-spatial correspondence with the geological facts, and it has been observed that strong anomalies are more discoverable in close proximity to MVT Pb<img>Zn occurrences. This work suggests a novel anomaly detection approach based on OPTICS, which exhibits superior performance and data-modeling efficiency. The main emphasis is on effectively distinguishing geochemical anomalies from sample data originating from populations with uncertain distributions.</p></div>","PeriodicalId":55973,"journal":{"name":"Chemie Der Erde-Geochemistry","volume":"84 2","pages":"Article 126094"},"PeriodicalIF":2.6000,"publicationDate":"2024-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A comprehensive evaluation of OPTICS, GMM and K-means clustering methodologies for geochemical anomaly detection connected with sample catchment basins\",\"authors\":\"Mahsa Hajihosseinlou ,&nbsp;Abbas Maghsoudi ,&nbsp;Reza Ghezelbash\",\"doi\":\"10.1016/j.chemer.2024.126094\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>The process of data-driven clustering to uncover geochemical anomalies linked to sample catchment basins (SCBs) includes a comprehensive framework to discern areas exhibiting unique geochemical attributes within a specified study area. The Ordering Points to Identify the Clustering Structure (OPTICS) method can serve as a robust methodology for detecting geochemical anomalies in SCBs. This is attributed to its capacity to effectively manage varying cluster densities, adaptively identify cluster numbers, exhibit resilience to noise, and display minimum sensitivity to parameters. A comparison was conducted in this research between the outcomes of the OPTICS clustering algorithm and two traditional clustering techniques, namely the Gaussian Mixture Model (GMM) and K-means clustering. In the following, the Expectation-Maximization (EM) technique is employed to train the GMM for clustering. Moreover, the Akaike Information Criterion (AIC) and Bayesian Information Criterion (BIC) as two validate statistical metrics implemented to ascertain the optimal number of components (clusters) belong to the GMM. It should be noted that the effectiveness of the clustering algorithms was further assessed using the Calinski-Harabasz (CH) index and the success-rate curves. OPTICS, a density-based clustering approach, was confirmed to be more effective than K-means and GMM for identifying MVT Pb<img>Zn anomalies in Varcheh district, western Iran. Furthermore, the specified anomalies show a geo-spatial correspondence with the geological facts, and it has been observed that strong anomalies are more discoverable in close proximity to MVT Pb<img>Zn occurrences. This work suggests a novel anomaly detection approach based on OPTICS, which exhibits superior performance and data-modeling efficiency. The main emphasis is on effectively distinguishing geochemical anomalies from sample data originating from populations with uncertain distributions.</p></div>\",\"PeriodicalId\":55973,\"journal\":{\"name\":\"Chemie Der Erde-Geochemistry\",\"volume\":\"84 2\",\"pages\":\"Article 126094\"},\"PeriodicalIF\":2.6000,\"publicationDate\":\"2024-05-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Chemie Der Erde-Geochemistry\",\"FirstCategoryId\":\"89\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0009281924000187\",\"RegionNum\":3,\"RegionCategory\":\"地球科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"GEOCHEMISTRY & GEOPHYSICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Chemie Der Erde-Geochemistry","FirstCategoryId":"89","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0009281924000187","RegionNum":3,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"GEOCHEMISTRY & GEOPHYSICS","Score":null,"Total":0}
引用次数: 0

摘要

通过数据驱动的聚类来发现与样本集水盆地(SCB)相关的地球化学异常现象的过程包括一个综合框架,用于识别特定研究区域内表现出独特地球化学属性的区域。点排序以识别聚类结构(OPTICS)方法可作为检测 SCB 中地球化学异常的可靠方法。这归功于该方法能够有效地管理不同的聚类密度,自适应地识别聚类数量,对噪声表现出弹性,并对参数表现出最小的敏感性。本研究比较了 OPTICS 聚类算法与两种传统聚类技术(即高斯混合模型 (GMM) 和 K-means 聚类)的结果。下文将采用期望最大化(EM)技术来训练用于聚类的 GMM。此外,还采用了 Akaike 信息准则(AIC)和贝叶斯信息准则(BIC)这两个有效的统计指标来确定属于 GMM 的最佳成分(聚类)数量。值得注意的是,聚类算法的有效性是通过卡林斯基-哈拉巴什(CH)指数和成功率曲线进一步评估的。在识别伊朗西部瓦尔切地区的 MVT 铅锌异常方面,基于密度的聚类方法 OPTICS 被证实比 K-means 和 GMM 更有效。此外,指定的异常点与地质事实显示出地理空间上的对应关系,并且观察到在靠近 MVT 铅锌矿点的地方更容易发现强异常点。这项工作提出了一种基于 OPTICS 的新型异常检测方法,该方法具有卓越的性能和数据建模效率。主要重点是从不确定分布的样本数据中有效区分地球化学异常。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
A comprehensive evaluation of OPTICS, GMM and K-means clustering methodologies for geochemical anomaly detection connected with sample catchment basins

The process of data-driven clustering to uncover geochemical anomalies linked to sample catchment basins (SCBs) includes a comprehensive framework to discern areas exhibiting unique geochemical attributes within a specified study area. The Ordering Points to Identify the Clustering Structure (OPTICS) method can serve as a robust methodology for detecting geochemical anomalies in SCBs. This is attributed to its capacity to effectively manage varying cluster densities, adaptively identify cluster numbers, exhibit resilience to noise, and display minimum sensitivity to parameters. A comparison was conducted in this research between the outcomes of the OPTICS clustering algorithm and two traditional clustering techniques, namely the Gaussian Mixture Model (GMM) and K-means clustering. In the following, the Expectation-Maximization (EM) technique is employed to train the GMM for clustering. Moreover, the Akaike Information Criterion (AIC) and Bayesian Information Criterion (BIC) as two validate statistical metrics implemented to ascertain the optimal number of components (clusters) belong to the GMM. It should be noted that the effectiveness of the clustering algorithms was further assessed using the Calinski-Harabasz (CH) index and the success-rate curves. OPTICS, a density-based clustering approach, was confirmed to be more effective than K-means and GMM for identifying MVT PbZn anomalies in Varcheh district, western Iran. Furthermore, the specified anomalies show a geo-spatial correspondence with the geological facts, and it has been observed that strong anomalies are more discoverable in close proximity to MVT PbZn occurrences. This work suggests a novel anomaly detection approach based on OPTICS, which exhibits superior performance and data-modeling efficiency. The main emphasis is on effectively distinguishing geochemical anomalies from sample data originating from populations with uncertain distributions.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Chemie Der Erde-Geochemistry
Chemie Der Erde-Geochemistry 地学-地球化学与地球物理
CiteScore
7.10
自引率
0.00%
发文量
40
审稿时长
3.0 months
期刊介绍: GEOCHEMISTRY was founded as Chemie der Erde 1914 in Jena, and, hence, is one of the oldest journals for geochemistry-related topics. GEOCHEMISTRY (formerly Chemie der Erde / Geochemistry) publishes original research papers, short communications, reviews of selected topics, and high-class invited review articles addressed at broad geosciences audience. Publications dealing with interdisciplinary questions are particularly welcome. Young scientists are especially encouraged to submit their work. Contributions will be published exclusively in English. The journal, through very personalized consultation and its worldwide distribution, offers entry into the world of international scientific communication, and promotes interdisciplinary discussion on chemical problems in a broad spectrum of geosciences. The following topics are covered by the expertise of the members of the editorial board (see below): -cosmochemistry, meteoritics- igneous, metamorphic, and sedimentary petrology- volcanology- low & high temperature geochemistry- experimental - theoretical - field related studies- mineralogy - crystallography- environmental geosciences- archaeometry
期刊最新文献
Editorial Board Contrasting fluids and implications for ore genesis in the Jiawula-Chaganbulagen Porphyry Mo-epithermal PbZn metallogenetic system: Evidence from fluid inclusions and H-O-He-Ar isotopes Ediacaran anorogenic alkaline magmatism and wolframite mineralization linked to mantle plume activity in the north Arabian-Nubian Shield (Egypt) A hydrous sub-arc mantle domain within the northeastern Neo-Tethyan ophiolites: Insights from cumulate hornblendites Hydrothermal alteration of accessory minerals (allanite and titanite) in the late Archean Closepet granitoid (Dharwar Craton, India): A TEM study
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1