Mahsa Hajihosseinlou , Abbas Maghsoudi , Reza Ghezelbash
{"title":"全面评估 OPTICS、GMM 和 K-means 聚类方法在与样本集水盆地相关的地球化学异常检测中的应用","authors":"Mahsa Hajihosseinlou , Abbas Maghsoudi , Reza Ghezelbash","doi":"10.1016/j.chemer.2024.126094","DOIUrl":null,"url":null,"abstract":"<div><p>The process of data-driven clustering to uncover geochemical anomalies linked to sample catchment basins (SCBs) includes a comprehensive framework to discern areas exhibiting unique geochemical attributes within a specified study area. The Ordering Points to Identify the Clustering Structure (OPTICS) method can serve as a robust methodology for detecting geochemical anomalies in SCBs. This is attributed to its capacity to effectively manage varying cluster densities, adaptively identify cluster numbers, exhibit resilience to noise, and display minimum sensitivity to parameters. A comparison was conducted in this research between the outcomes of the OPTICS clustering algorithm and two traditional clustering techniques, namely the Gaussian Mixture Model (GMM) and K-means clustering. In the following, the Expectation-Maximization (EM) technique is employed to train the GMM for clustering. Moreover, the Akaike Information Criterion (AIC) and Bayesian Information Criterion (BIC) as two validate statistical metrics implemented to ascertain the optimal number of components (clusters) belong to the GMM. It should be noted that the effectiveness of the clustering algorithms was further assessed using the Calinski-Harabasz (CH) index and the success-rate curves. OPTICS, a density-based clustering approach, was confirmed to be more effective than K-means and GMM for identifying MVT Pb<img>Zn anomalies in Varcheh district, western Iran. Furthermore, the specified anomalies show a geo-spatial correspondence with the geological facts, and it has been observed that strong anomalies are more discoverable in close proximity to MVT Pb<img>Zn occurrences. This work suggests a novel anomaly detection approach based on OPTICS, which exhibits superior performance and data-modeling efficiency. The main emphasis is on effectively distinguishing geochemical anomalies from sample data originating from populations with uncertain distributions.</p></div>","PeriodicalId":55973,"journal":{"name":"Chemie Der Erde-Geochemistry","volume":"84 2","pages":"Article 126094"},"PeriodicalIF":2.6000,"publicationDate":"2024-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A comprehensive evaluation of OPTICS, GMM and K-means clustering methodologies for geochemical anomaly detection connected with sample catchment basins\",\"authors\":\"Mahsa Hajihosseinlou , Abbas Maghsoudi , Reza Ghezelbash\",\"doi\":\"10.1016/j.chemer.2024.126094\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>The process of data-driven clustering to uncover geochemical anomalies linked to sample catchment basins (SCBs) includes a comprehensive framework to discern areas exhibiting unique geochemical attributes within a specified study area. The Ordering Points to Identify the Clustering Structure (OPTICS) method can serve as a robust methodology for detecting geochemical anomalies in SCBs. This is attributed to its capacity to effectively manage varying cluster densities, adaptively identify cluster numbers, exhibit resilience to noise, and display minimum sensitivity to parameters. A comparison was conducted in this research between the outcomes of the OPTICS clustering algorithm and two traditional clustering techniques, namely the Gaussian Mixture Model (GMM) and K-means clustering. In the following, the Expectation-Maximization (EM) technique is employed to train the GMM for clustering. Moreover, the Akaike Information Criterion (AIC) and Bayesian Information Criterion (BIC) as two validate statistical metrics implemented to ascertain the optimal number of components (clusters) belong to the GMM. It should be noted that the effectiveness of the clustering algorithms was further assessed using the Calinski-Harabasz (CH) index and the success-rate curves. OPTICS, a density-based clustering approach, was confirmed to be more effective than K-means and GMM for identifying MVT Pb<img>Zn anomalies in Varcheh district, western Iran. Furthermore, the specified anomalies show a geo-spatial correspondence with the geological facts, and it has been observed that strong anomalies are more discoverable in close proximity to MVT Pb<img>Zn occurrences. This work suggests a novel anomaly detection approach based on OPTICS, which exhibits superior performance and data-modeling efficiency. The main emphasis is on effectively distinguishing geochemical anomalies from sample data originating from populations with uncertain distributions.</p></div>\",\"PeriodicalId\":55973,\"journal\":{\"name\":\"Chemie Der Erde-Geochemistry\",\"volume\":\"84 2\",\"pages\":\"Article 126094\"},\"PeriodicalIF\":2.6000,\"publicationDate\":\"2024-05-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Chemie Der Erde-Geochemistry\",\"FirstCategoryId\":\"89\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0009281924000187\",\"RegionNum\":3,\"RegionCategory\":\"地球科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"GEOCHEMISTRY & GEOPHYSICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Chemie Der Erde-Geochemistry","FirstCategoryId":"89","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0009281924000187","RegionNum":3,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"GEOCHEMISTRY & GEOPHYSICS","Score":null,"Total":0}
A comprehensive evaluation of OPTICS, GMM and K-means clustering methodologies for geochemical anomaly detection connected with sample catchment basins
The process of data-driven clustering to uncover geochemical anomalies linked to sample catchment basins (SCBs) includes a comprehensive framework to discern areas exhibiting unique geochemical attributes within a specified study area. The Ordering Points to Identify the Clustering Structure (OPTICS) method can serve as a robust methodology for detecting geochemical anomalies in SCBs. This is attributed to its capacity to effectively manage varying cluster densities, adaptively identify cluster numbers, exhibit resilience to noise, and display minimum sensitivity to parameters. A comparison was conducted in this research between the outcomes of the OPTICS clustering algorithm and two traditional clustering techniques, namely the Gaussian Mixture Model (GMM) and K-means clustering. In the following, the Expectation-Maximization (EM) technique is employed to train the GMM for clustering. Moreover, the Akaike Information Criterion (AIC) and Bayesian Information Criterion (BIC) as two validate statistical metrics implemented to ascertain the optimal number of components (clusters) belong to the GMM. It should be noted that the effectiveness of the clustering algorithms was further assessed using the Calinski-Harabasz (CH) index and the success-rate curves. OPTICS, a density-based clustering approach, was confirmed to be more effective than K-means and GMM for identifying MVT PbZn anomalies in Varcheh district, western Iran. Furthermore, the specified anomalies show a geo-spatial correspondence with the geological facts, and it has been observed that strong anomalies are more discoverable in close proximity to MVT PbZn occurrences. This work suggests a novel anomaly detection approach based on OPTICS, which exhibits superior performance and data-modeling efficiency. The main emphasis is on effectively distinguishing geochemical anomalies from sample data originating from populations with uncertain distributions.
期刊介绍:
GEOCHEMISTRY was founded as Chemie der Erde 1914 in Jena, and, hence, is one of the oldest journals for geochemistry-related topics.
GEOCHEMISTRY (formerly Chemie der Erde / Geochemistry) publishes original research papers, short communications, reviews of selected topics, and high-class invited review articles addressed at broad geosciences audience. Publications dealing with interdisciplinary questions are particularly welcome. Young scientists are especially encouraged to submit their work. Contributions will be published exclusively in English. The journal, through very personalized consultation and its worldwide distribution, offers entry into the world of international scientific communication, and promotes interdisciplinary discussion on chemical problems in a broad spectrum of geosciences.
The following topics are covered by the expertise of the members of the editorial board (see below):
-cosmochemistry, meteoritics-
igneous, metamorphic, and sedimentary petrology-
volcanology-
low & high temperature geochemistry-
experimental - theoretical - field related studies-
mineralogy - crystallography-
environmental geosciences-
archaeometry