Pengpeng Yu , Yuan Liu , Hanyu Wang , Xi Chen , Yi Zheng , Wei Cao , Yiqu Xiong , Hongxiang Shan
{"title":"Machine learning of pyrite geochemistry reconstructs the multi-stage history of mineral deposits","authors":"Pengpeng Yu , Yuan Liu , Hanyu Wang , Xi Chen , Yi Zheng , Wei Cao , Yiqu Xiong , Hongxiang Shan","doi":"10.1016/j.gsf.2025.102011","DOIUrl":null,"url":null,"abstract":"<div><div>The application of machine learning for pyrite discrimination establishes a robust foundation for constructing the ore-forming history of multi-stage deposits; however, published models face challenges related to limited, imbalanced datasets and oversampling. In this study, the dataset was expanded to approximately 500 samples for each type, including 508 sedimentary, 573 orogenic gold, 548 sedimentary exhalative (SEDEX) deposits, and 364 volcanogenic massive sulfides (VMS) pyrites, utilizing random forest (RF) and support vector machine (SVM) methodologies to enhance the reliability of the classifier models. The RF classifier achieved an overall accuracy of 99.8%, and the SVM classifier attained an overall accuracy of 100%. The model was evaluated by a five-fold cross-validation approach with 93.8% accuracy for the RF and 94.9% for the SVM classifier. These results demonstrate the strong feasibility of pyrite classification, supported by a relatively large, balanced dataset and high accuracy rates. The classifier was employed to reveal the genesis of the controversial Keketale Pb-Zn deposit in NW China, which has been inconclusive among SEDEX, VMS, or a SEDEX-VMS transition. Petrographic investigations indicated that the deposit comprises early fine-grained layered pyrite (Py1) and late recrystallized pyrite (Py2). The majority voting classified Py1 as the VMS type, with an accuracy of RF and SVM being 72.2% and 75%, respectively, and confirmed Py2 as an orogenic type with 74.3% and 77.1% accuracy, respectively. The new findings indicated that the Keketale deposit originated from a submarine VMS mineralization system, followed by late orogenic-type overprinting of metamorphism and deformation, which is consistent with the geological and geochemical observations. This study further emphasizes the advantages of Machine learning (ML) methods in accurately and directly discriminating the deposit types and reconstructing the formation history of multi-stage deposits.</div></div>","PeriodicalId":12711,"journal":{"name":"Geoscience frontiers","volume":"16 3","pages":"Article 102011"},"PeriodicalIF":8.5000,"publicationDate":"2025-02-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Geoscience frontiers","FirstCategoryId":"89","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1674987125000118","RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GEOSCIENCES, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
Abstract
The application of machine learning for pyrite discrimination establishes a robust foundation for constructing the ore-forming history of multi-stage deposits; however, published models face challenges related to limited, imbalanced datasets and oversampling. In this study, the dataset was expanded to approximately 500 samples for each type, including 508 sedimentary, 573 orogenic gold, 548 sedimentary exhalative (SEDEX) deposits, and 364 volcanogenic massive sulfides (VMS) pyrites, utilizing random forest (RF) and support vector machine (SVM) methodologies to enhance the reliability of the classifier models. The RF classifier achieved an overall accuracy of 99.8%, and the SVM classifier attained an overall accuracy of 100%. The model was evaluated by a five-fold cross-validation approach with 93.8% accuracy for the RF and 94.9% for the SVM classifier. These results demonstrate the strong feasibility of pyrite classification, supported by a relatively large, balanced dataset and high accuracy rates. The classifier was employed to reveal the genesis of the controversial Keketale Pb-Zn deposit in NW China, which has been inconclusive among SEDEX, VMS, or a SEDEX-VMS transition. Petrographic investigations indicated that the deposit comprises early fine-grained layered pyrite (Py1) and late recrystallized pyrite (Py2). The majority voting classified Py1 as the VMS type, with an accuracy of RF and SVM being 72.2% and 75%, respectively, and confirmed Py2 as an orogenic type with 74.3% and 77.1% accuracy, respectively. The new findings indicated that the Keketale deposit originated from a submarine VMS mineralization system, followed by late orogenic-type overprinting of metamorphism and deformation, which is consistent with the geological and geochemical observations. This study further emphasizes the advantages of Machine learning (ML) methods in accurately and directly discriminating the deposit types and reconstructing the formation history of multi-stage deposits.
Geoscience frontiersEarth and Planetary Sciences-General Earth and Planetary Sciences
CiteScore
17.80
自引率
3.40%
发文量
147
审稿时长
35 days
期刊介绍:
Geoscience Frontiers (GSF) is the Journal of China University of Geosciences (Beijing) and Peking University. It publishes peer-reviewed research articles and reviews in interdisciplinary fields of Earth and Planetary Sciences. GSF covers various research areas including petrology and geochemistry, lithospheric architecture and mantle dynamics, global tectonics, economic geology and fuel exploration, geophysics, stratigraphy and paleontology, environmental and engineering geology, astrogeology, and the nexus of resources-energy-emissions-climate under Sustainable Development Goals. The journal aims to bridge innovative, provocative, and challenging concepts and models in these fields, providing insights on correlations and evolution.