Mehrdad Daviran , Abbas Maghsoudi , Reza Ghezelbash
{"title":"Optimized AI-MPM: Application of PSO for tuning the hyperparameters of SVM and RF algorithms","authors":"Mehrdad Daviran , Abbas Maghsoudi , Reza Ghezelbash","doi":"10.1016/j.cageo.2024.105785","DOIUrl":null,"url":null,"abstract":"<div><div>Modern computational techniques, particularly Support Vector Machines (SVM) and Random Forest (RF) models, are revolutionizing predictive mineral prospectivity mapping. These advanced systems excel at identifying prime resource locations but require meticulous fine-tuning of their internal settings to achieve peak performance. Careful calibration of these configurations during the learning phase significantly enhances their ability to detect promising deposits. The main goal of this study is to introduce a hybrid model called PSO -SVM and PSO-RF, which aim to combine particle swarm optimization (PSO) with SVM (with RBF kernel) and RF models. This hybrid model automatically adjusts the optimized hyperparameters of SVM and RF, resulting in highly accurate predictions and a wide range of applicability. The PSO algorithm has been applied to fine-tune two main parameters (<em>C</em> and <em>λ</em>) for SVM-RBF and three main parameters (<em>N</em><sub><em>T</em></sub>, <em>N</em><sub><em>S</em></sub>, and <em>d</em>) for RF, creating efficient models for both. The proposed hybrid model as well as the traditional versions of SVM and RF models, were tested using a geo-spatial dataset related to Cu mineralization in Kerman belt, SE Iran. Forecasting algorithms were developed by integrating diverse datasets: multi-element concentrations from stream samples, bedrock and fault line maps, indicators of hot fluid interaction, aeromagnetic survey results, coordinates of previously identified copper-rich igneous intrusions, and verified ore body positions as reference points. The models' performance was evaluated using four validation methods: Multi-round data partitioning (K-fold), error classification tables (confusion matrix), true-positive vs. false-positive graphical analysis ((ROC) curve), and P-A plot were used to assess algorithms and models performance. Tests revealed that the PSO-SVM surpassed all competitors. Impressively, this fine-tuned classifier identified prime target zones in merely one-seventh of the region (14%), yet these areas encompassed nearly all verified resource sites (97%).</div></div>","PeriodicalId":55221,"journal":{"name":"Computers & Geosciences","volume":"195 ","pages":"Article 105785"},"PeriodicalIF":4.2000,"publicationDate":"2024-11-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers & Geosciences","FirstCategoryId":"89","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0098300424002681","RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0
Abstract
Modern computational techniques, particularly Support Vector Machines (SVM) and Random Forest (RF) models, are revolutionizing predictive mineral prospectivity mapping. These advanced systems excel at identifying prime resource locations but require meticulous fine-tuning of their internal settings to achieve peak performance. Careful calibration of these configurations during the learning phase significantly enhances their ability to detect promising deposits. The main goal of this study is to introduce a hybrid model called PSO -SVM and PSO-RF, which aim to combine particle swarm optimization (PSO) with SVM (with RBF kernel) and RF models. This hybrid model automatically adjusts the optimized hyperparameters of SVM and RF, resulting in highly accurate predictions and a wide range of applicability. The PSO algorithm has been applied to fine-tune two main parameters (C and λ) for SVM-RBF and three main parameters (NT, NS, and d) for RF, creating efficient models for both. The proposed hybrid model as well as the traditional versions of SVM and RF models, were tested using a geo-spatial dataset related to Cu mineralization in Kerman belt, SE Iran. Forecasting algorithms were developed by integrating diverse datasets: multi-element concentrations from stream samples, bedrock and fault line maps, indicators of hot fluid interaction, aeromagnetic survey results, coordinates of previously identified copper-rich igneous intrusions, and verified ore body positions as reference points. The models' performance was evaluated using four validation methods: Multi-round data partitioning (K-fold), error classification tables (confusion matrix), true-positive vs. false-positive graphical analysis ((ROC) curve), and P-A plot were used to assess algorithms and models performance. Tests revealed that the PSO-SVM surpassed all competitors. Impressively, this fine-tuned classifier identified prime target zones in merely one-seventh of the region (14%), yet these areas encompassed nearly all verified resource sites (97%).
期刊介绍:
Computers & Geosciences publishes high impact, original research at the interface between Computer Sciences and Geosciences. Publications should apply modern computer science paradigms, whether computational or informatics-based, to address problems in the geosciences.