Optimized AI-MPM: Application of PSO for tuning the hyperparameters of SVM and RF algorithms

IF 4.2 2区 地球科学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Computers & Geosciences Pub Date : 2024-11-20 DOI:10.1016/j.cageo.2024.105785
Mehrdad Daviran , Abbas Maghsoudi , Reza Ghezelbash
{"title":"Optimized AI-MPM: Application of PSO for tuning the hyperparameters of SVM and RF algorithms","authors":"Mehrdad Daviran ,&nbsp;Abbas Maghsoudi ,&nbsp;Reza Ghezelbash","doi":"10.1016/j.cageo.2024.105785","DOIUrl":null,"url":null,"abstract":"<div><div>Modern computational techniques, particularly Support Vector Machines (SVM) and Random Forest (RF) models, are revolutionizing predictive mineral prospectivity mapping. These advanced systems excel at identifying prime resource locations but require meticulous fine-tuning of their internal settings to achieve peak performance. Careful calibration of these configurations during the learning phase significantly enhances their ability to detect promising deposits. The main goal of this study is to introduce a hybrid model called PSO -SVM and PSO-RF, which aim to combine particle swarm optimization (PSO) with SVM (with RBF kernel) and RF models. This hybrid model automatically adjusts the optimized hyperparameters of SVM and RF, resulting in highly accurate predictions and a wide range of applicability. The PSO algorithm has been applied to fine-tune two main parameters (<em>C</em> and <em>λ</em>) for SVM-RBF and three main parameters (<em>N</em><sub><em>T</em></sub>, <em>N</em><sub><em>S</em></sub>, and <em>d</em>) for RF, creating efficient models for both. The proposed hybrid model as well as the traditional versions of SVM and RF models, were tested using a geo-spatial dataset related to Cu mineralization in Kerman belt, SE Iran. Forecasting algorithms were developed by integrating diverse datasets: multi-element concentrations from stream samples, bedrock and fault line maps, indicators of hot fluid interaction, aeromagnetic survey results, coordinates of previously identified copper-rich igneous intrusions, and verified ore body positions as reference points. The models' performance was evaluated using four validation methods: Multi-round data partitioning (K-fold), error classification tables (confusion matrix), true-positive vs. false-positive graphical analysis ((ROC) curve), and P-A plot were used to assess algorithms and models performance. Tests revealed that the PSO-SVM surpassed all competitors. Impressively, this fine-tuned classifier identified prime target zones in merely one-seventh of the region (14%), yet these areas encompassed nearly all verified resource sites (97%).</div></div>","PeriodicalId":55221,"journal":{"name":"Computers & Geosciences","volume":"195 ","pages":"Article 105785"},"PeriodicalIF":4.2000,"publicationDate":"2024-11-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers & Geosciences","FirstCategoryId":"89","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0098300424002681","RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0

Abstract

Modern computational techniques, particularly Support Vector Machines (SVM) and Random Forest (RF) models, are revolutionizing predictive mineral prospectivity mapping. These advanced systems excel at identifying prime resource locations but require meticulous fine-tuning of their internal settings to achieve peak performance. Careful calibration of these configurations during the learning phase significantly enhances their ability to detect promising deposits. The main goal of this study is to introduce a hybrid model called PSO -SVM and PSO-RF, which aim to combine particle swarm optimization (PSO) with SVM (with RBF kernel) and RF models. This hybrid model automatically adjusts the optimized hyperparameters of SVM and RF, resulting in highly accurate predictions and a wide range of applicability. The PSO algorithm has been applied to fine-tune two main parameters (C and λ) for SVM-RBF and three main parameters (NT, NS, and d) for RF, creating efficient models for both. The proposed hybrid model as well as the traditional versions of SVM and RF models, were tested using a geo-spatial dataset related to Cu mineralization in Kerman belt, SE Iran. Forecasting algorithms were developed by integrating diverse datasets: multi-element concentrations from stream samples, bedrock and fault line maps, indicators of hot fluid interaction, aeromagnetic survey results, coordinates of previously identified copper-rich igneous intrusions, and verified ore body positions as reference points. The models' performance was evaluated using four validation methods: Multi-round data partitioning (K-fold), error classification tables (confusion matrix), true-positive vs. false-positive graphical analysis ((ROC) curve), and P-A plot were used to assess algorithms and models performance. Tests revealed that the PSO-SVM surpassed all competitors. Impressively, this fine-tuned classifier identified prime target zones in merely one-seventh of the region (14%), yet these areas encompassed nearly all verified resource sites (97%).
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
优化的 AI-MPM:应用 PSO 调整 SVM 和 RF 算法的超参数
现代计算技术,尤其是支持向量机(SVM)和随机森林(RF)模型,正在彻底改变矿产远景预测绘图。这些先进的系统在确定主要资源位置方面表现出色,但需要对其内部设置进行细致的微调才能达到最佳性能。在学习阶段对这些配置进行仔细校准,可大大提高其探测有潜力矿藏的能力。本研究的主要目标是引入一种名为 PSO -SVM 和 PSO-RF 的混合模型,旨在将粒子群优化(PSO)与 SVM(带 RBF 内核)和 RF 模型相结合。这种混合模型可自动调整 SVM 和 RF 的优化超参数,从而实现高精度预测和广泛的适用性。PSO 算法被用于微调 SVM-RBF 的两个主要参数(C 和 λ)和 RF 的三个主要参数(NT、NS 和 d),为两者创建了高效模型。我们使用与伊朗东南部克尔曼矿带铜矿化相关的地理空间数据集对所提出的混合模型以及传统版本的 SVM 和 RF 模型进行了测试。预测算法是通过整合多种数据集而开发的:溪流样本中的多元素浓度、基岩和断层线图、热流体相互作用指标、航磁勘测结果、先前确定的富铜火成岩侵入体坐标以及作为参考点的已验证矿体位置。使用四种验证方法对模型的性能进行了评估:使用多轮数据分区(K-fold)、误差分类表(混淆矩阵)、真阳性与假阳性图形分析((ROC)曲线)和 P-A 图来评估算法和模型的性能。测试表明,PSO-SVM 超越了所有竞争对手。令人印象深刻的是,这种经过微调的分类器仅在七分之一的区域(14%)识别出了主要目标区,但这些区域几乎涵盖了所有经过验证的资源点(97%)。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Computers & Geosciences
Computers & Geosciences 地学-地球科学综合
CiteScore
9.30
自引率
6.80%
发文量
164
审稿时长
3.4 months
期刊介绍: Computers & Geosciences publishes high impact, original research at the interface between Computer Sciences and Geosciences. Publications should apply modern computer science paradigms, whether computational or informatics-based, to address problems in the geosciences.
期刊最新文献
Fire-Image-DenseNet (FIDN) for predicting wildfire burnt area using remote sensing data Efficient reservoir characterization using dimensionless ensemble smoother and multiple data assimilation in damaged multilayer systems Shear wave velocity prediction based on bayesian-optimized multi-head attention mechanism and CNN-BiLSTM Multivariate simulation using a locally varying coregionalization model Automatic variogram calculation and modeling
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1