首页 > 最新文献

Artificial Intelligence in Geosciences最新文献

英文 中文
Machine learning assisted enhancement of petrophysical property dataset of fractured Variscan granites of the Cornubian Batholith, SW UK 机器学习辅助增强了英国西南部Cornubian岩基裂缝Variscan花岗岩的岩石物性数据集
IF 4.2 Pub Date : 2025-12-01 Epub Date: 2025-08-22 DOI: 10.1016/j.aiig.2025.100151
A. Turan , E. Artun , I. Sass
Outcrop analogue studies play an important role in advancing our comprehension of reservoir architectures, offering insights into hidden reservoir rocks prior to drilling, in a cost-effective manner. These studies contribute to the delineation of the three-dimensional geometry of geological structures, the characterization of petro- and thermo-physical properties, and the structural geological aspects of reservoir rocks. Nevertheless, several challenges, including inaccessible sampling sites, limited resources, and the dimensional constraints of different laboratories hinder the acquisition of comprehensive datasets. In this study, we employ machine learning techniques to estimate missing data in a petrophysical dataset of fractured Variscan granites from the Cornubian Batholith in Southwest UK. The utilization of mean, k-nearest neighbors, and random forest imputation methods addresses the challenge of missing data, thereby revealing the effectiveness of random forest imputation in providing realistic estimations. Subsequently, supervised classification models are trained to classify samples according to their pluton origins, with promising accuracy achieved by models trained with imputed values. Variable importance ranking of the models showed that the choice of imputation method influences the inferred importance of specific petrophysical properties. While porosity (POR) and grain density (GD) were among important variables, variables with high missingness ratio were not among the top variables. This study demonstrates the value of machine learning in enhancing petrophysical datasets, while emphasizing the importance of careful method selection and model validation for reliable results. The findings contribute to a more informed decision-making process in geothermal exploration and reservoir characterization efforts, thereby demonstrating the potential of machine learning in advancing subsurface characterization techniques.
露头模拟研究在提高我们对储层结构的理解方面发挥着重要作用,以经济有效的方式在钻探前提供对隐藏储层岩石的深入了解。这些研究有助于描绘地质构造的三维几何形状,表征石油和热物理性质,以及储层岩石的构造地质方面。然而,一些挑战,包括难以进入的采样地点、有限的资源和不同实验室的尺寸限制,阻碍了全面数据集的获取。在这项研究中,我们使用机器学习技术来估计英国西南部Cornubian岩基中裂缝Variscan花岗岩的岩石物理数据集中的缺失数据。利用均值、k近邻和随机森林插值方法解决了数据缺失的挑战,从而揭示了随机森林插值在提供现实估计方面的有效性。随后,训练监督分类模型,根据样本的岩体起源对样本进行分类,用输入值训练的模型取得了很好的精度。模型的不同重要性排序表明,计算方法的选择影响了具体岩石物性的推断重要性。孔隙度(POR)和颗粒密度(GD)是重要变量,缺失率高的变量不在重要变量之列。该研究证明了机器学习在增强岩石物理数据集方面的价值,同时强调了仔细选择方法和模型验证以获得可靠结果的重要性。这些发现有助于在地热勘探和储层表征工作中做出更明智的决策过程,从而展示了机器学习在推进地下表征技术方面的潜力。
{"title":"Machine learning assisted enhancement of petrophysical property dataset of fractured Variscan granites of the Cornubian Batholith, SW UK","authors":"A. Turan ,&nbsp;E. Artun ,&nbsp;I. Sass","doi":"10.1016/j.aiig.2025.100151","DOIUrl":"10.1016/j.aiig.2025.100151","url":null,"abstract":"<div><div>Outcrop analogue studies play an important role in advancing our comprehension of reservoir architectures, offering insights into hidden reservoir rocks prior to drilling, in a cost-effective manner. These studies contribute to the delineation of the three-dimensional geometry of geological structures, the characterization of petro- and thermo-physical properties, and the structural geological aspects of reservoir rocks. Nevertheless, several challenges, including inaccessible sampling sites, limited resources, and the dimensional constraints of different laboratories hinder the acquisition of comprehensive datasets. In this study, we employ machine learning techniques to estimate missing data in a petrophysical dataset of fractured Variscan granites from the Cornubian Batholith in Southwest UK. The utilization of mean, k-nearest neighbors, and random forest imputation methods addresses the challenge of missing data, thereby revealing the effectiveness of random forest imputation in providing realistic estimations. Subsequently, supervised classification models are trained to classify samples according to their pluton origins, with promising accuracy achieved by models trained with imputed values. Variable importance ranking of the models showed that the choice of imputation method influences the inferred importance of specific petrophysical properties. While porosity (POR) and grain density (GD) were among important variables, variables with high missingness ratio were not among the top variables. This study demonstrates the value of machine learning in enhancing petrophysical datasets, while emphasizing the importance of careful method selection and model validation for reliable results. The findings contribute to a more informed decision-making process in geothermal exploration and reservoir characterization efforts, thereby demonstrating the potential of machine learning in advancing subsurface characterization techniques.</div></div>","PeriodicalId":100124,"journal":{"name":"Artificial Intelligence in Geosciences","volume":"6 2","pages":"Article 100151"},"PeriodicalIF":4.2,"publicationDate":"2025-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145009931","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Understanding hydrological responses through LULC analysis and predictive modelling (MLPNN-MC Model): A study of Bandu Sub-watershed (India) over three decades 通过LULC分析和预测模型(MLPNN-MC模型)理解水文响应:印度班杜小流域30年的研究
IF 4.2 Pub Date : 2025-12-01 Epub Date: 2025-08-19 DOI: 10.1016/j.aiig.2025.100152
Sudipto Halder , Somnath Mandal , Zarkheen Mukhtar , Debdas Ray , Gupinath Bhandari , Suman Paul
{"title":"Understanding hydrological responses through LULC analysis and predictive modelling (MLPNN-MC Model): A study of Bandu Sub-watershed (India) over three decades","authors":"Sudipto Halder ,&nbsp;Somnath Mandal ,&nbsp;Zarkheen Mukhtar ,&nbsp;Debdas Ray ,&nbsp;Gupinath Bhandari ,&nbsp;Suman Paul","doi":"10.1016/j.aiig.2025.100152","DOIUrl":"10.1016/j.aiig.2025.100152","url":null,"abstract":"","PeriodicalId":100124,"journal":{"name":"Artificial Intelligence in Geosciences","volume":"6 2","pages":"Article 100152"},"PeriodicalIF":4.2,"publicationDate":"2025-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144887019","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Identification of major minerals in igneous rock microscopic images from thin sections through deep neural network analysis 利用深度神经网络分析方法识别火成岩薄片显微图像中的主要矿物
IF 4.2 Pub Date : 2025-12-01 Epub Date: 2025-09-24 DOI: 10.1016/j.aiig.2025.100157
Kouadio Krah , Sié Ouattara , Gbele Ouattara , Marc Euphrem Allialy , Alain Clement
Several socio-environmental needs (medicine, industry, engineering, orogenesis, genesis, etc.) require minerals to be more precisly defined and characterised. The identification of minerals plays a crucial role for researchers and is becoming an essential aspect of geological analysis. However, traditional methods relied heavily on expert knowledge and specialised equipment, making them labour-intensive, costly and time-consuming. This dependence is often labour-intensive, not to mention costly and time-consuming. To address this issue, some researchers have opted for machine learning algorithms to quickly identify a single mineral in a microscopic image of rocks. However this approch does not correspond to patterns of mineral distribution, where minerals are typically found in associations. These associations make it difficult to accurately identify minerals using conventional machine learning algorithms. This paper introduces a deep neural learning model based on multi-label classification, utilizing the problem adaptation method to analyse microscopic images of rock thin sections. The model is based on the ResNet50 architecture, which is designed to analyse minerals and generates the probability of a mineral presence in an image. This method provides a solution to the dependence between associated minerals. Experiments on many test images showed a model confidence, achieving average precision, recall and F1_score 97.15 %, 96.25 % and 96.69 %, respectively. Visualisation of the class activation mapping using the Grad-CAM algorithm indicates that our model is likely to locate the identified minerals effectively. In this way, the importance of each pixel with the class of interest can be assessed using heat maps. The recorded results, in terms of both performance and pixel_level evaluation, demonstrate the promising potential of the model used. It can therefore be considered for multi-labels image classification, particulary for images representing rock minerals. This approach serves as a valuable support tool for geological studies.
一些社会环境需求(医药、工业、工程、造山、成因等)要求对矿物进行更精确的定义和表征。矿物的鉴定对研究人员起着至关重要的作用,并正在成为地质分析的一个重要方面。然而,传统的方法严重依赖于专业知识和专用设备,这使得它们劳动密集,成本高昂且耗时。这种依赖往往是劳动密集型的,更不用说昂贵和耗时了。为了解决这个问题,一些研究人员选择了机器学习算法来快速识别岩石微观图像中的单一矿物。然而,这种方法并不符合矿物分布的模式,因为矿物通常是在组合中发现的。这些关联使得使用传统的机器学习算法难以准确识别矿物。介绍了一种基于多标签分类的深度神经学习模型,利用问题自适应方法对岩石薄片显微图像进行分析。该模型基于ResNet50架构,该架构旨在分析矿物质并生成图像中矿物质存在的概率。这种方法为伴生矿物之间的依赖性提供了一种解决方案。在多幅测试图像上的实验表明,模型置信度较好,平均准确率、召回率和F1_score分别达到97.15%、96.25%和96.69%。使用Grad-CAM算法的类激活映射可视化表明,我们的模型可能有效地定位已识别的矿物。通过这种方式,可以使用热图评估每个感兴趣类别像素的重要性。从性能和pixel_level评估两方面来看,记录的结果显示了所使用模型的良好潜力。因此,可以考虑对多标签图像进行分类,特别是对代表岩石矿物的图像。这种方法为地质研究提供了宝贵的支持工具。
{"title":"Identification of major minerals in igneous rock microscopic images from thin sections through deep neural network analysis","authors":"Kouadio Krah ,&nbsp;Sié Ouattara ,&nbsp;Gbele Ouattara ,&nbsp;Marc Euphrem Allialy ,&nbsp;Alain Clement","doi":"10.1016/j.aiig.2025.100157","DOIUrl":"10.1016/j.aiig.2025.100157","url":null,"abstract":"<div><div>Several socio-environmental needs (medicine, industry, engineering, orogenesis, genesis, etc.) require minerals to be more precisly defined and characterised. The identification of minerals plays a crucial role for researchers and is becoming an essential aspect of geological analysis. However, traditional methods relied heavily on expert knowledge and specialised equipment, making them labour-intensive, costly and time-consuming. This dependence is often labour-intensive, not to mention costly and time-consuming. To address this issue, some researchers have opted for machine learning algorithms to quickly identify a single mineral in a microscopic image of rocks. However this approch does not correspond to patterns of mineral distribution, where minerals are typically found in associations. These associations make it difficult to accurately identify minerals using conventional machine learning algorithms. This paper introduces a deep neural learning model based on multi-label classification, utilizing the problem adaptation method to analyse microscopic images of rock thin sections. The model is based on the ResNet50 architecture, which is designed to analyse minerals and generates the probability of a mineral presence in an image. This method provides a solution to the dependence between associated minerals. Experiments on many test images showed a model confidence, achieving average precision, recall and F1_score 97.15 %, 96.25 % and 96.69 %, respectively. Visualisation of the class activation mapping using the Grad-CAM algorithm indicates that our model is likely to locate the identified minerals effectively. In this way, the importance of each pixel with the class of interest can be assessed using heat maps. The <strong>recorded</strong> results, in terms of both performance and pixel_level evaluation, demonstrate the promising potential of the model used. It can therefore be considered for multi-labels image classification, particulary for images representing rock minerals. This approach serves as a valuable support tool for geological studies.</div></div>","PeriodicalId":100124,"journal":{"name":"Artificial Intelligence in Geosciences","volume":"6 2","pages":"Article 100157"},"PeriodicalIF":4.2,"publicationDate":"2025-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145219427","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Improved estimation of two-phase capillary pressure with nuclear magnetic resonance measurements via machine learning 基于机器学习的核磁共振测量改进的两相毛细管压力估计
Pub Date : 2025-12-01 Epub Date: 2025-07-05 DOI: 10.1016/j.aiig.2025.100144
Oriyomi Raheem , Misael M. Morales , Wen Pan , Carlos Torres-Verdín
<div><div>Capillary pressure plays a crucial role in determining the spatial distribution of oil and gas, particularly in medium-to-low permeability reservoirs, where it is closely linked to the rock's pore structure and wettability. In these environments, pore structure is the primary factor influencing capillary pressure, with different pore types affecting fluid transport through varying degrees of hydrocarbon saturation. One of the main challenges in characterizing pore structure is how to use data from core plugs to establish a relationship with microscopic pore and throat properties, enabling more accurate predictions of capillary pressure. While special core analysis laboratory experiments are effective, they are time-consuming and expensive. In contrast, nuclear magnetic resonance (NMR) measurements, which provide information on pore body size distribution, are faster and can be leveraged to estimate capillary pressure using machine learning algorithms. Recently, artificial intelligence methods have also been applied to capillary pressure prediction (Qi et al., 2024).</div><div>Currently, no readily applicable predictive model exists for estimating an entire capillary pressure curve directly from standard petrophysical logs and core data. Although porescale imaging and network modeling techniques can compute capillary pressure from micro-CT rock images (Øren and Bakke, 2003; Valvatne and Blunt, 2004), these approaches are time-consuming, limited to small sample volumes, and not yet practical for routine reservoir evaluation. In this study, we introduce rock classification techniques and implement a data-driven machine learning (ML) method to estimate saturation-dependent capillary pressure from core petrophysical properties. The new model integrates cumulative NMR data and densely resampled core measurements as training data, with prediction errors quantified throughout the process. To approach the common condition of sparsely sampled training data, we transformed the prediction problem into an overdetermined one by applying composite fitting to both capillary pressure and pore throat size distribution, and Gaussian cumulative distribution fitting to the NMR <span><math><mrow><msub><mi>T</mi><mn>2</mn></msub></mrow></math></span> measurements, generating evenly sampled data points. Using these preprocessed input features, we performed classification based on the natural logarithm of the permeability-to-porosity ratio <span><math><mrow><mo>(</mo><mrow><mi>ln</mi><mrow><mo>(</mo><mrow><mi>k</mi><mo>/</mo><mi>ϕ</mi></mrow><mo>)</mo></mrow></mrow><mo>)</mo></mrow></math></span> to cluster distinct rock types. For each rock class, we applied regression techniques—such as random forest (RF), k-nearest neighbors (k-NN), extreme gradient boosting (XGB), and artificial neural networks (ANN)—to estimate the logarithm of capillary pressure. The methods were tested on blind core samples, and performance comparisons among different estimation methods
毛管压力对油气的空间分布起着至关重要的作用,特别是在中低渗透储层中,毛管压力与岩石的孔隙结构和润湿性密切相关。在这些环境中,孔隙结构是影响毛管压力的主要因素,不同孔隙类型通过不同程度的烃饱和度影响流体的输运。表征孔隙结构的主要挑战之一是如何利用岩心桥塞的数据建立微观孔隙和喉道特性的关系,从而更准确地预测毛管压力。虽然特殊的岩心分析实验室实验是有效的,但它们耗时且昂贵。相比之下,核磁共振(NMR)测量可以提供孔体大小分布的信息,速度更快,并且可以利用机器学习算法来估计毛细管压力。最近,人工智能方法也被应用于毛细管压力预测(Qi et al., 2024)。目前,还没有现成的预测模型可以直接从标准岩石物理测井和岩心数据中估计整个毛管压力曲线。虽然孔隙尺度成像和网络建模技术可以从微ct岩石图像中计算毛细压力(Øren和Bakke, 2003;Valvatne和Blunt, 2004),这些方法耗时长,仅限于小样本量,还不能用于常规储层评价。在这项研究中,我们引入了岩石分类技术,并实现了一种数据驱动的机器学习(ML)方法,通过岩心岩石物理性质来估计与饱和度相关的毛管压力。新模型将累积核磁共振数据和密集重采样的岩心测量数据作为训练数据,并在整个过程中量化预测误差。为了接近稀疏采样训练数据的常见情况,我们通过对毛细管压力和孔喉大小分布进行复合拟合,并对NMR T2测量值进行高斯累积分布拟合,将预测问题转化为过确定问题,生成均匀采样的数据点。利用这些预处理的输入特征,我们根据渗透率-孔隙度比(ln(k/ϕ))的自然对数进行分类,以聚类不同的岩石类型。对于每个岩石类别,我们应用回归技术——如随机森林(RF)、k近邻(k-NN)、极端梯度增强(XGB)和人工神经网络(ANN)——来估计毛细管压力的对数。对盲岩心样本进行了测试,并基于预测的相对标准误差对不同估计方法进行了性能比较。结果表明,核磁共振数据对岩石孔隙结构较为敏感,对毛细管压力和孔喉大小分布的预测有显著改善。对于毛细管压力和孔喉大小分布,极端梯度增强和随机森林模型的平均估计误差分别为5%和10%,表现最好。相比之下,当NMR T2数据被排除作为输入特征时,预测误差增加到25%。使用传统的高斯模型拟合和更高分辨率的重采样确保了训练数据覆盖了广泛的变异性。将核磁共振T2数据作为输入特征增强了模型捕捉非常规岩石中多峰的能力,使预测问题过度确定。通过向量输入特征预测向量函数,有效降低了预测误差。该解释工作流程可用于构建具有代表性的分类模型,并在广泛的饱和度范围内估计毛细管压力。
{"title":"Improved estimation of two-phase capillary pressure with nuclear magnetic resonance measurements via machine learning","authors":"Oriyomi Raheem ,&nbsp;Misael M. Morales ,&nbsp;Wen Pan ,&nbsp;Carlos Torres-Verdín","doi":"10.1016/j.aiig.2025.100144","DOIUrl":"10.1016/j.aiig.2025.100144","url":null,"abstract":"&lt;div&gt;&lt;div&gt;Capillary pressure plays a crucial role in determining the spatial distribution of oil and gas, particularly in medium-to-low permeability reservoirs, where it is closely linked to the rock's pore structure and wettability. In these environments, pore structure is the primary factor influencing capillary pressure, with different pore types affecting fluid transport through varying degrees of hydrocarbon saturation. One of the main challenges in characterizing pore structure is how to use data from core plugs to establish a relationship with microscopic pore and throat properties, enabling more accurate predictions of capillary pressure. While special core analysis laboratory experiments are effective, they are time-consuming and expensive. In contrast, nuclear magnetic resonance (NMR) measurements, which provide information on pore body size distribution, are faster and can be leveraged to estimate capillary pressure using machine learning algorithms. Recently, artificial intelligence methods have also been applied to capillary pressure prediction (Qi et al., 2024).&lt;/div&gt;&lt;div&gt;Currently, no readily applicable predictive model exists for estimating an entire capillary pressure curve directly from standard petrophysical logs and core data. Although porescale imaging and network modeling techniques can compute capillary pressure from micro-CT rock images (Øren and Bakke, 2003; Valvatne and Blunt, 2004), these approaches are time-consuming, limited to small sample volumes, and not yet practical for routine reservoir evaluation. In this study, we introduce rock classification techniques and implement a data-driven machine learning (ML) method to estimate saturation-dependent capillary pressure from core petrophysical properties. The new model integrates cumulative NMR data and densely resampled core measurements as training data, with prediction errors quantified throughout the process. To approach the common condition of sparsely sampled training data, we transformed the prediction problem into an overdetermined one by applying composite fitting to both capillary pressure and pore throat size distribution, and Gaussian cumulative distribution fitting to the NMR &lt;span&gt;&lt;math&gt;&lt;mrow&gt;&lt;msub&gt;&lt;mi&gt;T&lt;/mi&gt;&lt;mn&gt;2&lt;/mn&gt;&lt;/msub&gt;&lt;/mrow&gt;&lt;/math&gt;&lt;/span&gt; measurements, generating evenly sampled data points. Using these preprocessed input features, we performed classification based on the natural logarithm of the permeability-to-porosity ratio &lt;span&gt;&lt;math&gt;&lt;mrow&gt;&lt;mo&gt;(&lt;/mo&gt;&lt;mrow&gt;&lt;mi&gt;ln&lt;/mi&gt;&lt;mrow&gt;&lt;mo&gt;(&lt;/mo&gt;&lt;mrow&gt;&lt;mi&gt;k&lt;/mi&gt;&lt;mo&gt;/&lt;/mo&gt;&lt;mi&gt;ϕ&lt;/mi&gt;&lt;/mrow&gt;&lt;mo&gt;)&lt;/mo&gt;&lt;/mrow&gt;&lt;/mrow&gt;&lt;mo&gt;)&lt;/mo&gt;&lt;/mrow&gt;&lt;/math&gt;&lt;/span&gt; to cluster distinct rock types. For each rock class, we applied regression techniques—such as random forest (RF), k-nearest neighbors (k-NN), extreme gradient boosting (XGB), and artificial neural networks (ANN)—to estimate the logarithm of capillary pressure. The methods were tested on blind core samples, and performance comparisons among different estimation methods ","PeriodicalId":100124,"journal":{"name":"Artificial Intelligence in Geosciences","volume":"6 2","pages":"Article 100144"},"PeriodicalIF":0.0,"publicationDate":"2025-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144633212","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Quantifying uncertainty in foraminifera classification: How deep learning methods compare to human experts 量化有孔虫分类中的不确定性:深度学习方法与人类专家的比较
Pub Date : 2025-12-01 Epub Date: 2025-07-16 DOI: 10.1016/j.aiig.2025.100145
Iver Martinsen , Steffen Aagaard Sørensen , Samuel Ortega , Fred Godtliebsen , Miguel Tejedor , Eirik Myrvoll-Nilsen
Foraminifera are shell-bearing microorganisms that are commonly found in marine deposits on the seabed. They are important indicators in many analyses, are used in climate change research, monitoring marine environments, evolutionary studies, and are also frequently used in the oil and gas industry. Although some research has focused on automating the classification of foraminifera images, few have addressed the uncertainty in these classifications. Although foraminifera classification is not a safety-critical task, estimating uncertainty is crucial to avoid misclassifications that could overlook rare and ecologically significant species that are informative indicators of the environment in which they lived. Uncertainty estimation in deep learning has gained significant attention and many methods have been developed. However, evaluating the performance of these methods in practical settings remains a challenge. To create a benchmark for uncertainty estimation in the classification of foraminifera, we administered a multiple choice questionnaire containing classification tasks to four senior geologists. By analyzing their responses, we generated human-derived uncertainty estimates for a test set of 260 images of foraminifera and sediment grains. These uncertainty estimates served as a baseline for comparison when training neural networks in classification. We then trained multiple deep neural networks using a range of uncertainty quantification methods to classify and state the uncertainty about the classifications. The results of the deep learning uncertainty quantification methods were then analyzed and compared with the human benchmark, to see how the methods performed individually and how the methods aligned with humans. Our results show that human-level performance can be achieved with deep learning and that test-time data augmentation and ensembling can help improve both uncertainty estimation and classification performance. Our results also show that human uncertainty estimates are helpful indicators for detecting classification errors and that deep learning-based uncertainty estimates can improve calibration and classification accuracy.
有孔虫是一种含壳微生物,通常在海底的海洋沉积物中发现。它们是许多分析中的重要指标,用于气候变化研究、海洋环境监测、进化研究,也经常用于石油和天然气工业。虽然一些研究集中在有孔虫图像的自动分类上,但很少有人解决这些分类中的不确定性。虽然有孔虫分类不是一项安全关键任务,但估计不确定性对于避免错误分类至关重要,因为错误分类可能会忽略稀有和具有生态意义的物种,这些物种是它们生活环境的信息指标。深度学习中的不确定性估计受到了广泛的关注,并开发了许多方法。然而,评估这些方法在实际环境中的性能仍然是一个挑战。为了在有孔虫分类中建立一个不确定性估计的基准,我们对四位高级地质学家进行了包含分类任务的多项选择问卷。通过分析他们的反应,我们对260张有孔虫和沉积物颗粒的测试图像产生了人为的不确定性估计。当训练神经网络进行分类时,这些不确定性估计作为比较的基线。然后,我们使用一系列不确定性量化方法训练多个深度神经网络来分类和说明分类的不确定性。然后对深度学习不确定性量化方法的结果进行分析,并与人类基准进行比较,以了解这些方法如何单独执行以及这些方法如何与人类一致。我们的研究结果表明,深度学习可以达到人类水平的性能,测试时间数据增强和集成可以帮助提高不确定性估计和分类性能。我们的研究结果还表明,人为的不确定性估计是检测分类错误的有用指标,基于深度学习的不确定性估计可以提高校准和分类精度。
{"title":"Quantifying uncertainty in foraminifera classification: How deep learning methods compare to human experts","authors":"Iver Martinsen ,&nbsp;Steffen Aagaard Sørensen ,&nbsp;Samuel Ortega ,&nbsp;Fred Godtliebsen ,&nbsp;Miguel Tejedor ,&nbsp;Eirik Myrvoll-Nilsen","doi":"10.1016/j.aiig.2025.100145","DOIUrl":"10.1016/j.aiig.2025.100145","url":null,"abstract":"<div><div>Foraminifera are shell-bearing microorganisms that are commonly found in marine deposits on the seabed. They are important indicators in many analyses, are used in climate change research, monitoring marine environments, evolutionary studies, and are also frequently used in the oil and gas industry. Although some research has focused on automating the classification of foraminifera images, few have addressed the uncertainty in these classifications. Although foraminifera classification is not a safety-critical task, estimating uncertainty is crucial to avoid misclassifications that could overlook rare and ecologically significant species that are informative indicators of the environment in which they lived. Uncertainty estimation in deep learning has gained significant attention and many methods have been developed. However, evaluating the performance of these methods in practical settings remains a challenge. To create a benchmark for uncertainty estimation in the classification of foraminifera, we administered a multiple choice questionnaire containing classification tasks to four senior geologists. By analyzing their responses, we generated human-derived uncertainty estimates for a test set of 260 images of foraminifera and sediment grains. These uncertainty estimates served as a baseline for comparison when training neural networks in classification. We then trained multiple deep neural networks using a range of uncertainty quantification methods to classify and state the uncertainty about the classifications. The results of the deep learning uncertainty quantification methods were then analyzed and compared with the human benchmark, to see how the methods performed individually and how the methods aligned with humans. Our results show that human-level performance can be achieved with deep learning and that test-time data augmentation and ensembling can help improve both uncertainty estimation and classification performance. Our results also show that human uncertainty estimates are helpful indicators for detecting classification errors and that deep learning-based uncertainty estimates can improve calibration and classification accuracy.</div></div>","PeriodicalId":100124,"journal":{"name":"Artificial Intelligence in Geosciences","volume":"6 2","pages":"Article 100145"},"PeriodicalIF":0.0,"publicationDate":"2025-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144694746","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Quantifying uncertainty of mineral prediction using a novel Bayesian deep learning framework 使用新的贝叶斯深度学习框架量化矿物预测的不确定性
IF 4.2 Pub Date : 2025-12-01 Epub Date: 2025-11-04 DOI: 10.1016/j.aiig.2025.100164
Yue Liu
Mineral resource exploration increasingly demands not only accurate prospectivity maps but also reliable measures of confidence to guide high-stakes decisions. In this study, a novel Bayesian deep learning (BDL) framework was introduced, which embeds probabilistic inference within a deep neural network to jointly predict mineralization potential and quantify uncertainty. Two posterior approximation strategies, Metropolis–Hastings (MH) sampling and variational inference (VI), are implemented to estimate model weights as distributions rather than as fixed values, enabling decomposition of predictive uncertainty into aleatoric and epistemic components. When applied to eleven ore-controlling features in the Nanling tungsten polymetallic region (China), both MH-based and VI-based BDL models demonstrate strong classification performance while revealing contrasting spatial patterns and uncertainty patterns. Correlation studies across probability bands confirm that MH sampling captures a broader spread of uncertainty at the cost of greater computational demand, while VI delivers greater efficiency but risks underestimating uncertainty. The results highlight trade-offs between accuracy, interpretability, and computational load, demonstrating that MH-based BDL offers more robust uncertainty assessments, whereas VI-based BDL places greater emphasis on efficiency. By providing spatially explicit probability and uncertainty maps, this framework advances risk-aware mineral exploration, enabling practitioners to target areas of high potential with low uncertainty and to identify regions warranting additional data acquisition.
矿产资源勘探不仅需要精确的勘探地图,而且需要可靠的信心措施来指导高风险的决策。在本研究中,引入了一种新的贝叶斯深度学习(BDL)框架,该框架将概率推理嵌入到深度神经网络中,以联合预测矿化潜力和量化不确定性。采用Metropolis-Hastings (MH)抽样和变分推理(VI)两种后验逼近策略,将模型权重估计为分布而不是固定值,从而将预测不确定性分解为任意分量和认知分量。将其应用于南岭钨多金属区11个控矿特征,结果表明,基于h和vi的BDL模型在揭示空间格局和不确定性格局的同时具有较强的分类能力。跨概率带的相关性研究证实,MH采样以更大的计算需求为代价,捕获了更广泛的不确定性,而VI提供了更高的效率,但存在低估不确定性的风险。结果强调了准确性、可解释性和计算负载之间的权衡,表明基于mh的BDL提供了更强大的不确定性评估,而基于vi的BDL更强调效率。通过提供空间上明确的概率和不确定性地图,该框架促进了风险意识的矿产勘探,使从业者能够以低不确定性瞄准高潜力区域,并确定需要额外数据采集的区域。
{"title":"Quantifying uncertainty of mineral prediction using a novel Bayesian deep learning framework","authors":"Yue Liu","doi":"10.1016/j.aiig.2025.100164","DOIUrl":"10.1016/j.aiig.2025.100164","url":null,"abstract":"<div><div>Mineral resource exploration increasingly demands not only accurate prospectivity maps but also reliable measures of confidence to guide high-stakes decisions. In this study, a novel Bayesian deep learning (BDL) framework was introduced, which embeds probabilistic inference within a deep neural network to jointly predict mineralization potential and quantify uncertainty. Two posterior approximation strategies, Metropolis–Hastings (MH) sampling and variational inference (VI), are implemented to estimate model weights as distributions rather than as fixed values, enabling decomposition of predictive uncertainty into aleatoric and epistemic components. When applied to eleven ore-controlling features in the Nanling tungsten polymetallic region (China), both MH-based and VI-based BDL models demonstrate strong classification performance while revealing contrasting spatial patterns and uncertainty patterns. Correlation studies across probability bands confirm that MH sampling captures a broader spread of uncertainty at the cost of greater computational demand, while VI delivers greater efficiency but risks underestimating uncertainty. The results highlight trade-offs between accuracy, interpretability, and computational load, demonstrating that MH-based BDL offers more robust uncertainty assessments, whereas VI-based BDL places greater emphasis on efficiency. By providing spatially explicit probability and uncertainty maps, this framework advances risk-aware mineral exploration, enabling practitioners to target areas of high potential with low uncertainty and to identify regions warranting additional data acquisition.</div></div>","PeriodicalId":100124,"journal":{"name":"Artificial Intelligence in Geosciences","volume":"6 2","pages":"Article 100164"},"PeriodicalIF":4.2,"publicationDate":"2025-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145519710","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
On the application of machine learning algorithms in predicting the permeability of oil reservoirs 机器学习算法在油藏渗透率预测中的应用研究
Pub Date : 2025-12-01 Epub Date: 2025-06-03 DOI: 10.1016/j.aiig.2025.100126
Andrey V. Soromotin , Dmitriy A. Martyushev , João Luiz Junho Pereira
Permeability is one of the main oil reservoir characteristics. It affects potential oil production, well-completion technologies, the choice of enhanced oil recovery methods, and more. The methods used to determine and predict reservoir permeability have serious shortcomings. This article aims to refine and adapt machine learning techniques using historical data from hydrocarbon field development to evaluate and predict parameters such as the skin factor and permeability of the remote reservoir zone. The article analyzes data from 4045 wells tests in oil fields in the Perm Krai (Russia). An evaluation of the performance of different Machine Learning (ML) algorithms in the prediction of the well permeability is performed. Three different real datasets are used to train more than 20 machine learning regressors, whose hyperparameters are optimized using Bayesian Optimization (BO). The resulting models demonstrate significantly better predictive performance compared to traditional methods and the best ML model found is one that never was applied before to this problem. The permeability prediction model is characterized by a high R2 adjusted value of 0.799. A promising approach is the integration of machine learning methods and the use of pressure recovery curves to estimate permeability in real-time. The work is unique for its approach to predicting pressure recovery curves during well operation without stopping wells, providing primary data for interpretation. These innovations are exclusive and can improve the accuracy of permeability forecasts. It also reduces well downtime associated with traditional well-testing procedures. The proposed methods pave the way for more efficient and cost-effective reservoir development, ultimately supporting better decision-making and resource optimization in oil production.
渗透率是油藏的主要特征之一。它会影响潜在的产油量、完井技术、提高采收率方法的选择等。目前用于确定和预测储层渗透率的方法存在严重缺陷。本文旨在利用油气田开发的历史数据来改进和适应机器学习技术,以评估和预测偏远储层的表皮系数和渗透率等参数。本文分析了俄罗斯彼尔姆边疆区油田4045口试井的数据。对不同机器学习(ML)算法在预测井渗透率方面的性能进行了评估。使用三个不同的真实数据集训练20多个机器学习回归量,并使用贝叶斯优化(BO)对其超参数进行优化。与传统方法相比,结果模型显示出更好的预测性能,并且发现的最佳ML模型是以前从未应用于此问题的模型。渗透率预测模型具有较高的R2调整值(0.799)。一种很有前途的方法是结合机器学习方法和使用压力恢复曲线来实时估计渗透率。这项工作的独特之处在于,它可以在不停井的情况下预测井运行过程中的压力恢复曲线,为解释提供了原始数据。这些创新是独一无二的,可以提高渗透率预测的准确性。它还减少了与传统试井程序相关的井停工期。所提出的方法为更高效、更具成本效益的油藏开发铺平了道路,最终支持更好的石油生产决策和资源优化。
{"title":"On the application of machine learning algorithms in predicting the permeability of oil reservoirs","authors":"Andrey V. Soromotin ,&nbsp;Dmitriy A. Martyushev ,&nbsp;João Luiz Junho Pereira","doi":"10.1016/j.aiig.2025.100126","DOIUrl":"10.1016/j.aiig.2025.100126","url":null,"abstract":"<div><div>Permeability is one of the main oil reservoir characteristics. It affects potential oil production, well-completion technologies, the choice of enhanced oil recovery methods, and more. The methods used to determine and predict reservoir permeability have serious shortcomings. This article aims to refine and adapt machine learning techniques using historical data from hydrocarbon field development to evaluate and predict parameters such as the skin factor and permeability of the remote reservoir zone. The article analyzes data from 4045 wells tests in oil fields in the Perm Krai (Russia). An evaluation of the performance of different Machine Learning (ML) algorithms in the prediction of the well permeability is performed. Three different real datasets are used to train more than 20 machine learning regressors, whose hyperparameters are optimized using Bayesian Optimization (BO). The resulting models demonstrate significantly better predictive performance compared to traditional methods and the best ML model found is one that never was applied before to this problem. The permeability prediction model is characterized by a high R<sup>2</sup> adjusted value of 0.799. A promising approach is the integration of machine learning methods and the use of pressure recovery curves to estimate permeability in real-time. The work is unique for its approach to predicting pressure recovery curves during well operation without stopping wells, providing primary data for interpretation. These innovations are exclusive and can improve the accuracy of permeability forecasts. It also reduces well downtime associated with traditional well-testing procedures. The proposed methods pave the way for more efficient and cost-effective reservoir development, ultimately supporting better decision-making and resource optimization in oil production.</div></div>","PeriodicalId":100124,"journal":{"name":"Artificial Intelligence in Geosciences","volume":"6 2","pages":"Article 100126"},"PeriodicalIF":0.0,"publicationDate":"2025-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144330611","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
AI-based approaches for wetland mapping and classification: A review of current practices and future perspectives 基于人工智能的湿地制图与分类方法:现状与展望
IF 4.2 Pub Date : 2025-12-01 Epub Date: 2025-11-05 DOI: 10.1016/j.aiig.2025.100165
Kai Cheng , Cong Zhang , Yaocheng Fan , Hongli Diao , Shibin Xia
Wetlands are critical ecosystems that provide essential ecological, hydrological, and socio-economic services, such as water purification, climate regulation, and biodiversity conservation. However, effective wetland management faces significant challenges, particularly in the analysis and classification of complex wetland environments. Traditional methods of wetland monitoring often suffer from limitations in spatial coverage, temporal resolution, and data processing efficiency. Recent advancements in artificial intelligence (AI), particularly machine learning and deep learning techniques, have been increasingly integrated with remote sensing technologies, offering a powerful solution to these challenges. AI has demonstrated significant potential in automating large-scale remote sensing data analysis, enabling the extraction of detailed spatial information, and enhancing the accuracy and efficiency of wetland mapping and classification. Bibliometric analysis indicates a growing body of research, with notable contributions from China and the United States, though regional disparities and a lack of diverse datasets remain key issues. Despite the success of AI in wetland monitoring, challenges persist in addressing environmental heterogeneity, mixed pixels, and data quality. This review synthesizes the current state of AI-based approaches in wetland mapping and classification, identifies trends and gaps, and outlines future research directions, emphasizing the need for interdisciplinary collaboration and integration of multi-source data to advance AI applications in wetland conservation.
湿地是重要的生态系统,提供必要的生态、水文和社会经济服务,如水净化、气候调节和生物多样性保护。然而,有效的湿地管理面临着重大挑战,特别是在复杂湿地环境的分析和分类方面。传统的湿地监测方法在空间覆盖、时间分辨率和数据处理效率等方面存在局限性。人工智能(AI)的最新进展,特别是机器学习和深度学习技术,已越来越多地与遥感技术相结合,为应对这些挑战提供了有力的解决方案。人工智能在自动化大规模遥感数据分析、提取详细空间信息以及提高湿地制图和分类的准确性和效率方面显示出巨大的潜力。文献计量分析表明,尽管区域差异和缺乏多样化的数据集仍然是关键问题,但中国和美国的研究成果正在不断增加。尽管人工智能在湿地监测方面取得了成功,但在解决环境异质性、混合像素和数据质量方面仍然存在挑战。本文综述了基于人工智能的湿地制图和分类方法的现状,指出了趋势和差距,并概述了未来的研究方向,强调需要跨学科合作和多源数据的整合来推进人工智能在湿地保护中的应用。
{"title":"AI-based approaches for wetland mapping and classification: A review of current practices and future perspectives","authors":"Kai Cheng ,&nbsp;Cong Zhang ,&nbsp;Yaocheng Fan ,&nbsp;Hongli Diao ,&nbsp;Shibin Xia","doi":"10.1016/j.aiig.2025.100165","DOIUrl":"10.1016/j.aiig.2025.100165","url":null,"abstract":"<div><div>Wetlands are critical ecosystems that provide essential ecological, hydrological, and socio-economic services, such as water purification, climate regulation, and biodiversity conservation. However, effective wetland management faces significant challenges, particularly in the analysis and classification of complex wetland environments. Traditional methods of wetland monitoring often suffer from limitations in spatial coverage, temporal resolution, and data processing efficiency. Recent advancements in artificial intelligence (AI), particularly machine learning and deep learning techniques, have been increasingly integrated with remote sensing technologies, offering a powerful solution to these challenges. AI has demonstrated significant potential in automating large-scale remote sensing data analysis, enabling the extraction of detailed spatial information, and enhancing the accuracy and efficiency of wetland mapping and classification. Bibliometric analysis indicates a growing body of research, with notable contributions from China and the United States, though regional disparities and a lack of diverse datasets remain key issues. Despite the success of AI in wetland monitoring, challenges persist in addressing environmental heterogeneity, mixed pixels, and data quality. This review synthesizes the current state of AI-based approaches in wetland mapping and classification, identifies trends and gaps, and outlines future research directions, emphasizing the need for interdisciplinary collaboration and integration of multi-source data to advance AI applications in wetland conservation.</div></div>","PeriodicalId":100124,"journal":{"name":"Artificial Intelligence in Geosciences","volume":"6 2","pages":"Article 100165"},"PeriodicalIF":4.2,"publicationDate":"2025-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145465152","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Intelligent identification of fractures and holes in ultrasonic logging images based on the improved YOLOv8 model 基于改进YOLOv8模型的超声测井图像缝孔智能识别
IF 4.2 Pub Date : 2025-12-01 Epub Date: 2025-11-11 DOI: 10.1016/j.aiig.2025.100167
Jingyi Han , Xiumei Zhang , Yujuan Qi , Lin Liu
Aiming to address the demand for intelligent recognition of geological features in whole-wellbore ultrasonic images, this paper integrates the YOLOv8 model with the Convolution Block Attention Module (CBAM). It proposes an intelligent method for detecting fractures and holes, as well as segmenting whole-wellbore images. Firstly, we develop a dataset sample of effective reservoir sections by integrating logging data and conducting data augmentation on fracture and hole samples in ultrasonic logging images. A standardized process procedure for the generation of new samples and model training has been proposed effectively. Subsequently, the improved YOLOv8 model undergoes a process of training and validation. The results indicate that the model achieves average accuracies of 0.910 and 0.884 in target detection and image segmentation tasks, respectively. These findings demonstrate a notable performance improvement compared to the original model. Furthermore, a sliding window strategy is proposed to tackle the challenges of high computational demands and insufficient accuracy in the intelligent processing of full-well ultrasonic images. To manage overlapping regions within the sliding window, we employ the Non-Maximum Suppression (NMS) principle for effective processing. Finally, the model has been tested on actual logging images and demonstrates an enhanced capability to identify irregular fractures and holes, which significantly improves the efficiency of geological feature recognition in the whole-well section ultrasonic logging images.
针对全井筒超声图像中地质特征的智能识别需求,本文将YOLOv8模型与卷积块注意模块(Convolution Block Attention Module, CBAM)相结合。提出了一种智能的裂缝、井眼检测及全井图像分割方法。首先,对测井资料进行整合,并对超声测井图像中的裂缝和孔样进行数据增强,得到有效储层剖面数据集样本;提出了一种用于新样本生成和模型训练的标准化流程。随后,改进的YOLOv8模型经历了一个训练和验证过程。结果表明,该模型在目标检测和图像分割任务中的平均准确率分别为0.910和0.884。这些发现表明,与原始模型相比,性能有了显著提高。此外,针对全井超声图像智能处理中计算量大、精度不高的问题,提出了滑动窗口策略。为了管理滑动窗口内的重叠区域,我们采用非最大抑制(NMS)原则进行有效的处理。最后,在实际测井图像上进行了验证,结果表明该模型对不规则裂缝和井眼的识别能力增强,显著提高了全井段超声测井图像的地质特征识别效率。
{"title":"Intelligent identification of fractures and holes in ultrasonic logging images based on the improved YOLOv8 model","authors":"Jingyi Han ,&nbsp;Xiumei Zhang ,&nbsp;Yujuan Qi ,&nbsp;Lin Liu","doi":"10.1016/j.aiig.2025.100167","DOIUrl":"10.1016/j.aiig.2025.100167","url":null,"abstract":"<div><div>Aiming to address the demand for intelligent recognition of geological features in whole-wellbore ultrasonic images, this paper integrates the YOLOv8 model with the Convolution Block Attention Module (CBAM). It proposes an intelligent method for detecting fractures and holes, as well as segmenting whole-wellbore images. Firstly, we develop a dataset sample of effective reservoir sections by integrating logging data and conducting data augmentation on fracture and hole samples in ultrasonic logging images. A standardized process procedure for the generation of new samples and model training has been proposed effectively. Subsequently, the improved YOLOv8 model undergoes a process of training and validation. The results indicate that the model achieves average accuracies of 0.910 and 0.884 in target detection and image segmentation tasks, respectively. These findings demonstrate a notable performance improvement compared to the original model. Furthermore, a sliding window strategy is proposed to tackle the challenges of high computational demands and insufficient accuracy in the intelligent processing of full-well ultrasonic images. To manage overlapping regions within the sliding window, we employ the Non-Maximum Suppression (NMS) principle for effective processing. Finally, the model has been tested on actual logging images and demonstrates an enhanced capability to identify irregular fractures and holes, which significantly improves the efficiency of geological feature recognition in the whole-well section ultrasonic logging images.</div></div>","PeriodicalId":100124,"journal":{"name":"Artificial Intelligence in Geosciences","volume":"6 2","pages":"Article 100167"},"PeriodicalIF":4.2,"publicationDate":"2025-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145519713","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Comparison of the performance of gradient boost, linear regression, decision tree, and voting algorithms to separate geochemical anomalies areas in the fractal environment 梯度增强、线性回归、决策树和投票算法在分形环境下分离地球化学异常区的性能比较
IF 4.2 Pub Date : 2025-12-01 Epub Date: 2025-09-24 DOI: 10.1016/j.aiig.2025.100156
Mirmahdi Seyedrahimi-Niaraq , Hossein Mahdiyanfar , Mohammad hossein Olyaee
In this investigation, the Gradient Boosting (GB), Linear Regression (LR), Decision Tree (DT), and Voting algorithms were applied to predict the distribution pattern of Au geochemical data. Trace and indicator elements, including Mo, Cu, Pb, Zn, Ag, Ni, Co, Mn, Fe, and As, were used with these machine learning algorithms (MLAs) to predict Au concentration values in the Doostbigloo porphyry Cu-Au-Mo mineralization area. The performance of the models was evaluated using the Mean Absolute Percentage Error (MAPE) and Root Mean Square Error (RMSE) metrics. The proposed ensemble Voting algorithm outperformed the other models, yielding more accurate predictions according to both metrics. The predicted data from the GB, LR, DT, and Voting MLAs were modeled using the Concentration-Area fractal method, and Au geochemical anomalies were mapped. To compare and validate the results, factors such as the location of the mineral deposits, their surface extent, and mineralization trend were considered. The results indicate that integrating hybrid MLAs with fractal modeling significantly improves geochemical prospectivity mapping. Among the four models, three (DT, GB, Voting) accurately identified both mineral deposits. The LR model, however, only identified Deposit I (central), and its mineralization trend diverged from the field data. The GB and Voting models produced similar results, with their final maps derived from fractal modeling showing the same anomalous areas. The anomaly boundaries identified by these two models are consistent with the two known reserves in the region. The results and plots related to prediction indicators and error rates for these two models also show high similarity, with lower error rates than the other models. Notably, the Voting model demonstrated superior performance in accurately delineating mineral deposit locations and identifying realistic mineralization trends while minimizing false anomalies.
采用梯度增强(GB)、线性回归(LR)、决策树(DT)和投票(Voting)算法预测Au地球化学数据的分布模式。利用Mo、Cu、Pb、Zn、Ag、Ni、Co、Mn、Fe、As等微量元素和指示元素,结合机器学习算法(MLAs)预测了Doostbigloo斑岩Cu-Au-Mo矿化区Au的富集值。使用平均绝对百分比误差(MAPE)和均方根误差(RMSE)指标评估模型的性能。所提出的集成投票算法优于其他模型,根据两个指标产生更准确的预测。采用浓度-面积分形方法对GB、LR、DT和Voting MLAs预测数据进行建模,绘制了Au地球化学异常图。为了比较和验证结果,考虑了矿床的位置、地表范围和矿化趋势等因素。结果表明,将混合MLAs与分形建模相结合,显著提高了地球化学找矿能力。四种模型中,DT、GB、Voting三种模型均能准确识别两个矿床。然而,LR模型只识别了1号矿床(中部),其成矿趋势与现场数据不符。GB和Voting模型产生了类似的结果,它们的最终地图源自分形模型,显示了相同的异常区域。这两种模型识别的异常边界与该地区已知的两个储量一致。两种模型的预测指标和错误率相关的结果和图也显示出较高的相似性,错误率低于其他模型。值得注意的是,Voting模型在准确描绘矿床位置和识别实际矿化趋势方面表现出色,同时最大限度地减少了虚假异常。
{"title":"Comparison of the performance of gradient boost, linear regression, decision tree, and voting algorithms to separate geochemical anomalies areas in the fractal environment","authors":"Mirmahdi Seyedrahimi-Niaraq ,&nbsp;Hossein Mahdiyanfar ,&nbsp;Mohammad hossein Olyaee","doi":"10.1016/j.aiig.2025.100156","DOIUrl":"10.1016/j.aiig.2025.100156","url":null,"abstract":"<div><div>In this investigation, the Gradient Boosting (GB), Linear Regression (LR), Decision Tree (DT), and Voting algorithms were applied to predict the distribution pattern of Au geochemical data. Trace and indicator elements, including Mo, Cu, Pb, Zn, Ag, Ni, Co, Mn, Fe, and As, were used with these machine learning algorithms (MLAs) to predict Au concentration values in the Doostbigloo porphyry Cu-Au-Mo mineralization area. The performance of the models was evaluated using the Mean Absolute Percentage Error (MAPE) and Root Mean Square Error (RMSE) metrics. The proposed ensemble Voting algorithm outperformed the other models, yielding more accurate predictions according to both metrics. The predicted data from the GB, LR, DT, and Voting MLAs were modeled using the Concentration-Area fractal method, and Au geochemical anomalies were mapped. To compare and validate the results, factors such as the location of the mineral deposits, their surface extent, and mineralization trend were considered. The results indicate that integrating hybrid MLAs with fractal modeling significantly improves geochemical prospectivity mapping. Among the four models, three (DT, GB, Voting) accurately identified both mineral deposits. The LR model, however, only identified Deposit I (central), and its mineralization trend diverged from the field data. The GB and Voting models produced similar results, with their final maps derived from fractal modeling showing the same anomalous areas. The anomaly boundaries identified by these two models are consistent with the two known reserves in the region. The results and plots related to prediction indicators and error rates for these two models also show high similarity, with lower error rates than the other models. Notably, the Voting model demonstrated superior performance in accurately delineating mineral deposit locations and identifying realistic mineralization trends while minimizing false anomalies.</div></div>","PeriodicalId":100124,"journal":{"name":"Artificial Intelligence in Geosciences","volume":"6 2","pages":"Article 100156"},"PeriodicalIF":4.2,"publicationDate":"2025-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145219420","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Artificial Intelligence in Geosciences
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1