首页 > 最新文献

Artificial Intelligence in Geosciences最新文献

英文 中文
Seismic facies characterization: Integrated subsurface-outcrop analysis for complex depositional systems in northeast India 地震相表征:印度东北部复杂沉积体系的综合地下-露头分析
IF 4.2 Pub Date : 2026-03-01 Epub Date: 2026-02-09 DOI: 10.1016/j.aiig.2026.100196
Priyadarshi Chinmoy Kumar , Heather Bedle , Jitender Kumar , Tapos Kumar Goswami , Kalachand Sain
Seismic facies analysis involves the interpretation of reflection patterns from seismic data to provide insights into subsurface sedimentary environments, depositional processes, and lithological variations, aiding georesources exploration. This study evaluates unsupervised machine learning (ML) models for seismic facies mapping within the Barail group from the Amguri region of the Upper Assam basin in northeast (NE) India. Utilizing high-quality three-dimensional seismic data, a comprehensive set of seismic attributes, including amplitude-based, instantaneous, spectral, geometric, and textural measures, is extracted, optimally selected, and integrated using two unsupervised models: the self-organizing map (SOM) and the generative topographic mapping (GTM). The two models are compared to identify the most effective approach for discerning seismic facies patterns within the Barail-Coal-Shale (BCS) and Barail-Main-Sand (BMS) units. Results indicate that GTM outperforms SOM by providing improved cluster separation, enhanced facies continuity, and greater geological consistency across the target interval in the study area. GTM-derived facies insights are validated with borehole and field data, facilitating an in-depth interpretation of the sedimentary environment. The BCS interval predominantly consists of coaly shale, coal-shale alternations, and minor sandstone facies, indicative of a swampy deltaic setting conducive to periodic vegetation accumulation. In contrast, the BMS interval primarily comprises sandstone with occasional shale and coal-shale intercalations, reflecting a fluvial-dominated environment. Post-depositional tectonic processes contributed to the deformation and structural complexity within these intervals. The proposed methodology (specifically data enhancement, attribute optimization, ML model comparison and integration of interpretations with different geoscientific data) demonstrates potential for application in other geological settings to enhance subsurface interpretation.
地震相分析包括从地震数据中解释反射模式,从而深入了解地下沉积环境、沉积过程和岩性变化,从而帮助勘探地质资源。本研究评估了印度东北部上阿萨姆邦盆地Amguri地区Barail组地震相测绘的无监督机器学习(ML)模型。利用高质量的三维地震数据,提取、优化选择了一套全面的地震属性,包括基于振幅的、瞬时的、光谱的、几何的和纹理的测量,并使用两种无监督模型进行整合:自组织地图(SOM)和生成式地形映射(GTM)。将这两种模型进行比较,以确定识别barail -煤-页岩(BCS)和barail -主-砂(BMS)单元地震相模式的最有效方法。结果表明,GTM优于SOM,在整个研究区域的目标层段中,GTM提供了更好的簇分离、增强的相连续性和更高的地质一致性。通过井眼和现场数据验证了gtm导出的相信息,有助于对沉积环境进行深入解释。BCS层段主要由煤页岩、煤页岩交替相和少量砂岩相组成,表明其为沼泽三角洲环境,有利于周期性植被聚集。相比之下,BMS层段主要由砂岩组成,间或有页岩和煤页岩夹层,反映了河流主导的环境。沉积后的构造作用增加了这些层段的变形和构造复杂性。所提出的方法(特别是数据增强、属性优化、ML模型比较和与不同地球科学数据的解释整合)显示了在其他地质环境中应用的潜力,以增强地下解释。
{"title":"Seismic facies characterization: Integrated subsurface-outcrop analysis for complex depositional systems in northeast India","authors":"Priyadarshi Chinmoy Kumar ,&nbsp;Heather Bedle ,&nbsp;Jitender Kumar ,&nbsp;Tapos Kumar Goswami ,&nbsp;Kalachand Sain","doi":"10.1016/j.aiig.2026.100196","DOIUrl":"10.1016/j.aiig.2026.100196","url":null,"abstract":"<div><div>Seismic facies analysis involves the interpretation of reflection patterns from seismic data to provide insights into subsurface sedimentary environments, depositional processes, and lithological variations, aiding georesources exploration. This study evaluates unsupervised machine learning (ML) models for seismic facies mapping within the Barail group from the Amguri region of the Upper Assam basin in northeast (NE) India. Utilizing high-quality three-dimensional seismic data, a comprehensive set of seismic attributes, including amplitude-based, instantaneous, spectral, geometric, and textural measures, is extracted, optimally selected, and integrated using two unsupervised models: the self-organizing map (SOM) and the generative topographic mapping (GTM). The two models are compared to identify the most effective approach for discerning seismic facies patterns within the Barail-Coal-Shale (BCS) and Barail-Main-Sand (BMS) units. Results indicate that GTM outperforms SOM by providing improved cluster separation, enhanced facies continuity, and greater geological consistency across the target interval in the study area. GTM-derived facies insights are validated with borehole and field data, facilitating an in-depth interpretation of the sedimentary environment. The BCS interval predominantly consists of coaly shale, coal-shale alternations, and minor sandstone facies, indicative of a swampy deltaic setting conducive to periodic vegetation accumulation. In contrast, the BMS interval primarily comprises sandstone with occasional shale and coal-shale intercalations, reflecting a fluvial-dominated environment. Post-depositional tectonic processes contributed to the deformation and structural complexity within these intervals. The proposed methodology (specifically data enhancement, attribute optimization, ML model comparison and integration of interpretations with different geoscientific data) demonstrates potential for application in other geological settings to enhance subsurface interpretation.</div></div>","PeriodicalId":100124,"journal":{"name":"Artificial Intelligence in Geosciences","volume":"7 1","pages":"Article 100196"},"PeriodicalIF":4.2,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146188458","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Enhancing model parameterization with linearly constrained deep generative network for ensemble-based history matching 基于集成历史匹配的线性约束深度生成网络模型参数化改进
IF 4.2 Pub Date : 2026-03-01 Epub Date: 2026-03-03 DOI: 10.1016/j.aiig.2026.100201
Yanhui Zhang, Ibrahim Hoteit
Ensemble-based data assimilation methods have been widely used for history matching in subsurface reservoir modeling, but struggle to handle the complex nonlinear and non-Gaussian behaviors prevalent in real field applications. To address these limitations, this paper introduces a deep generative model-based parameterization that effectively reduces dimensionality, preserves non-Gaussian patterns, and enhances linearity with observational data. We propose a regularized variational autoencoder (RVAE) comprising three integrated components: (1) an encoder that projects high-dimensional reservoir model parameters into a low-dimensional latent space, capturing complex non-Gaussian distributions; (2) a decoder that maps latent variables back to the original model space, ensuring accurate and geologically consistent reconstruction; and (3) a lightweight linear subnetwork that imposes additional regularization on the latent space, enforcing a linear relationship with observational data. This RVAE framework strengthens ensemble-based methods by aligning the parameterization more closely with their linear-Gaussian assumptions, thereby enhancing compatibility and improving history matching accuracy. Given the high computational cost and time typically required for forward reservoir simulations, we adopt a semi-supervised learning approach, utilizing a dataset where only a small subset of the generated model realizations is paired with production data derived from these simulations. For the network design, the core architecture of the RVAE is based on a convolutional DenseNet, integrated with an attention mechanism to optimize feature representation. Experimental results demonstrate that the proposed approach effectively captures non-Gaussian patterns in permeability fields and significantly improves the assimilation of highly nonlinear production data.
基于集成的数据同化方法已广泛用于地下油藏建模的历史拟合,但难以处理实际油田应用中普遍存在的复杂非线性和非高斯行为。为了解决这些限制,本文引入了一种基于深度生成模型的参数化方法,该方法有效地降低了维数,保留了非高斯模式,并增强了观测数据的线性。我们提出了一种正则化变分自编码器(RVAE),包括三个组成部分:(1)将高维油藏模型参数投影到低维潜在空间,捕获复杂的非高斯分布的编码器;(2)将潜在变量映射回原始模型空间的解码器,确保重建的准确性和地质一致性;(3)一个轻量级的线性子网络,对潜在空间施加额外的正则化,加强与观测数据的线性关系。该RVAE框架通过将参数化与其线性高斯假设更紧密地对齐来加强基于集成的方法,从而增强兼容性并提高历史匹配精度。考虑到正向油藏模拟通常需要较高的计算成本和时间,我们采用了半监督学习方法,利用数据集,其中只有一小部分生成的模型实现与从这些模拟中获得的生产数据相匹配。在网络设计方面,RVAE的核心架构基于卷积DenseNet,并集成了一个关注机制来优化特征表示。实验结果表明,该方法有效地捕获了渗透率场中的非高斯模式,显著改善了高度非线性生产数据的同化。
{"title":"Enhancing model parameterization with linearly constrained deep generative network for ensemble-based history matching","authors":"Yanhui Zhang,&nbsp;Ibrahim Hoteit","doi":"10.1016/j.aiig.2026.100201","DOIUrl":"10.1016/j.aiig.2026.100201","url":null,"abstract":"<div><div>Ensemble-based data assimilation methods have been widely used for history matching in subsurface reservoir modeling, but struggle to handle the complex nonlinear and non-Gaussian behaviors prevalent in real field applications. To address these limitations, this paper introduces a deep generative model-based parameterization that effectively reduces dimensionality, preserves non-Gaussian patterns, and enhances linearity with observational data. We propose a regularized variational autoencoder (RVAE) comprising three integrated components: (1) an encoder that projects high-dimensional reservoir model parameters into a low-dimensional latent space, capturing complex non-Gaussian distributions; (2) a decoder that maps latent variables back to the original model space, ensuring accurate and geologically consistent reconstruction; and (3) a lightweight linear subnetwork that imposes additional regularization on the latent space, enforcing a linear relationship with observational data. This RVAE framework strengthens ensemble-based methods by aligning the parameterization more closely with their linear-Gaussian assumptions, thereby enhancing compatibility and improving history matching accuracy. Given the high computational cost and time typically required for forward reservoir simulations, we adopt a semi-supervised learning approach, utilizing a dataset where only a small subset of the generated model realizations is paired with production data derived from these simulations. For the network design, the core architecture of the RVAE is based on a convolutional DenseNet, integrated with an attention mechanism to optimize feature representation. Experimental results demonstrate that the proposed approach effectively captures non-Gaussian patterns in permeability fields and significantly improves the assimilation of highly nonlinear production data.</div></div>","PeriodicalId":100124,"journal":{"name":"Artificial Intelligence in Geosciences","volume":"7 1","pages":"Article 100201"},"PeriodicalIF":4.2,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147396655","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Spatial mapping and modelling of soil organic carbon using random forest and remote sensing variables in part of Kaduna, Northern Nigeria 利用随机森林和遥感变量在尼日利亚北部卡杜纳部分地区进行土壤有机碳空间制图和建模
IF 4.2 Pub Date : 2026-03-01 Epub Date: 2026-03-04 DOI: 10.1016/j.aiig.2026.100198
Mays Taha Yaqub , Fatihu Kabir Sadiq , Mohammed Abdal-Mnam Hassan , AbdulKarem Ahmed Meklef Alalwany
Reliable and up-to-date digital soil data is crucial for achieving Sustainable Development Goal 13 (Climate Action) by enabling improved monitoring of soil carbon and land degradation, thereby supporting climate-smart agriculture and ensuring stable crop yields in sub-Saharan Africa. This study focuses on the spatial mapping of soil organic carbon (SOC) comparing predictive models that integrate Landsat 8 variables and DEM derivatives within a Random Forest framework. Three models were evaluated: Model A, which incorporates only Landsat 8 derivatives; Model B, based solely on DEM variables; and Model C, which integrates both Landsat 8 and DEM datasets. The results indicate that Model A achieved an RMSE of 0.15 (%) and an R2 of 0.67, while Model B achieved an RMSE of 0.19 (%) and an R2 of 0.54. Model C (the combined model) achieved the highest explanatory power with an R2 of 0.69. The findings highlight the significant influence of DEM-derived variables, such as profile and plan curvature, on SOC distribution, emphasizing the role of terrain attributes in SOC mapping. This study demonstrates the potential of RF modeling for SOC prediction, reinforcing the importance of integrating spectral and topographic variables to enhance accuracy. To achieve sustainable farming and resilient crop production in sub-Saharan Africa, accurate digital soil mapping is essential. These datasets empower climate action by tracking soil health and carbon sequestration, providing the necessary evidence base for effective land management strategies.
可靠和最新的数字土壤数据对于实现可持续发展目标13(气候行动)至关重要,因为它可以改善对土壤碳和土地退化的监测,从而支持气候智慧型农业并确保撒哈拉以南非洲地区的作物稳定产量。本研究的重点是在随机森林框架下比较整合Landsat 8变量和DEM衍生物的预测模型对土壤有机碳(SOC)的空间映射。评估了三种模型:模型A,仅包含Landsat 8衍生品;模型B,仅基于DEM变量;模型C集成了Landsat 8和DEM数据集。结果表明,模型A的RMSE为0.15 (%),R2为0.67;模型B的RMSE为0.19 (%),R2为0.54。模型C(组合模型)的解释能力最高,R2为0.69。研究结果强调了dem衍生变量(如剖面和平面曲率)对有机碳分布的显著影响,强调了地形属性在有机碳制图中的作用。这项研究证明了射频建模在SOC预测中的潜力,强调了整合光谱和地形变量以提高准确性的重要性。为了在撒哈拉以南非洲实现可持续农业和抗灾作物生产,精确的数字土壤测绘至关重要。这些数据集通过跟踪土壤健康和碳固存,增强了气候行动的能力,为有效的土地管理战略提供了必要的证据基础。
{"title":"Spatial mapping and modelling of soil organic carbon using random forest and remote sensing variables in part of Kaduna, Northern Nigeria","authors":"Mays Taha Yaqub ,&nbsp;Fatihu Kabir Sadiq ,&nbsp;Mohammed Abdal-Mnam Hassan ,&nbsp;AbdulKarem Ahmed Meklef Alalwany","doi":"10.1016/j.aiig.2026.100198","DOIUrl":"10.1016/j.aiig.2026.100198","url":null,"abstract":"<div><div>Reliable and up-to-date digital soil data is crucial for achieving Sustainable Development Goal 13 (Climate Action) by enabling improved monitoring of soil carbon and land degradation, thereby supporting climate-smart agriculture and ensuring stable crop yields in sub-Saharan Africa. This study focuses on the spatial mapping of soil organic carbon (SOC) comparing predictive models that integrate Landsat 8 variables and DEM derivatives within a Random Forest framework. Three models were evaluated: Model A, which incorporates only Landsat 8 derivatives; Model B, based solely on DEM variables; and Model C, which integrates both Landsat 8 and DEM datasets. The results indicate that Model A achieved an RMSE of 0.15 (%) and an R<sup>2</sup> of 0.67, while Model B achieved an RMSE of 0.19 (%) and an R<sup>2</sup> of 0.54. Model C (the combined model) achieved the highest explanatory power with an R<sup>2</sup> of 0.69. The findings highlight the significant influence of DEM-derived variables, such as profile and plan curvature, on SOC distribution, emphasizing the role of terrain attributes in SOC mapping. This study demonstrates the potential of RF modeling for SOC prediction, reinforcing the importance of integrating spectral and topographic variables to enhance accuracy. To achieve sustainable farming and resilient crop production in sub-Saharan Africa, accurate digital soil mapping is essential. These datasets empower climate action by tracking soil health and carbon sequestration, providing the necessary evidence base for effective land management strategies.</div></div>","PeriodicalId":100124,"journal":{"name":"Artificial Intelligence in Geosciences","volume":"7 1","pages":"Article 100198"},"PeriodicalIF":4.2,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147396654","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Hierarchical machine learning for the automatic classification of surface deformation from SAR observations 基于分层机器学习的SAR地表形变自动分类
IF 4.2 Pub Date : 2026-03-01 Epub Date: 2025-12-09 DOI: 10.1016/j.aiig.2025.100171
Jhonatan Rivera-Rivera , Héctor Aguilera , Marta Béjar-Pizarro , Carolina Guardiola-Albert , Pablo Ezquerro , Anna Barra
Ground deformation processes, such as landslides and subsidence, cause significant social, economic, and environmental impacts. This study aims to automatically classify ground deformation processes in Spain using a machine learning approach applied to InSAR-based datasets. The database integrates InSAR measurement points (MPs) from 20 case studies in Spain, obtained from various institutional sources, and 32 geoenvironmental variables related to ground deformation, morphometry, geology, climate, and land use. The proposed classification strategy follows a hierarchical structure with two levels: first, distinguishing between landslides and subsidence; then, identifying the specific type within each main class (mining landslide, environmental landslide, constructive subsidence, mining subsidence, and piezometric subsidence). Several machine learning algorithms (Naïve Bayes, Logistic Regression, Decision Tree, Random Forest, Extra Trees, Gradient Boosting Machine, XGBoost, LightGBM, and CatBoost) and data configurations were tested, combining different spatial resolutions and class balancing techniques. The best performance (Cohen's Kappa = 0.78) was achieved with the hierarchical approach using the 200 m grid dataset, applying XGBoost for the parental and landslide models, and CatBoost for the subsidence model. Using this approach, 70 % de test sites achieved over 88 % correctly classified cells, 20 % had between 50 % and 83 %, and only one test case was entirely misclassified. The analysis of the most relevant variables indicates that annual mean precipitation, mining activity, buildings, landslide susceptibility, and slope are key factors. These results demonstrate the potential of the hierarchical approach to improve classification and lay the groundwork for future application at national and European scales, incorporating new training cases, process types, and continental data sources. In conclusion, this study presents, for the first time, a hierarchical machine learning model capable of accurately classifying ground deformation processes in Spain, with the aim of supporting territorial management and geohazard mitigation.
地面变形过程,如滑坡和下沉,会造成重大的社会、经济和环境影响。本研究旨在使用应用于基于insar的数据集的机器学习方法对西班牙的地面变形过程进行自动分类。该数据库整合了来自西班牙20个案例研究的InSAR测量点(MPs),这些数据来自不同的机构来源,以及与地面变形、地貌测量、地质、气候和土地利用相关的32个地球环境变量。提出的分类策略遵循两个层次的分层结构:第一,区分滑坡和沉降;然后,在每个主要类别中确定具体类型(采矿滑坡、环境滑坡、建设性沉陷、采矿沉陷和压力沉降)。结合不同的空间分辨率和类平衡技术,测试了几种机器学习算法(Naïve Bayes, Logistic Regression, Decision Tree, Random Forest, Extra Trees, Gradient Boosting machine, XGBoost, LightGBM和CatBoost)和数据配置。使用200米网格数据集的分层方法获得了最佳性能(Cohen’s Kappa = 0.78),对亲代和滑坡模型应用XGBoost,对沉降模型应用CatBoost。使用这种方法,70%的测试站点实现了超过88%的正确分类单元,20%的站点在50%到83%之间,并且只有一个测试用例完全被错误分类。对最相关变量的分析表明,年平均降水、采矿活动、建筑物、滑坡易感性和坡度是关键因素。这些结果显示了层次方法改进分类的潜力,并为将来在国家和欧洲范围内的应用奠定了基础,结合了新的培训案例、过程类型和大陆数据源。总之,本研究首次提出了一种分层机器学习模型,能够准确地对西班牙的地面变形过程进行分类,目的是支持领土管理和减轻地质灾害。
{"title":"Hierarchical machine learning for the automatic classification of surface deformation from SAR observations","authors":"Jhonatan Rivera-Rivera ,&nbsp;Héctor Aguilera ,&nbsp;Marta Béjar-Pizarro ,&nbsp;Carolina Guardiola-Albert ,&nbsp;Pablo Ezquerro ,&nbsp;Anna Barra","doi":"10.1016/j.aiig.2025.100171","DOIUrl":"10.1016/j.aiig.2025.100171","url":null,"abstract":"<div><div>Ground deformation processes, such as landslides and subsidence, cause significant social, economic, and environmental impacts. This study aims to automatically classify ground deformation processes in Spain using a machine learning approach applied to InSAR-based datasets. The database integrates InSAR measurement points (MPs) from 20 case studies in Spain, obtained from various institutional sources, and 32 geoenvironmental variables related to ground deformation, morphometry, geology, climate, and land use. The proposed classification strategy follows a hierarchical structure with two levels: first, distinguishing between landslides and subsidence; then, identifying the specific type within each main class (mining landslide, environmental landslide, constructive subsidence, mining subsidence, and piezometric subsidence). Several machine learning algorithms (Naïve Bayes, Logistic Regression, Decision Tree, Random Forest, Extra Trees, Gradient Boosting Machine, XGBoost, LightGBM, and CatBoost) and data configurations were tested, combining different spatial resolutions and class balancing techniques. The best performance (Cohen's Kappa = 0.78) was achieved with the hierarchical approach using the 200 m grid dataset, applying XGBoost for the parental and landslide models, and CatBoost for the subsidence model. Using this approach, 70 % de test sites achieved over 88 % correctly classified cells, 20 % had between 50 % and 83 %, and only one test case was entirely misclassified. The analysis of the most relevant variables indicates that annual mean precipitation, mining activity, buildings, landslide susceptibility, and slope are key factors. These results demonstrate the potential of the hierarchical approach to improve classification and lay the groundwork for future application at national and European scales, incorporating new training cases, process types, and continental data sources. In conclusion, this study presents, for the first time, a hierarchical machine learning model capable of accurately classifying ground deformation processes in Spain, with the aim of supporting territorial management and geohazard mitigation.</div></div>","PeriodicalId":100124,"journal":{"name":"Artificial Intelligence in Geosciences","volume":"7 1","pages":"Article 100171"},"PeriodicalIF":4.2,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145841219","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A hybrid unsupervised-supervised deep learning framework for sandstone thickness prediction from seismic data 基于地震数据预测砂岩厚度的混合无监督深度学习框架
IF 4.2 Pub Date : 2026-03-01 Epub Date: 2026-03-06 DOI: 10.1016/j.aiig.2026.100199
Yixue Xiong , Bing Tan , Qiannan Wang , Bing Li , Zegang Wang , Xiaoyi Zhou , Xingyu Liu , Wenqiang Ma , Lan Huang , Zhiguo Wang
Accurate sandstone thickness prediction from seismic data is vital for reservoir characterization and well placement optimization. However, conventional deep learning methods are often hindered by inefficient sequential processing or excessive computational costs when handling long seismic traces. To overcome these limitations, we propose a two-stage deep learning framework. First, an unsupervised feature extraction network derives high-dimensional latent representations directly from seismic data. Second, a novel reservoir sequence prediction network—utilizing efficient ProbSparse self-attention and self-attention distilling—maps these features to sandstone thickness, even with limited well-log training data. When applied to a field dataset with limited borehole control, our method resolved sandstone bodies thickness about 15∼20 m and achieved a Mean Absolute Percentage Error of just 3.7% at blind validation wells. This hybrid approach offers a robust, computationally efficient solution for high-precision reservoir prediction in data-constrained environments.
从地震数据中准确预测砂岩厚度对于储层表征和井位优化至关重要。然而,传统的深度学习方法在处理长地震轨迹时经常受到低效的顺序处理或过多的计算成本的阻碍。为了克服这些限制,我们提出了一个两阶段的深度学习框架。首先,无监督特征提取网络直接从地震数据中提取高维潜在表示。其次,利用高效的ProbSparse自关注和自关注提取的新型储层序列预测网络,即使在有限的测井训练数据下,也能将这些特征映射到砂岩厚度上。当应用于有限井眼控制的现场数据集时,我们的方法可以分辨出砂岩体厚度约为15 ~ 20 m,并且在盲验证井中实现了平均绝对百分比误差仅为3.7%。这种混合方法为数据受限环境下的高精度储层预测提供了一种强大的、计算效率高的解决方案。
{"title":"A hybrid unsupervised-supervised deep learning framework for sandstone thickness prediction from seismic data","authors":"Yixue Xiong ,&nbsp;Bing Tan ,&nbsp;Qiannan Wang ,&nbsp;Bing Li ,&nbsp;Zegang Wang ,&nbsp;Xiaoyi Zhou ,&nbsp;Xingyu Liu ,&nbsp;Wenqiang Ma ,&nbsp;Lan Huang ,&nbsp;Zhiguo Wang","doi":"10.1016/j.aiig.2026.100199","DOIUrl":"10.1016/j.aiig.2026.100199","url":null,"abstract":"<div><div>Accurate sandstone thickness prediction from seismic data is vital for reservoir characterization and well placement optimization. However, conventional deep learning methods are often hindered by inefficient sequential processing or excessive computational costs when handling long seismic traces. To overcome these limitations, we propose a two-stage deep learning framework. First, an unsupervised feature extraction network derives high-dimensional latent representations directly from seismic data. Second, a novel reservoir sequence prediction network—utilizing efficient ProbSparse self-attention and self-attention distilling—maps these features to sandstone thickness, even with limited well-log training data. When applied to a field dataset with limited borehole control, our method resolved sandstone bodies thickness about 15∼20 m and achieved a Mean Absolute Percentage Error of just 3.7% at blind validation wells. This hybrid approach offers a robust, computationally efficient solution for high-precision reservoir prediction in data-constrained environments.</div></div>","PeriodicalId":100124,"journal":{"name":"Artificial Intelligence in Geosciences","volume":"7 1","pages":"Article 100199"},"PeriodicalIF":4.2,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147396656","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The Fossil Frontier: An answer to the 3-billion fossil question 化石前沿:30亿化石问题的答案
IF 4.2 Pub Date : 2026-03-01 Epub Date: 2025-12-30 DOI: 10.1016/j.aiig.2025.100184
Iver Martinsen , Benjamin Ricaud , David Wade , Odd Kolbjørnsen , Fred Godtliebsen
Microfossil analysis is important in subsurface mapping, for example to match strata between wells. This analysis is currently conducted by specialist geoscientists who manually investigate large numbers of physical samples with the aim of identifying informative microfossil species and genera. The current digitalization of large volumes of microfossil samples that is being conducted by the Norwegian Offshore Directorate, paired with AI development, opens up new opportunities for automating parts of the analysis to help the geologist in the analysis. Unsupervised representation learning is a research area in Artificial Intelligence (AI) that lies at the core of this challenge, as this way of learning can create useful image representations by utilizing large volumes of data without requiring labels. Previous work has presented good results for the classification of a limited number of classes, but there are still challenges related to classification in realistic settings where additional unknown species are present. In this paper, we connect unsupervised representation learning and uncertainty estimation and create a comprehensive tool to automate microfossil analysis. We present our methodology and results in three parts. In the first part, we train several AI models from scratch using state-of-the-art self-supervised learning methods, obtaining excellent results compared against state-of-the-art foundation models for image classification and content-based image retrieval. In the second part, we develop a method based on conformal prediction which enables our classifier to handle a large pool of images containing both in-distribution and out-of-distribution data, while at the same time allowing us to create error estimates to control the uncertainty of the prediction sets. In the third part, we use our method to create distribution charts of fossils for a range of genera in multiple wells.
微化石分析在地下测绘中很重要,例如在井间匹配地层。这种分析目前是由专业的地球科学家进行的,他们手动调查大量的物理样本,目的是识别信息丰富的微化石物种和属。目前,挪威海上管理局正在对大量微化石样本进行数字化处理,再加上人工智能的发展,为自动化部分分析提供了新的机会,从而帮助地质学家进行分析。无监督表示学习是人工智能(AI)的一个研究领域,也是这一挑战的核心,因为这种学习方式可以通过利用大量数据而不需要标签来创建有用的图像表示。以前的工作已经为有限数量的分类提供了良好的结果,但是在存在额外未知物种的现实环境中,分类仍然存在挑战。在本文中,我们将无监督表示学习和不确定性估计联系起来,并创建了一个自动化微化石分析的综合工具。我们分三部分介绍我们的方法和结果。在第一部分中,我们使用最先进的自监督学习方法从头开始训练几个AI模型,与最先进的图像分类和基于内容的图像检索基础模型相比,获得了出色的结果。在第二部分中,我们开发了一种基于保形预测的方法,该方法使我们的分类器能够处理包含分布内和分布外数据的大量图像,同时允许我们创建误差估计来控制预测集的不确定性。在第三部分中,我们使用我们的方法在多个井中创建了一系列属的化石分布图。
{"title":"The Fossil Frontier: An answer to the 3-billion fossil question","authors":"Iver Martinsen ,&nbsp;Benjamin Ricaud ,&nbsp;David Wade ,&nbsp;Odd Kolbjørnsen ,&nbsp;Fred Godtliebsen","doi":"10.1016/j.aiig.2025.100184","DOIUrl":"10.1016/j.aiig.2025.100184","url":null,"abstract":"<div><div>Microfossil analysis is important in subsurface mapping, for example to match strata between wells. This analysis is currently conducted by specialist geoscientists who manually investigate large numbers of physical samples with the aim of identifying informative microfossil species and genera. The current digitalization of large volumes of microfossil samples that is being conducted by the Norwegian Offshore Directorate, paired with AI development, opens up new opportunities for automating parts of the analysis to help the geologist in the analysis. Unsupervised representation learning is a research area in Artificial Intelligence (AI) that lies at the core of this challenge, as this way of learning can create useful image representations by utilizing large volumes of data without requiring labels. Previous work has presented good results for the classification of a limited number of classes, but there are still challenges related to classification in realistic settings where additional unknown species are present. In this paper, we connect unsupervised representation learning and uncertainty estimation and create a comprehensive tool to automate microfossil analysis. We present our methodology and results in three parts. In the first part, we train several AI models from scratch using state-of-the-art self-supervised learning methods, obtaining excellent results compared against state-of-the-art foundation models for image classification and content-based image retrieval. In the second part, we develop a method based on conformal prediction which enables our classifier to handle a large pool of images containing both in-distribution and out-of-distribution data, while at the same time allowing us to create error estimates to control the uncertainty of the prediction sets. In the third part, we use our method to create distribution charts of fossils for a range of genera in multiple wells.</div></div>","PeriodicalId":100124,"journal":{"name":"Artificial Intelligence in Geosciences","volume":"7 1","pages":"Article 100184"},"PeriodicalIF":4.2,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145939042","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
DeepSeg-based noise reduction algorithm trained on a hybrid synthetic dataset for signals from acoustic logging-while-drilling 基于deepseg的降噪算法在混合合成数据集上进行训练,以获取随钻声波测井信号
IF 4.2 Pub Date : 2026-03-01 Epub Date: 2026-02-06 DOI: 10.1016/j.aiig.2026.100194
Xin Fu , Junyi Song , Yang Gou
Acoustic logging-while-drilling (ALWD) enables real-time acoustic measurements during drilling operations. However, challenging downhole conditions introduce considerable noise into ALWD signals. This study applied a numerical method to simulate clean ALWD array waveforms. A synthetic noisy dataset was subsequently generated by superimposing high-quality noise data acquired during laboratory measurements onto the simulated clean dataset. Time–frequency spectral representations of noisy signals were obtained via short-time discrete cosine transformation and were divided into past, present, and future intervals. These were used as input for a U-Net-based neural network—DeepSeg—that was employed for frequency-domain denoising. The trained model outputted denoised frequency segments for the intermediate current time interval. This sliding-window strategy applied to frequency slices substantially reduced the required dataset size. The network effectively removed complex downhole noise, even with limited training data, and demonstrated a strong generalization capability on field data. The network also significantly enhanced the quality of acoustic array signals and improved the accuracy of slowness-time-coherence processing in a logging-while-drilling application example. Requiring as little as one tenth of the amount of data used in methods from previous studies, the proposed method is particularly advantageous for ALWD applications, where data acquisition is challenging.
随钻声波测井(ALWD)可以在钻井过程中进行实时声波测量。然而,恶劣的井下环境会给ALWD信号带来相当大的噪声。本研究采用数值方法模拟干净ALWD阵列波形。随后,通过将实验室测量期间获得的高质量噪声数据叠加到模拟干净数据集上,生成合成噪声数据集。噪声信号的时频谱表示通过短时离散余弦变换获得,并分为过去,现在和未来的间隔。这些数据被用作基于u - net的神经网络deepseg的输入,该网络用于频域去噪。训练后的模型输出中间电流时间间隔的去噪频率段。这种用于频率切片的滑动窗口策略大大减少了所需的数据集大小。即使训练数据有限,该网络也能有效地去除复杂的井下噪声,并显示出对现场数据的强大泛化能力。在随钻测井应用实例中,该网络还显著提高了声阵列信号的质量,提高了慢速-时间相干处理的准确性。该方法所需的数据量仅为以往研究方法的十分之一,对于数据采集具有挑战性的随钻测井应用尤其有利。
{"title":"DeepSeg-based noise reduction algorithm trained on a hybrid synthetic dataset for signals from acoustic logging-while-drilling","authors":"Xin Fu ,&nbsp;Junyi Song ,&nbsp;Yang Gou","doi":"10.1016/j.aiig.2026.100194","DOIUrl":"10.1016/j.aiig.2026.100194","url":null,"abstract":"<div><div>Acoustic logging-while-drilling (ALWD) enables real-time acoustic measurements during drilling operations. However, challenging downhole conditions introduce considerable noise into ALWD signals. This study applied a numerical method to simulate clean ALWD array waveforms. A synthetic noisy dataset was subsequently generated by superimposing high-quality noise data acquired during laboratory measurements onto the simulated clean dataset. Time–frequency spectral representations of noisy signals were obtained via short-time discrete cosine transformation and were divided into past, present, and future intervals. These were used as input for a U-Net-based neural network—DeepSeg—that was employed for frequency-domain denoising. The trained model outputted denoised frequency segments for the intermediate current time interval. This sliding-window strategy applied to frequency slices substantially reduced the required dataset size. The network effectively removed complex downhole noise, even with limited training data, and demonstrated a strong generalization capability on field data. The network also significantly enhanced the quality of acoustic array signals and improved the accuracy of slowness-time-coherence processing in a logging-while-drilling application example. Requiring as little as one tenth of the amount of data used in methods from previous studies, the proposed method is particularly advantageous for ALWD applications, where data acquisition is challenging.</div></div>","PeriodicalId":100124,"journal":{"name":"Artificial Intelligence in Geosciences","volume":"7 1","pages":"Article 100194"},"PeriodicalIF":4.2,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146188456","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Prediction of the soil–water retention curve of compacted clays using PSO–GA XGBoost 利用PSO-GA XGBoost预测压实粘土的土水保持曲线
IF 4.2 Pub Date : 2026-03-01 Epub Date: 2025-12-03 DOI: 10.1016/j.aiig.2025.100173
Reza Taherdangkoo , Thomas Nagel , Vladimir Tyurin , Chaofan Chen , Faramarz Doulati Ardejani , Christoph Butscher
Soil–water retention (SWR) is fundamental for understanding the hydro-mechanical behavior of unsaturated clay soils. The soil–water retention curve is typically obtained through extensive and costly laboratory testing. To offer a more efficient alternative, an extreme gradient boosting (XGBoost) model, optimized using a hybrid particle swarm optimization and genetic algorithm (PSO–GA), was developed. This hybrid model estimates the SWR across a broad suction range, accounting for both drying and wetting paths, along with key soil parameters. The performance of the model was evaluated through various statistical analyses and by comparing the predicted gravimetric water content with experimental data. A backward feature elimination method was employed to assess the impact of various input parameters on model accuracy and to offer a simplified model for scenarios with limited data availability. Additionally, Monte Carlo simulations were conducted to quantify the inherent uncertainties associated with the dataset, XGBoost hyperparameters, and model performance. The hybrid PSO–GA XGBoost model effectively estimates the water retention of clayey soils during both drying and wetting cycles, proving to be an alternative to traditional soil mechanics correlations.
土壤保水是理解非饱和粘土水力学特性的基础。土壤-水保持曲线通常是通过广泛和昂贵的实验室测试获得的。为了提供更有效的替代方案,开发了一种使用混合粒子群优化和遗传算法(PSO-GA)进行优化的极端梯度增强(XGBoost)模型。该混合模型估计了在广泛的吸力范围内的SWR,考虑了干燥和湿润路径,以及关键的土壤参数。通过各种统计分析,并将预测的重力含水率与实验数据进行比较,对模型的性能进行了评价。采用反向特征消去法评估各种输入参数对模型精度的影响,并为数据可用性有限的场景提供简化模型。此外,还进行了蒙特卡罗模拟,以量化与数据集、XGBoost超参数和模型性能相关的固有不确定性。混合PSO-GA XGBoost模型有效地估计了粘土在干湿循环中的保水性,证明是传统土壤力学相关性的替代方法。
{"title":"Prediction of the soil–water retention curve of compacted clays using PSO–GA XGBoost","authors":"Reza Taherdangkoo ,&nbsp;Thomas Nagel ,&nbsp;Vladimir Tyurin ,&nbsp;Chaofan Chen ,&nbsp;Faramarz Doulati Ardejani ,&nbsp;Christoph Butscher","doi":"10.1016/j.aiig.2025.100173","DOIUrl":"10.1016/j.aiig.2025.100173","url":null,"abstract":"<div><div>Soil–water retention (SWR) is fundamental for understanding the hydro-mechanical behavior of unsaturated clay soils. The soil–water retention curve is typically obtained through extensive and costly laboratory testing. To offer a more efficient alternative, an extreme gradient boosting (XGBoost) model, optimized using a hybrid particle swarm optimization and genetic algorithm (PSO–GA), was developed. This hybrid model estimates the SWR across a broad suction range, accounting for both drying and wetting paths, along with key soil parameters. The performance of the model was evaluated through various statistical analyses and by comparing the predicted gravimetric water content with experimental data. A backward feature elimination method was employed to assess the impact of various input parameters on model accuracy and to offer a simplified model for scenarios with limited data availability. Additionally, Monte Carlo simulations were conducted to quantify the inherent uncertainties associated with the dataset, XGBoost hyperparameters, and model performance. The hybrid PSO–GA XGBoost model effectively estimates the water retention of clayey soils during both drying and wetting cycles, proving to be an alternative to traditional soil mechanics correlations.</div></div>","PeriodicalId":100124,"journal":{"name":"Artificial Intelligence in Geosciences","volume":"7 1","pages":"Article 100173"},"PeriodicalIF":4.2,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145747577","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Remote sensing estimation of rice chlorophyll content based on UAV image feature selection and PSO-optimized ensemble learning 基于无人机图像特征选择和pso优化集成学习的水稻叶绿素含量遥感估计
IF 4.2 Pub Date : 2026-03-01 Epub Date: 2026-01-23 DOI: 10.1016/j.aiig.2026.100190
Tuo Wang , Guijun Yang , Xingang Xu , Jiaqi Sun , Yang Meng , Xiaodong Yang , Haikuan Feng , Hanyu Xue , Xinlei Xu , Yuekun Song
Chlorophyll content is one crucial indicator of evaluating crop growth and physiological status. Rapid, accurate, and large-scale monitoring of chlorophyll content is vital for the precise diagnosis of crop nutritional status and developmental dynamics. In this study, rice chlorophyll content was estimated using remote sensing data derived from field samples collected under varying nitrogen application levels. The proposed monitoring framework integrated spectral feature selection methods, data augmentation techniques with ensemble learning models, using multi-temporal multispectral imagery acquired by unmanned aerial vehicles (UAVs). Firstly, one two-stage Otsu adaptive thresholding method was utilized to efficiently extract rice pixels from UAV images captured at different growth stages, thereby eliminating non-rice noise in paddy field imagery. Subsequently, spectral vegetation indices, and texture features of rice UAV images were extracted. Three feature selection methods—GRA, mRMR, and SHAP—were employed to identify chlorophyll-sensitive features. Finally, based on Gaussian Mixture Model (GMM)-based data augmentation strategy and the selected sensitive features, one two-layer ensemble learning framework optimized by using Particle Swarm Optimization (PSO) algorithm was proposed to achieve accurate and efficient estimation of rice chlorophyll content across different growth stages. The results demonstrate that the SHAP-based feature selection method consistently achieved the best prediction performance across all growth stages, highlighting its superiority in retaining key feature information and enhancing model stability. Compared with individual machine learning models, the SHAP–PSO-based ensemble learning model exhibited higher accuracy and robustness, with R2 of 0.641, 0.555, and 0.527, and corresponding rRMSE of 0.055, 0.099, and 0.104 across the different stages. These findings indicate that the proposed ensemble learning framework, which integrates SHAP-based feature selection, GMM-based data augmentation, and PSO optimization, possesses strong generalization capability and stability for the remote sensing estimation of rice chlorophyll content. Moreover, the methodology proposed in this study can serve as a valuable reference for remote sensing-based monitoring of physicochemical parameters in other crops.
叶绿素含量是评价作物生长和生理状况的重要指标之一。快速、准确、大规模地监测叶绿素含量对于准确诊断作物营养状况和发育动态至关重要。本研究利用不同施氮水平下稻田样品的遥感数据估算了水稻叶绿素含量。该监测框架利用无人机获取的多时相多光谱图像,将光谱特征选择方法、数据增强技术与集成学习模型相结合。首先,利用两阶段Otsu自适应阈值法从不同生长阶段的无人机图像中高效提取水稻像元,从而消除稻田图像中的非水稻噪声;随后,提取水稻无人机影像的光谱植被指数和纹理特征。采用gra、mRMR和shap三种特征选择方法鉴定叶绿素敏感特征。最后,基于高斯混合模型(Gaussian Mixture Model, GMM)的数据增强策略和选取的敏感特征,提出了一种基于粒子群优化(PSO)算法优化的两层集成学习框架,以实现水稻不同生长阶段叶绿素含量的准确高效估计。结果表明,基于shap的特征选择方法在所有生长阶段都能保持最佳的预测性能,突出了其在保留关键特征信息和增强模型稳定性方面的优势。与单个机器学习模型相比,基于shap - pso的集成学习模型具有更高的准确性和鲁棒性,不同阶段的R2分别为0.641、0.555和0.527,相应的rRMSE分别为0.055、0.099和0.104。综上所述,基于shap的特征选择、基于gmm的数据增强和PSO优化相结合的集成学习框架对水稻叶绿素含量的遥感估算具有较强的泛化能力和稳定性。此外,本文提出的方法可为其他作物理化参数的遥感监测提供有价值的参考。
{"title":"Remote sensing estimation of rice chlorophyll content based on UAV image feature selection and PSO-optimized ensemble learning","authors":"Tuo Wang ,&nbsp;Guijun Yang ,&nbsp;Xingang Xu ,&nbsp;Jiaqi Sun ,&nbsp;Yang Meng ,&nbsp;Xiaodong Yang ,&nbsp;Haikuan Feng ,&nbsp;Hanyu Xue ,&nbsp;Xinlei Xu ,&nbsp;Yuekun Song","doi":"10.1016/j.aiig.2026.100190","DOIUrl":"10.1016/j.aiig.2026.100190","url":null,"abstract":"<div><div>Chlorophyll content is one crucial indicator of evaluating crop growth and physiological status. Rapid, accurate, and large-scale monitoring of chlorophyll content is vital for the precise diagnosis of crop nutritional status and developmental dynamics. In this study, rice chlorophyll content was estimated using remote sensing data derived from field samples collected under varying nitrogen application levels. The proposed monitoring framework integrated spectral feature selection methods, data augmentation techniques with ensemble learning models, using multi-temporal multispectral imagery acquired by unmanned aerial vehicles (UAVs). Firstly, one two-stage Otsu adaptive thresholding method was utilized to efficiently extract rice pixels from UAV images captured at different growth stages, thereby eliminating non-rice noise in paddy field imagery. Subsequently, spectral vegetation indices, and texture features of rice UAV images were extracted. Three feature selection methods—GRA, mRMR, and SHAP—were employed to identify chlorophyll-sensitive features. Finally, based on Gaussian Mixture Model (GMM)-based data augmentation strategy and the selected sensitive features, one two-layer ensemble learning framework optimized by using Particle Swarm Optimization (PSO) algorithm was proposed to achieve accurate and efficient estimation of rice chlorophyll content across different growth stages. The results demonstrate that the SHAP-based feature selection method consistently achieved the best prediction performance across all growth stages, highlighting its superiority in retaining key feature information and enhancing model stability. Compared with individual machine learning models, the SHAP–PSO-based ensemble learning model exhibited higher accuracy and robustness, with <em>R</em><sup><em>2</em></sup> of 0.641, 0.555, and 0.527, and corresponding rRMSE of 0.055, 0.099, and 0.104 across the different stages. These findings indicate that the proposed ensemble learning framework, which integrates SHAP-based feature selection, GMM-based data augmentation, and PSO optimization, possesses strong generalization capability and stability for the remote sensing estimation of rice chlorophyll content. Moreover, the methodology proposed in this study can serve as a valuable reference for remote sensing-based monitoring of physicochemical parameters in other crops.</div></div>","PeriodicalId":100124,"journal":{"name":"Artificial Intelligence in Geosciences","volume":"7 1","pages":"Article 100190"},"PeriodicalIF":4.2,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146188457","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Application of YOLOv11 deep learning model for classification and counting ice-rafted debris (IRD) in core sediments in the Arctic Ocean YOLOv11深度学习模型在北冰洋岩心沉积物冰筏碎片分类与计数中的应用
IF 4.2 Pub Date : 2026-03-01 Epub Date: 2026-01-29 DOI: 10.1016/j.aiig.2026.100191
Sunhwa Bang , Jae-Yoon Keum , Yoon Ji , Yang Jae Kang , Byung-Dal So , Jong Kuk Hong , Hyo-Im Kim
The classification and quantification of ice-rafted debris (IRD) in marine sediments are key to reconstructing glacial-interglacial dynamics and sediment provenance. However, traditional IRD analysis, based on manual grain identification under binocular microscopes, is time-consuming and dependent on expertise, which acted as a barrier to entry for IRD research. Here, we present a deep learning-based framework for the automated detection and lithological classification of IRD from high-resolution microscopic images with grains from natural Arctic sediments. Using the YOLOv11 algorithm, we designed a two-stage model: an instance segmentation model (Model 1) that detects individual IRD from multi-grain images, and a classification model (Model 2) that categorizes each grain into one of four lithological types (i.e., quartz, detrital carbonate, clastic, and crystalline). The dataset comprises 110 images containing 9642 grains from the Chukchi Sea sediment core ARA14C-ST12. Model 1 achieved the detection performance with precision = 0.95, recall = 0.97, mAP50 = 0.98, mAP50-95 = 0.85, and F1 score = 0.96, demonstrating high model performance to detect the complex morphology of grain. The evaluation metric of Model 2, used to identify lithological classes, showed average Top-1 accuracy of 0.87 and 0.75 on the validation and test sets, respectively. The classification model showed reliable recognition for quartz and detrital carbonate, with moderate confusion between clastic and crystalline grains. These results demonstrate that the proposed YOLOv11-based approach enables rapid, reproducible, and objective lithological classification of IRD grains, providing an efficient alternative to conventional manual counting.
海洋沉积物中浮冰碎屑(IRD)的分类和量化是重建冰期-间冰期动力学和沉积物物源的关键。然而,传统的IRD分析方法是在双目显微镜下进行人工颗粒鉴定,费时且依赖专业知识,这对IRD研究的进入构成了障碍。在这里,我们提出了一个基于深度学习的框架,用于从自然北极沉积物颗粒的高分辨率显微图像中自动检测和岩性分类IRD。使用YOLOv11算法,我们设计了一个两阶段模型:一个实例分割模型(模型1),用于从多颗粒图像中检测单个IRD,一个分类模型(模型2),用于将每个颗粒分类为四种岩性类型(即石英、碎屑碳酸盐岩、碎屑和晶体)中的一种。该数据集包含110幅图像,其中包含9642颗来自楚科奇海沉积岩心ARA14C-ST12的颗粒。模型1的检测性能为precision = 0.95, recall = 0.97, mAP50 = 0.98, mAP50-95 = 0.85, F1得分= 0.96,显示出较高的模型检测谷物复杂形态的性能。模型2的评价指标用于识别岩性类别,在验证集和测试集上的平均Top-1精度分别为0.87和0.75。该分类模型对石英和碎屑碳酸盐岩的识别较为可靠,对碎屑和结晶颗粒的区分较为模糊。这些结果表明,基于yolov11的方法可以快速、可重复、客观地对IRD颗粒进行岩性分类,为传统的人工计数提供了有效的替代方法。
{"title":"Application of YOLOv11 deep learning model for classification and counting ice-rafted debris (IRD) in core sediments in the Arctic Ocean","authors":"Sunhwa Bang ,&nbsp;Jae-Yoon Keum ,&nbsp;Yoon Ji ,&nbsp;Yang Jae Kang ,&nbsp;Byung-Dal So ,&nbsp;Jong Kuk Hong ,&nbsp;Hyo-Im Kim","doi":"10.1016/j.aiig.2026.100191","DOIUrl":"10.1016/j.aiig.2026.100191","url":null,"abstract":"<div><div>The classification and quantification of ice-rafted debris (IRD) in marine sediments are key to reconstructing glacial-interglacial dynamics and sediment provenance. However, traditional IRD analysis, based on manual grain identification under binocular microscopes, is time-consuming and dependent on expertise, which acted as a barrier to entry for IRD research. Here, we present a deep learning-based framework for the automated detection and lithological classification of IRD from high-resolution microscopic images with grains from natural Arctic sediments. Using the YOLOv11 algorithm, we designed a two-stage model: an instance segmentation model (Model 1) that detects individual IRD from multi-grain images, and a classification model (Model 2) that categorizes each grain into one of four lithological types (i.e., quartz, detrital carbonate, clastic, and crystalline). The dataset comprises 110 images containing 9642 grains from the Chukchi Sea sediment core ARA14C-ST12. Model 1 achieved the detection performance with precision = 0.95, recall = 0.97, <em>m</em>AP50 = 0.98, <em>m</em>AP50-95 = 0.85, and F1 score = 0.96, demonstrating high model performance to detect the complex morphology of grain. The evaluation metric of Model 2, used to identify lithological classes, showed average Top-1 accuracy of 0.87 and 0.75 on the validation and test sets, respectively. The classification model showed reliable recognition for quartz and detrital carbonate, with moderate confusion between clastic and crystalline grains. These results demonstrate that the proposed YOLOv11-based approach enables rapid, reproducible, and objective lithological classification of IRD grains, providing an efficient alternative to conventional manual counting.</div></div>","PeriodicalId":100124,"journal":{"name":"Artificial Intelligence in Geosciences","volume":"7 1","pages":"Article 100191"},"PeriodicalIF":4.2,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146188455","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Artificial Intelligence in Geosciences
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1