首页 > 最新文献

Artificial Intelligence in Geosciences最新文献

英文 中文
Online learning to accelerate nonlinear PDE solvers: Applied to multiphase porous media flow 在线学习加速非线性偏微分方程求解:应用于多相多孔介质流
Pub Date : 2025-07-23 DOI: 10.1016/j.aiig.2025.100146
Vinicius L.S. Silva , Pablo Salinas , Claire E. Heaney , Matthew D. Jackson , Christopher C. Pain
We propose a novel type of nonlinear solver acceleration for systems of nonlinear partial differential equations (PDEs) that is based on online/adaptive learning. It is applied in the context of multiphase flow in porous media. The proposed method rely on four pillars: (i) dimensionless numbers as input parameters for the machine learning model, (ii) simplified numerical model (two-dimensional) for the offline training, (iii) dynamic control of a nonlinear solver tuning parameter (numerical relaxation), (iv) and online learning for real-time improvement of the machine learning model. This strategy decreases the number of nonlinear iterations by dynamically modifying a single global parameter, the relaxation factor, and by adaptively learning the attributes of each numerical model on-the-run. Furthermore, this work performs a sensitivity study in the dimensionless parameters (machine learning features), assess the efficacy of various machine learning models, demonstrate a decrease in nonlinear iterations using our method in more intricate, realistic three-dimensional models, and fully couple a machine learning model into an open-source multiphase flow simulator achieving up to 85% reduction in computational time.
我们提出了一种基于在线/自适应学习的非线性偏微分方程(PDEs)系统的非线性求解器加速算法。它适用于多孔介质中多相流的研究。所提出的方法依赖于四个支柱:(i)无量纲数字作为机器学习模型的输入参数,(ii)用于离线训练的简化数值模型(二维),(iii)非线性求解器调谐参数的动态控制(数值松弛),(iv)以及用于实时改进机器学习模型的在线学习。该策略通过动态修改单个全局参数、松弛因子和自适应学习运行中的每个数值模型的属性来减少非线性迭代的次数。此外,这项工作对无维度参数(机器学习特征)进行了敏感性研究,评估了各种机器学习模型的有效性,在更复杂、更逼真的三维模型中使用我们的方法证明了非线性迭代的减少,并将机器学习模型完全耦合到开源多相流模拟器中,实现了高达85%的计算时间减少。
{"title":"Online learning to accelerate nonlinear PDE solvers: Applied to multiphase porous media flow","authors":"Vinicius L.S. Silva ,&nbsp;Pablo Salinas ,&nbsp;Claire E. Heaney ,&nbsp;Matthew D. Jackson ,&nbsp;Christopher C. Pain","doi":"10.1016/j.aiig.2025.100146","DOIUrl":"10.1016/j.aiig.2025.100146","url":null,"abstract":"<div><div>We propose a novel type of nonlinear solver acceleration for systems of nonlinear partial differential equations (PDEs) that is based on online/adaptive learning. It is applied in the context of multiphase flow in porous media. The proposed method rely on four pillars: (i) dimensionless numbers as input parameters for the machine learning model, (ii) simplified numerical model (two-dimensional) for the offline training, (iii) dynamic control of a nonlinear solver tuning parameter (numerical relaxation), (iv) and online learning for real-time improvement of the machine learning model. This strategy decreases the number of nonlinear iterations by dynamically modifying a single global parameter, the relaxation factor, and by adaptively learning the attributes of each numerical model on-the-run. Furthermore, this work performs a sensitivity study in the dimensionless parameters (machine learning features), assess the efficacy of various machine learning models, demonstrate a decrease in nonlinear iterations using our method in more intricate, realistic three-dimensional models, and fully couple a machine learning model into an open-source multiphase flow simulator achieving up to 85% reduction in computational time.</div></div>","PeriodicalId":100124,"journal":{"name":"Artificial Intelligence in Geosciences","volume":"6 2","pages":"Article 100146"},"PeriodicalIF":0.0,"publicationDate":"2025-07-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144702140","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Prediction of groundwater level in Indonesian tropical peatland forest plantations using machine learning 利用机器学习预测印度尼西亚热带泥炭地森林人工林的地下水位
Pub Date : 2025-07-21 DOI: 10.1016/j.aiig.2025.100148
Kazuo Yonekura , Sota Miyazaki , Masaatsu Aichi , Takafumi Nishizu , Masao Hasegawa , Katsuyuki Suzuki
Maintaining high groundwater level (GWL) is important for preventing fires in peatlands. This study proposes GWL prediction using machine learning methods for forest plantations in Indonesian tropical peatlands. Deep neural networks (DNN) have been used for prediction; however, they have not been applied to groundwater prediction in Indonesian peatlands. Tropical peatland is characterized by high permeability and forest plantations are surrounded by several canals. By predicting daily differences in GWL, the GWL can be predicted with high accuracy. DNNs, random forests, support vector regression, and XGBoost were compared, all of which indicated similar errors. The SHAP value revealed that the precipitation falling on the hill rapidly seeps into the soil and flows into the canals, which agrees with the fact that the soil has high permeability. These findings can potentially be used to alleviate and manage future fires in peatlands.
保持高地下水位对泥炭地火灾的防治具有重要意义。本研究提出使用机器学习方法对印度尼西亚热带泥炭地的森林种植园进行GWL预测。深度神经网络(DNN)已被用于预测;然而,它们尚未应用于印度尼西亚泥炭地的地下水预测。热带泥炭地的特点是高渗透性,森林人工林被几条运河包围。通过预测GWL的日差值,可以较准确地预测GWL。将dnn、随机森林、支持向量回归和XGBoost进行比较,结果显示误差相似。SHAP值表明,落在山上的降水迅速渗入土壤并流入渠道,这与土壤具有高渗透性的事实相一致。这些发现可能用于缓解和管理泥炭地未来的火灾。
{"title":"Prediction of groundwater level in Indonesian tropical peatland forest plantations using machine learning","authors":"Kazuo Yonekura ,&nbsp;Sota Miyazaki ,&nbsp;Masaatsu Aichi ,&nbsp;Takafumi Nishizu ,&nbsp;Masao Hasegawa ,&nbsp;Katsuyuki Suzuki","doi":"10.1016/j.aiig.2025.100148","DOIUrl":"10.1016/j.aiig.2025.100148","url":null,"abstract":"<div><div>Maintaining high groundwater level (GWL) is important for preventing fires in peatlands. This study proposes GWL prediction using machine learning methods for forest plantations in Indonesian tropical peatlands. Deep neural networks (DNN) have been used for prediction; however, they have not been applied to groundwater prediction in Indonesian peatlands. Tropical peatland is characterized by high permeability and forest plantations are surrounded by several canals. By predicting daily differences in GWL, the GWL can be predicted with high accuracy. DNNs, random forests, support vector regression, and XGBoost were compared, all of which indicated similar errors. The SHAP value revealed that the precipitation falling on the hill rapidly seeps into the soil and flows into the canals, which agrees with the fact that the soil has high permeability. These findings can potentially be used to alleviate and manage future fires in peatlands.</div></div>","PeriodicalId":100124,"journal":{"name":"Artificial Intelligence in Geosciences","volume":"6 2","pages":"Article 100148"},"PeriodicalIF":0.0,"publicationDate":"2025-07-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144714577","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Quantification of greenhouse gas emissions from livestock using remote sensing & artificial intelligence 利用遥感和人工智能对牲畜温室气体排放进行量化
Pub Date : 2025-07-18 DOI: 10.1016/j.aiig.2025.100147
Evet Naturinda , Fortunate Kemigyisha , Anthony Gidudu , Isa Kabenge , Emmanuel Omia , Jackline Aboth
Greenhouse gases (GHGs) from agriculture in Africa are among the world's fastest-growing emissions, with the livestock sector as the primary contributor. However, the methods for quantifying these emissions rely on manual and outdated data collection and processing approaches. Therefore, there is a need to develop more accurate and efficient methods of quantifying GHGs from livestock. This research developed a remote sensing and Artificial Intelligence (AI) based approach to quantify GHG emissions from cattle in the Kisombwa Ranching Scheme in Mubende District, central Uganda.
We trained a deep learning algorithm, You Only Look Once (YOLO) v4, to detect cattle from the Unmanned Aerial Vehicle (UAV) images of the study area and applied the Simple Online Real-time Tracker (SORT) algorithm for automated counting. Methane (CH4) and Nitrous Oxide (N2O) emissions from manure management and enteric fermentation were estimated using the number of cattle and the Tier 1 guidelines from the Intergovernmental Panel on Climate Change (IPCC). The total estimated emissions were 321,121.34 kg carbon dioxide equivalent (CO2eq) per year, with CH4 at 282,282.96 kg CO2eq per year (88 %) and N2O at 38,838.38 kg CO2eq per year (12 %). Enteric fermentation contributed the highest emissions, about 99 % of the total CH4 emissions and 87 % of the total GHGs.
The proposed remote sensing and AI-driven method achieved an average F1 score of 88.9 %, average precision of 97 %, and average recall of 82.9 % on the testing set of images. Therefore, these research findings demonstrate that remote sensing and AI are a more potent and efficient approach to upscale quantifying and reporting animal population and livestock GHG emissions for sustainable agriculture and climate change mitigation.
非洲农业排放的温室气体是世界上增长最快的排放源之一,其中畜牧业是主要排放源。然而,量化这些排放的方法依赖于人工和过时的数据收集和处理方法。因此,有必要开发更准确和有效的方法来量化牲畜的温室气体。这项研究开发了一种基于遥感和人工智能(AI)的方法,用于量化乌干达中部Mubende地区Kisombwa牧场计划中牛的温室气体排放。我们训练了一个深度学习算法You Only Look Once (YOLO) v4,从研究区域的无人机(UAV)图像中检测牛,并应用Simple Online Real-time Tracker (SORT)算法进行自动计数。利用牛的数量和政府间气候变化专门委员会(IPCC)的一级指南,估算了粪便管理和肠道发酵产生的甲烷(CH4)和氧化亚氮(N2O)排放。估计总排放量为每年321,121.34千克二氧化碳当量(CO2eq),其中CH4为每年282,282.96千克二氧化碳当量(88%),N2O为每年38,838.38千克二氧化碳当量(12%)。肠道发酵的排放最高,约占CH4排放总量的99%和温室气体排放总量的87%。本文提出的遥感和人工智能驱动方法在图像测试集上的平均F1分数为88.9%,平均精度为97%,平均召回率为82.9%。因此,这些研究结果表明,遥感和人工智能是对可持续农业和减缓气候变化的动物种群和牲畜温室气体排放进行高级量化和报告的更有力和有效的方法。
{"title":"Quantification of greenhouse gas emissions from livestock using remote sensing & artificial intelligence","authors":"Evet Naturinda ,&nbsp;Fortunate Kemigyisha ,&nbsp;Anthony Gidudu ,&nbsp;Isa Kabenge ,&nbsp;Emmanuel Omia ,&nbsp;Jackline Aboth","doi":"10.1016/j.aiig.2025.100147","DOIUrl":"10.1016/j.aiig.2025.100147","url":null,"abstract":"<div><div>Greenhouse gases (GHGs) from agriculture in Africa are among the world's fastest-growing emissions, with the livestock sector as the primary contributor. However, the methods for quantifying these emissions rely on manual and outdated data collection and processing approaches. Therefore, there is a need to develop more accurate and efficient methods of quantifying GHGs from livestock. This research developed a remote sensing and Artificial Intelligence (AI) based approach to quantify GHG emissions from cattle in the Kisombwa Ranching Scheme in Mubende District, central Uganda.</div><div>We trained a deep learning algorithm, You Only Look Once (YOLO) v4, to detect cattle from the Unmanned Aerial Vehicle (UAV) images of the study area and applied the Simple Online Real-time Tracker (SORT) algorithm for automated counting. Methane (CH<sub>4)</sub> and Nitrous Oxide (N<sub>2</sub>O) emissions from manure management and enteric fermentation were estimated using the number of cattle and the Tier 1 guidelines from the Intergovernmental Panel on Climate Change (IPCC). The total estimated emissions were 321,121.34 kg carbon dioxide equivalent (CO<sub>2</sub>eq) per year, with CH<sub>4</sub> at 282,282.96 kg CO<sub>2</sub>eq per year (88 %) and N<sub>2</sub>O at 38,838.38 kg CO<sub>2</sub>eq per year (12 %). Enteric fermentation contributed the highest emissions, about 99 % of the total CH<sub>4</sub> emissions and 87 % of the total GHGs.</div><div>The proposed remote sensing and AI-driven method achieved an average F1 score of 88.9 %, average precision of 97 %, and average recall of 82.9 % on the testing set of images. Therefore, these research findings demonstrate that remote sensing and AI are a more potent and efficient approach to upscale quantifying and reporting animal population and livestock GHG emissions for sustainable agriculture and climate change mitigation.</div></div>","PeriodicalId":100124,"journal":{"name":"Artificial Intelligence in Geosciences","volume":"6 2","pages":"Article 100147"},"PeriodicalIF":0.0,"publicationDate":"2025-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144702139","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Quantifying uncertainty in foraminifera classification: How deep learning methods compare to human experts 量化有孔虫分类中的不确定性:深度学习方法与人类专家的比较
Pub Date : 2025-07-16 DOI: 10.1016/j.aiig.2025.100145
Iver Martinsen , Steffen Aagaard Sørensen , Samuel Ortega , Fred Godtliebsen , Miguel Tejedor , Eirik Myrvoll-Nilsen
Foraminifera are shell-bearing microorganisms that are commonly found in marine deposits on the seabed. They are important indicators in many analyses, are used in climate change research, monitoring marine environments, evolutionary studies, and are also frequently used in the oil and gas industry. Although some research has focused on automating the classification of foraminifera images, few have addressed the uncertainty in these classifications. Although foraminifera classification is not a safety-critical task, estimating uncertainty is crucial to avoid misclassifications that could overlook rare and ecologically significant species that are informative indicators of the environment in which they lived. Uncertainty estimation in deep learning has gained significant attention and many methods have been developed. However, evaluating the performance of these methods in practical settings remains a challenge. To create a benchmark for uncertainty estimation in the classification of foraminifera, we administered a multiple choice questionnaire containing classification tasks to four senior geologists. By analyzing their responses, we generated human-derived uncertainty estimates for a test set of 260 images of foraminifera and sediment grains. These uncertainty estimates served as a baseline for comparison when training neural networks in classification. We then trained multiple deep neural networks using a range of uncertainty quantification methods to classify and state the uncertainty about the classifications. The results of the deep learning uncertainty quantification methods were then analyzed and compared with the human benchmark, to see how the methods performed individually and how the methods aligned with humans. Our results show that human-level performance can be achieved with deep learning and that test-time data augmentation and ensembling can help improve both uncertainty estimation and classification performance. Our results also show that human uncertainty estimates are helpful indicators for detecting classification errors and that deep learning-based uncertainty estimates can improve calibration and classification accuracy.
有孔虫是一种含壳微生物,通常在海底的海洋沉积物中发现。它们是许多分析中的重要指标,用于气候变化研究、海洋环境监测、进化研究,也经常用于石油和天然气工业。虽然一些研究集中在有孔虫图像的自动分类上,但很少有人解决这些分类中的不确定性。虽然有孔虫分类不是一项安全关键任务,但估计不确定性对于避免错误分类至关重要,因为错误分类可能会忽略稀有和具有生态意义的物种,这些物种是它们生活环境的信息指标。深度学习中的不确定性估计受到了广泛的关注,并开发了许多方法。然而,评估这些方法在实际环境中的性能仍然是一个挑战。为了在有孔虫分类中建立一个不确定性估计的基准,我们对四位高级地质学家进行了包含分类任务的多项选择问卷。通过分析他们的反应,我们对260张有孔虫和沉积物颗粒的测试图像产生了人为的不确定性估计。当训练神经网络进行分类时,这些不确定性估计作为比较的基线。然后,我们使用一系列不确定性量化方法训练多个深度神经网络来分类和说明分类的不确定性。然后对深度学习不确定性量化方法的结果进行分析,并与人类基准进行比较,以了解这些方法如何单独执行以及这些方法如何与人类一致。我们的研究结果表明,深度学习可以达到人类水平的性能,测试时间数据增强和集成可以帮助提高不确定性估计和分类性能。我们的研究结果还表明,人为的不确定性估计是检测分类错误的有用指标,基于深度学习的不确定性估计可以提高校准和分类精度。
{"title":"Quantifying uncertainty in foraminifera classification: How deep learning methods compare to human experts","authors":"Iver Martinsen ,&nbsp;Steffen Aagaard Sørensen ,&nbsp;Samuel Ortega ,&nbsp;Fred Godtliebsen ,&nbsp;Miguel Tejedor ,&nbsp;Eirik Myrvoll-Nilsen","doi":"10.1016/j.aiig.2025.100145","DOIUrl":"10.1016/j.aiig.2025.100145","url":null,"abstract":"<div><div>Foraminifera are shell-bearing microorganisms that are commonly found in marine deposits on the seabed. They are important indicators in many analyses, are used in climate change research, monitoring marine environments, evolutionary studies, and are also frequently used in the oil and gas industry. Although some research has focused on automating the classification of foraminifera images, few have addressed the uncertainty in these classifications. Although foraminifera classification is not a safety-critical task, estimating uncertainty is crucial to avoid misclassifications that could overlook rare and ecologically significant species that are informative indicators of the environment in which they lived. Uncertainty estimation in deep learning has gained significant attention and many methods have been developed. However, evaluating the performance of these methods in practical settings remains a challenge. To create a benchmark for uncertainty estimation in the classification of foraminifera, we administered a multiple choice questionnaire containing classification tasks to four senior geologists. By analyzing their responses, we generated human-derived uncertainty estimates for a test set of 260 images of foraminifera and sediment grains. These uncertainty estimates served as a baseline for comparison when training neural networks in classification. We then trained multiple deep neural networks using a range of uncertainty quantification methods to classify and state the uncertainty about the classifications. The results of the deep learning uncertainty quantification methods were then analyzed and compared with the human benchmark, to see how the methods performed individually and how the methods aligned with humans. Our results show that human-level performance can be achieved with deep learning and that test-time data augmentation and ensembling can help improve both uncertainty estimation and classification performance. Our results also show that human uncertainty estimates are helpful indicators for detecting classification errors and that deep learning-based uncertainty estimates can improve calibration and classification accuracy.</div></div>","PeriodicalId":100124,"journal":{"name":"Artificial Intelligence in Geosciences","volume":"6 2","pages":"Article 100145"},"PeriodicalIF":0.0,"publicationDate":"2025-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144694746","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Explaining machine learning models trained to predict Copernicus DEM errors in different land cover environments 解释机器学习模型训练预测哥白尼DEM误差在不同的土地覆盖环境
Pub Date : 2025-07-15 DOI: 10.1016/j.aiig.2025.100141
Michael Meadows, Karin Reinke, Simon Jones
Machine learning models are increasingly used to correct the vertical biases (mainly due to vegetation and buildings) in global Digital Elevation Models (DEMs), for downstream applications which need “bare earth” elevations. The predictive accuracy of these models has improved significantly as more flexible model architectures are developed and new explanatory datasets produced, leading to the recent release of three model-corrected DEMs (FABDEM, DiluviumDEM and FathomDEM). However, there has been relatively little focus so far on explaining or interrogating these models, especially important in this context given their downstream impact on many other applications (including natural hazard simulations). In this study we train five separate models (by land cover environment) to correct vertical biases in the Copernicus DEM and then explain them using SHapley Additive exPlanation (SHAP) values. Comparing the models, we find significant variation in terms of the specific input variables selected and their relative importance, suggesting that an ensemble of models (specialising by land cover) is likely preferable to a general model applied everywhere. Visualising the patterns learned by the models (using SHAP dependence plots) provides further insights, building confidence in some cases (where patterns are consistent with domain knowledge and past studies) and highlighting potentially problematic variables in others (such as proxy relationships which may not apply in new application sites). Our results have implications for future DEM error prediction studies, particularly in evaluating a very wide range of potential input variables (160 candidates) drawn from topographic, multispectral, Synthetic Aperture Radar, vegetation, climate and urbanisation datasets.
机器学习模型越来越多地用于纠正全球数字高程模型(dem)中的垂直偏差(主要是由于植被和建筑物),用于需要“裸地”高程的下游应用。随着更灵活的模型架构的开发和新的解释性数据集的产生,这些模型的预测精度得到了显著提高,导致最近发布了三种模型修正的dem (FABDEM, DiluviumDEM和FathomDEM)。然而,到目前为止,对这些模型的解释或质疑相对较少,特别是考虑到它们对许多其他应用(包括自然灾害模拟)的下游影响,这些模型在这种情况下尤为重要。在这项研究中,我们训练了五个独立的模型(按土地覆盖环境)来纠正哥白尼DEM中的垂直偏差,然后使用SHapley加性解释(SHAP)值对它们进行解释。比较这些模型,我们发现在选择的特定输入变量及其相对重要性方面存在显著差异,这表明模型的集合(按土地覆盖专门划分)可能比到处应用的一般模型更可取。可视化模型学习的模式(使用SHAP依赖图)提供了进一步的见解,在某些情况下(模式与领域知识和过去的研究一致)建立信心,并突出显示其他情况下潜在的问题变量(例如可能不适用于新应用程序站点的代理关系)。我们的研究结果对未来的DEM误差预测研究具有重要意义,特别是在评估从地形、多光谱、合成孔径雷达、植被、气候和城市化数据集提取的非常广泛的潜在输入变量(160个候选变量)方面。
{"title":"Explaining machine learning models trained to predict Copernicus DEM errors in different land cover environments","authors":"Michael Meadows,&nbsp;Karin Reinke,&nbsp;Simon Jones","doi":"10.1016/j.aiig.2025.100141","DOIUrl":"10.1016/j.aiig.2025.100141","url":null,"abstract":"<div><div>Machine learning models are increasingly used to correct the vertical biases (mainly due to vegetation and buildings) in global Digital Elevation Models (DEMs), for downstream applications which need “bare earth” elevations. The predictive accuracy of these models has improved significantly as more flexible model architectures are developed and new explanatory datasets produced, leading to the recent release of three model-corrected DEMs (FABDEM, DiluviumDEM and FathomDEM). However, there has been relatively little focus so far on explaining or interrogating these models, especially important in this context given their downstream impact on many other applications (including natural hazard simulations). In this study we train five separate models (by land cover environment) to correct vertical biases in the Copernicus DEM and then explain them using SHapley Additive exPlanation (SHAP) values. Comparing the models, we find significant variation in terms of the specific input variables selected and their relative importance, suggesting that an ensemble of models (specialising by land cover) is likely preferable to a general model applied everywhere. Visualising the patterns learned by the models (using SHAP dependence plots) provides further insights, building confidence in some cases (where patterns are consistent with domain knowledge and past studies) and highlighting potentially problematic variables in others (such as proxy relationships which may not apply in new application sites). Our results have implications for future DEM error prediction studies, particularly in evaluating a very wide range of potential input variables (160 candidates) drawn from topographic, multispectral, Synthetic Aperture Radar, vegetation, climate and urbanisation datasets.</div></div>","PeriodicalId":100124,"journal":{"name":"Artificial Intelligence in Geosciences","volume":"6 2","pages":"Article 100141"},"PeriodicalIF":0.0,"publicationDate":"2025-07-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144663596","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Generating high-resolution climate data in the Andes using artificial intelligence: A lightweight alternative to the WRF model 使用人工智能在安第斯山脉生成高分辨率气候数据:WRF模型的轻量级替代方案
Pub Date : 2025-07-13 DOI: 10.1016/j.aiig.2025.100143
Christian Carhuancho , Edwin Villanueva , Christian Yarleque , Romel Erick Principe , Marcia Castromonte
In weather forecasting, generating atmospheric variables for regions with complex topography, such as the Andean regions with peaks reaching 6500 m above sea level, poses significant challenges. Traditional regional climate models often struggle to accurately represent the atmospheric behavior in such areas. Furthermore, the capability to produce high spatio-temporal resolution data (less than 27 km and hourly) is limited to a few institutions globally due to the substantial computational resources required. This study presents the results of atmospheric data generated using a new type of artificial intelligence (AI) models, aimed to reduce the computational cost of generating downscaled climate data using climate regional models like the Weather Research and Forecasting (WRF) model over the Andes. The WRF model was selected for this comparison due to its frequent use in simulating atmospheric variables in the Andes.
Our results demonstrate a higher downscaling performance for the four target weather variables studied (temperature, relative humidity, zonal and meridional wind) over coastal, mountain, and jungle regions. Moreover, this AI model offers several advantages, including lower computational costs compared to dynamic models like WRF and continuous improvement potential with additional training data.
在天气预报中,为地形复杂的地区(如海拔6500米的安第斯地区)生成大气变量是一项重大挑战。传统的区域气候模式往往难以准确地表示这些地区的大气行为。此外,由于需要大量的计算资源,产生高时空分辨率数据(每小时少于27公里)的能力仅限于全球少数机构。本研究展示了使用新型人工智能(AI)模型生成的大气数据的结果,旨在降低使用气候区域模型(如安第斯山脉的天气研究与预报(WRF)模型)生成缩小比例气候数据的计算成本。之所以选择WRF模式进行比较,是因为它经常用于模拟安第斯山脉的大气变量。我们的研究结果表明,在沿海、山区和丛林地区,研究的四个目标天气变量(温度、相对湿度、纬向风和经向风)具有更高的降尺度性能。此外,该人工智能模型具有几个优势,包括与WRF等动态模型相比,计算成本更低,并且具有额外训练数据的持续改进潜力。
{"title":"Generating high-resolution climate data in the Andes using artificial intelligence: A lightweight alternative to the WRF model","authors":"Christian Carhuancho ,&nbsp;Edwin Villanueva ,&nbsp;Christian Yarleque ,&nbsp;Romel Erick Principe ,&nbsp;Marcia Castromonte","doi":"10.1016/j.aiig.2025.100143","DOIUrl":"10.1016/j.aiig.2025.100143","url":null,"abstract":"<div><div>In weather forecasting, generating atmospheric variables for regions with complex topography, such as the Andean regions with peaks reaching 6500 m above sea level, poses significant challenges. Traditional regional climate models often struggle to accurately represent the atmospheric behavior in such areas. Furthermore, the capability to produce high spatio-temporal resolution data (less than 27 km and hourly) is limited to a few institutions globally due to the substantial computational resources required. This study presents the results of atmospheric data generated using a new type of artificial intelligence (AI) models, aimed to reduce the computational cost of generating downscaled climate data using climate regional models like the Weather Research and Forecasting (WRF) model over the Andes. The WRF model was selected for this comparison due to its frequent use in simulating atmospheric variables in the Andes.</div><div>Our results demonstrate a higher downscaling performance for the four target weather variables studied (temperature, relative humidity, zonal and meridional wind) over coastal, mountain, and jungle regions. Moreover, this AI model offers several advantages, including lower computational costs compared to dynamic models like WRF and continuous improvement potential with additional training data.</div></div>","PeriodicalId":100124,"journal":{"name":"Artificial Intelligence in Geosciences","volume":"6 2","pages":"Article 100143"},"PeriodicalIF":0.0,"publicationDate":"2025-07-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144653503","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Machine learning assisted estimation of total solids content of drilling fluids 机器学习辅助估计钻井液的总固体含量
Pub Date : 2025-07-05 DOI: 10.1016/j.aiig.2025.100138
B.T. Gunel , Y.D. Pak , A.Ö. Herekeli , S. Gül , B. Kulga , E. Artun
Characterization and optimization of physical and chemical properties of drilling fluids are critical for the efficiency and success of drilling operations. In particular, maintaining the optimal levels of solids content is essential for achieving the most effective fluid performance. Proper management of solids content also reduces the risk of tool failures. Traditional solids content analysis methods, such as retort analysis, require substantial human intervention and time, which can lead to inaccuracies, time-management issues, and increased operational risks. In contrast to human-intensive methods, machine learning may offer a viable alternative for solids content estimation due to its pattern-recognition capability. In this study, a large set of laboratory reports of drilling-fluid analyses from 130 oil wells around the world were compiled to construct a comprehensive data set. The relationships among various rheological parameters were analyzed using statistical methods and machine learning algorithms. Several machine learning algorithms of diverse classes, namely linear (linear regression, ridge regression, and ElasticNet regression), kernel-based (support vector machine) and ensemble tree-based (gradient boosting, XGBoost, and random forests) algorithms, were trained and tuned to estimate solids content from other readily available drilling fluid properties. Input variables were kept consistent across all models for interpretation and comparison purposes. In the final stage, different evaluation metrics were employed to evaluate and compare the performance of different classes of machine learning models. Among all algorithms tested, random forests algorithm was found to be the best predictive model resulting in consistently high accuracy. Further optimization of the random forests model resulted in a mean absolute percentage error (MAPE) of 3.9% and 9.6% and R2 of 0.99 and 0.93 for the training and testing sets, respectively. Analysis of residuals, their histograms and Q-Q normality plots showed Gaussian distributions with residuals that are scattered around a mean of zero within error ranges of ±1% and ±4%, for training and testing, respectively. The selected model was further validated by applying the rheological measurements from mud samples taken from an offshore well from the Gulf of Mexico. The model was able to estimate total solids content in those four mud samples with an average absolute error of 1.08% of total solids content. The model was then used to develop a web-based graphical-user-interface (GUI) application, which can be practically used at the rig site by engineers to optimize drilling fluid programs. The proposed model can complement automation workflows that are designed to measure fundamental rheological properties in real time during drilling operations. While a st
钻井液物理化学性质的表征和优化对钻井作业的效率和成功至关重要。特别是,保持最佳固体含量水平对于实现最有效的流体性能至关重要。适当的固体含量管理也降低了工具故障的风险。传统的固体含量分析方法,如蒸馏器分析,需要大量的人工干预和时间,这可能导致不准确,时间管理问题,并增加操作风险。与人工密集型方法相比,机器学习由于其模式识别能力,可以为固体含量估计提供可行的替代方案。在这项研究中,收集了大量来自世界各地130口油井的钻井液分析实验室报告,构建了一个全面的数据集。利用统计方法和机器学习算法分析了各流变参数之间的关系。几种不同类型的机器学习算法,即线性(线性回归、脊回归和ElasticNet回归)、基于核(支持向量机)和基于集成树(梯度增强、XGBoost和随机森林)算法,经过训练和调整,可以从其他可用的钻井液性质中估计固体含量。为了解释和比较的目的,所有模型的输入变量保持一致。在最后阶段,采用不同的评估指标来评估和比较不同类别的机器学习模型的性能。在所有被测试的算法中,随机森林算法是最好的预测模型,具有较高的准确率。进一步优化随机森林模型,训练集和测试集的平均绝对百分比误差(MAPE)分别为3.9%和9.6%,R2分别为0.99和0.93。残差分析,其直方图和Q-Q正态图显示高斯分布,残差分散在平均值零附近,误差范围分别为±1%和±4%,用于训练和测试。通过对墨西哥湾海上油井的泥浆样品进行流变测量,进一步验证了所选模型的有效性。该模型能够估计出这4种泥浆样品中的总固体含量,平均绝对误差为总固体含量的1.08%。然后,该模型被用于开发基于web的图形用户界面(GUI)应用程序,工程师可以在钻井现场实际使用该应用程序来优化钻井液方案。所提出的模型可以补充自动化工作流程,旨在实时测量钻井作业中的基本流变性能。虽然标准的油罐测试在钻井现场大约需要2小时,但这种实时评估可以帮助钻井人员及时优化钻井液,单台钻机每年可节省2920个工时。
{"title":"Machine learning assisted estimation of total solids content of drilling fluids","authors":"B.T. Gunel ,&nbsp;Y.D. Pak ,&nbsp;A.Ö. Herekeli ,&nbsp;S. Gül ,&nbsp;B. Kulga ,&nbsp;E. Artun","doi":"10.1016/j.aiig.2025.100138","DOIUrl":"10.1016/j.aiig.2025.100138","url":null,"abstract":"<div><div>Characterization and optimization of physical and chemical properties of drilling fluids are critical for the efficiency and success of drilling operations. In particular, maintaining the optimal levels of solids content is essential for achieving the most effective fluid performance. Proper management of solids content also reduces the risk of tool failures. Traditional solids content analysis methods, such as retort analysis, require substantial human intervention and time, which can lead to inaccuracies, time-management issues, and increased operational risks. In contrast to human-intensive methods, machine learning may offer a viable alternative for solids content estimation due to its pattern-recognition capability. In this study, a large set of laboratory reports of drilling-fluid analyses from 130 oil wells around the world were compiled to construct a comprehensive data set. The relationships among various rheological parameters were analyzed using statistical methods and machine learning algorithms. Several machine learning algorithms of diverse classes, namely linear (linear regression, ridge regression, and ElasticNet regression), kernel-based (support vector machine) and ensemble tree-based (gradient boosting, XGBoost, and random forests) algorithms, were trained and tuned to estimate solids content from other readily available drilling fluid properties. Input variables were kept consistent across all models for interpretation and comparison purposes. In the final stage, different evaluation metrics were employed to evaluate and compare the performance of different classes of machine learning models. Among all algorithms tested, random forests algorithm was found to be the best predictive model resulting in consistently high accuracy. Further optimization of the random forests model resulted in a mean absolute percentage error (MAPE) of 3.9% and 9.6% and R<span><math><msup><mrow></mrow><mrow><mn>2</mn></mrow></msup></math></span> of 0.99 and 0.93 for the training and testing sets, respectively. Analysis of residuals, their histograms and Q-Q normality plots showed Gaussian distributions with residuals that are scattered around a mean of zero within error ranges of <span><math><mo>±</mo></math></span>1% and <span><math><mo>±</mo></math></span>4%, for training and testing, respectively. The selected model was further validated by applying the rheological measurements from mud samples taken from an offshore well from the Gulf of Mexico. The model was able to estimate total solids content in those four mud samples with an average absolute error of 1.08% of total solids content. The model was then used to develop a web-based graphical-user-interface (GUI) application, which can be practically used at the rig site by engineers to optimize drilling fluid programs. The proposed model can complement automation workflows that are designed to measure fundamental rheological properties in real time during drilling operations. While a st","PeriodicalId":100124,"journal":{"name":"Artificial Intelligence in Geosciences","volume":"6 2","pages":"Article 100138"},"PeriodicalIF":0.0,"publicationDate":"2025-07-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144581266","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Improved estimation of two-phase capillary pressure with nuclear magnetic resonance measurements via machine learning 基于机器学习的核磁共振测量改进的两相毛细管压力估计
Pub Date : 2025-07-05 DOI: 10.1016/j.aiig.2025.100144
Oriyomi Raheem , Misael M. Morales , Wen Pan , Carlos Torres-Verdín
Capillary pressure plays a crucial role in determining the spatial distribution of oil and gas, particularly in medium-to-low permeability reservoirs, where it is closely linked to the rock's pore structure and wettability. In these environments, pore structure is the primary factor influencing capillary pressure, with different pore types affecting fluid transport through varying degrees of hydrocarbon saturation. One of the main challenges in characterizing pore structure is how to use data from core plugs to establish a relationship with microscopic pore and throat properties, enabling more accurate predictions of capillary pressure. While special core analysis laboratory experiments are effective, they are time-consuming and expensive. In contrast, nuclear magnetic resonance (NMR) measurements, which provide information on pore body size distribution, are faster and can be leveraged to estimate capillary pressure using machine learning algorithms. Recently, artificial intelligence methods have also been applied to capillary pressure prediction (Qi et al., 2024).
Currently, no readily applicable predictive model exists for estimating an entire capillary pressure curve directly from standard petrophysical logs and core data. Although porescale imaging and network modeling techniques can compute capillary pressure from micro-CT rock images (Øren and Bakke, 2003; Valvatne and Blunt, 2004), these approaches are time-consuming, limited to small sample volumes, and not yet practical for routine reservoir evaluation. In this study, we introduce rock classification techniques and implement a data-driven machine learning (ML) method to estimate saturation-dependent capillary pressure from core petrophysical properties. The new model integrates cumulative NMR data and densely resampled core measurements as training data, with prediction errors quantified throughout the process. To approach the common condition of sparsely sampled training data, we transformed the prediction problem into an overdetermined one by applying composite fitting to both capillary pressure and pore throat size distribution, and Gaussian cumulative distribution fitting to the NMR T2 measurements, generating evenly sampled data points. Using these preprocessed input features, we performed classification based on the natural logarithm of the permeability-to-porosity ratio (ln(k/ϕ)) to cluster distinct rock types. For each rock class, we applied regression techniques—such as random forest (RF), k-nearest neighbors (k-NN), extreme gradient boosting (XGB), and artificial neural networks (ANN)—to estimate the logarithm of capillary pressure. The methods were tested on blind core samples, and performance comparisons among different estimation methods
毛管压力对油气的空间分布起着至关重要的作用,特别是在中低渗透储层中,毛管压力与岩石的孔隙结构和润湿性密切相关。在这些环境中,孔隙结构是影响毛管压力的主要因素,不同孔隙类型通过不同程度的烃饱和度影响流体的输运。表征孔隙结构的主要挑战之一是如何利用岩心桥塞的数据建立微观孔隙和喉道特性的关系,从而更准确地预测毛管压力。虽然特殊的岩心分析实验室实验是有效的,但它们耗时且昂贵。相比之下,核磁共振(NMR)测量可以提供孔体大小分布的信息,速度更快,并且可以利用机器学习算法来估计毛细管压力。最近,人工智能方法也被应用于毛细管压力预测(Qi et al., 2024)。目前,还没有现成的预测模型可以直接从标准岩石物理测井和岩心数据中估计整个毛管压力曲线。虽然孔隙尺度成像和网络建模技术可以从微ct岩石图像中计算毛细压力(Øren和Bakke, 2003;Valvatne和Blunt, 2004),这些方法耗时长,仅限于小样本量,还不能用于常规储层评价。在这项研究中,我们引入了岩石分类技术,并实现了一种数据驱动的机器学习(ML)方法,通过岩心岩石物理性质来估计与饱和度相关的毛管压力。新模型将累积核磁共振数据和密集重采样的岩心测量数据作为训练数据,并在整个过程中量化预测误差。为了接近稀疏采样训练数据的常见情况,我们通过对毛细管压力和孔喉大小分布进行复合拟合,并对NMR T2测量值进行高斯累积分布拟合,将预测问题转化为过确定问题,生成均匀采样的数据点。利用这些预处理的输入特征,我们根据渗透率-孔隙度比(ln(k/ϕ))的自然对数进行分类,以聚类不同的岩石类型。对于每个岩石类别,我们应用回归技术——如随机森林(RF)、k近邻(k-NN)、极端梯度增强(XGB)和人工神经网络(ANN)——来估计毛细管压力的对数。对盲岩心样本进行了测试,并基于预测的相对标准误差对不同估计方法进行了性能比较。结果表明,核磁共振数据对岩石孔隙结构较为敏感,对毛细管压力和孔喉大小分布的预测有显著改善。对于毛细管压力和孔喉大小分布,极端梯度增强和随机森林模型的平均估计误差分别为5%和10%,表现最好。相比之下,当NMR T2数据被排除作为输入特征时,预测误差增加到25%。使用传统的高斯模型拟合和更高分辨率的重采样确保了训练数据覆盖了广泛的变异性。将核磁共振T2数据作为输入特征增强了模型捕捉非常规岩石中多峰的能力,使预测问题过度确定。通过向量输入特征预测向量函数,有效降低了预测误差。该解释工作流程可用于构建具有代表性的分类模型,并在广泛的饱和度范围内估计毛细管压力。
{"title":"Improved estimation of two-phase capillary pressure with nuclear magnetic resonance measurements via machine learning","authors":"Oriyomi Raheem ,&nbsp;Misael M. Morales ,&nbsp;Wen Pan ,&nbsp;Carlos Torres-Verdín","doi":"10.1016/j.aiig.2025.100144","DOIUrl":"10.1016/j.aiig.2025.100144","url":null,"abstract":"<div><div>Capillary pressure plays a crucial role in determining the spatial distribution of oil and gas, particularly in medium-to-low permeability reservoirs, where it is closely linked to the rock's pore structure and wettability. In these environments, pore structure is the primary factor influencing capillary pressure, with different pore types affecting fluid transport through varying degrees of hydrocarbon saturation. One of the main challenges in characterizing pore structure is how to use data from core plugs to establish a relationship with microscopic pore and throat properties, enabling more accurate predictions of capillary pressure. While special core analysis laboratory experiments are effective, they are time-consuming and expensive. In contrast, nuclear magnetic resonance (NMR) measurements, which provide information on pore body size distribution, are faster and can be leveraged to estimate capillary pressure using machine learning algorithms. Recently, artificial intelligence methods have also been applied to capillary pressure prediction (Qi et al., 2024).</div><div>Currently, no readily applicable predictive model exists for estimating an entire capillary pressure curve directly from standard petrophysical logs and core data. Although porescale imaging and network modeling techniques can compute capillary pressure from micro-CT rock images (Øren and Bakke, 2003; Valvatne and Blunt, 2004), these approaches are time-consuming, limited to small sample volumes, and not yet practical for routine reservoir evaluation. In this study, we introduce rock classification techniques and implement a data-driven machine learning (ML) method to estimate saturation-dependent capillary pressure from core petrophysical properties. The new model integrates cumulative NMR data and densely resampled core measurements as training data, with prediction errors quantified throughout the process. To approach the common condition of sparsely sampled training data, we transformed the prediction problem into an overdetermined one by applying composite fitting to both capillary pressure and pore throat size distribution, and Gaussian cumulative distribution fitting to the NMR <span><math><mrow><msub><mi>T</mi><mn>2</mn></msub></mrow></math></span> measurements, generating evenly sampled data points. Using these preprocessed input features, we performed classification based on the natural logarithm of the permeability-to-porosity ratio <span><math><mrow><mo>(</mo><mrow><mi>ln</mi><mrow><mo>(</mo><mrow><mi>k</mi><mo>/</mo><mi>ϕ</mi></mrow><mo>)</mo></mrow></mrow><mo>)</mo></mrow></math></span> to cluster distinct rock types. For each rock class, we applied regression techniques—such as random forest (RF), k-nearest neighbors (k-NN), extreme gradient boosting (XGB), and artificial neural networks (ANN)—to estimate the logarithm of capillary pressure. The methods were tested on blind core samples, and performance comparisons among different estimation methods ","PeriodicalId":100124,"journal":{"name":"Artificial Intelligence in Geosciences","volume":"6 2","pages":"Article 100144"},"PeriodicalIF":0.0,"publicationDate":"2025-07-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144633212","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Deep learning approaches for estimating maximum wall deflection in excavations with inconsistent clay stratigraphy 不一致粘土地层条件下挖掘最大壁挠度估计的深度学习方法
Pub Date : 2025-07-04 DOI: 10.1016/j.aiig.2025.100140
Vinh V. Le , HongGiang Nguyen , Nguyen Huu Ngu
This paper presents a deep learning architecture combined with exploratory data analysis to estimate maximum wall deflection in deep excavations. Six major geotechnical parameters were studied. Statistical methods, such as pair plots and Pearson correlation, highlighted excavation depth (correlation coefficient = 0.82) as the most significant factor. For method prediction, five deep learning models (CNN, LSTM, BiLSTM, CNN-LSTM, and CNN-BiLSTM) were built. The CNN-BiLSTM model excelled in training performance (R2 = 0.98, RMSE = 0.02), while BiLSTM reached superior testing results (R2 = 0.85, RMSE = 0.06), suggesting greater generalization ability. Based on the feature importance analysis from model weights, excavation depth, stiffness ratio, and bracing spacing were ranked as the highest contributors. This point verified a lack of prediction bias on residual plots and high model agreement with measured values on Taylor diagrams (correlation coefficient 0.92). The effectiveness of integrated techniques was reliably assured for predicting wall deformation. This approach facilitates more accurate and efficient geotechnical design and provides engineers with improved tools for risk evaluation and decision-making in deep excavation projects.
本文提出了一种结合探索性数据分析的深度学习体系结构,用于估计深基坑中墙体的最大挠度。研究了6个主要岩土参数。通过配对图和Pearson相关等统计方法,挖掘深度(相关系数= 0.82)是最显著的影响因素。在方法预测方面,建立了CNN、LSTM、BiLSTM、CNN-LSTM和CNN-BiLSTM五个深度学习模型。CNN-BiLSTM模型具有较好的训练性能(R2 = 0.98, RMSE = 0.02),而BiLSTM模型具有较好的测试结果(R2 = 0.85, RMSE = 0.06),具有较强的泛化能力。基于模型权重特征重要性分析,挖掘深度、刚度比和支撑间距是影响最大的因素。这一点证实了残差图上没有预测偏差,模型与泰勒图上的实测值高度吻合(相关系数0.92)。综合技术的有效性为预测围岩变形提供了可靠的保证。该方法有助于提高岩土工程设计的准确性和效率,为深基坑工程的风险评估和决策提供了改进的工具。
{"title":"Deep learning approaches for estimating maximum wall deflection in excavations with inconsistent clay stratigraphy","authors":"Vinh V. Le ,&nbsp;HongGiang Nguyen ,&nbsp;Nguyen Huu Ngu","doi":"10.1016/j.aiig.2025.100140","DOIUrl":"10.1016/j.aiig.2025.100140","url":null,"abstract":"<div><div>This paper presents a deep learning architecture combined with exploratory data analysis to estimate maximum wall deflection in deep excavations. Six major geotechnical parameters were studied. Statistical methods, such as pair plots and Pearson correlation, highlighted excavation depth (correlation coefficient = 0.82) as the most significant factor. For method prediction, five deep learning models (CNN, LSTM, BiLSTM, CNN-LSTM, and CNN-BiLSTM) were built. The CNN-BiLSTM model excelled in training performance (R<sup>2</sup> = 0.98, RMSE = 0.02), while BiLSTM reached superior testing results (R<sup>2</sup> = 0.85, RMSE = 0.06), suggesting greater generalization ability. Based on the feature importance analysis from model weights, excavation depth, stiffness ratio, and bracing spacing were ranked as the highest contributors. This point verified a lack of prediction bias on residual plots and high model agreement with measured values on Taylor diagrams (correlation coefficient 0.92). The effectiveness of integrated techniques was reliably assured for predicting wall deformation. This approach facilitates more accurate and efficient geotechnical design and provides engineers with improved tools for risk evaluation and decision-making in deep excavation projects.</div></div>","PeriodicalId":100124,"journal":{"name":"Artificial Intelligence in Geosciences","volume":"6 2","pages":"Article 100140"},"PeriodicalIF":0.0,"publicationDate":"2025-07-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144581267","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Cellular automata models for simulation and prediction of urban land use change: Development and prospects 城市土地利用变化模拟与预测的元胞自动机模型:发展与展望
Pub Date : 2025-06-30 DOI: 10.1016/j.aiig.2025.100142
Baoling Gui, Anshuman Bhardwaj, Lydia Sam
Rapid urbanization and land-use changes are placing immense pressure on resources, infrastructure, and environmental sustainability. To address these, accurate urban simulation models are essential for sustainable development and governance. Among them, Cellular Automata (CA) models have become key tools for predicting urban expansion, optimizing land-use planning, and supporting data-driven decision-making. This review provides a comprehensive examination of the development of urban cellular automata (UCA) models, presenting a new framework to enhance individual UCA sub-modules within the context of emerging technologies, sustainable environments, and public governance. By addressing gaps in prior UCA modelling reviews—particularly in the integration and optimization of UCA sub-module technologies—this framework is designed to simplify UCA model understanding and development. We systematically review pioneering case studies, deconstruct current UCA operational processes, and explore modern technologies, such as big data and artificial intelligence, to optimize these sub-modules further. We discuss current limitations within UCA models and propose future pathways, emphasizing the necessity of comprehensive analyses for effective UCA simulations. Proposed solutions include strengthening our understanding of urban growth mechanisms, examining spatial positioning and temporal evolution dynamics, and enhancing urban geographic simulations with deep learning techniques to support sustainable transitions in public governance. These improvements offer data-driven decision support for environmental management, advancing policies that foster sustainable urban development.
快速城市化和土地利用变化给资源、基础设施和环境可持续性带来巨大压力。为了解决这些问题,精确的城市模拟模型对于可持续发展和治理至关重要。其中,元胞自动机(CA)模型已成为预测城市扩张、优化土地利用规划和支持数据驱动决策的关键工具。这篇综述对城市元胞自动机(UCA)模型的发展进行了全面的研究,提出了一个新的框架,在新兴技术、可持续环境和公共治理的背景下增强单个UCA子模块。通过解决先前UCA建模审查中的差距,特别是在UCA子模块技术的集成和优化方面,该框架旨在简化UCA模型的理解和开发。我们系统地回顾开创性的案例研究,解构当前UCA的操作流程,并探索现代技术,如大数据和人工智能,以进一步优化这些子模块。我们讨论了当前UCA模型的局限性,并提出了未来的途径,强调了对有效的UCA模拟进行综合分析的必要性。建议的解决方案包括加强我们对城市增长机制的理解,研究空间定位和时间演变动态,以及利用深度学习技术加强城市地理模拟,以支持公共治理的可持续转型。这些改进为环境管理提供了数据驱动的决策支持,推进了促进可持续城市发展的政策。
{"title":"Cellular automata models for simulation and prediction of urban land use change: Development and prospects","authors":"Baoling Gui,&nbsp;Anshuman Bhardwaj,&nbsp;Lydia Sam","doi":"10.1016/j.aiig.2025.100142","DOIUrl":"10.1016/j.aiig.2025.100142","url":null,"abstract":"<div><div>Rapid urbanization and land-use changes are placing immense pressure on resources, infrastructure, and environmental sustainability. To address these, accurate urban simulation models are essential for sustainable development and governance. Among them, Cellular Automata (CA) models have become key tools for predicting urban expansion, optimizing land-use planning, and supporting data-driven decision-making. This review provides a comprehensive examination of the development of urban cellular automata (UCA) models, presenting a new framework to enhance individual UCA sub-modules within the context of emerging technologies, sustainable environments, and public governance. By addressing gaps in prior UCA modelling reviews—particularly in the integration and optimization of UCA sub-module technologies—this framework is designed to simplify UCA model understanding and development. We systematically review pioneering case studies, deconstruct current UCA operational processes, and explore modern technologies, such as big data and artificial intelligence, to optimize these sub-modules further. We discuss current limitations within UCA models and propose future pathways, emphasizing the necessity of comprehensive analyses for effective UCA simulations. Proposed solutions include strengthening our understanding of urban growth mechanisms, examining spatial positioning and temporal evolution dynamics, and enhancing urban geographic simulations with deep learning techniques to support sustainable transitions in public governance. These improvements offer data-driven decision support for environmental management, advancing policies that foster sustainable urban development.</div></div>","PeriodicalId":100124,"journal":{"name":"Artificial Intelligence in Geosciences","volume":"6 2","pages":"Article 100142"},"PeriodicalIF":0.0,"publicationDate":"2025-06-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144518220","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Artificial Intelligence in Geosciences
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1