Pub Date : 2025-06-01Epub Date: 2025-05-17DOI: 10.1016/j.aiig.2025.100118
Stalyn Paucar, Christian Mejia-Escobar, Victor Collaguazo
The identification and characterization of rock types is a core activity in geology and related fields, including mining, petroleum, environmental science, industry, and construction. Traditionally, this task is performed by human specialists who analyze and describe the type, composition, texture, shape, and other properties of rock samples, whether collected in-situ or prepared in a laboratory. However, the process is subjective, dependent on the specialist’s experience, and time-consuming. This study proposes an artificial intelligence-based approach that combines computer vision and natural language processing to generate both textual and verbal descriptions from images of rock thin sections. A dataset of images and corresponding textual descriptions is used to train a hybrid deep learning model. Features extracted from the images using EfficientNetB7 are processed by a Transformer network to generate textual descriptions, which are then converted into verbal responses using a speech synthesis service. The experimental results show an accuracy of 0.892 and a BLEU score of 0.71. This model offers potential utility for research, professional, and academic applications and has been deployed as a web application for public use.
{"title":"Automatic description of rock thin sections: A web application","authors":"Stalyn Paucar, Christian Mejia-Escobar, Victor Collaguazo","doi":"10.1016/j.aiig.2025.100118","DOIUrl":"10.1016/j.aiig.2025.100118","url":null,"abstract":"<div><div>The identification and characterization of rock types is a core activity in geology and related fields, including mining, petroleum, environmental science, industry, and construction. Traditionally, this task is performed by human specialists who analyze and describe the type, composition, texture, shape, and other properties of rock samples, whether collected in-situ or prepared in a laboratory. However, the process is subjective, dependent on the specialist’s experience, and time-consuming. This study proposes an artificial intelligence-based approach that combines computer vision and natural language processing to generate both textual and verbal descriptions from images of rock thin sections. A dataset of images and corresponding textual descriptions is used to train a hybrid deep learning model. Features extracted from the images using EfficientNetB7 are processed by a Transformer network to generate textual descriptions, which are then converted into verbal responses using a speech synthesis service. The experimental results show an accuracy of 0.892 and a BLEU score of 0.71. This model offers potential utility for research, professional, and academic applications and has been deployed as a web application for public use.</div></div>","PeriodicalId":100124,"journal":{"name":"Artificial Intelligence in Geosciences","volume":"6 1","pages":"Article 100118"},"PeriodicalIF":0.0,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144212094","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-06-01Epub Date: 2025-02-26DOI: 10.1016/j.aiig.2025.100110
A. Bolève, R. Eddies, M. Staring, Y. Benboudiaf, H. Pournaki, M. Nepveaux
Geotechnical parameters derived from an intrusive cone penetration test (CPT) are used to asses mechanical properties to inform the design phase of infrastructure projects. However, local, in situ 1D measurements can fail to capture 3D subsurface variations, which could mean less than optimal design decisions for foundation engineering. By coupling the localised measurements from CPTs with more global 3D measurements derived from geophysical methods, a higher fidelity 3D overview of the subsurface can be obtained. Machine Learning (ML) may offer an effective means to capture all types of geophysical information associated with CPT data at a site scale to build a 2D or 3D ground model. In this paper, we present an ML approach to build a 3D ground model of cone resistance and sleeve friction by combining several CPT measurements with Multichannel Analysis of Surface Waves (MASW) and Electrical Resistivity Tomography (ERT) data on a land site characterisation project in the United Arab Emirates (UAE). To avoid a potential overfitting problem inherent to the use of machine learning and a lack of data at certain locations, we explore the possibility of using a prior Geo-Statistical (GS) approach that attempts to constrain the overfitting process by “artificially” increasing the amount of input data. A sensitivity study is also performed on input features used to train the ML algorithm to better define the optimal combination of input features for the prediction. Our results showed that ERT data were not useful in capturing 3D variations of geotechnical properties compared to Vs due to the geographical location of the site (200 m east from the Oman Gulf) and the possible effect of saline water intrusion. Additionally, we demonstrate that the use of a prior GS phase could be a promising and interesting means to make the prediction of ground properties more robust, especially for this specific case study described in this paper. Looking ahead, better representation of the subsurface can lead to a number of benefits for stakeholders involved in developing assets. Better ground/geotechnical models mean better site calibration of design methods and fewer design assumptions for reliability-based design, creating an opportunity for value engineering in the form of lighter construction without compromising safety, shorter construction timelines, and reduced resource requirements.
{"title":"Innovative cone resistance and sleeve friction prediction from geophysics based on a coupled geo-statistical and machine learning process","authors":"A. Bolève, R. Eddies, M. Staring, Y. Benboudiaf, H. Pournaki, M. Nepveaux","doi":"10.1016/j.aiig.2025.100110","DOIUrl":"10.1016/j.aiig.2025.100110","url":null,"abstract":"<div><div>Geotechnical parameters derived from an intrusive cone penetration test (CPT) are used to asses mechanical properties to inform the design phase of infrastructure projects. However, local, in situ 1D measurements can fail to capture 3D subsurface variations, which could mean less than optimal design decisions for foundation engineering. By coupling the localised measurements from CPTs with more global 3D measurements derived from geophysical methods, a higher fidelity 3D overview of the subsurface can be obtained. Machine Learning (ML) may offer an effective means to capture all types of geophysical information associated with CPT data at a site scale to build a 2D or 3D ground model. In this paper, we present an ML approach to build a 3D ground model of cone resistance and sleeve friction by combining several CPT measurements with Multichannel Analysis of Surface Waves (MASW) and Electrical Resistivity Tomography (ERT) data on a land site characterisation project in the United Arab Emirates (UAE). To avoid a potential overfitting problem inherent to the use of machine learning and a lack of data at certain locations, we explore the possibility of using a prior Geo-Statistical (GS) approach that attempts to constrain the overfitting process by “artificially” increasing the amount of input data. A sensitivity study is also performed on input features used to train the ML algorithm to better define the optimal combination of input features for the prediction. Our results showed that ERT data were not useful in capturing 3D variations of geotechnical properties compared to V<sub>s</sub> due to the geographical location of the site (200 m east from the Oman Gulf) and the possible effect of saline water intrusion. Additionally, we demonstrate that the use of a prior GS phase could be a promising and interesting means to make the prediction of ground properties more robust, especially for this specific case study described in this paper. Looking ahead, better representation of the subsurface can lead to a number of benefits for stakeholders involved in developing assets. Better ground/geotechnical models mean better site calibration of design methods and fewer design assumptions for reliability-based design, creating an opportunity for value engineering in the form of lighter construction without compromising safety, shorter construction timelines, and reduced resource requirements.</div></div>","PeriodicalId":100124,"journal":{"name":"Artificial Intelligence in Geosciences","volume":"6 1","pages":"Article 100110"},"PeriodicalIF":0.0,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143526930","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-06-01Epub Date: 2025-05-20DOI: 10.1016/j.aiig.2025.100122
Gamze Maden Muftuoglu , Kaveh Dehghanian
Liquefaction is one of the prominent factors leading to damage to soil and structures. In this study, the relationship between liquefaction potential and soil parameters is determined by applying feature importance methods to Random Forest (RF), Logistic Regression (LR), Multilayer Perceptron (MLP), Support Vector Machine (SVM) and eXtreme Gradient Boosting (XGBoost) algorithms. Feature importance methods consist of permutation and Shapley Additive exPlanations (SHAP) importances along with the used model's built-in feature importance method if it exists. These suggested approaches incorporate an extensive dataset of geotechnical parameters, historical liquefaction events, and soil properties. The feature set comprises 18 parameters that are gathered from 161 field cases. Algorithms are used to determine the optimum performance feature set. Compared to other approaches, the study assesses how well these algorithms predict soil liquefaction potential. Early findings show that the algorithms perform well, demonstrating their capacity to identify non-linear connections and improve prediction accuracy. Among the feature set, σ,v (psf), MSF, CSRσ,v, FC%, Vs∗,40ft(f ps) and N1,60,CS are the ones that have the highest deterministic power on the result. The study's contribution is that, in the absence of extensive data for liquefaction assessment, the proposed method estimates the liquefaction potential using five parameters with promising accuracy.
{"title":"Soil liquefaction assessment using machine learning","authors":"Gamze Maden Muftuoglu , Kaveh Dehghanian","doi":"10.1016/j.aiig.2025.100122","DOIUrl":"10.1016/j.aiig.2025.100122","url":null,"abstract":"<div><div>Liquefaction is one of the prominent factors leading to damage to soil and structures. In this study, the relationship between liquefaction potential and soil parameters is determined by applying feature importance methods to Random Forest (RF), Logistic Regression (LR), Multilayer Perceptron (MLP), Support Vector Machine (SVM) and eXtreme Gradient Boosting (XGBoost) algorithms. Feature importance methods consist of permutation and Shapley Additive exPlanations (SHAP) importances along with the used model's built-in feature importance method if it exists. These suggested approaches incorporate an extensive dataset of geotechnical parameters, historical liquefaction events, and soil properties. The feature set comprises 18 parameters that are gathered from 161 field cases. Algorithms are used to determine the optimum performance feature set. Compared to other approaches, the study assesses how well these algorithms predict soil liquefaction potential. Early findings show that the algorithms perform well, demonstrating their capacity to identify non-linear connections and improve prediction accuracy. Among the feature set, <em>σ</em><sup><em>,</em></sup><sub><em>v</em></sub> (<em>psf</em>), MSF, <em>CSR</em><sub><em>σ,</em></sub> <sub><em>v</em></sub>, FC%, V<sub>s∗,40f</sub> <sub>t</sub>(f ps) and <em>N</em><sub>1<em>,</em>60<em>,CS</em></sub> are the ones that have the highest deterministic power on the result. The study's contribution is that, in the absence of extensive data for liquefaction assessment, the proposed method estimates the liquefaction potential using five parameters with promising accuracy.</div></div>","PeriodicalId":100124,"journal":{"name":"Artificial Intelligence in Geosciences","volume":"6 1","pages":"Article 100122"},"PeriodicalIF":0.0,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144134957","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-06-01Epub Date: 2025-06-02DOI: 10.1016/j.aiig.2025.100123
Omar M. Saad , Matteo Ravasi , Tariq Alkhalifah
Seismic data denoising is a critical process usually applied at various stages of the seismic processing workflow, as our ability to mitigate noise in seismic data affects the quality of our subsequent analyses. However, finding an optimal balance between preserving seismic signals and effectively reducing seismic noise presents a substantial challenge. In this study, we introduce a multi-stage deep learning model, trained in a self-supervised manner, designed specifically to suppress seismic noise while minimizing signal leakage. This model operates as a patch-based approach, extracting overlapping patches from the noisy data and converting them into 1D vectors for input. It consists of two identical sub-networks, each configured differently. Inspired by the transformer architecture, each sub-network features an embedded block that comprises two fully connected layers, which are utilized for feature extraction from the input patches. After reshaping, a multi-head attention module enhances the model’s focus on significant features by assigning higher attention weights to them. The key difference between the two sub-networks lies in the number of neurons within their fully connected layers. The first sub-network serves as a strong denoiser with a small number of neurons, effectively attenuating seismic noise; in contrast, the second sub-network functions as a signal-add-back model, using a larger number of neurons to retrieve some of the signal that was not preserved in the output of the first sub-network. The proposed model produces two outputs, each corresponding to one of the sub-networks, and both sub-networks are optimized simultaneously using the noisy data as the label for both outputs. Evaluations conducted on both synthetic and field data demonstrate the model’s effectiveness in suppressing seismic noise with minimal signal leakage, outperforming some benchmark methods.
{"title":"Self-supervised multi-stage deep learning network for seismic data denoising","authors":"Omar M. Saad , Matteo Ravasi , Tariq Alkhalifah","doi":"10.1016/j.aiig.2025.100123","DOIUrl":"10.1016/j.aiig.2025.100123","url":null,"abstract":"<div><div>Seismic data denoising is a critical process usually applied at various stages of the seismic processing workflow, as our ability to mitigate noise in seismic data affects the quality of our subsequent analyses. However, finding an optimal balance between preserving seismic signals and effectively reducing seismic noise presents a substantial challenge. In this study, we introduce a multi-stage deep learning model, trained in a self-supervised manner, designed specifically to suppress seismic noise while minimizing signal leakage. This model operates as a patch-based approach, extracting overlapping patches from the noisy data and converting them into 1D vectors for input. It consists of two identical sub-networks, each configured differently. Inspired by the transformer architecture, each sub-network features an embedded block that comprises two fully connected layers, which are utilized for feature extraction from the input patches. After reshaping, a multi-head attention module enhances the model’s focus on significant features by assigning higher attention weights to them. The key difference between the two sub-networks lies in the number of neurons within their fully connected layers. The first sub-network serves as a strong denoiser with a small number of neurons, effectively attenuating seismic noise; in contrast, the second sub-network functions as a signal-add-back model, using a larger number of neurons to retrieve some of the signal that was not preserved in the output of the first sub-network. The proposed model produces two outputs, each corresponding to one of the sub-networks, and both sub-networks are optimized simultaneously using the noisy data as the label for both outputs. Evaluations conducted on both synthetic and field data demonstrate the model’s effectiveness in suppressing seismic noise with minimal signal leakage, outperforming some benchmark methods.</div></div>","PeriodicalId":100124,"journal":{"name":"Artificial Intelligence in Geosciences","volume":"6 1","pages":"Article 100123"},"PeriodicalIF":0.0,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144243298","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-06-01Epub Date: 2025-03-14DOI: 10.1016/j.aiig.2025.100111
Suraparb Keawsawasvong , Jim Shiau , Nhat Tan Duong , Thanachon Promwichai , Rungkhun Banyong , Van Qui Lai
This study examines the stability of three-dimensional rectangular tunnel headings in drained c-ϕ soils, incorporating surcharge effects using 3D Finite Element Limit Analysis (FELA). It focuses on the upper and lower bound solutions for three stability factors: cohesion, surcharge, and soil unit weight (Nc, Ns, and Nγ). Based on Terzaghi's principle of superposition, the analysis evaluates tunnel stability under varying parameters, such as cover-depth ratio (H/D), width-depth ratio (B/D), and friction angle (ϕ). The results align closely with previous studies, and practical design charts are provided for calculating minimum support pressures. Additionally, machine learning models (ANN and XGBoost) are used to develop accurate correlations between input parameters and stability results. A relative importance index analysis is conducted to assess the impact of these parameters. This research enhances understanding of tunnel stability and offers practical insights for tunnel design.
{"title":"Enhancing understanding of 3D rectangular tunnel heading stability in c-φ soils with surcharge loading: A comprehensive FELA analysis using three stability factors and machine learning","authors":"Suraparb Keawsawasvong , Jim Shiau , Nhat Tan Duong , Thanachon Promwichai , Rungkhun Banyong , Van Qui Lai","doi":"10.1016/j.aiig.2025.100111","DOIUrl":"10.1016/j.aiig.2025.100111","url":null,"abstract":"<div><div>This study examines the stability of three-dimensional rectangular tunnel headings in drained <em>c-ϕ</em> soils, incorporating surcharge effects using 3D Finite Element Limit Analysis (FELA). It focuses on the upper and lower bound solutions for three stability factors: cohesion, surcharge, and soil unit weight (Nc, Ns, and Nγ). Based on Terzaghi's principle of superposition, the analysis evaluates tunnel stability under varying parameters, such as cover-depth ratio (<em>H/D</em>), width-depth ratio (<em>B/D</em>), and friction angle (<em>ϕ</em>). The results align closely with previous studies, and practical design charts are provided for calculating minimum support pressures. Additionally, machine learning models (ANN and XGBoost) are used to develop accurate correlations between input parameters and stability results. A relative importance index analysis is conducted to assess the impact of these parameters. This research enhances understanding of tunnel stability and offers practical insights for tunnel design.</div></div>","PeriodicalId":100124,"journal":{"name":"Artificial Intelligence in Geosciences","volume":"6 1","pages":"Article 100111"},"PeriodicalIF":0.0,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143637661","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-06-01Epub Date: 2025-03-01DOI: 10.1016/j.aiig.2025.100107
Jiaqi Yan , Li Ma , Tianqi Jiang , Jing Zheng , Dewei Li , Xingzhi Teng
This paper proposed a moment tensor regression prediction technology based on ResNet for microseismic events. Taking the great advantages of deep networks in classification and regression tasks, it can realize the great potential of fast and accurate inversion of microseismic moment tensors after the network trained. This ResNet-based moment tensor prediction technology, whose input is raw recordings, does not require the extraction of data features in advance. First, we tested the network using synthetic data and performed a quantitative assessment of the errors. The results demonstrate that the network exhibits high accuracy and efficiency during the prediction phase. Next, we tested the network using real microseismic data and compared the results with those from traditional inversion methods. The error in the results was relatively small compared to traditional methods. However, the network operates more efficiently without requiring manual intervention, making it highly valuable for near-real-time monitoring applications.
{"title":"Microseismic moment tensor inversion based on ResNet model","authors":"Jiaqi Yan , Li Ma , Tianqi Jiang , Jing Zheng , Dewei Li , Xingzhi Teng","doi":"10.1016/j.aiig.2025.100107","DOIUrl":"10.1016/j.aiig.2025.100107","url":null,"abstract":"<div><div>This paper proposed a moment tensor regression prediction technology based on ResNet for microseismic events. Taking the great advantages of deep networks in classification and regression tasks, it can realize the great potential of fast and accurate inversion of microseismic moment tensors after the network trained. This ResNet-based moment tensor prediction technology, whose input is raw recordings, does not require the extraction of data features in advance. First, we tested the network using synthetic data and performed a quantitative assessment of the errors. The results demonstrate that the network exhibits high accuracy and efficiency during the prediction phase. Next, we tested the network using real microseismic data and compared the results with those from traditional inversion methods. The error in the results was relatively small compared to traditional methods. However, the network operates more efficiently without requiring manual intervention, making it highly valuable for near-real-time monitoring applications.</div></div>","PeriodicalId":100124,"journal":{"name":"Artificial Intelligence in Geosciences","volume":"6 1","pages":"Article 100107"},"PeriodicalIF":0.0,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143551571","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-06-01Epub Date: 2025-06-06DOI: 10.1016/j.aiig.2025.100127
M.A. Dalhat, Sami A. Osman
This study employed convolutional neural networks (CNNs) for the classification of rock minerals based on 3179 RGB-scale original microstructural images of undisturbed broken surfaces. The image dataset covers 40 distinct rock mineral-types. Three CNN architectures (Simple model, SqueezeNet, and Xception) were evaluated to compare their performance and feature extraction capabilities. Gradient-weighted Class Activation Mapping (Grad-CAM) was employed to visualize the features influencing model predictions, providing insights into how each model distinguishes between mineral classes. Key discriminative attributes included texture, grain size, pattern, and color variations. Texture and grain boundaries were identified as the most critical features, as they were strongly activated regions by the best model. Patterns such as banding and chromatic contrasts further enhanced classification accuracy. Performance analysis revealed that the Simple model had limited ability to isolate fine-grained details, producing broad and less specific activations (0.84 test accuracy). SqueezeNet demonstrated improved localization of discriminative features but occasionally missed finer textural details (0.95 test accuracy). The Xception model outperformed the others, achieving the highest classification accuracy (0.98 test accuracy) by exhibiting precise and tightly focused activations, capturing intricate textures and subtle chromatic variations. Its superior performance can be attributed to its deep architecture and efficient depth-wise separable convolutions, which enabled hierarchical and detailed feature extraction. Results underscores the importance of texture, pattern, and chromatic features in accurate mineral classification and highlights the suitability of deep, efficient architectures like Xception for such tasks. These findings demonstrate the potential of CNNs in geoscience research, offering a framework for automated mineral identification in industrial and scientific applications.
{"title":"Deep learning based identification of rock minerals from un-processed digital microscopic images of undisturbed broken-surfaces","authors":"M.A. Dalhat, Sami A. Osman","doi":"10.1016/j.aiig.2025.100127","DOIUrl":"10.1016/j.aiig.2025.100127","url":null,"abstract":"<div><div>This study employed convolutional neural networks (CNNs) for the classification of rock minerals based on 3179 RGB-scale original microstructural images of undisturbed broken surfaces. The image dataset covers 40 distinct rock mineral-types. Three CNN architectures (Simple model, SqueezeNet, and Xception) were evaluated to compare their performance and feature extraction capabilities. Gradient-weighted Class Activation Mapping (Grad-CAM) was employed to visualize the features influencing model predictions, providing insights into how each model distinguishes between mineral classes. Key discriminative attributes included texture, grain size, pattern, and color variations. Texture and grain boundaries were identified as the most critical features, as they were strongly activated regions by the best model. Patterns such as banding and chromatic contrasts further enhanced classification accuracy. Performance analysis revealed that the Simple model had limited ability to isolate fine-grained details, producing broad and less specific activations (0.84 test accuracy). SqueezeNet demonstrated improved localization of discriminative features but occasionally missed finer textural details (0.95 test accuracy). The Xception model outperformed the others, achieving the highest classification accuracy (0.98 test accuracy) by exhibiting precise and tightly focused activations, capturing intricate textures and subtle chromatic variations. Its superior performance can be attributed to its deep architecture and efficient depth-wise separable convolutions, which enabled hierarchical and detailed feature extraction. Results underscores the importance of texture, pattern, and chromatic features in accurate mineral classification and highlights the suitability of deep, efficient architectures like Xception for such tasks. These findings demonstrate the potential of CNNs in geoscience research, offering a framework for automated mineral identification in industrial and scientific applications.</div></div>","PeriodicalId":100124,"journal":{"name":"Artificial Intelligence in Geosciences","volume":"6 1","pages":"Article 100127"},"PeriodicalIF":0.0,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144255175","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-06-01Epub Date: 2025-04-03DOI: 10.1016/j.aiig.2025.100114
{"title":"Thank you reviewers!","authors":"","doi":"10.1016/j.aiig.2025.100114","DOIUrl":"10.1016/j.aiig.2025.100114","url":null,"abstract":"","PeriodicalId":100124,"journal":{"name":"Artificial Intelligence in Geosciences","volume":"6 1","pages":"Article 100114"},"PeriodicalIF":0.0,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144366260","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-06-01Epub Date: 2025-05-09DOI: 10.1016/j.aiig.2025.100115
Mohammad H. Taufik, Tariq Alkhalifah
Physics-informed neural networks (PINNs) are promising to replace conventional mesh-based partial differential equation (PDE) solvers by offering more accurate and flexible PDE solutions. However, PINNs are hampered by the relatively slow convergence and the need to perform additional, potentially expensive training for new PDE parameters. To solve this limitation, we introduce LatentPINN, a framework that utilizes latent representations of the PDE parameters as additional (to the coordinates) inputs into PINNs and allows for training over the distribution of these parameters. Motivated by the recent progress on generative models, we promote using latent diffusion models to learn compressed latent representations of the distribution of PDE parameters as they act as input parameters for NN functional solutions. We use a two-stage training scheme in which, in the first stage, we learn the latent representations for the distribution of PDE parameters. In the second stage, we train a physics-informed neural network over inputs given by randomly drawn samples from the coordinate space within the solution domain and samples from the learned latent representation of the PDE parameters. Considering their importance in capturing evolving interfaces and fronts in various fields, we test the approach on a class of level set equations given, for example, by the nonlinear Eikonal equation. We share results corresponding to three Eikonal parameters (velocity models) sets. The proposed method performs well on new phase velocity models without the need for any additional training.
{"title":"LatentPINNs: Generative physics-informed neural networks via a latent representation learning","authors":"Mohammad H. Taufik, Tariq Alkhalifah","doi":"10.1016/j.aiig.2025.100115","DOIUrl":"10.1016/j.aiig.2025.100115","url":null,"abstract":"<div><div>Physics-informed neural networks (PINNs) are promising to replace conventional mesh-based partial differential equation (PDE) solvers by offering more accurate and flexible PDE solutions. However, PINNs are hampered by the relatively slow convergence and the need to perform additional, potentially expensive training for new PDE parameters. To solve this limitation, we introduce LatentPINN, a framework that utilizes latent representations of the PDE parameters as additional (to the coordinates) inputs into PINNs and allows for training over the distribution of these parameters. Motivated by the recent progress on generative models, we promote using latent diffusion models to learn compressed latent representations of the distribution of PDE parameters as they act as input parameters for NN functional solutions. We use a two-stage training scheme in which, in the first stage, we learn the latent representations for the distribution of PDE parameters. In the second stage, we train a physics-informed neural network over inputs given by randomly drawn samples from the coordinate space within the solution domain and samples from the learned latent representation of the PDE parameters. Considering their importance in capturing evolving interfaces and fronts in various fields, we test the approach on a class of level set equations given, for example, by the nonlinear Eikonal equation. We share results corresponding to three Eikonal parameters (velocity models) sets. The proposed method performs well on new phase velocity models without the need for any additional training.</div></div>","PeriodicalId":100124,"journal":{"name":"Artificial Intelligence in Geosciences","volume":"6 1","pages":"Article 100115"},"PeriodicalIF":0.0,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143948290","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-06-01Epub Date: 2025-06-18DOI: 10.1016/j.aiig.2025.100129
Kun Chen , Meng Li , Xiaolian Li , Guangzhi Cui , Jia Tian , JiaLe Li , RuoYao Mu , JunJie Zhu
Microseismic monitoring is essential for understanding subsurface dynamics and optimizing oil and gas production. However, traditional methods for the automatic detection of microseismic events rely heavily on characteristic functions and human intervention, often resulting in suboptimal performance when dealing with complex and noisy data. In this study, we propose a novel approach that leverages deep learning frame to extract multiscale features from microseismic data using a TransUNet neural network. Our model integrates the advantages of Transformer and UNet architectures to achieve high accuracy in multivariate image segmentation and precise picking of P-wave and S-wave first arrivals simultaneously. We validate our approach using both synthetic and field microseismic datasets recorded from gas storage monitoring and roof fracturing in a coal seam. The robustness of the proposed method has been verified in the testing of synthetic data with various levels of Gaussian and real background noises extracted from field data. The comparisons of the proposed method with UNet and SwinUNet in terms of the model architecture and classification performance demonstrate the TransUNet achieves the optimal balance in its architecture and inference speed. With relatively low inference time and network complexity, it operates effectively in high-precision microseismic phase pickings. This advancement holds significant promise for enhancing microseismic monitoring technology in hydraulic fracturing and reservoir monitoring applications.
{"title":"Enhancing microseismic event detection with TransUNet: A deep learning approach for simultaneous pickings of P-wave and S-wave first arrivals","authors":"Kun Chen , Meng Li , Xiaolian Li , Guangzhi Cui , Jia Tian , JiaLe Li , RuoYao Mu , JunJie Zhu","doi":"10.1016/j.aiig.2025.100129","DOIUrl":"10.1016/j.aiig.2025.100129","url":null,"abstract":"<div><div>Microseismic monitoring is essential for understanding subsurface dynamics and optimizing oil and gas production. However, traditional methods for the automatic detection of microseismic events rely heavily on characteristic functions and human intervention, often resulting in suboptimal performance when dealing with complex and noisy data. In this study, we propose a novel approach that leverages deep learning frame to extract multiscale features from microseismic data using a TransUNet neural network. Our model integrates the advantages of Transformer and UNet architectures to achieve high accuracy in multivariate image segmentation and precise picking of P-wave and S-wave first arrivals simultaneously. We validate our approach using both synthetic and field microseismic datasets recorded from gas storage monitoring and roof fracturing in a coal seam. The robustness of the proposed method has been verified in the testing of synthetic data with various levels of Gaussian and real background noises extracted from field data. The comparisons of the proposed method with UNet and SwinUNet in terms of the model architecture and classification performance demonstrate the TransUNet achieves the optimal balance in its architecture and inference speed. With relatively low inference time and network complexity, it operates effectively in high-precision microseismic phase pickings. This advancement holds significant promise for enhancing microseismic monitoring technology in hydraulic fracturing and reservoir monitoring applications.</div></div>","PeriodicalId":100124,"journal":{"name":"Artificial Intelligence in Geosciences","volume":"6 1","pages":"Article 100129"},"PeriodicalIF":0.0,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144314011","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}