Pub Date : 2024-09-06DOI: 10.1016/j.cageo.2024.105715
Fuyan Zhao , Peng Hong , Chuanshan Dai , Guiling Wang , Haiyan Lei
In this study, we propose an improved lattice Boltzmann flux solver (LBFS) to simulate the thermal-hydraulic (TH) processes within fractured porous media. In LBFS, the flux at cell interfaces is calculated using a locally reconstructed lattice Boltzmann model (LBM). Unlike conventional methods that use direct mathematical approximations, LBFS can suppress the oscillation of solutions and has better accuracy. However, when simulating two-dimensional fractured porous media problems, the rock matrix is divided into surface cells, while fractures are usually divided into line cells. This increases the complexity of implementing the LBFS, as the reconstruction of interface flux in different dimensions requires the use of discrete velocity models (DmQn) in different dimensions. To address this challenge, we introduce an innovative interpolation scheme based on the improved D1Q3 model, thereby establishing a dimensionally independent approach for the reconstruction of the interface flux. This approach greatly reduces the complexity of applying the LBFS to hybrid dimensional problems and simplifies the computational process. The present method is validated by simulating three typical cases and the results show good agreement with the reference solutions. Finally, the improved LBFS is applied to analyze the TH coupling behavior in fractured porous media with a single fracture and a more complex scenario involving two intersected fractures.
{"title":"A lattice Boltzmann flux solver with the 1D-link interpolation scheme for simulating fluid flow and heat transfer in fractured porous media","authors":"Fuyan Zhao , Peng Hong , Chuanshan Dai , Guiling Wang , Haiyan Lei","doi":"10.1016/j.cageo.2024.105715","DOIUrl":"10.1016/j.cageo.2024.105715","url":null,"abstract":"<div><p>In this study, we propose an improved lattice Boltzmann flux solver (LBFS) to simulate the thermal-hydraulic (TH) processes within fractured porous media. In LBFS, the flux at cell interfaces is calculated using a locally reconstructed lattice Boltzmann model (LBM). Unlike conventional methods that use direct mathematical approximations, LBFS can suppress the oscillation of solutions and has better accuracy. However, when simulating two-dimensional fractured porous media problems, the rock matrix is divided into surface cells, while fractures are usually divided into line cells. This increases the complexity of implementing the LBFS, as the reconstruction of interface flux in different dimensions requires the use of discrete velocity models (DmQn) in different dimensions. To address this challenge, we introduce an innovative interpolation scheme based on the improved D1Q3 model, thereby establishing a dimensionally independent approach for the reconstruction of the interface flux. This approach greatly reduces the complexity of applying the LBFS to hybrid dimensional problems and simplifies the computational process. The present method is validated by simulating three typical cases and the results show good agreement with the reference solutions. Finally, the improved LBFS is applied to analyze the TH coupling behavior in fractured porous media with a single fracture and a more complex scenario involving two intersected fractures.</p></div>","PeriodicalId":55221,"journal":{"name":"Computers & Geosciences","volume":"193 ","pages":"Article 105715"},"PeriodicalIF":4.2,"publicationDate":"2024-09-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142241968","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-09-05DOI: 10.1016/j.cageo.2024.105712
Min Jun Kim , Yongchae Cho
Well log data provide key subsurface information, which is crucial for lithology evaluation and reservoir characterization. However, due to technical issues, well log data may contain missing values at certain depth intervals, which can be detrimental for data analysis. The best method is to reacquire the missing data by relogging, but this increases operational costs. Thus, a cost-efficient method for restoring the lost data is needed to overcome this issue. We propose an imputation method for missing well log data using collaborative filtering, a widely used algorithm for making new item recommendations to users. Although collaborative filtering is mainly used in recommendation systems, its fundamental principle allows us to utilize it to help make predictions for missing log data. The method is applied to a well log dataset obtained from the North Sea near Norway. The results show that the collaborative filtering algorithm has the potential to be a powerful imputation method for missing well log data, but there are some limitations that need to be addressed.
{"title":"Imputation of missing values in well log data using k-nearest neighbor collaborative filtering","authors":"Min Jun Kim , Yongchae Cho","doi":"10.1016/j.cageo.2024.105712","DOIUrl":"10.1016/j.cageo.2024.105712","url":null,"abstract":"<div><p>Well log data provide key subsurface information, which is crucial for lithology evaluation and reservoir characterization. However, due to technical issues, well log data may contain missing values at certain depth intervals, which can be detrimental for data analysis. The best method is to reacquire the missing data by relogging, but this increases operational costs. Thus, a cost-efficient method for restoring the lost data is needed to overcome this issue. We propose an imputation method for missing well log data using collaborative filtering, a widely used algorithm for making new item recommendations to users. Although collaborative filtering is mainly used in recommendation systems, its fundamental principle allows us to utilize it to help make predictions for missing log data. The method is applied to a well log dataset obtained from the North Sea near Norway. The results show that the collaborative filtering algorithm has the potential to be a powerful imputation method for missing well log data, but there are some limitations that need to be addressed.</p></div>","PeriodicalId":55221,"journal":{"name":"Computers & Geosciences","volume":"193 ","pages":"Article 105712"},"PeriodicalIF":4.2,"publicationDate":"2024-09-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142242006","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-09-05DOI: 10.1016/j.cageo.2024.105714
Fábio Corrêa Cordeiro , Patrícia Ferreira da Silva , Alexandre Tessarollo , Cláudia Freitas , Elvis de Souza , Diogo da Silva Magalhaes Gomes , Renato Rocha Souza , Flávio Codeço Coelho
Most companies struggle to find and extract relevant information from their technical documents. In particular, the Oil and Gas (O&G) industry faces the challenge of dealing with large amounts of data hidden within old and new geoscientific reports collected over decades of operation. Making this information available in a structured format can unlock valuable information among these mountains of data, which is crucial to support a wide range of industrial and academic applications. However, most natural language processing resources were built from general domain corpora extracted from the Internet and primarily written in English. This paper presents Petro NLP, a comprehensive set of natural language processing and information extraction resources for the oil and gas industry in Portuguese.
We connected an interdisciplinary team of geoscientists, linguists, computer scientists, petroleum engineers, librarians, and ontologists to build a knowledge graph and several annotated corpora. The Petro NLP resources comprise: (i) Petro KGraph– a knowledge graph populated with entities and relations commonly found on technical reports; and (ii) Petrolês, PetroGold, PetroNER, and PetroRE– sets of corpora containing raw text and documents annotated with morphosyntactic labels, named entities, and relations. These resources are fundamental infrastructure for future research in natural language processing and information extraction in the oil industry. Our ongoing research uses these datasets to train and enhance pre-trained machine learning models that automatically extract information from geoscientific technical documents.
大多数公司都在努力从技术文件中查找和提取相关信息。特别是,石油和天然气(O&G)行业面临的挑战是如何处理几十年来收集的新旧地球科学报告中隐藏的大量数据。以结构化的格式提供这些信息可以从堆积如山的数据中挖掘出有价值的信息,这对支持广泛的工业和学术应用至关重要。然而,大多数自然语言处理资源都是从互联网上提取的通用领域语料库中建立的,而且主要是用英语编写的。我们将一个由地球科学家、语言学家、计算机科学家、石油工程师、图书馆员和本体论专家组成的跨学科团队联系起来,构建了一个知识图谱和若干注释语料库。Petro NLP 资源包括:(i) Petro KGraph--一个知识图谱,其中包含技术报告中常见的实体和关系;(ii) Petrolês、PetroGold、PetroNER 和 PetroRE--包含原始文本和文档的语料集,其中标注了语态句法标签、命名实体和关系。这些资源是未来石油工业自然语言处理和信息提取研究的基础架构。我们正在进行的研究利用这些数据集来训练和增强预训练的机器学习模型,这些模型可自动从地球科学技术文档中提取信息。
{"title":"Petro NLP: Resources for natural language processing and information extraction for the oil and gas industry","authors":"Fábio Corrêa Cordeiro , Patrícia Ferreira da Silva , Alexandre Tessarollo , Cláudia Freitas , Elvis de Souza , Diogo da Silva Magalhaes Gomes , Renato Rocha Souza , Flávio Codeço Coelho","doi":"10.1016/j.cageo.2024.105714","DOIUrl":"10.1016/j.cageo.2024.105714","url":null,"abstract":"<div><p>Most companies struggle to find and extract relevant information from their technical documents. In particular, the Oil and Gas (O&G) industry faces the challenge of dealing with large amounts of data hidden within old and new geoscientific reports collected over decades of operation. Making this information available in a structured format can unlock valuable information among these <em>mountains</em> of data, which is crucial to support a wide range of industrial and academic applications. However, most natural language processing resources were built from general domain corpora extracted from the Internet and primarily written in English. This paper presents <span>Petro NLP</span>, a comprehensive set of natural language processing and information extraction resources for the oil and gas industry in Portuguese.</p><p>We connected an interdisciplinary team of geoscientists, linguists, computer scientists, petroleum engineers, librarians, and ontologists to build a knowledge graph and several annotated corpora. The <span>Petro NLP</span> resources comprise: (i) <span>Petro KGraph</span>– a knowledge graph populated with entities and relations commonly found on technical reports; and (ii) <span>Petrolês</span>, <span>PetroGold</span>, <span>PetroNER</span>, and <span>PetroRE</span>– sets of corpora containing raw text and documents annotated with morphosyntactic labels, named entities, and relations. These resources are fundamental infrastructure for future research in natural language processing and information extraction in the oil industry. Our ongoing research uses these datasets to train and enhance pre-trained machine learning models that automatically extract information from geoscientific technical documents.</p></div>","PeriodicalId":55221,"journal":{"name":"Computers & Geosciences","volume":"193 ","pages":"Article 105714"},"PeriodicalIF":4.2,"publicationDate":"2024-09-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142242005","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
THEPORE (THErmo-POro-Elastic solutions) is an open source software to perform forward and inverse modeling of ground displacements induced by thermo-poro-elastic sources. The software, implemented in MATLAB, offers a library of analytical and semi-analytical solutions to compute ground displacements induced by thermo-poro-elastic deformation sources of different geometries, embedded in an elastic, homogeneous and isotropic half-space. The solutions have been verified against finite-element simulations. THEPORE includes also an inversion procedure of the deformation data to constrain the source parameters that better fit the observed signals.
The software's functionality is showcased by inverting the GPS deformation data recorded on Vulcano Island at the onset of the 2021 unrest, in order to estimate the position and volume change of the source responsible for the observed deformations. The results encourage to consider THEPORE as a practical tool suitable for a fast preliminary estimation of the deformation source during a volcanic crisis.
{"title":"THEPORE: A software package for modeling THErmo-PORo-elastic displacements","authors":"Gilda Currenti, Rosalba Napoli, Santina Chiara Stissi","doi":"10.1016/j.cageo.2024.105716","DOIUrl":"10.1016/j.cageo.2024.105716","url":null,"abstract":"<div><p>THEPORE (THErmo-POro-Elastic solutions) is an open source software to perform forward and inverse modeling of ground displacements induced by thermo-poro-elastic sources. The software, implemented in MATLAB, offers a library of analytical and semi-analytical solutions to compute ground displacements induced by thermo-poro-elastic deformation sources of different geometries, embedded in an elastic, homogeneous and isotropic half-space. The solutions have been verified against finite-element simulations. THEPORE includes also an inversion procedure of the deformation data to constrain the source parameters that better fit the observed signals.</p><p>The software's functionality is showcased by inverting the GPS deformation data recorded on Vulcano Island at the onset of the 2021 unrest, in order to estimate the position and volume change of the source responsible for the observed deformations. The results encourage to consider THEPORE as a practical tool suitable for a fast preliminary estimation of the deformation source during a volcanic crisis.</p></div>","PeriodicalId":55221,"journal":{"name":"Computers & Geosciences","volume":"192 ","pages":"Article 105716"},"PeriodicalIF":4.2,"publicationDate":"2024-09-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0098300424001997/pdfft?md5=21b8a591b1181c37092880b66377dcc3&pid=1-s2.0-S0098300424001997-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142150799","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-09-02DOI: 10.1016/j.cageo.2024.105706
Jichen Wang , Jing Li , Kun Li , Zerui Li , Yu Kang , Ji Chang , Wenjun Lv
Geophysical logging is a geo-scientific instrument that detects information such as electric, acoustic, and radioactive properties of a well. Its data plays a vital role in interpreting subsurface geology. However, since logging data is an indirect reflection of rocks, it requires the construction of a logging interpretation model in combination with core samples. Obtaining and analysing all core samples in a well is not practical due to their enormous cost, leading to the problem of scarce core sample labels. This problem can be addressed using semi-supervised learning. Existing studies on lithology identification using logging data mostly utilize graph-based semi-supervised learning, which requires known features to establish a graph Laplacian matrix. Therefore, these methods often use logging values at certain depths to construct feature vectors and cannot learn the shape information of logging curves. In this paper, we propose a semi-supervised learning method with feature learning capability based on semi-supervised generative adversarial network (SSGAN) to learn the shape information of logging curves while utilizing unlabelled logging curves. Additionally, considering the problem of insufficient use of labels when dividing a validation set in extremely scarce-label situations, we propose a strategy of weighted averaging of three sub-models, which effectively improves model performance. We verify the effectiveness of our proposed method on five wells and demonstrate the mechanism of semi-supervised learning using adversarial learning through extensive visualization methods.
{"title":"Borehole lithology modelling with scarce labels by deep transductive learning","authors":"Jichen Wang , Jing Li , Kun Li , Zerui Li , Yu Kang , Ji Chang , Wenjun Lv","doi":"10.1016/j.cageo.2024.105706","DOIUrl":"10.1016/j.cageo.2024.105706","url":null,"abstract":"<div><p>Geophysical logging is a geo-scientific instrument that detects information such as electric, acoustic, and radioactive properties of a well. Its data plays a vital role in interpreting subsurface geology. However, since logging data is an indirect reflection of rocks, it requires the construction of a logging interpretation model in combination with core samples. Obtaining and analysing all core samples in a well is not practical due to their enormous cost, leading to the problem of scarce core sample labels. This problem can be addressed using semi-supervised learning. Existing studies on lithology identification using logging data mostly utilize graph-based semi-supervised learning, which requires known features to establish a graph Laplacian matrix. Therefore, these methods often use logging values at certain depths to construct feature vectors and cannot learn the shape information of logging curves. In this paper, we propose a semi-supervised learning method with feature learning capability based on semi-supervised generative adversarial network (SSGAN) to learn the shape information of logging curves while utilizing unlabelled logging curves. Additionally, considering the problem of insufficient use of labels when dividing a validation set in extremely scarce-label situations, we propose a strategy of weighted averaging of three sub-models, which effectively improves model performance. We verify the effectiveness of our proposed method on five wells and demonstrate the mechanism of semi-supervised learning using adversarial learning through extensive visualization methods.</p></div>","PeriodicalId":55221,"journal":{"name":"Computers & Geosciences","volume":"192 ","pages":"Article 105706"},"PeriodicalIF":4.2,"publicationDate":"2024-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142136833","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
In this study, we proposed a general workflow that aims to enhance the ML-based geothermobarometer modelling. Our workflow focuses on three key areas. Firstly, we developed a robust pre-processing pipeline that addresses data imbalance, feature engineering, and data augmentation. Secondly, we assessed modelling errors using a Monte Carlo approach to quantify the impact of analytical uncertainties on the final pressure and temperature estimates. Thirdly, we implemented a robust strategy to validate and test the ML models to avoid over- and under-fitting issues while correcting biases associated with the application of specific ML models (i.e., tree-based ensembles).
To facilitate the use of our workflow, we have developed a web app (https://bit.ly/ml-pt-web) and a Python module (https://bit.ly/ml-pt-py). The robustness of this strategy has been tested on two calibrations: clinopyroxene (cpx) and clinopyroxene-liquid (cpx-liq). Our results show a significant reduction in errors compared to the baseline model, as well as good generalization ability on an independent external dataset. The Root Mean Squared Errors are 57 °C and 2.5 kbar for the cpx calibration, and 36 °C and 2.1 kbar for the cpx-liq calibration. Finally, our models show improved outcomes on the external dataset compared to existing ML and classical cpx and cpx-liq thermobarometers.
在本研究中,我们提出了一个通用工作流程,旨在增强基于 ML 的地温热压计建模。我们的工作流程侧重于三个关键领域。首先,我们开发了一个强大的预处理管道,以解决数据不平衡、特征工程和数据增强等问题。其次,我们使用蒙特卡罗方法评估建模误差,量化分析不确定性对最终压力和温度估计值的影响。第三,我们实施了一种稳健的策略来验证和测试 ML 模型,以避免过度拟合和拟合不足的问题,同时纠正与应用特定 ML 模型(即基于树的集合)相关的偏差。为了方便使用我们的工作流程,我们开发了一个网络应用程序 (https://bit.ly/ml-pt-web) 和一个 Python 模块 (https://bit.ly/ml-pt-py)。我们在两个定标中测试了这一策略的稳健性:clinopyroxene (cpx) 和 clinopyroxene-liquid (cpx-liq)。结果表明,与基线模型相比,误差明显减少,而且在独立的外部数据集上具有良好的泛化能力。cpx 标定的均方根误差为 57 ℃ 和 2.5 千巴,cpx-liq 标定的均方根误差为 36 ℃ 和 2.1 千巴。最后,与现有的 ML 和经典 cpx 和 cpx-liq 温度计相比,我们的模型在外部数据集上显示出更好的结果。
{"title":"Enhancing machine learning thermobarometry for clinopyroxene-bearing magmas","authors":"Mónica Ágreda-López , Valerio Parodi , Alessandro Musu , Corin Jorgenson , Alessandro Carfì , Fulvio Mastrogiovanni , Luca Caricchi , Diego Perugini , Maurizio Petrelli","doi":"10.1016/j.cageo.2024.105707","DOIUrl":"10.1016/j.cageo.2024.105707","url":null,"abstract":"<div><p>In this study, we proposed a general workflow that aims to enhance the ML-based geothermobarometer modelling. Our workflow focuses on three key areas. Firstly, we developed a robust pre-processing pipeline that addresses data imbalance, feature engineering, and data augmentation. Secondly, we assessed modelling errors using a Monte Carlo approach to quantify the impact of analytical uncertainties on the final pressure and temperature estimates. Thirdly, we implemented a robust strategy to validate and test the ML models to avoid over- and under-fitting issues while correcting biases associated with the application of specific ML models (i.e., tree-based ensembles).</p><p>To facilitate the use of our workflow, we have developed a web app (<span><span>https://bit.ly/ml-pt-web</span><svg><path></path></svg></span>) and a Python module (<span><span>https://bit.ly/ml-pt-py</span><svg><path></path></svg></span>). The robustness of this strategy has been tested on two calibrations: clinopyroxene (cpx) and clinopyroxene-liquid (cpx-liq). Our results show a significant reduction in errors compared to the baseline model, as well as good generalization ability on an independent external dataset. The Root Mean Squared Errors are 57 °C and 2.5 kbar for the cpx calibration, and 36 °C and 2.1 kbar for the cpx-liq calibration. Finally, our models show improved outcomes on the external dataset compared to existing ML and classical cpx and cpx-liq thermobarometers.</p></div>","PeriodicalId":55221,"journal":{"name":"Computers & Geosciences","volume":"193 ","pages":"Article 105707"},"PeriodicalIF":4.2,"publicationDate":"2024-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0098300424001900/pdfft?md5=35a76aa189a72d9015dd976686c4e57f&pid=1-s2.0-S0098300424001900-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142173049","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-08-31DOI: 10.1016/j.cageo.2024.105708
Xingli Zhang, Zihan Zhang, Ruisheng Jia, Xinming Lu
The present study proposes a double-branch classification network, DPNet (Double Path Net), for the classification and identification of microseismic and blasting signals based on multimodal feature extraction. The vibration signals’ one-dimensional spectrogram and two-dimensional wavelet time–frequency graph are inputted into the double branch network. Subsequently, convolutional neural networks and ResNet are employed to extract the one-dimensional frequency features and two-dimensional time–frequency features of the vibration signals, respectively. Experimental results demonstrate that our proposed method achieves outstanding classification performance with an accuracy of 97.34% for microseismic signals and blasting signals. This research not only provides innovative solutions to practical problems but also introduces a novel idea of multimodal feature extraction at a theoretical level. By successfully applying it to efficiently classify complex signals in mining engineering, we offer a feasible solution that holds promising prospects for practical applications in this field.
{"title":"Research on microseismic signal identification through data fusion","authors":"Xingli Zhang, Zihan Zhang, Ruisheng Jia, Xinming Lu","doi":"10.1016/j.cageo.2024.105708","DOIUrl":"10.1016/j.cageo.2024.105708","url":null,"abstract":"<div><p>The present study proposes a double-branch classification network, DPNet (Double Path Net), for the classification and identification of microseismic and blasting signals based on multimodal feature extraction. The vibration signals’ one-dimensional spectrogram and two-dimensional wavelet time–frequency graph are inputted into the double branch network. Subsequently, convolutional neural networks and ResNet are employed to extract the one-dimensional frequency features and two-dimensional time–frequency features of the vibration signals, respectively. Experimental results demonstrate that our proposed method achieves outstanding classification performance with an accuracy of 97.34% for microseismic signals and blasting signals. This research not only provides innovative solutions to practical problems but also introduces a novel idea of multimodal feature extraction at a theoretical level. By successfully applying it to efficiently classify complex signals in mining engineering, we offer a feasible solution that holds promising prospects for practical applications in this field.</p></div>","PeriodicalId":55221,"journal":{"name":"Computers & Geosciences","volume":"192 ","pages":"Article 105708"},"PeriodicalIF":4.2,"publicationDate":"2024-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142122340","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-08-30DOI: 10.1016/j.cageo.2024.105710
Xiaolei Tu, Esteban Jeremy Bowles-Martinez, Adam Schultz
Numerical modeling of electromagnetic (EM) fields in a conductive marine environment is crucial for marine EM data interpretation. During marine controlled-source electromagnetic (MCSEM) surveys, a variety of transmitter locations are used to introduce electric currents. The resulting electric and magnetic fields are then concurrently logged by a network of receivers. The forward simulation of MCSEM data for a subsea structure whose electrical properties vary in all three dimensions is computationally intensive. We demonstrate how such computations may be substantially accelerated by adapting algorithms to operate efficiently on modern GPUs with many core architectures. The algorithm we present features a hybrid MPI-CUDA programming model suitable for multi-GPU computers and consists of three levels of parallelism. We design the optimal kernels for different components to minimize redundant memory accesses. We have tested the algorithm on NVIDIA Kepler architecture and achieved up to 105 × speedup compared with the serial code version. We further showcased the algorithm's performance advantages through its application to a realistic marine model featuring complex geological structures. Our algorithm's significant efficiency increase opens the possibility of 3D MCSEM data interpretation based on probabilistic or machine learning approaches, which require tens of thousands of forward simulations for every survey.
{"title":"Massively parallel modeling of electromagnetic field in conductive media: An MPI-CUDA implementation on Multi-GPU computers","authors":"Xiaolei Tu, Esteban Jeremy Bowles-Martinez, Adam Schultz","doi":"10.1016/j.cageo.2024.105710","DOIUrl":"10.1016/j.cageo.2024.105710","url":null,"abstract":"<div><p>Numerical modeling of electromagnetic (EM) fields in a conductive marine environment is crucial for marine EM data interpretation. During marine controlled-source electromagnetic (MCSEM) surveys, a variety of transmitter locations are used to introduce electric currents. The resulting electric and magnetic fields are then concurrently logged by a network of receivers. The forward simulation of MCSEM data for a subsea structure whose electrical properties vary in all three dimensions is computationally intensive. We demonstrate how such computations may be substantially accelerated by adapting algorithms to operate efficiently on modern GPUs with many core architectures. The algorithm we present features a hybrid MPI-CUDA programming model suitable for multi-GPU computers and consists of three levels of parallelism. We design the optimal kernels for different components to minimize redundant memory accesses. We have tested the algorithm on NVIDIA Kepler architecture and achieved up to 105 × speedup compared with the serial code version. We further showcased the algorithm's performance advantages through its application to a realistic marine model featuring complex geological structures. Our algorithm's significant efficiency increase opens the possibility of 3D MCSEM data interpretation based on probabilistic or machine learning approaches, which require tens of thousands of forward simulations for every survey.</p></div>","PeriodicalId":55221,"journal":{"name":"Computers & Geosciences","volume":"192 ","pages":"Article 105710"},"PeriodicalIF":4.2,"publicationDate":"2024-08-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142157755","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-08-30DOI: 10.1016/j.cageo.2024.105709
Adam Lurka
Mine induced seismic events are a major safety concern in mining and require careful monitoring and management to reduce their effects. Therefore, an essential step in assessing seismic and rock burst hazards is the analysis of mine seismicity. Recently, deep neural networks have been used to automatically determine seismic wave arrival times, surpassing human performance and allowing their use in seismic data analysis such as seismic event location and seismic energy calculation. In order to properly automate the rockburst and seismic hazard assessment deep neural network phase picker and a spatio-temporal clustering method were utilized. Seismic and rockburst hazards were statistically quantified using two-way contingency tables for two categorical variables: seismic energy level of mine tremors and number of clusters. Correlations between several spatio-temporal clusters and a statistical association between two categorical variables: seismic energy level and cluster number indicate an increase of seismic hazard in the Marcel hard coal mine in Poland. A new automated tool has been elaborated to automatically identify high-stress areas in mines in the form of spatio-temporal clusters.
{"title":"Combining deep neural network and spatio-temporal clustering to automatically assess rockburst and seismic hazard – Case study from Marcel coal mine in Upper Silesian Basin, Poland","authors":"Adam Lurka","doi":"10.1016/j.cageo.2024.105709","DOIUrl":"10.1016/j.cageo.2024.105709","url":null,"abstract":"<div><p>Mine induced seismic events are a major safety concern in mining and require careful monitoring and management to reduce their effects. Therefore, an essential step in assessing seismic and rock burst hazards is the analysis of mine seismicity. Recently, deep neural networks have been used to automatically determine seismic wave arrival times, surpassing human performance and allowing their use in seismic data analysis such as seismic event location and seismic energy calculation. In order to properly automate the rockburst and seismic hazard assessment deep neural network phase picker and a spatio-temporal clustering method were utilized. Seismic and rockburst hazards were statistically quantified using two-way contingency tables for two categorical variables: seismic energy level of mine tremors and number of clusters. Correlations between several spatio-temporal clusters and a statistical association between two categorical variables: seismic energy level and cluster number indicate an increase of seismic hazard in the Marcel hard coal mine in Poland. A new automated tool has been elaborated to automatically identify high-stress areas in mines in the form of spatio-temporal clusters.</p></div>","PeriodicalId":55221,"journal":{"name":"Computers & Geosciences","volume":"192 ","pages":"Article 105709"},"PeriodicalIF":4.2,"publicationDate":"2024-08-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0098300424001924/pdfft?md5=2bf4c7ce15a9d7979aa62ba8147334ed&pid=1-s2.0-S0098300424001924-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142136832","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-08-23DOI: 10.1016/j.cageo.2024.105687
Wei Chen , Yangkang Chen
Some target seismic signals in the earthquake data can be very weak compared with interfering phases, and are thus difficult to detect, which further hinders the effective usage of these weak phases for subsequent high-resolution imaging of earth interiors. The strong ambient noise makes this situation even more troublesome since the weak signals can be mostly buried in the noise. Here, we present an open-source package for uncovering the weak phases from global seismograms. We adopt a two-step scheme to reconstruct and denoise array data. The first step is weighted average interpolation which puts the data into irregular grids. The second step adopts the weighted projection-onto-convex sets based on damped rank-reduction to further interpolate and denoise for the binned data. Taking the complexity of the weak signal into consideration, we adopt the automatic strategy to select an appropriate rank in different localized windows. We conduct several synthetic tests to carefully investigate the performance regarding effectiveness, robustness, and efficiency, and compare the algorithm with the frequency–wavenumber-domain projection onto convex sets method that is already used in the global seismology literature. Finally, the proposed framework is validated via a recorded array data set of the 1995 May 5 Philippines earthquake.
{"title":"DRRGlobal: Uncovering the weak phases from global seismograms using the damped rank-reduction method","authors":"Wei Chen , Yangkang Chen","doi":"10.1016/j.cageo.2024.105687","DOIUrl":"10.1016/j.cageo.2024.105687","url":null,"abstract":"<div><p>Some target seismic signals in the earthquake data can be very weak compared with interfering phases, and are thus difficult to detect, which further hinders the effective usage of these weak phases for subsequent high-resolution imaging of earth interiors. The strong ambient noise makes this situation even more troublesome since the weak signals can be mostly buried in the noise. Here, we present an open-source package for uncovering the weak phases from global seismograms. We adopt a two-step scheme to reconstruct and denoise array data. The first step is weighted average interpolation which puts the data into irregular grids. The second step adopts the weighted projection-onto-convex sets based on damped rank-reduction to further interpolate and denoise for the binned data. Taking the complexity of the weak signal into consideration, we adopt the automatic strategy to select an appropriate rank in different localized windows. We conduct several synthetic tests to carefully investigate the performance regarding effectiveness, robustness, and efficiency, and compare the algorithm with the frequency–wavenumber-domain projection onto convex sets method that is already used in the global seismology literature. Finally, the proposed framework is validated via a recorded array data set of the 1995 May 5 Philippines earthquake.</p></div>","PeriodicalId":55221,"journal":{"name":"Computers & Geosciences","volume":"192 ","pages":"Article 105687"},"PeriodicalIF":4.2,"publicationDate":"2024-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142050260","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}