Applied Computing and Geosciences最新文献_第10页

Contributions of machine learning to quantitative and real-time mud gas data analysis: A critical review 机器学习对定量和实时泥浆气体数据分析的贡献:综述

IF 3.4 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Applied Computing and Geosciences

Pub Date : 2022-12-01 DOI: 10.1016/j.acags.2022.100095

Fatai Anifowose, Mokhles Mezghani, Saleh Badawood, Javed Ismail

The current utility of mud gas data is typically limited to geological and petrophysical correlation, formation evaluation, and fluid typing. A critical and comprehensive review of the literature on mud gas data revealed that the mud gas data is abundantly acquired during drilling but not sufficiently utilized in real time. There is the need to leverage the current advances in machine learning technology and the race towards the digital transformation of the petroleum industry to create new opportunities for more extensive utility of mud gas data. Now that data is the new “oil” or “gold”, the utility of the rich and abundant mud gas data could be explored for real-time applications. Such new possibilities are capable of adding more value to the reservoir characterization workflow ahead of geophysical logging, geological core data analysis, and well testing. Achieving this will facilitate early decision-making, improve safety, reduce nonproductive time, and ultimately accelerate the attainment of the digital transformation objective of the petroleum industry. We conclude with identifying possible future directions for the ultimate attainment of maximizing the utility of mud gas data through real-time and more advanced applications.

目前，泥浆气数据的应用通常局限于地质和岩石物理对比、地层评价和流体类型。对泥浆气数据的文献进行了批判性和全面的回顾，发现在钻井过程中获得了大量的泥浆气数据，但没有得到充分的实时利用。因此，有必要利用当前机器学习技术的进步和石油行业数字化转型的竞争，为更广泛地利用泥浆气数据创造新的机会。如今，数据已成为新的“石油”或“黄金”，丰富的泥气数据可以用于实时应用。这些新的可能性能够在地球物理测井、地质岩心数据分析和试井之前为储层描述工作流程增加更多价值。实现这一目标将有助于早期决策，提高安全性，减少非生产时间，并最终加速实现石油行业的数字化转型目标。最后，我们确定了未来可能的方向，通过实时和更先进的应用，最终实现最大限度地利用泥浆气数据。

{"title":"Contributions of machine learning to quantitative and real-time mud gas data analysis: A critical review","authors":"Fatai Anifowose, Mokhles Mezghani, Saleh Badawood, Javed Ismail","doi":"10.1016/j.acags.2022.100095","DOIUrl":"10.1016/j.acags.2022.100095","url":null,"abstract":"<div><p>The current utility of mud gas data is typically limited to geological and petrophysical correlation, formation evaluation, and fluid typing. A critical and comprehensive review of the literature on mud gas data revealed that the mud gas data is abundantly acquired during drilling but not sufficiently utilized in real time. There is the need to leverage the current advances in machine learning technology and the race towards the digital transformation of the petroleum industry to create new opportunities for more extensive utility of mud gas data. Now that data is the new “oil” or “gold”, the utility of the rich and abundant mud gas data could be explored for real-time applications. Such new possibilities are capable of adding more value to the reservoir characterization workflow ahead of geophysical logging, geological core data analysis, and well testing. Achieving this will facilitate early decision-making, improve safety, reduce nonproductive time, and ultimately accelerate the attainment of the digital transformation objective of the petroleum industry. We conclude with identifying possible future directions for the ultimate attainment of maximizing the utility of mud gas data through real-time and more advanced applications.</p></div>","PeriodicalId":33804,"journal":{"name":"Applied Computing and Geosciences","volume":"16 ","pages":"Article 100095"},"PeriodicalIF":3.4,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2590197422000179/pdfft?md5=05e1d93af07412f49d3f45e78cc62d49&pid=1-s2.0-S2590197422000179-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44840264","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

Applying machine learning methods to predict geology using soil sample geochemistry 应用机器学习方法利用土壤样本地球化学预测地质

IF 3.4 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Applied Computing and Geosciences

Pub Date : 2022-12-01 DOI: 10.1016/j.acags.2022.100094

Timothy C.C. Lui , Daniel D. Gregory , Marek Anderson , Well-Shen Lee , Sharon A. Cowling

In this study we compared various machine learning techniques that used soil geochemistry to aid in geologic mapping. We tested six different sampling methods (undersample, oversample, Synthetic Minority Oversampling Technique (SMOTE), Adaptive Synthetic Sampling (ADASYN), SMOTE and Edited Nearest Neighbor (SMOTEENN), and SMOTE and Tomek links (SMOTETomek)). SMOTE performed best with ADASYN and SMOTETomek having slightly lower effectiveness. Nine machine learning algorithms (naïve Bayes, logistic regression, quadratic discriminant analysis, nearest neighbors, radial basis function support-vector machine, artificial neural network, random forest, AdaBoost classifier, and gradient boosting classifier) were compared and AdaBoost classifiers and gradient boosting classifiers were found to be most effective. Finally, we experimented with multiple classifier systems (MCS) testing different combinations of algorithms and various combinatorial functions. It was found that MCS can outperform individual models, and the best MCS combined nearest neighbors, radial basis function support-vector machine, artificial neural network, random forest, AdaBoost classifiers, and gradient boosting classifier, then applied a logistic regression to the probabilities output by the models. Ultimately, we created a tool that is able to adequately predict underlying geology in the study area using soil geochemistry.

在这项研究中，我们比较了各种机器学习技术，这些技术使用土壤地球化学来帮助进行地质制图。我们测试了六种不同的采样方法(欠采样，过采样，合成少数过采样技术(SMOTE)，自适应合成采样(ADASYN)， SMOTE和编辑最近邻(SMOTEENN)，以及SMOTE和Tomek链接(SMOTETomek))。SMOTE表现最好，ADASYN和SMOTETomek的效果稍低。比较了9种机器学习算法(naïve贝叶斯、逻辑回归、二次判别分析、最近邻、径向基函数支持向量机、人工神经网络、随机森林、AdaBoost分类器和梯度增强分类器)，发现AdaBoost分类器和梯度增强分类器最有效。最后，我们用多个分类器系统(MCS)进行了实验，测试了不同的算法组合和各种组合函数。结果表明，最优MCS将最近邻、径向基函数支持向量机、人工神经网络、随机森林、AdaBoost分类器和梯度增强分类器组合在一起，对模型输出的概率进行逻辑回归。最终，我们创造了一种工具，能够利用土壤地球化学充分预测研究区域的潜在地质情况。

{"title":"Applying machine learning methods to predict geology using soil sample geochemistry","authors":"Timothy C.C. Lui , Daniel D. Gregory , Marek Anderson , Well-Shen Lee , Sharon A. Cowling","doi":"10.1016/j.acags.2022.100094","DOIUrl":"10.1016/j.acags.2022.100094","url":null,"abstract":"<div><p>In this study we compared various machine learning techniques that used soil geochemistry to aid in geologic mapping. We tested six different sampling methods (undersample, oversample, Synthetic Minority Oversampling Technique (SMOTE), Adaptive Synthetic Sampling (ADASYN), SMOTE and Edited Nearest Neighbor (SMOTEENN), and SMOTE and Tomek links (SMOTETomek)). SMOTE performed best with ADASYN and SMOTETomek having slightly lower effectiveness. Nine machine learning algorithms (naïve Bayes, logistic regression, quadratic discriminant analysis, nearest neighbors, radial basis function support-vector machine, artificial neural network, random forest, AdaBoost classifier, and gradient boosting classifier) were compared and AdaBoost classifiers and gradient boosting classifiers were found to be most effective. Finally, we experimented with multiple classifier systems (MCS) testing different combinations of algorithms and various combinatorial functions. It was found that MCS can outperform individual models, and the best <span>MCS</span> combined nearest neighbors, radial basis function support-vector machine, artificial neural network, random forest, AdaBoost classifiers, and gradient boosting classifier, then applied a logistic regression to the probabilities output by the models. Ultimately, we created a tool that is able to adequately predict underlying geology in the study area using soil geochemistry.</p></div>","PeriodicalId":33804,"journal":{"name":"Applied Computing and Geosciences","volume":"16 ","pages":"Article 100094"},"PeriodicalIF":3.4,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2590197422000167/pdfft?md5=b4237ad5984255315f0bafb7d1c87083&pid=1-s2.0-S2590197422000167-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49360901","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 4

Enhanced machine learning tree classifiers for lithology identification using Bayesian optimization 使用贝叶斯优化的增强机器学习树分类器用于岩性识别

IF 3.4 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Applied Computing and Geosciences

Pub Date : 2022-12-01 DOI: 10.1016/j.acags.2022.100100

Solomon Asante-Okyere , Chuanbo Shen , Harrison Osei

Lithology identification is a fundamental activity in oil and gas exploration. The application of artificial intelligence (AI) is currently being adopted as a state-of-the-art means of automating lithology identification. One aspect of this AI approach is the application of population search algorithms to optimise hyperparameters for enhanced prediction performance. For the first time, Bayesian optimization is deployed to determine the optimal learning parameters for more accurate tree and tree ensemble lithology classifiers. The aim is to rely on the ability of Bayesian optimization to consider previous classification results to improve the output of decision and ensemble tree lithology models using well logs as inputs. The proposed Bayesian optimised decision tree (BODT) generated the best classification accuracy of 89.8% as compared to 86.9%, 83.3% and 81.2% for fine, medium and coarse trees. For the ensembled trees, the Bayesian optimised AdaBoost (BO-AdaBoost) classifier generated the highest improved prediction accuracy of 94.2% while Bayesian optimised Bagged (BO-Bagged) and Bayesian optimised RUSBoost (BO-RUSBoost) had a lower accuracy rate of 94.0% and 77.1% respectively. Additionally, the performance of the Bayesian optimised classifiers offered higher reliability when compared with particle swarm optimization-based artificial neural networks (PSO-ANN). Hence, incorporating Bayesian optimization as a hyperparameter search algorithm will improve litholofacies recognition, leading to a higher accuracy rate and thereby provide an improved alternative for intelligent lithology identification.

岩性识别是油气勘探的一项基础性工作。人工智能(AI)的应用目前正被采用为自动化岩性识别的最先进手段。这种人工智能方法的一个方面是应用种群搜索算法来优化超参数以增强预测性能。第一次，贝叶斯优化被用于确定更准确的树和树集合岩性分类器的最佳学习参数。其目的是依靠贝叶斯优化的能力来考虑之前的分类结果，以提高决策的输出和使用测井作为输入的集成树岩性模型的输出。所提出的贝叶斯优化决策树(BODT)的分类准确率为89.8%，而细树、中树和粗树的分类准确率分别为86.9%、83.3%和81.2%。对于集成树，贝叶斯优化AdaBoost (BO-AdaBoost)分类器的预测准确率最高，达到94.2%，而贝叶斯优化Bagged (BO-Bagged)和贝叶斯优化RUSBoost (BO-RUSBoost)的准确率较低，分别为94.0%和77.1%。此外，与基于粒子群优化的人工神经网络(PSO-ANN)相比，贝叶斯优化分类器的性能具有更高的可靠性。因此，将贝叶斯优化作为一种超参数搜索算法，可以提高岩性识别的准确率，从而为智能岩性识别提供一种改进的选择。

{"title":"Enhanced machine learning tree classifiers for lithology identification using Bayesian optimization","authors":"Solomon Asante-Okyere , Chuanbo Shen , Harrison Osei","doi":"10.1016/j.acags.2022.100100","DOIUrl":"10.1016/j.acags.2022.100100","url":null,"abstract":"<div><p>Lithology identification is a fundamental activity in oil and gas exploration. The application of artificial intelligence (AI) is currently being adopted as a state-of-the-art means of automating lithology identification. One aspect of this AI approach is the application of population search algorithms to optimise hyperparameters for enhanced prediction performance. For the first time, Bayesian optimization is deployed to determine the optimal learning parameters for more accurate tree and tree ensemble lithology classifiers. The aim is to rely on the ability of Bayesian optimization to consider previous classification results to improve the output of decision and ensemble tree lithology models using well logs as inputs. The proposed Bayesian optimised decision tree (BODT) generated the best classification accuracy of 89.8% as compared to 86.9%, 83.3% and 81.2% for fine, medium and coarse trees. For the ensembled trees, the Bayesian optimised AdaBoost (BO-AdaBoost) classifier generated the highest improved prediction accuracy of 94.2% while Bayesian optimised Bagged (BO-Bagged) and Bayesian optimised RUSBoost (BO-RUSBoost) had a lower accuracy rate of 94.0% and 77.1% respectively. Additionally, the performance of the Bayesian optimised classifiers offered higher reliability when compared with particle swarm optimization-based artificial neural networks (PSO-ANN). Hence, incorporating Bayesian optimization as a hyperparameter search algorithm will improve litholofacies recognition, leading to a higher accuracy rate and thereby provide an improved alternative for intelligent lithology identification.</p></div>","PeriodicalId":33804,"journal":{"name":"Applied Computing and Geosciences","volume":"16 ","pages":"Article 100100"},"PeriodicalIF":3.4,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2590197422000222/pdfft?md5=70b41f43c359f6a72c8f285b2d646140&pid=1-s2.0-S2590197422000222-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48858608","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

Automated detection of microfossil fish teeth from slide images using combined deep learning models 结合深度学习模型从幻灯片图像中自动检测微化石鱼牙齿

IF 3.4 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Applied Computing and Geosciences

Pub Date : 2022-12-01 DOI: 10.1016/j.acags.2022.100092

Kazuhide Mimura , Shugo Minabe , Kentaro Nakamura , Kazutaka Yasukawa , Junichiro Ohta , Yasuhiro Kato

Microfossil fish teeth, known as ichthyoliths, provide a key constraint on the depositional age and environment of deep-sea sediments, especially pelagic clays where siliceous and calcareous microfossils are rarely observed. However, traditional methods for the observation of ichthyoliths require considerable time and manual labor, which can hinder their wider application. In this study, we constructed a system to automatically detect ichthyoliths in microscopic images by combining two open source deep learning models. First, the regions for ichthyoliths within the microscopic images are predicted by the instance segmentation model Mask R–CNN. All the detected regions are then re-classified using the image classification model EfficientNet-V2 to determine the classes more accurately. Compared with only using the Mask R–CNN model, the combined system offers significantly higher performance (89.0% precision, 78.6% recall, and an F1 score of 83.5%), demonstrating the utility of the system. Our system can also predict the lengths of the teeth that have been detected, with more than 90% of the predicted lengths being within ±20% of measured length. This system provides a novel, automated, and reliable approach for the detection and length measurement of ichthyoliths from microscope images that can be applied in a range of paleoceanographic and paleoecological contexts.

微化石鱼牙，被称为鱼石，为研究深海沉积物的沉积时代和环境提供了关键的约束条件，特别是在很少观察到硅质和钙质微化石的远洋粘土中。然而，传统的观察鱼鳞石的方法需要大量的时间和体力劳动，这阻碍了它们的广泛应用。在这项研究中，我们结合两个开源的深度学习模型构建了一个系统来自动检测微观图像中的鱼鳞石。首先，通过实例分割模型Mask R-CNN预测显微图像中鱼石体的区域。然后使用图像分类模型EfficientNet-V2对所有检测到的区域进行重新分类，以更准确地确定类别。与仅使用Mask R-CNN模型相比，组合系统的性能显著提高(准确率为89.0%，召回率为78.6%，F1得分为83.5%)，证明了系统的实用性。我们的系统还可以预测已经检测到的牙齿的长度，超过90%的预测长度在测量长度的±20%以内。该系统提供了一种新颖、自动化和可靠的方法，用于从显微镜图像中检测和测量鱼石的长度，可应用于一系列古海洋学和古生态学背景。

{"title":"Automated detection of microfossil fish teeth from slide images using combined deep learning models","authors":"Kazuhide Mimura , Shugo Minabe , Kentaro Nakamura , Kazutaka Yasukawa , Junichiro Ohta , Yasuhiro Kato","doi":"10.1016/j.acags.2022.100092","DOIUrl":"10.1016/j.acags.2022.100092","url":null,"abstract":"<div><p>Microfossil fish teeth, known as ichthyoliths, provide a key constraint on the depositional age and environment of deep-sea sediments, especially pelagic clays where siliceous and calcareous microfossils are rarely observed. However, traditional methods for the observation of ichthyoliths require considerable time and manual labor, which can hinder their wider application. In this study, we constructed a system to automatically detect ichthyoliths in microscopic images by combining two open source deep learning models. First, the regions for ichthyoliths within the microscopic images are predicted by the instance segmentation model Mask R–CNN. All the detected regions are then re-classified using the image classification model EfficientNet-V2 to determine the classes more accurately. Compared with only using the Mask R–CNN model, the combined system offers significantly higher performance (89.0% precision, 78.6% recall, and an F1 score of 83.5%), demonstrating the utility of the system. Our system can also predict the lengths of the teeth that have been detected, with more than 90% of the predicted lengths being within ±20% of measured length. This system provides a novel, automated, and reliable approach for the detection and length measurement of ichthyoliths from microscope images that can be applied in a range of paleoceanographic and paleoecological contexts.</p></div>","PeriodicalId":33804,"journal":{"name":"Applied Computing and Geosciences","volume":"16 ","pages":"Article 100092"},"PeriodicalIF":3.4,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2590197422000143/pdfft?md5=4c5b722dfdb0bf4e0c4d6459e827ee6b&pid=1-s2.0-S2590197422000143-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43365348","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Random forest rock type classification with integration of geochemical and photographic data 结合地球化学和摄影资料的随机森林岩石类型分类

IF 3.4 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Applied Computing and Geosciences

Pub Date : 2022-09-01 DOI: 10.1016/j.acags.2022.100090

McLean Trott , Matthew Leybourne , Lindsay Hall , Daniel Layton-Matthews

Systematic manual and algorithmic classification workflows to characterize rock types are increasingly applied in the mineral exploration and mining industry, leveraging large systematically collected datasets. The aim of these are robust and repeatable classifications to aid more traditional visual logging practices. This study uses random forest algorithms to examine the impacts of integrating distinct datasets with complementary characteristics; chemistry to enable compositional distinctions, and photography to enable textural distinctions. We use a random forest classifier to examine the accuracy metrics of models producing rock type classifications using these two data types independently and integrated together. Prediction accuracy, measured using 10-fold cross validation, was 87% for geochemical-only inputs, 85% for photographic-only inputs, and 90% for mixed inputs from both datasets. A mining and exploration project in the Late Miocene to early Pliocene porphyry belt in Chile is the site of this case study, where datasets were systematically acquired using in-field methods on historical drill-cores. Results indicate that classification of lithology is improved by integration of photography-based and composition-based feature inputs. We infer that the benefits of integration would increase in proportion with increasing compositional similarity between rock types. This approach might also be applied to similar geological problems, such as alteration or metallurgical classifications; and with somewhat distinct datatypes, such as geochemical interval data and photographic metric extraction from coincident intervals in core photos.

利用大量系统收集的数据集，在矿产勘探和采矿业中越来越多地应用系统的人工和算法分类工作流程来表征岩石类型。这些分类的目的是健壮和可重复的分类，以帮助更传统的可视化日志记录实践。本研究使用随机森林算法来检验整合具有互补特征的不同数据集的影响;化学可以区分成分，摄影可以区分材质。我们使用随机森林分类器来检查使用这两种数据类型独立和集成在一起产生岩石类型分类的模型的精度指标。使用10倍交叉验证测量的预测精度，仅地球化学输入为87%，仅摄影输入为85%，两个数据集的混合输入为90%。智利晚中新世至上新世早期斑岩带的一个采矿和勘探项目是本案例研究的地点，在该项目中，使用历史钻孔岩心的现场方法系统地获取了数据集。结果表明，将基于照片和成分的特征输入相结合，改进了岩性分类。我们推断，整合的好处将随着岩石类型之间成分相似性的增加而成比例地增加。这种方法也可适用于类似的地质问题，例如蚀变或冶金分类;并且数据类型有所不同，如地球化学层段数据和岩心照片中重合层段的摄影度量提取。

{"title":"Random forest rock type classification with integration of geochemical and photographic data","authors":"McLean Trott , Matthew Leybourne , Lindsay Hall , Daniel Layton-Matthews","doi":"10.1016/j.acags.2022.100090","DOIUrl":"10.1016/j.acags.2022.100090","url":null,"abstract":"<div><p>Systematic manual and algorithmic classification workflows to characterize rock types are increasingly applied in the mineral exploration and mining industry, leveraging large systematically collected datasets. The aim of these are robust and repeatable classifications to aid more traditional visual logging practices. This study uses random forest algorithms to examine the impacts of integrating distinct datasets with complementary characteristics; chemistry to enable compositional distinctions, and photography to enable textural distinctions. We use a random forest classifier to examine the accuracy metrics of models producing rock type classifications using these two data types independently and integrated together. Prediction accuracy, measured using 10-fold cross validation, was 87% for geochemical-only inputs, 85% for photographic-only inputs, and 90% for mixed inputs from both datasets. A mining and exploration project in the Late Miocene to early Pliocene porphyry belt in Chile is the site of this case study, where datasets were systematically acquired using in-field methods on historical drill-cores. Results indicate that classification of lithology is improved by integration of photography-based and composition-based feature inputs. We infer that the benefits of integration would increase in proportion with increasing compositional similarity between rock types. This approach might also be applied to similar geological problems, such as alteration or metallurgical classifications; and with somewhat distinct datatypes, such as geochemical interval data and photographic metric extraction from coincident intervals in core photos.</p></div>","PeriodicalId":33804,"journal":{"name":"Applied Computing and Geosciences","volume":"15 ","pages":"Article 100090"},"PeriodicalIF":3.4,"publicationDate":"2022-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S259019742200012X/pdfft?md5=9f3745cefef8413c910837602f704efe&pid=1-s2.0-S259019742200012X-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"53922094","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

Estimating size of finite fracture networks in layered reservoirs 层状储层有限裂缝网络大小的估计

IF 3.4 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Applied Computing and Geosciences

Pub Date : 2022-09-01 DOI: 10.1016/j.acags.2022.100089

Sait I. Ozkaya , M.M. Al-Fahmi

Conductive fractures in petroleum reservoirs may be totally isolated or fully interconnected. There is an intermediate state between the two extremities. Partially fractured reservoirs include finite fracture networks (FFN), which are bundles of interconnected fractures embedded in a sea of isolated fractures. Devising measures for sizes of FFNs is crucial in estimating critical engineering aspects such as productivity index, production decline rate and expected ultimate recovery of wells especially in reservoirs with low matrix porosity and permeability. Here, we present results of statistical evaluation of FFN size in relation to fracture connectivity which is in essence the number of fracture intersections per fracture. The analysis is based on a large number of stochastic 2-dimensional (2D) Poisson models of sub-vertical layer bound fractures. Fracture length in the models has log normal or truncated power distribution and fracture strike has circular normal distribution. The models may have single or multiple fracture sets and various truncation modes with different probabilities.

The analysis shows that number of fracture intersections per fracture can be accurately estimated by a fracture connectivity index, which is defined as product of facture scan-line density, average fracture length and sine of strike standard deviation. The statistically significant finding of this study is that the number of fractures within a FFN is an exponential function of fracture connectivity index. All three fracture properties defining the index can be measured on borehole image logs. Hence it should be possible to estimate fracture connectivity and corresponding FFN size from borehole image data. The analysis pertains to 2D fracture connectivity which is always the lower bound of number of fracture intersections in 3-dimensions. Therefore the exponential relationships must also hold for actual 3-dimensional layer-bound fractures with variable dips.

油藏中的导电性裂缝可以是完全隔离的，也可以是完全连通的。在这两个极端之间有一个中间状态。部分裂缝性油藏包括有限裂缝网络(FFN)，它们是嵌在孤立裂缝海洋中的相互连接的裂缝束。设计出ffn尺寸的测量方法对于估算关键的工程方面至关重要，例如产能指数、产量递减率和油井的预期最终采收率，特别是在基质孔隙度和渗透率较低的油藏中。在这里，我们给出了与裂缝连通性相关的FFN大小的统计评估结果，裂缝连通性本质上是每个裂缝的裂缝相交数量。该分析是基于大量的随机二维泊松模型的亚垂直层界裂缝。模型中裂缝长度呈对数正态分布或截断功率分布，裂缝走向呈圆正态分布。模型可以有单个或多个断裂集，也可以有不同概率的截断模式。分析表明，利用裂缝连通性指数(裂缝扫描线密度、平均裂缝长度和走向标准差正弦的乘积)可以准确估计每条裂缝的相交数。本研究有统计学意义的发现是，一个FFN内的裂缝数量是裂缝连通性指数的指数函数。定义该指数的所有三种裂缝属性都可以在井眼图像测井上测量。因此，应该可以从井眼图像数据中估计裂缝连通性和相应的FFN大小。该分析属于二维裂缝连通性，而二维裂缝连通性通常是三维裂缝相交数的下界。因此，指数关系也必须适用于具有可变倾角的实际三维层状裂缝。

{"title":"Estimating size of finite fracture networks in layered reservoirs","authors":"Sait I. Ozkaya , M.M. Al-Fahmi","doi":"10.1016/j.acags.2022.100089","DOIUrl":"10.1016/j.acags.2022.100089","url":null,"abstract":"<div><p>Conductive fractures in petroleum reservoirs may be totally isolated or fully interconnected. There is an intermediate state between the two extremities. Partially fractured reservoirs include finite fracture networks (FFN), which are bundles of interconnected fractures embedded in a sea of isolated fractures. Devising measures for sizes of FFNs is crucial in estimating critical engineering aspects such as productivity index, production decline rate and expected ultimate recovery of wells especially in reservoirs with low matrix porosity and permeability. Here, we present results of statistical evaluation of FFN size in relation to fracture connectivity which is in essence the number of fracture intersections per fracture. The analysis is based on a large number of stochastic 2-dimensional (2D) Poisson models of sub-vertical layer bound fractures. Fracture length in the models has log normal or truncated power distribution and fracture strike has circular normal distribution. The models may have single or multiple fracture sets and various truncation modes with different probabilities.</p><p>The analysis shows that number of fracture intersections per fracture can be accurately estimated by a fracture connectivity index, which is defined as product of facture scan-line density, average fracture length and sine of strike standard deviation. The statistically significant finding of this study is that the number of fractures within a FFN is an exponential function of fracture connectivity index. All three fracture properties defining the index can be measured on borehole image logs. Hence it should be possible to estimate fracture connectivity and corresponding FFN size from borehole image data. The analysis pertains to 2D fracture connectivity which is always the lower bound of number of fracture intersections in 3-dimensions. Therefore the exponential relationships must also hold for actual 3-dimensional layer-bound fractures with variable dips.</p></div>","PeriodicalId":33804,"journal":{"name":"Applied Computing and Geosciences","volume":"15 ","pages":"Article 100089"},"PeriodicalIF":3.4,"publicationDate":"2022-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2590197422000118/pdfft?md5=c965145e7845eaea78b57cf39d69d86d&pid=1-s2.0-S2590197422000118-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46065297","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Multivariate imputation via chained equations for elastic well log imputation and prediction 基于链式方程的弹性测井资料多元输入与预测

IF 3.4 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Applied Computing and Geosciences

Pub Date : 2022-06-01 DOI: 10.1016/j.acags.2022.100083

Antony Hallam , Debajoy Mukherjee , Romain Chassagne

Well logging is essential in studies which require an understanding of the subsurface geology and depositional conditions. Unfortunately, the measurements are rarely complete and missing data intervals are common due to operational issues or malfunction of the logging equipment. Therefore the imputation of missing data from down-hole well logs is a common problem in subsurface workflows. Recently, many different approaches have been used for imputation but they are often manual or data set specific. Machine learning has reignited interest in this field with promises of a more generic and simpler approach. We explore whether the chaining of machine learning for mutli-log imputation improves results by overcoming disparities in the patterns of missing data. Our research interest is primarily petroleum geophysics and therefore this study focuses on the elastic logs of compressional (DT) and shear (DTS) sonic along with the bulk density (RHOB). However, the method may be applied to all sufficiently large well log data sets in any industry.

在需要了解地下地质和沉积条件的研究中，测井是必不可少的。不幸的是，测量很少完成，由于操作问题或测井设备故障，丢失数据间隔很常见。因此，井下测井数据缺失的补全是井下工作流程中的一个常见问题。最近，有许多不同的方法被用来进行估算，但它们通常是手动的或特定于数据集的。机器学习重新点燃了人们对这一领域的兴趣，因为它有望提供一种更通用、更简单的方法。我们探讨了多对数输入的机器学习链是否通过克服缺失数据模式的差异来改善结果。我们的研究兴趣主要是石油地球物理，因此本研究的重点是压缩(DT)和剪切(DTS)声波以及体积密度(RHOB)的弹性测井。然而，该方法可以应用于任何行业的所有足够大的测井数据集。

{"title":"Multivariate imputation via chained equations for elastic well log imputation and prediction","authors":"Antony Hallam , Debajoy Mukherjee , Romain Chassagne","doi":"10.1016/j.acags.2022.100083","DOIUrl":"10.1016/j.acags.2022.100083","url":null,"abstract":"<div><p>Well logging is essential in studies which require an understanding of the subsurface geology and depositional conditions. Unfortunately, the measurements are rarely complete and missing data intervals are common due to operational issues or malfunction of the logging equipment. Therefore the imputation of missing data from down-hole well logs is a common problem in subsurface workflows. Recently, many different approaches have been used for imputation but they are often manual or data set specific. Machine learning has reignited interest in this field with promises of a more generic and simpler approach. We explore whether the chaining of machine learning for mutli-log imputation improves results by overcoming disparities in the patterns of missing data. Our research interest is primarily petroleum geophysics and therefore this study focuses on the elastic logs of compressional (DT) and shear (DTS) sonic along with the bulk density (RHOB). However, the method may be applied to all sufficiently large well log data sets in any industry.</p></div>","PeriodicalId":33804,"journal":{"name":"Applied Computing and Geosciences","volume":"14 ","pages":"Article 100083"},"PeriodicalIF":3.4,"publicationDate":"2022-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2590197422000052/pdfft?md5=75bc3e9371817b929126e095a3078aca&pid=1-s2.0-S2590197422000052-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42143016","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Evolution of the stress and strain field in the tyra field during the Post-Chalk Deposition and seismic inversion of fault zone using informed-proposal Monte Carlo 白垩后沉积期tyra场应力场演化及断裂带信息建议蒙特卡罗地震反演

IF 3.4 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Applied Computing and Geosciences

Pub Date : 2022-06-01 DOI: 10.1016/j.acags.2022.100085

Sarouyeh Khoshkholgh, Ivanka Orozova-Bekkevold, Klaus Mosegaard

When hydrocarbon reservoirs are used as a CO2 storage facility, an accurate uncertainty analysis and risk assessment is essential. An integration of information from geological knowledge, geological modelling, well log data, and geophysical data provides the basis for this analysis. Modelling the time development of stress/strain changes in the overburden provides prior knowledge about fault and fracture probability in the reservoir, which in turn is used in seismic inversion to constrain models of faulting and fracturing. One main problem in solving large scale seismic inverse problems is high computational cost and inefficiency. We use a newly introduced methodology - Informed-proposal Monte Carlo (IPMC) - to deal with this problem, and to carry out a conceptual study based on real data from the Danish North Sea. The result outlines a methodology for evaluating the risk of having sub-seismic faulting in the overburden that potentially compromises the CO2 storage of the reservoir.

当油气储层被用作二氧化碳储存设施时，准确的不确定性分析和风险评估至关重要。地质知识、地质建模、测井数据和地球物理数据的综合信息为这一分析提供了基础。对覆盖层应力/应变变化的时间发展进行建模，可以提供有关储层断层和裂缝概率的先验知识，进而用于地震反演，以约束断层和压裂模型。求解大规模地震反演问题的一个主要问题是计算成本高、效率低。我们使用了一种新引入的方法——知情建议蒙特卡罗(IPMC)来处理这个问题，并根据丹麦北海的实际数据进行了概念性研究。该结果概述了一种评估上覆层中存在亚地震断层的风险的方法，这种断层可能会损害储层的二氧化碳储存。

{"title":"Evolution of the stress and strain field in the tyra field during the Post-Chalk Deposition and seismic inversion of fault zone using informed-proposal Monte Carlo","authors":"Sarouyeh Khoshkholgh, Ivanka Orozova-Bekkevold, Klaus Mosegaard","doi":"10.1016/j.acags.2022.100085","DOIUrl":"10.1016/j.acags.2022.100085","url":null,"abstract":"<div><p>When hydrocarbon reservoirs are used as a CO2 storage facility, an accurate uncertainty analysis and risk assessment is essential. An integration of information from geological knowledge, geological modelling, well log data, and geophysical data provides the basis for this analysis. Modelling the time development of stress/strain changes in the overburden provides prior knowledge about fault and fracture probability in the reservoir, which in turn is used in seismic inversion to constrain models of faulting and fracturing. One main problem in solving large scale seismic inverse problems is high computational cost and inefficiency. We use a newly introduced methodology - Informed-proposal Monte Carlo (IPMC) - to deal with this problem, and to carry out a conceptual study based on real data from the Danish North Sea. The result outlines a methodology for evaluating the risk of having sub-seismic faulting in the overburden that potentially compromises the CO2 storage of the reservoir.</p></div>","PeriodicalId":33804,"journal":{"name":"Applied Computing and Geosciences","volume":"14 ","pages":"Article 100085"},"PeriodicalIF":3.4,"publicationDate":"2022-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2590197422000076/pdfft?md5=f2aeb16d5a833110a0c3e557c58c7ab4&pid=1-s2.0-S2590197422000076-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43700141","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Estimating the seven transformational parameters between two geodetic datums using the steepest descent algorithm of machine learning 利用机器学习的最陡下降算法估计两个大地测量基准之间的七个变换参数

IF 3.4 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Applied Computing and Geosciences

Pub Date : 2022-06-01 DOI: 10.1016/j.acags.2022.100086

Ikechukwu Kalu , Christopher E. Ndehedehe , Onuwa Okwuashi , Aniekan E. Eyoh

This study evaluates the steepest descent algorithm as a tool for root mean square (RMS) error optimization in geodetic reference systems to improve the integrity of transformation. With an initial RMS error estimate of 0.01830m, the negative gradient direction was applied through the steepest optimization leading to a final RMS error estimate of 0.00051m. Using the exact line search mode with a one-point step size of 0.1, we achieved the minimum values in less than sixty iterations, regardless of the slow convergence rate of the steepest descent algorithm.

本研究评估了最陡下降算法作为大地测量参考系统中均方根误差优化的工具，以提高变换的完整性。初始RMS误差估计为0.01830m，通过最陡优化应用负梯度方向，最终RMS误差估计为0.00051m。使用精确的直线搜索模式，单点步长为0.1，我们在不到60次迭代中获得了最小值，而不考虑最速下降算法的缓慢收敛速度。

引用次数: 0

Geoscience language models and their intrinsic evaluation 地球科学语言模型及其内在评价

IF 3.4 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Applied Computing and Geosciences

Pub Date : 2022-06-01 DOI: 10.1016/j.acags.2022.100084

Christopher J.M. Lawley , Stefania Raimondo , Tianyi Chen , Lindsay Brin , Anton Zakharov , Daniel Kur , Jenny Hui , Glen Newton , Sari L. Burgoyne , Geneviève Marquis

Geoscientists use observations and descriptions of the rock record to study the origins and history of our planet, which has resulted in a vast volume of scientific literature. Recent progress in natural language processing (NLP) has the potential to parse through and extract knowledge from unstructured text, but there has, so far, been only limited work on the concepts and vocabularies that are specific to geoscience. Herein we harvest and process public geoscientific reports (i.e., Canadian federal and provincial geological survey publications databases) and a subset of open access and peer-reviewed publications to train new, geoscience-specific language models to address that knowledge gap. Language model performance is validated using a series of new geoscience-specific NLP tasks (i.e., analogies, clustering, relatedness, and nearest neighbour analysis) that were developed as part of the current study. The raw and processed national geological survey corpora, language models, and evaluation criteria are all made public for the first time. We demonstrate that non-contextual (i.e., Global Vectors for Word Representation, GloVe) and contextual (i.e., Bidirectional Encoder Representations from Transformers, BERT) language models updated using the geoscientific corpora outperform the generic versions of these models for each of the evaluation criteria. Principal component analysis further demonstrates that word embeddings trained on geoscientific text capture meaningful semantic relationships, including rock classifications, mineral properties and compositions, and the geochemical behaviour of elements. Semantic relationships that emerge from the vector space have the potential to unlock latent knowledge within unstructured text, and perhaps more importantly, also highlight the potential for other downstream geoscience-focused NLP tasks (e.g., keyword prediction, document similarity, recommender systems, rock and mineral classification).

地球科学家通过对岩石记录的观察和描述来研究地球的起源和历史，这导致了大量的科学文献。自然语言处理(NLP)的最新进展有可能从非结构化文本中解析和提取知识，但到目前为止，在地球科学特定的概念和词汇方面的工作有限。在这里，我们收集和处理公共地球科学报告(即加拿大联邦和省级地质调查出版物数据库)以及开放获取和同行评审出版物的子集，以培训新的，地球科学特定的语言模型，以解决知识差距。使用一系列新的地球科学特定的NLP任务(即类比，聚类，相关性和最近邻分析)来验证语言模型的性能，这些任务是作为当前研究的一部分开发的。国家地质调查原始语料库、语言模型、评价标准首次公开。我们证明了使用地球科学语料库更新的非上下文(即用于单词表示的全局向量，GloVe)和上下文(即来自Transformers的双向编码器表示，BERT)语言模型在每个评估标准上都优于这些模型的通用版本。主成分分析进一步表明，在地球科学文本上训练的词嵌入捕获有意义的语义关系，包括岩石分类、矿物性质和成分，以及元素的地球化学行为。从向量空间中出现的语义关系有可能解锁非结构化文本中的潜在知识，也许更重要的是，也突出了其他下游以地球科学为重点的NLP任务的潜力(例如，关键字预测、文档相似性、推荐系统、岩石和矿物分类)。

{"title":"Geoscience language models and their intrinsic evaluation","authors":"Christopher J.M. Lawley , Stefania Raimondo , Tianyi Chen , Lindsay Brin , Anton Zakharov , Daniel Kur , Jenny Hui , Glen Newton , Sari L. Burgoyne , Geneviève Marquis","doi":"10.1016/j.acags.2022.100084","DOIUrl":"10.1016/j.acags.2022.100084","url":null,"abstract":"<div><p>Geoscientists use observations and descriptions of the rock record to study the origins and history of our planet, which has resulted in a vast volume of scientific literature. Recent progress in natural language processing (NLP) has the potential to parse through and extract knowledge from unstructured text, but there has, so far, been only limited work on the concepts and vocabularies that are specific to geoscience. Herein we harvest and process public geoscientific reports (i.e., Canadian federal and provincial geological survey publications databases) and a subset of open access and peer-reviewed publications to train new, geoscience-specific language models to address that knowledge gap. Language model performance is validated using a series of new geoscience-specific NLP tasks (i.e., analogies, clustering, relatedness, and nearest neighbour analysis) that were developed as part of the current study. The raw and processed national geological survey corpora, language models, and evaluation criteria are all made public for the first time. We demonstrate that non-contextual (i.e., Global Vectors for Word Representation, GloVe) and contextual (i.e., Bidirectional Encoder Representations from Transformers, BERT) language models updated using the geoscientific corpora outperform the generic versions of these models for each of the evaluation criteria. Principal component analysis further demonstrates that word embeddings trained on geoscientific text capture meaningful semantic relationships, including rock classifications, mineral properties and compositions, and the geochemical behaviour of elements. Semantic relationships that emerge from the vector space have the potential to unlock latent knowledge within unstructured text, and perhaps more importantly, also highlight the potential for other downstream geoscience-focused NLP tasks (e.g., keyword prediction, document similarity, recommender systems, rock and mineral classification).</p></div>","PeriodicalId":33804,"journal":{"name":"Applied Computing and Geosciences","volume":"14 ","pages":"Article 100084"},"PeriodicalIF":3.4,"publicationDate":"2022-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2590197422000064/pdfft?md5=fd1994f61f32c322ea2f0108221236be&pid=1-s2.0-S2590197422000064-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42112365","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 5