Enhancing lithofacies machine learning predictions with gamma-ray attributes for boreholes with limited diversity of recorded well logs

Artificial Intelligence in Geosciences Pub Date : 2021-12-01 DOI:10.1016/j.aiig.2022.02.007

David A. Wood

{"title":"Enhancing lithofacies machine learning predictions with gamma-ray attributes for boreholes with limited diversity of recorded well logs","authors":"David A. Wood","doi":"10.1016/j.aiig.2022.02.007","DOIUrl":null,"url":null,"abstract":"<div><p>Derivative and volatility attributes can be usefully calculated from recorded gamma ray (GR) data to enhance lithofacies classification in wellbores penetrating multiple lithologies. Such attributes extract information about the log curve shape that cannot be readily discerned from the recorded well log data. A logged wellbore section for which 8911 data records are available for the three recorded logs (GR, sonic (DT) and bulk density (PB)) is evaluated. That section demonstrates the value of the GR attributes for machine learning (ML) lithofacies predictions. Five feature selection configurations are considered. The 9-var configuration including GR, DT, PB and six GR attributes, and the 7-var configuration of GR and the six GR attributes, provide the most accurate and reproducible lithofacies predictions. The other three feature configurations evaluated do not include the GR attributes but just one to three of the recorded log features. The results of seven ML models and two regression models reveal that K-nearest neighbor (KNN), random forest (RF) and extreme gradient boosting (XGB) are the best performing models. They generate between 14 and 23 misclassification from 8911 data records for the 9-var model. Multi-layer perceptron (MLP) and support vector classification (SVC) do not perform well with the 7-var model which lacks the PB feature displaying the highest correlation with facies class. Annotated confusion matrices reveal that KNN, RF and XGB models can effectively distinguish all facies classes for the 9-var and 7-var configurations (that includes the GR attributes), whereas none of the models can achieve that outcome for the 3-var configuration (that excludes the GR attributes). Accurately distinguishing lithofacies using well-log data in sedimentary sections is an important objective in applied geoscience. The straightforward, GR-attribute method proposed works to improve confidence in ML-lithofacies classifications based on limited recorded well-log data.</p></div>","PeriodicalId":100124,"journal":{"name":"Artificial Intelligence in Geosciences","volume":"2 ","pages":"Pages 148-164"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666544122000077/pdfft?md5=2bf3b12ae35a11a62a8a749a700d3504&pid=1-s2.0-S2666544122000077-main.pdf","citationCount":"8","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Artificial Intelligence in Geosciences","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666544122000077","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 8

Abstract

Derivative and volatility attributes can be usefully calculated from recorded gamma ray (GR) data to enhance lithofacies classification in wellbores penetrating multiple lithologies. Such attributes extract information about the log curve shape that cannot be readily discerned from the recorded well log data. A logged wellbore section for which 8911 data records are available for the three recorded logs (GR, sonic (DT) and bulk density (PB)) is evaluated. That section demonstrates the value of the GR attributes for machine learning (ML) lithofacies predictions. Five feature selection configurations are considered. The 9-var configuration including GR, DT, PB and six GR attributes, and the 7-var configuration of GR and the six GR attributes, provide the most accurate and reproducible lithofacies predictions. The other three feature configurations evaluated do not include the GR attributes but just one to three of the recorded log features. The results of seven ML models and two regression models reveal that K-nearest neighbor (KNN), random forest (RF) and extreme gradient boosting (XGB) are the best performing models. They generate between 14 and 23 misclassification from 8911 data records for the 9-var model. Multi-layer perceptron (MLP) and support vector classification (SVC) do not perform well with the 7-var model which lacks the PB feature displaying the highest correlation with facies class. Annotated confusion matrices reveal that KNN, RF and XGB models can effectively distinguish all facies classes for the 9-var and 7-var configurations (that includes the GR attributes), whereas none of the models can achieve that outcome for the 3-var configuration (that excludes the GR attributes). Accurately distinguishing lithofacies using well-log data in sedimentary sections is an important objective in applied geoscience. The straightforward, GR-attribute method proposed works to improve confidence in ML-lithofacies classifications based on limited recorded well-log data.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

利用伽马射线属性对测井记录有限的井眼进行岩相机器学习预测

从记录的伽马射线(GR)数据中可以有效地计算导数和挥发性属性，以增强穿透多种岩性的井的岩相分类。这些属性提取了测井曲线形状的信息，这些信息无法从记录的测井数据中轻易识别出来。对一段测井井段进行了评估，其中有8911条数据记录可用于三种测井(GR、声波(DT)和体积密度(PB))。该部分展示了GR属性在机器学习(ML)岩相预测中的价值。考虑了五种特征选择配置。包括GR、DT、PB和6种GR属性在内的9变量配置，以及GR和6种GR属性在内的7变量配置，提供了最准确、可重复性最好的岩相预测。评估的其他三个特性配置不包括GR属性，而只包括记录的日志特性中的一到三个。7个ML模型和2个回归模型的结果表明，k最近邻模型(KNN)、随机森林模型(RF)和极端梯度增强模型(XGB)是表现最好的模型。他们为9变量模型从8911个数据记录中产生了14到23个错误分类。多层感知器(MLP)和支持向量分类(SVC)在缺少与相类相关性最高的PB特征的7变量模型上表现不佳。带注释的混淆矩阵显示，KNN、RF和XGB模型可以有效区分9-var和7-var配置(包括GR属性)的所有相类，而对于3-var配置(不包括GR属性)，没有一个模型可以实现这一结果。利用沉积剖面测井资料准确区分岩相是应用地球科学的重要目标。所提出的简单的gr属性方法可以提高基于有限记录测井数据的ml岩相分类的可信度。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助