利用“TM”尼日尔三角洲井数据预测含水饱和度的机器学习方法

IF 3.3 Q2 MULTIDISCIPLINARY SCIENCES Scientific African Pub Date : 2025-03-01 Epub Date: 2025-02-25 DOI:10.1016/j.sciaf.2025.e02596
Oluwakemi Y. Adeogun, Mukthar O. Abdulwaheed, Lukumon Adeoti, Olawale J. Allo, Olawunmi O. Fasakin, Oluwafemi O. Okunowo
{"title":"利用“TM”尼日尔三角洲井数据预测含水饱和度的机器学习方法","authors":"Oluwakemi Y. Adeogun,&nbsp;Mukthar O. Abdulwaheed,&nbsp;Lukumon Adeoti,&nbsp;Olawale J. Allo,&nbsp;Olawunmi O. Fasakin,&nbsp;Oluwafemi O. Okunowo","doi":"10.1016/j.sciaf.2025.e02596","DOIUrl":null,"url":null,"abstract":"<div><div>Accurate estimation of water saturation (Sw) is critical for hydrocarbon exploration reservoir management, as it reveals the proportion of pore spaces filled with hydrocarbons and water. Determining Sw could be challenging in the absence of core data or resistivity logs. This informed the use of machine learning (ML) techniques to predict Sw in the \"TM\" Field, Niger Delta, where missing resistivity log data poses a challenge. Five ML models (XGBoost, AdaBoost, CatBoost, LightGBM, and Gradient Boost) were deployed using well log data (caliper, gamma-ray, neutron, porosity, density, and shale volume) to estimate Sw at “TM” Field, Niger-Delta. The dataset includes 61,253 observations, which were split into training (70%) and testing (30%) sets. After preprocessing and correcting inconsistencies in the data, the five ML models were trained and hyperparameters tuned to optimize performance. The models were evaluated using standard statistical metrics: Root Mean Square Error (RMSE), Mean Squared Error (MSE), Mean Absolute Error (MAE), and R-squared (R²). To validate the performance of these models, predicted Sw values were compared with those estimated from resistivity logs. Likewise the predicted Sw from the five ML models were plotted against the Sw estimated from the resistivity data not used in the prediction process to validate and determine the quality of the predicted Sw from the ML models. Among the five ML models tested, XGBoost exhibited the best performance, with the highest R² value of 0.9992 and the lowest RMSE of 0.0071. Other models, such as CatBoost, LightGBM, and Gradient Boost, showed strong performance with correlation coefficients of 0.9785, 0.9732, and 0.9299, respectively, but were less accurate than XGBoost. AdaBoost, on the other hand, demonstrated the poorest performance with a correlation coefficient of 0.4381 and the highest RMSE of 0.2082. The cross plot of the predicted Sw from XGBoost's model and actual Sw from Archie's equation had the highest correlation coefficient of 0.9, providing quality prediction thereby aligning with the statistical metrics. Hence, this study has identified Xgboost to be a promising ML tool that could be used to efficiently predict Sw without the use of resistivity data at “TM” Field Niger Delta and this could be applied in other similar geological settings.</div></div>","PeriodicalId":21690,"journal":{"name":"Scientific African","volume":"27 ","pages":"Article e02596"},"PeriodicalIF":3.3000,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Machine learning approach in predicting water saturation using well data at “TM” Niger Delta\",\"authors\":\"Oluwakemi Y. Adeogun,&nbsp;Mukthar O. Abdulwaheed,&nbsp;Lukumon Adeoti,&nbsp;Olawale J. Allo,&nbsp;Olawunmi O. Fasakin,&nbsp;Oluwafemi O. Okunowo\",\"doi\":\"10.1016/j.sciaf.2025.e02596\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Accurate estimation of water saturation (Sw) is critical for hydrocarbon exploration reservoir management, as it reveals the proportion of pore spaces filled with hydrocarbons and water. Determining Sw could be challenging in the absence of core data or resistivity logs. This informed the use of machine learning (ML) techniques to predict Sw in the \\\"TM\\\" Field, Niger Delta, where missing resistivity log data poses a challenge. Five ML models (XGBoost, AdaBoost, CatBoost, LightGBM, and Gradient Boost) were deployed using well log data (caliper, gamma-ray, neutron, porosity, density, and shale volume) to estimate Sw at “TM” Field, Niger-Delta. The dataset includes 61,253 observations, which were split into training (70%) and testing (30%) sets. After preprocessing and correcting inconsistencies in the data, the five ML models were trained and hyperparameters tuned to optimize performance. The models were evaluated using standard statistical metrics: Root Mean Square Error (RMSE), Mean Squared Error (MSE), Mean Absolute Error (MAE), and R-squared (R²). To validate the performance of these models, predicted Sw values were compared with those estimated from resistivity logs. Likewise the predicted Sw from the five ML models were plotted against the Sw estimated from the resistivity data not used in the prediction process to validate and determine the quality of the predicted Sw from the ML models. Among the five ML models tested, XGBoost exhibited the best performance, with the highest R² value of 0.9992 and the lowest RMSE of 0.0071. Other models, such as CatBoost, LightGBM, and Gradient Boost, showed strong performance with correlation coefficients of 0.9785, 0.9732, and 0.9299, respectively, but were less accurate than XGBoost. AdaBoost, on the other hand, demonstrated the poorest performance with a correlation coefficient of 0.4381 and the highest RMSE of 0.2082. The cross plot of the predicted Sw from XGBoost's model and actual Sw from Archie's equation had the highest correlation coefficient of 0.9, providing quality prediction thereby aligning with the statistical metrics. Hence, this study has identified Xgboost to be a promising ML tool that could be used to efficiently predict Sw without the use of resistivity data at “TM” Field Niger Delta and this could be applied in other similar geological settings.</div></div>\",\"PeriodicalId\":21690,\"journal\":{\"name\":\"Scientific African\",\"volume\":\"27 \",\"pages\":\"Article e02596\"},\"PeriodicalIF\":3.3000,\"publicationDate\":\"2025-03-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Scientific African\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2468227625000663\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/2/25 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q2\",\"JCRName\":\"MULTIDISCIPLINARY SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Scientific African","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2468227625000663","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/2/25 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0

摘要

准确估计含水饱和度(Sw)对于油气勘探和油藏管理至关重要,因为它揭示了油气和水充满孔隙空间的比例。在没有岩心数据或电阻率测井的情况下,确定Sw可能具有挑战性。这使得机器学习(ML)技术应用于尼日尔三角洲TM油田的Sw预测,该油田缺乏电阻率测井数据是一个挑战。利用测井数据(井径仪、伽马射线、中子、孔隙度、密度和页岩体积),在尼日尔三角洲“TM”油田部署了5种ML模型(XGBoost、AdaBoost、CatBoost、LightGBM和Gradient Boost)来估算Sw。该数据集包括61253个观测值,分为训练集(70%)和测试集(30%)。在预处理和纠正数据中的不一致性之后,对五个ML模型进行训练并调整超参数以优化性能。采用标准统计指标评估模型:均方根误差(RMSE)、均方误差(MSE)、平均绝对误差(MAE)和R平方(R²)。为了验证这些模型的性能,将预测的Sw值与电阻率测井估计的Sw值进行了比较。同样,将5个ML模型预测的Sw与预测过程中未使用的电阻率数据估计的Sw进行对比,以验证和确定ML模型预测Sw的质量。在5个ML模型中,XGBoost表现最好,R²值最高为0.9992,RMSE最低为0.0071。CatBoost、LightGBM和Gradient Boost等模型的相关系数分别为0.9785、0.9732和0.9299,但精度低于XGBoost。AdaBoost表现最差,相关系数为0.4381,RMSE最高,为0.2082。XGBoost模型的预测Sw与Archie方程的实际Sw的交叉图的相关系数最高,为0.9,提供了高质量的预测,从而与统计指标保持一致。因此,这项研究已经确定Xgboost是一种很有前途的机器学习工具,可以在不使用尼日尔三角洲TM油田电阻率数据的情况下有效地预测Sw,这可以应用于其他类似的地质环境。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Machine learning approach in predicting water saturation using well data at “TM” Niger Delta
Accurate estimation of water saturation (Sw) is critical for hydrocarbon exploration reservoir management, as it reveals the proportion of pore spaces filled with hydrocarbons and water. Determining Sw could be challenging in the absence of core data or resistivity logs. This informed the use of machine learning (ML) techniques to predict Sw in the "TM" Field, Niger Delta, where missing resistivity log data poses a challenge. Five ML models (XGBoost, AdaBoost, CatBoost, LightGBM, and Gradient Boost) were deployed using well log data (caliper, gamma-ray, neutron, porosity, density, and shale volume) to estimate Sw at “TM” Field, Niger-Delta. The dataset includes 61,253 observations, which were split into training (70%) and testing (30%) sets. After preprocessing and correcting inconsistencies in the data, the five ML models were trained and hyperparameters tuned to optimize performance. The models were evaluated using standard statistical metrics: Root Mean Square Error (RMSE), Mean Squared Error (MSE), Mean Absolute Error (MAE), and R-squared (R²). To validate the performance of these models, predicted Sw values were compared with those estimated from resistivity logs. Likewise the predicted Sw from the five ML models were plotted against the Sw estimated from the resistivity data not used in the prediction process to validate and determine the quality of the predicted Sw from the ML models. Among the five ML models tested, XGBoost exhibited the best performance, with the highest R² value of 0.9992 and the lowest RMSE of 0.0071. Other models, such as CatBoost, LightGBM, and Gradient Boost, showed strong performance with correlation coefficients of 0.9785, 0.9732, and 0.9299, respectively, but were less accurate than XGBoost. AdaBoost, on the other hand, demonstrated the poorest performance with a correlation coefficient of 0.4381 and the highest RMSE of 0.2082. The cross plot of the predicted Sw from XGBoost's model and actual Sw from Archie's equation had the highest correlation coefficient of 0.9, providing quality prediction thereby aligning with the statistical metrics. Hence, this study has identified Xgboost to be a promising ML tool that could be used to efficiently predict Sw without the use of resistivity data at “TM” Field Niger Delta and this could be applied in other similar geological settings.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Scientific African
Scientific African Multidisciplinary-Multidisciplinary
CiteScore
5.60
自引率
3.40%
发文量
332
审稿时长
10 weeks
期刊最新文献
A nonparametric framework for linear–circular regression: Applications in environmental and biological sciences A unified H∞ dynamic observer for vehicle state and road profile estimation in half-vehicle models Powering growth or stunting futures? Energy poverty and long-term child nutrition in Malawi A new family of generalized distributions based on Modi-G transformation, with applications in Epidemiology, Engineering, and Environmental Science Efficiency assessment of two pyrolyzer models for converting waste plastics into fuels
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1