Sand Production Prediction with Machine Learning using Input Variables from Geological and Operational Conditions in the Karazhanbas Oilfield, Kazakhstan
{"title":"Sand Production Prediction with Machine Learning using Input Variables from Geological and Operational Conditions in the Karazhanbas Oilfield, Kazakhstan","authors":"Ainash Shabdirova, Ashirgul Kozhagulova, Yernazar Samenov, Nguyen Minh, Yong Zhao","doi":"10.1007/s11053-024-10389-3","DOIUrl":null,"url":null,"abstract":"<p>This paper describes a comprehensive approach to predict sand production in the Karazhanbas oilfield using machine learning (ML) techniques. By analyzing data from 2000 wells, the research uncovered the complex dynamics of sand production and emphasized the critical need for accurately predicting the peak sand mass and its occurrence time. ML techniques can have a significant impact on prediction of sand production and on the optimization of oilfield operation, which can be improved with the combined use of enriched training data and domain-specific knowledge. The research underscored the influence of geological factors, especially fault proximity, on prediction accuracy. Domain and field knowledge is needed to formulate different production scenarios for prediction purposes such that the relevant data can be selected for the training of ML models. Moreover, new metrics are needed to evaluate model performance as the applied method is tailored for different operational strategies. As the peak sand mass is considered a pivotal event in field operation, new metrics in terms of peak prediction accuracy and peak time prediction accuracy were introduced to evaluate the performance of ML models. A suite of ML algorithms was employed in the study, which demonstrated notable accuracy in the classification of sand-producing wells.</p>","PeriodicalId":54284,"journal":{"name":"Natural Resources Research","volume":"72 1","pages":""},"PeriodicalIF":4.8000,"publicationDate":"2024-08-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Natural Resources Research","FirstCategoryId":"89","ListUrlMain":"https://doi.org/10.1007/s11053-024-10389-3","RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GEOSCIENCES, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
Abstract
This paper describes a comprehensive approach to predict sand production in the Karazhanbas oilfield using machine learning (ML) techniques. By analyzing data from 2000 wells, the research uncovered the complex dynamics of sand production and emphasized the critical need for accurately predicting the peak sand mass and its occurrence time. ML techniques can have a significant impact on prediction of sand production and on the optimization of oilfield operation, which can be improved with the combined use of enriched training data and domain-specific knowledge. The research underscored the influence of geological factors, especially fault proximity, on prediction accuracy. Domain and field knowledge is needed to formulate different production scenarios for prediction purposes such that the relevant data can be selected for the training of ML models. Moreover, new metrics are needed to evaluate model performance as the applied method is tailored for different operational strategies. As the peak sand mass is considered a pivotal event in field operation, new metrics in terms of peak prediction accuracy and peak time prediction accuracy were introduced to evaluate the performance of ML models. A suite of ML algorithms was employed in the study, which demonstrated notable accuracy in the classification of sand-producing wells.
本文介绍了一种利用机器学习(ML)技术预测卡拉赞巴斯油田产砂量的综合方法。通过分析 2000 口油井的数据,研究揭示了产砂的复杂动态,并强调了准确预测峰值砂量及其出现时间的迫切需要。结合使用丰富的训练数据和特定领域的知识,ML 技术可以对预测产砂量和优化油田运营产生重大影响。研究强调了地质因素(尤其是断层邻近性)对预测精度的影响。需要领域和现场知识来制定不同的预测生产方案,以便选择相关数据来训练 ML 模型。此外,还需要新的指标来评估模型的性能,因为所应用的方法是为不同的操作策略量身定制的。由于峰值砂量被认为是油田作业中的关键事件,因此引入了峰值预测精度和峰值时间预测精度等新指标来评估 ML 模型的性能。研究中采用了一套 ML 算法,这些算法在产砂井分类中表现出了显著的准确性。
期刊介绍:
This journal publishes quantitative studies of natural (mainly but not limited to mineral) resources exploration, evaluation and exploitation, including environmental and risk-related aspects. Typical articles use geoscientific data or analyses to assess, test, or compare resource-related aspects. NRR covers a wide variety of resources including minerals, coal, hydrocarbon, geothermal, water, and vegetation. Case studies are welcome.