Rahul Prajapati, Bappa Mukherjee, Upendra K Singh, Kalachand Sain
{"title":"使用地球物理测井的机器学习辅助岩性预测:柬埔寨盆地案例研究","authors":"Rahul Prajapati, Bappa Mukherjee, Upendra K Singh, Kalachand Sain","doi":"10.1007/s12040-024-02326-y","DOIUrl":null,"url":null,"abstract":"<h3 data-test=\"abstract-sub-heading\">Abstract</h3><p>Identification and characterisation of reservoir facies is a prime factor in delimiting the hydrocarbon potential zones of a reservoir for hydrocarbon exploration. The geophysical logs, which are physical parameters of reservoir facies measured in the vicinity of boreholes, play a crucial role in the interpretation of reservoir facies. The present study deals with the identification of the lithology of the Limbodara oil field in the Cambay basin using machine learning (ML) techniques on geophysical logs. The supervised techniques of machine learning, such as support vector machines (SVM), artificial neural networks (ANN), and k-nearest neighbours (kNN), are used as nonlinear classifiers for the identification of lithology from nonlinear geophysical logs. The hyperparameters of the ML model are optimised using the grid search cross-validation (CV) method to increase the performance of the model, as evaluated by confusion matrix, area under receiver operating characteristics curve (AUC), precision, recall, and F1 score. The ML model used five geophysical parameters of two wells with four known distinguished lithologies (Class-A, Class-B, Class-C, and Class-D) for optimisation and training of the model. The optimised and trained model for each lithology for kNN, SVM, and ANN shows an overall correct prediction of true values with 85.4, 87.0, and 88.9%, respectively, from the confusion matrix. Apart from this, the receiver operative characteristics (ROC) also show that the overall area under the curve for each lithology is greater than 90%, and other evaluation parameters such as precision, recall, and F1 score show accuracy greater than 84%, except for the cases of Class C and Class D from SVM and ANN. Thus, the accuracy of each model from evaluation parameters suggests that the combined analysis of different ML models offers to select the optimised ML model for better results and validation to achieve and model the lithology with better precision.</p><h3 data-test=\"abstract-sub-heading\">Highlights</h3><ul>\n<li>\n<p>A way out for obtaining litholog supplements at uncored section in boreholes</p>\n</li>\n<li>\n<p>Established ML assisted mapping function between wireline logs and lithologs</p>\n</li>\n<li>\n<p>Predicted litholog sequence with secure level of accuracy (>80%)</p>\n</li>\n</ul>","PeriodicalId":15609,"journal":{"name":"Journal of Earth System Science","volume":null,"pages":null},"PeriodicalIF":1.3000,"publicationDate":"2024-05-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Machine learning assisted lithology prediction using geophysical logs: A case study from Cambay basin\",\"authors\":\"Rahul Prajapati, Bappa Mukherjee, Upendra K Singh, Kalachand Sain\",\"doi\":\"10.1007/s12040-024-02326-y\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<h3 data-test=\\\"abstract-sub-heading\\\">Abstract</h3><p>Identification and characterisation of reservoir facies is a prime factor in delimiting the hydrocarbon potential zones of a reservoir for hydrocarbon exploration. The geophysical logs, which are physical parameters of reservoir facies measured in the vicinity of boreholes, play a crucial role in the interpretation of reservoir facies. The present study deals with the identification of the lithology of the Limbodara oil field in the Cambay basin using machine learning (ML) techniques on geophysical logs. The supervised techniques of machine learning, such as support vector machines (SVM), artificial neural networks (ANN), and k-nearest neighbours (kNN), are used as nonlinear classifiers for the identification of lithology from nonlinear geophysical logs. The hyperparameters of the ML model are optimised using the grid search cross-validation (CV) method to increase the performance of the model, as evaluated by confusion matrix, area under receiver operating characteristics curve (AUC), precision, recall, and F1 score. The ML model used five geophysical parameters of two wells with four known distinguished lithologies (Class-A, Class-B, Class-C, and Class-D) for optimisation and training of the model. The optimised and trained model for each lithology for kNN, SVM, and ANN shows an overall correct prediction of true values with 85.4, 87.0, and 88.9%, respectively, from the confusion matrix. Apart from this, the receiver operative characteristics (ROC) also show that the overall area under the curve for each lithology is greater than 90%, and other evaluation parameters such as precision, recall, and F1 score show accuracy greater than 84%, except for the cases of Class C and Class D from SVM and ANN. Thus, the accuracy of each model from evaluation parameters suggests that the combined analysis of different ML models offers to select the optimised ML model for better results and validation to achieve and model the lithology with better precision.</p><h3 data-test=\\\"abstract-sub-heading\\\">Highlights</h3><ul>\\n<li>\\n<p>A way out for obtaining litholog supplements at uncored section in boreholes</p>\\n</li>\\n<li>\\n<p>Established ML assisted mapping function between wireline logs and lithologs</p>\\n</li>\\n<li>\\n<p>Predicted litholog sequence with secure level of accuracy (>80%)</p>\\n</li>\\n</ul>\",\"PeriodicalId\":15609,\"journal\":{\"name\":\"Journal of Earth System Science\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":1.3000,\"publicationDate\":\"2024-05-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Earth System Science\",\"FirstCategoryId\":\"89\",\"ListUrlMain\":\"https://doi.org/10.1007/s12040-024-02326-y\",\"RegionNum\":4,\"RegionCategory\":\"地球科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"GEOSCIENCES, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Earth System Science","FirstCategoryId":"89","ListUrlMain":"https://doi.org/10.1007/s12040-024-02326-y","RegionNum":4,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"GEOSCIENCES, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
摘要
摘要储油层面的识别和特征描述是油气勘探中划分储油层油气潜力区的首要因素。地球物理测井记录是在钻孔附近测量的储层岩相物理参数,在解释储层岩相方面起着至关重要的作用。本研究涉及利用地球物理测井的机器学习(ML)技术识别柬埔寨盆地林博达拉油田的岩性。机器学习的监督技术,如支持向量机(SVM)、人工神经网络(ANN)和 k-近邻(kNN),被用作非线性分类器,用于从非线性地球物理测井记录中识别岩性。使用网格搜索交叉验证(CV)方法对 ML 模型的超参数进行了优化,以提高模型的性能,评估指标包括混淆矩阵、接收者工作特性曲线下面积(AUC)、精确度、召回率和 F1 分数。ML 模型使用两口井的五个地球物理参数和四种已知的不同岩性(A 类、B 类、C 类和 D 类)来优化和训练模型。从混淆矩阵来看,kNN、SVM 和 ANN 针对每种岩性优化和训练的模型对真实值的预测正确率分别为 85.4%、87.0% 和 88.9%。此外,接受者操作特征(ROC)也显示,除 SVM 和 ANN 的 C 类和 D 类外,各岩性的整体曲线下面积均大于 90%,精度、召回率和 F1 分数等其他评价参数的准确度均大于 84%。因此,从评价参数来看,每个模型的精确度都表明,综合分析不同的 ML 模型,可以选择优化的 ML 模型,以获得更好的结果和验证,从而以更高的精确度实现岩性建模。 亮点获取钻孔无刻蚀段岩性补充的出路建立了线性测井和岩性之间的 ML 辅助绘图功能预测岩性序列的精确度达到了安全水平(80%)。
Machine learning assisted lithology prediction using geophysical logs: A case study from Cambay basin
Abstract
Identification and characterisation of reservoir facies is a prime factor in delimiting the hydrocarbon potential zones of a reservoir for hydrocarbon exploration. The geophysical logs, which are physical parameters of reservoir facies measured in the vicinity of boreholes, play a crucial role in the interpretation of reservoir facies. The present study deals with the identification of the lithology of the Limbodara oil field in the Cambay basin using machine learning (ML) techniques on geophysical logs. The supervised techniques of machine learning, such as support vector machines (SVM), artificial neural networks (ANN), and k-nearest neighbours (kNN), are used as nonlinear classifiers for the identification of lithology from nonlinear geophysical logs. The hyperparameters of the ML model are optimised using the grid search cross-validation (CV) method to increase the performance of the model, as evaluated by confusion matrix, area under receiver operating characteristics curve (AUC), precision, recall, and F1 score. The ML model used five geophysical parameters of two wells with four known distinguished lithologies (Class-A, Class-B, Class-C, and Class-D) for optimisation and training of the model. The optimised and trained model for each lithology for kNN, SVM, and ANN shows an overall correct prediction of true values with 85.4, 87.0, and 88.9%, respectively, from the confusion matrix. Apart from this, the receiver operative characteristics (ROC) also show that the overall area under the curve for each lithology is greater than 90%, and other evaluation parameters such as precision, recall, and F1 score show accuracy greater than 84%, except for the cases of Class C and Class D from SVM and ANN. Thus, the accuracy of each model from evaluation parameters suggests that the combined analysis of different ML models offers to select the optimised ML model for better results and validation to achieve and model the lithology with better precision.
Highlights
A way out for obtaining litholog supplements at uncored section in boreholes
Established ML assisted mapping function between wireline logs and lithologs
Predicted litholog sequence with secure level of accuracy (>80%)
期刊介绍:
The Journal of Earth System Science, an International Journal, was earlier a part of the Proceedings of the Indian Academy of Sciences – Section A begun in 1934, and later split in 1978 into theme journals. This journal was published as Proceedings – Earth and Planetary Sciences since 1978, and in 2005 was renamed ‘Journal of Earth System Science’.
The journal is highly inter-disciplinary and publishes scholarly research – new data, ideas, and conceptual advances – in Earth System Science. The focus is on the evolution of the Earth as a system: manuscripts describing changes of anthropogenic origin in a limited region are not considered unless they go beyond describing the changes to include an analysis of earth-system processes. The journal''s scope includes the solid earth (geosphere), the atmosphere, the hydrosphere (including cryosphere), and the biosphere; it also addresses related aspects of planetary and space sciences. Contributions pertaining to the Indian sub- continent and the surrounding Indian-Ocean region are particularly welcome. Given that a large number of manuscripts report either observations or model results for a limited domain, manuscripts intended for publication in JESS are expected to fulfill at least one of the following three criteria.
The data should be of relevance and should be of statistically significant size and from a region from where such data are sparse. If the data are from a well-sampled region, the data size should be considerable and advance our knowledge of the region.
A model study is carried out to explain observations reported either in the same manuscript or in the literature.
The analysis, whether of data or with models, is novel and the inferences advance the current knowledge.