M. Popescu, Rebecca Head, T. Ferriday, K. Evans, J. Montero, Jiazuo Zhang, Gwynfor Jones, G. Kaeng
{"title":"利用监督式机器学习算法对电缆测井数据进行自动岩性预测","authors":"M. Popescu, Rebecca Head, T. Ferriday, K. Evans, J. Montero, Jiazuo Zhang, Gwynfor Jones, G. Kaeng","doi":"10.2118/208559-ms","DOIUrl":null,"url":null,"abstract":"\n This paper presents advancements in machine learning and cloud deployment that enable rapid and accurate automated lithology interpretation. A supervised machine learning technique is described that enables rapid, consistent, and accurate lithology prediction alongside quantitative uncertainty from large wireline or logging-while-drilling (LWD) datasets.\n To leverage supervised machine learning, a team of geoscientists and petrophysicists made detailed lithology interpretations of wells to generate a comprehensive training dataset. Lithology interpretations were based on applying determinist cross-plotting by utilizing and combining various raw logs. This training dataset was used to develop a model and test a machine learning pipeline. The pipeline was applied to a dataset previously unseen by the algorithm, to predict lithology. A quality checking process was performed by a petrophysicist to validate new predictions delivered by the pipeline against human interpretations.\n Confidence in the interpretations was assessed in two ways. The prior probability was calculated, a measure of confidence in the input data being recognized by the model. Posterior probability was calculated, which quantifies the likelihood that a specified depth interval comprises a given lithology.\n The supervised machine learning algorithm ensured that the wells were interpreted consistently by removing interpreter biases and inconsistencies. The scalability of cloud computing enabled a large log dataset to be interpreted rapidly; >100 wells were interpreted consistently in five minutes, yielding >70% lithological match to the human petrophysical interpretation.\n Supervised machine learning methods have strong potential for classifying lithology from log data because: 1) they can automatically define complex, non-parametric, multi-variate relationships across several input logs; and 2) they allow classifications to be quantified confidently. Furthermore, this approach captured the knowledge and nuances of an interpreter's decisions by training the algorithm using human-interpreted labels.\n In the hydrocarbon industry, the quantity of generated data is predicted to increase by >300% between 2018 and 2023 (IDC, Worldwide Global DataSphere Forecast, 2019–2023). Additionally, the industry holds vast legacy data. This supervised machine learning approach can unlock the potential of some of these datasets by providing consistent lithology interpretations rapidly, allowing resources to be used more effectively.","PeriodicalId":11215,"journal":{"name":"Day 2 Wed, November 24, 2021","volume":"38 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2021-11-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Using Supervised Machine Learning Algorithms for Automated Lithology Prediction from Wireline Log Data\",\"authors\":\"M. Popescu, Rebecca Head, T. Ferriday, K. Evans, J. Montero, Jiazuo Zhang, Gwynfor Jones, G. Kaeng\",\"doi\":\"10.2118/208559-ms\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"\\n This paper presents advancements in machine learning and cloud deployment that enable rapid and accurate automated lithology interpretation. A supervised machine learning technique is described that enables rapid, consistent, and accurate lithology prediction alongside quantitative uncertainty from large wireline or logging-while-drilling (LWD) datasets.\\n To leverage supervised machine learning, a team of geoscientists and petrophysicists made detailed lithology interpretations of wells to generate a comprehensive training dataset. Lithology interpretations were based on applying determinist cross-plotting by utilizing and combining various raw logs. This training dataset was used to develop a model and test a machine learning pipeline. The pipeline was applied to a dataset previously unseen by the algorithm, to predict lithology. A quality checking process was performed by a petrophysicist to validate new predictions delivered by the pipeline against human interpretations.\\n Confidence in the interpretations was assessed in two ways. The prior probability was calculated, a measure of confidence in the input data being recognized by the model. Posterior probability was calculated, which quantifies the likelihood that a specified depth interval comprises a given lithology.\\n The supervised machine learning algorithm ensured that the wells were interpreted consistently by removing interpreter biases and inconsistencies. The scalability of cloud computing enabled a large log dataset to be interpreted rapidly; >100 wells were interpreted consistently in five minutes, yielding >70% lithological match to the human petrophysical interpretation.\\n Supervised machine learning methods have strong potential for classifying lithology from log data because: 1) they can automatically define complex, non-parametric, multi-variate relationships across several input logs; and 2) they allow classifications to be quantified confidently. Furthermore, this approach captured the knowledge and nuances of an interpreter's decisions by training the algorithm using human-interpreted labels.\\n In the hydrocarbon industry, the quantity of generated data is predicted to increase by >300% between 2018 and 2023 (IDC, Worldwide Global DataSphere Forecast, 2019–2023). Additionally, the industry holds vast legacy data. This supervised machine learning approach can unlock the potential of some of these datasets by providing consistent lithology interpretations rapidly, allowing resources to be used more effectively.\",\"PeriodicalId\":11215,\"journal\":{\"name\":\"Day 2 Wed, November 24, 2021\",\"volume\":\"38 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-11-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Day 2 Wed, November 24, 2021\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.2118/208559-ms\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Day 2 Wed, November 24, 2021","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2118/208559-ms","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
摘要
本文介绍了机器学习和云部署方面的进展,这些进展可以实现快速、准确的自动岩性解释。介绍了一种监督式机器学习技术,该技术可以快速、一致、准确地预测岩性,同时消除大型电缆或随钻测井(LWD)数据集的定量不确定性。为了利用监督式机器学习,一个由地球科学家和岩石物理学家组成的团队对油井进行了详细的岩性解释,以生成一个全面的训练数据集。岩性解释是利用和组合各种原始测井资料,采用确定性交叉标绘方法进行的。该训练数据集用于开发模型和测试机器学习管道。该管道被应用于算法以前未见过的数据集,以预测岩性。一名岩石物理学家进行了质量检查,以验证管道提供的新预测与人类解释的不同。人们用两种方式评估对这些解释的信心。计算先验概率,即模型识别输入数据的置信度。计算后验概率,量化指定深度区间包含给定岩性的可能性。监督式机器学习算法通过消除解释器的偏差和不一致性,确保了井的解释一致性。云计算的可扩展性使大型日志数据集能够快速解释;在5分钟内连续解释了100口井,与人类岩石物理解释的岩性匹配度超过70%。有监督机器学习方法在从测井数据中分类岩性方面具有很大的潜力,因为:1)它们可以自动定义多个输入测井数据之间复杂的、非参数的、多变量的关系;2)它们允许分类被自信地量化。此外,这种方法通过使用人工解释的标签训练算法来捕获解释器决策的知识和细微差别。在油气行业,预计从2018年到2023年,生成的数据量将增加300%以上(IDC, Worldwide Global DataSphere Forecast, 2019-2023)。此外,该行业还拥有大量遗留数据。这种有监督的机器学习方法可以通过快速提供一致的岩性解释来释放其中一些数据集的潜力,从而更有效地利用资源。
Using Supervised Machine Learning Algorithms for Automated Lithology Prediction from Wireline Log Data
This paper presents advancements in machine learning and cloud deployment that enable rapid and accurate automated lithology interpretation. A supervised machine learning technique is described that enables rapid, consistent, and accurate lithology prediction alongside quantitative uncertainty from large wireline or logging-while-drilling (LWD) datasets.
To leverage supervised machine learning, a team of geoscientists and petrophysicists made detailed lithology interpretations of wells to generate a comprehensive training dataset. Lithology interpretations were based on applying determinist cross-plotting by utilizing and combining various raw logs. This training dataset was used to develop a model and test a machine learning pipeline. The pipeline was applied to a dataset previously unseen by the algorithm, to predict lithology. A quality checking process was performed by a petrophysicist to validate new predictions delivered by the pipeline against human interpretations.
Confidence in the interpretations was assessed in two ways. The prior probability was calculated, a measure of confidence in the input data being recognized by the model. Posterior probability was calculated, which quantifies the likelihood that a specified depth interval comprises a given lithology.
The supervised machine learning algorithm ensured that the wells were interpreted consistently by removing interpreter biases and inconsistencies. The scalability of cloud computing enabled a large log dataset to be interpreted rapidly; >100 wells were interpreted consistently in five minutes, yielding >70% lithological match to the human petrophysical interpretation.
Supervised machine learning methods have strong potential for classifying lithology from log data because: 1) they can automatically define complex, non-parametric, multi-variate relationships across several input logs; and 2) they allow classifications to be quantified confidently. Furthermore, this approach captured the knowledge and nuances of an interpreter's decisions by training the algorithm using human-interpreted labels.
In the hydrocarbon industry, the quantity of generated data is predicted to increase by >300% between 2018 and 2023 (IDC, Worldwide Global DataSphere Forecast, 2019–2023). Additionally, the industry holds vast legacy data. This supervised machine learning approach can unlock the potential of some of these datasets by providing consistent lithology interpretations rapidly, allowing resources to be used more effectively.