苹果对苹果:通过大数据和机器学习对钻井技术进行公正评估

D. Khvostichenko, Greg Skoff, Y. Arevalo, S. Makarychev-Mikhailov
{"title":"苹果对苹果:通过大数据和机器学习对钻井技术进行公正评估","authors":"D. Khvostichenko, Greg Skoff, Y. Arevalo, S. Makarychev-Mikhailov","doi":"10.2118/212446-ms","DOIUrl":null,"url":null,"abstract":"\n Ensuring a proper apple to apple comparison is a challenge in drilling performance evaluation. When assessing the effect of a particular drilling technology, such as bit, bottomhole assembly (BHA) or mud type, on the rate of penetration (ROP) or other drilling performance criteria, all other factors must be fixed to truly isolate the effect. Traditionally, performance evaluation starts with manual identification of reasonably similar entities, such as drilling runs or well sections by means of numerous selection criteria; e.g., location, depths, inclinations, drilling conditions, tools, etc. The selected drilling performance metrics are then compared using statistical analysis techniques with various extents of thoroughness. Such analyses are laborious and are usually limited to just a handful of cases due to practical reasons and time constraints. Furthermore, the analyses are difficult to apply to large data sets of hundreds or thousands of wells, and there is always a risk of missing an important combination of factors where the effect is important. Therefore, conclusions based on these analyses may well be insufficiently justified or even confirmation biased, leading to suboptimal technical and business decisions.\n This paper presents a combined machine learning and statistical analysis workflow addressing these challenges. The workflow a) discovers similar entities (wells, intervals, runs) in big datasets; b) extracts subsets of similar entities (i.e., \"apples\") for evaluation; c) applies rigorous statistical tests to quantify the effect (mud type, BHA type, bit type) on a metric (ROP, success rate) and its statistical significance; and, finally, d) returns information on areas, sets of conditions where the effect is pronounced (or not). In the statistical analysis workflow, the user first specifies the drilling technology of interest and drilling performance metrics, and then defines factors and parameters to be fixed to better isolate the effect of the drilling technology. The historical data on thousands of entities are then preprocessed, and the entities are clustered by similarities in the multitude of factors by the k-means algorithm. Statistical tests are performed automatically on each cluster, quantifying the magnitude of technology effect on performance criteria, and calculating p-values as the measure of statistical significance of the effect. The results are presented in a series of clustering observations that summarize the effects and allow for zooming into the clusters to review drilling parameters and to perform further in-depth analysis, if necessary.\n All steps of the workflow are presented in this paper, including data processing details, and reasons for selecting specific clustering algorithms and statistical tests. Several examples of the successful applications of the workflow to actual drilling data for thousands of wells are provided, focusing on the effects of BHA, steering tools, and drilling muds on drilling performance. This unique approach can be used to improve other drilling performance evaluation workflows.","PeriodicalId":255336,"journal":{"name":"Day 3 Thu, March 09, 2023","volume":"6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-03-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Apples to Apples: Impartial Assessment of Drilling Technologies through Big Data and Machine Learning\",\"authors\":\"D. Khvostichenko, Greg Skoff, Y. Arevalo, S. Makarychev-Mikhailov\",\"doi\":\"10.2118/212446-ms\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"\\n Ensuring a proper apple to apple comparison is a challenge in drilling performance evaluation. When assessing the effect of a particular drilling technology, such as bit, bottomhole assembly (BHA) or mud type, on the rate of penetration (ROP) or other drilling performance criteria, all other factors must be fixed to truly isolate the effect. Traditionally, performance evaluation starts with manual identification of reasonably similar entities, such as drilling runs or well sections by means of numerous selection criteria; e.g., location, depths, inclinations, drilling conditions, tools, etc. The selected drilling performance metrics are then compared using statistical analysis techniques with various extents of thoroughness. Such analyses are laborious and are usually limited to just a handful of cases due to practical reasons and time constraints. Furthermore, the analyses are difficult to apply to large data sets of hundreds or thousands of wells, and there is always a risk of missing an important combination of factors where the effect is important. Therefore, conclusions based on these analyses may well be insufficiently justified or even confirmation biased, leading to suboptimal technical and business decisions.\\n This paper presents a combined machine learning and statistical analysis workflow addressing these challenges. The workflow a) discovers similar entities (wells, intervals, runs) in big datasets; b) extracts subsets of similar entities (i.e., \\\"apples\\\") for evaluation; c) applies rigorous statistical tests to quantify the effect (mud type, BHA type, bit type) on a metric (ROP, success rate) and its statistical significance; and, finally, d) returns information on areas, sets of conditions where the effect is pronounced (or not). In the statistical analysis workflow, the user first specifies the drilling technology of interest and drilling performance metrics, and then defines factors and parameters to be fixed to better isolate the effect of the drilling technology. The historical data on thousands of entities are then preprocessed, and the entities are clustered by similarities in the multitude of factors by the k-means algorithm. Statistical tests are performed automatically on each cluster, quantifying the magnitude of technology effect on performance criteria, and calculating p-values as the measure of statistical significance of the effect. The results are presented in a series of clustering observations that summarize the effects and allow for zooming into the clusters to review drilling parameters and to perform further in-depth analysis, if necessary.\\n All steps of the workflow are presented in this paper, including data processing details, and reasons for selecting specific clustering algorithms and statistical tests. Several examples of the successful applications of the workflow to actual drilling data for thousands of wells are provided, focusing on the effects of BHA, steering tools, and drilling muds on drilling performance. This unique approach can be used to improve other drilling performance evaluation workflows.\",\"PeriodicalId\":255336,\"journal\":{\"name\":\"Day 3 Thu, March 09, 2023\",\"volume\":\"6 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-03-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Day 3 Thu, March 09, 2023\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.2118/212446-ms\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Day 3 Thu, March 09, 2023","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2118/212446-ms","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

在钻井性能评估中,确保适当的同类比较是一个挑战。在评估特定钻井技术(如钻头、底部钻具组合(BHA)或泥浆类型)对机械钻速(ROP)或其他钻井性能标准的影响时,必须确定所有其他因素,以真正隔离影响。传统上,性能评估是从人工识别合理相似的实体开始的,例如通过许多选择标准来识别钻井班次或井段;例如,位置、深度、斜度、钻井条件、工具等。然后使用统计分析技术对选定的钻井性能指标进行比较,这些指标具有不同程度的彻底性。这样的分析是费力的,并且由于实际原因和时间限制,通常仅限于少数几个案例。此外,这些分析很难应用于数百或数千口井的大型数据集,并且总是存在遗漏重要因素组合的风险,而这些因素的影响很重要。因此,基于这些分析的结论很可能是不充分的,甚至是有偏见的,从而导致次优的技术和业务决策。本文提出了一种结合机器学习和统计分析的工作流程来解决这些挑战。工作流a)在大数据集中发现相似的实体(井、井段、井段);B)提取相似实体(即“苹果”)的子集进行评估;c)采用严格的统计测试来量化(泥浆类型、底部钻具组合类型、钻头类型)对指标(ROP、成功率)的影响及其统计显著性;最后,d)返回有关区域的信息,即效果明显(或不明显)的条件集。在统计分析工作流程中,用户首先指定感兴趣的钻井技术和钻井性能指标,然后定义要固定的因素和参数,以便更好地隔离钻井技术的效果。然后对数千个实体的历史数据进行预处理,并通过k-means算法根据众多因素的相似性对实体进行聚类。在每个集群上自动执行统计测试,量化技术对性能标准的影响程度,并计算p值作为影响的统计显著性的度量。结果以一系列聚类观察的形式呈现,这些聚类观察总结了影响,并允许放大聚类以查看钻井参数,并在必要时进行进一步的深入分析。本文介绍了工作流的所有步骤,包括数据处理的细节,以及选择特定聚类算法和统计测试的原因。本文给出了将该工作流程成功应用于数千口井的实际钻井数据的几个例子,重点介绍了BHA、转向工具和钻井泥浆对钻井性能的影响。这种独特的方法可用于改进其他钻井性能评估工作流程。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Apples to Apples: Impartial Assessment of Drilling Technologies through Big Data and Machine Learning
Ensuring a proper apple to apple comparison is a challenge in drilling performance evaluation. When assessing the effect of a particular drilling technology, such as bit, bottomhole assembly (BHA) or mud type, on the rate of penetration (ROP) or other drilling performance criteria, all other factors must be fixed to truly isolate the effect. Traditionally, performance evaluation starts with manual identification of reasonably similar entities, such as drilling runs or well sections by means of numerous selection criteria; e.g., location, depths, inclinations, drilling conditions, tools, etc. The selected drilling performance metrics are then compared using statistical analysis techniques with various extents of thoroughness. Such analyses are laborious and are usually limited to just a handful of cases due to practical reasons and time constraints. Furthermore, the analyses are difficult to apply to large data sets of hundreds or thousands of wells, and there is always a risk of missing an important combination of factors where the effect is important. Therefore, conclusions based on these analyses may well be insufficiently justified or even confirmation biased, leading to suboptimal technical and business decisions. This paper presents a combined machine learning and statistical analysis workflow addressing these challenges. The workflow a) discovers similar entities (wells, intervals, runs) in big datasets; b) extracts subsets of similar entities (i.e., "apples") for evaluation; c) applies rigorous statistical tests to quantify the effect (mud type, BHA type, bit type) on a metric (ROP, success rate) and its statistical significance; and, finally, d) returns information on areas, sets of conditions where the effect is pronounced (or not). In the statistical analysis workflow, the user first specifies the drilling technology of interest and drilling performance metrics, and then defines factors and parameters to be fixed to better isolate the effect of the drilling technology. The historical data on thousands of entities are then preprocessed, and the entities are clustered by similarities in the multitude of factors by the k-means algorithm. Statistical tests are performed automatically on each cluster, quantifying the magnitude of technology effect on performance criteria, and calculating p-values as the measure of statistical significance of the effect. The results are presented in a series of clustering observations that summarize the effects and allow for zooming into the clusters to review drilling parameters and to perform further in-depth analysis, if necessary. All steps of the workflow are presented in this paper, including data processing details, and reasons for selecting specific clustering algorithms and statistical tests. Several examples of the successful applications of the workflow to actual drilling data for thousands of wells are provided, focusing on the effects of BHA, steering tools, and drilling muds on drilling performance. This unique approach can be used to improve other drilling performance evaluation workflows.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Geothermal Drilling – Collaborative and Holistic Applications Specific Solutions Drive Consistent Improvements in Drilling Efficiency and Operational Costs Downhole Trajectory Automation of RSS Tools: Autonomous Drilling Becomes Reality While-Drilling Pore Pressure Surveillance Using Machine Learning Drilling Three-Mile Laterals Tighter and Safer with a New Magnetic Reference Technique Continuous or Stacked Cement Bond Logs – Does It Matter?
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1