应用机器学习算法优化Marcellus页岩未来产量，以宾夕法尼亚州西南部为例

Day 4 Wed, October 10, 2018 Pub Date : 2018-10-05 DOI:10.2118/191827-18ERM-MS

A. Shahkarami, Kimberly L. Ayers, Guochang Wang, Alivia Ayers

{"title":"应用机器学习算法优化Marcellus页岩未来产量，以宾夕法尼亚州西南部为例","authors":"A. Shahkarami, Kimberly L. Ayers, Guochang Wang, Alivia Ayers","doi":"10.2118/191827-18ERM-MS","DOIUrl":null,"url":null,"abstract":"\n The Marcellus Shale has more than a decade of development history. However, there are many questions that still remain unanswered. What is the best inter-well spacing? What are the optimum stage length, proppant loading, and cluster spacing? What are the ideal combinations of these completion parameters? And how can we maximize the rate return on our investment? This study proposes innovative tools that allow researchers to answer these questions. We build these set of tools by utilizing the pattern recognition abilities of machine learning algorithms and public data from the Southwestern Pennsylvania region of the Marcellus Shale.\n By means of artificial intelligence and data mining techniques, we studied a database that includes public data from more than 2,000 wells producing from the aforementioned study area. The database contained completion, drilling, and production history information from various operators active in Allegheny, Greene, Fayette, Washington, and Westmoreland counties located in the Southwestern Pennsylvania. Extensive preprocessing and data cleansing steps were involved to prepare the database. Various machine learning techniques (Linear Regression (LR), Support Vector Machines (SVMs), Artificial Neural Networks (ANNs), and Gaussian Processes (GP)) were applied to understand the non-linear patterns in the data. The objective was to develop predictive models that were trained and validated based on the current database. The predictive models were validated using information originating from numerous wells in the area. Once validated, the model could be used in reservoir management decision-making workflows to answer questions such as what are the best drilling scenarios, the optimum hydraulic fracturing design, the initial production rate, and the estimated ultimate recovery (EUR). The workflow is purely based on field data and free of any cognitive human bias. As soon as more data is available, the model could be updated. The core data in this workflow is sourced from public domains, and therefore, intensive preprocessing efforts were necessary.","PeriodicalId":298489,"journal":{"name":"Day 4 Wed, October 10, 2018","volume":"7 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-10-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":"{\"title\":\"Application of Machine Learning Algorithms for Optimizing Future Production in Marcellus Shale, Case Study of Southwestern Pennsylvania\",\"authors\":\"A. Shahkarami, Kimberly L. Ayers, Guochang Wang, Alivia Ayers\",\"doi\":\"10.2118/191827-18ERM-MS\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"\\n The Marcellus Shale has more than a decade of development history. However, there are many questions that still remain unanswered. What is the best inter-well spacing? What are the optimum stage length, proppant loading, and cluster spacing? What are the ideal combinations of these completion parameters? And how can we maximize the rate return on our investment? This study proposes innovative tools that allow researchers to answer these questions. We build these set of tools by utilizing the pattern recognition abilities of machine learning algorithms and public data from the Southwestern Pennsylvania region of the Marcellus Shale.\\n By means of artificial intelligence and data mining techniques, we studied a database that includes public data from more than 2,000 wells producing from the aforementioned study area. The database contained completion, drilling, and production history information from various operators active in Allegheny, Greene, Fayette, Washington, and Westmoreland counties located in the Southwestern Pennsylvania. Extensive preprocessing and data cleansing steps were involved to prepare the database. Various machine learning techniques (Linear Regression (LR), Support Vector Machines (SVMs), Artificial Neural Networks (ANNs), and Gaussian Processes (GP)) were applied to understand the non-linear patterns in the data. The objective was to develop predictive models that were trained and validated based on the current database. The predictive models were validated using information originating from numerous wells in the area. Once validated, the model could be used in reservoir management decision-making workflows to answer questions such as what are the best drilling scenarios, the optimum hydraulic fracturing design, the initial production rate, and the estimated ultimate recovery (EUR). The workflow is purely based on field data and free of any cognitive human bias. As soon as more data is available, the model could be updated. The core data in this workflow is sourced from public domains, and therefore, intensive preprocessing efforts were necessary.\",\"PeriodicalId\":298489,\"journal\":{\"name\":\"Day 4 Wed, October 10, 2018\",\"volume\":\"7 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-10-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"7\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Day 4 Wed, October 10, 2018\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.2118/191827-18ERM-MS\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Day 4 Wed, October 10, 2018","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2118/191827-18ERM-MS","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 7

摘要

马塞勒斯页岩有十多年的开发历史。然而，仍有许多问题没有得到解答。最佳井间间距是多少?最佳压裂段长度、支撑剂载荷和簇间距是多少?这些完井参数的理想组合是什么?我们怎样才能使投资回报率最大化呢?这项研究提出了创新的工具，使研究人员能够回答这些问题。我们利用机器学习算法的模式识别能力和Marcellus页岩宾夕法尼亚州西南部地区的公共数据构建了这些工具。通过人工智能和数据挖掘技术，我们研究了一个数据库，其中包括来自上述研究区域的2000多口井的公共数据。该数据库包含了位于宾夕法尼亚州西南部的Allegheny、Greene、Fayette、Washington和Westmoreland县各作业者的完井、钻井和生产历史信息。准备数据库涉及大量的预处理和数据清理步骤。各种机器学习技术(线性回归(LR)、支持向量机(svm)、人工神经网络(ann)和高斯过程(GP))被应用于理解数据中的非线性模式。目标是开发基于当前数据库的训练和验证的预测模型。利用来自该地区众多井的信息，对预测模型进行了验证。一旦得到验证，该模型可用于油藏管理决策工作流程，以回答诸如最佳钻井方案、最佳水力压裂设计、初始产量和估计最终采收率(EUR)等问题。该工作流完全基于现场数据，不存在任何认知上的人类偏见。一旦有更多的数据可用，就可以更新模型。该工作流中的核心数据来源于公共领域，因此需要进行大量的预处理工作。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Application of Machine Learning Algorithms for Optimizing Future Production in Marcellus Shale, Case Study of Southwestern Pennsylvania

The Marcellus Shale has more than a decade of development history. However, there are many questions that still remain unanswered. What is the best inter-well spacing? What are the optimum stage length, proppant loading, and cluster spacing? What are the ideal combinations of these completion parameters? And how can we maximize the rate return on our investment? This study proposes innovative tools that allow researchers to answer these questions. We build these set of tools by utilizing the pattern recognition abilities of machine learning algorithms and public data from the Southwestern Pennsylvania region of the Marcellus Shale. By means of artificial intelligence and data mining techniques, we studied a database that includes public data from more than 2,000 wells producing from the aforementioned study area. The database contained completion, drilling, and production history information from various operators active in Allegheny, Greene, Fayette, Washington, and Westmoreland counties located in the Southwestern Pennsylvania. Extensive preprocessing and data cleansing steps were involved to prepare the database. Various machine learning techniques (Linear Regression (LR), Support Vector Machines (SVMs), Artificial Neural Networks (ANNs), and Gaussian Processes (GP)) were applied to understand the non-linear patterns in the data. The objective was to develop predictive models that were trained and validated based on the current database. The predictive models were validated using information originating from numerous wells in the area. Once validated, the model could be used in reservoir management decision-making workflows to answer questions such as what are the best drilling scenarios, the optimum hydraulic fracturing design, the initial production rate, and the estimated ultimate recovery (EUR). The workflow is purely based on field data and free of any cognitive human bias. As soon as more data is available, the model could be updated. The core data in this workflow is sourced from public domains, and therefore, intensive preprocessing efforts were necessary.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Day 4 Wed, October 10, 2018

自引率

0.00%

发文量