Improved Information Retrieval From Well Related Documents Using Supervised Learning

Glenn Miers, M. Czernuszenko, Brian Hughes
{"title":"Improved Information Retrieval From Well Related Documents Using Supervised Learning","authors":"Glenn Miers, M. Czernuszenko, Brian Hughes","doi":"10.2118/210146-ms","DOIUrl":null,"url":null,"abstract":"\n We introduce a system for rapid retrieval of relevant well related information from a corpus of over 20 million documents. This allows for exploration workers to retrieve important business data more quickly. Tracking down all of the information required to make complex business decisions is a time consuming and error prone process. This poses a direct risk of expensive miscalculations and missed opportunities. A first version of this system is currently undergoing tests with select users. As the work here represents the first version of the system, it is expected that improvements will be made. This is a system that can be used at enterprise scale to enable searches to more easily yield usable information to workers.\n This system uses a supervised learning model to identify well related documents from several categories. Examples of these categories include (but are not limited to) formation evaluation and well completion reports. A machine learning model was trained to classify documents according to input from a well document expert. This input came in the form of a set of labeled documents compiled by said expert. This model was then applied to over 20 million documents that are deemed relevant to the exploration process. The inferred classifications for each document were stored in a search engine in order to facilitate retrieval of documents by each of the labels from above. The benefits of this system are twofold. First, it reduces the number of documents that come back for a given search of a large corpus of documents. Second, it allows users without technical experience in well-related work to more easily find documents.","PeriodicalId":113697,"journal":{"name":"Day 2 Tue, October 04, 2022","volume":"18 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Day 2 Tue, October 04, 2022","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2118/210146-ms","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

We introduce a system for rapid retrieval of relevant well related information from a corpus of over 20 million documents. This allows for exploration workers to retrieve important business data more quickly. Tracking down all of the information required to make complex business decisions is a time consuming and error prone process. This poses a direct risk of expensive miscalculations and missed opportunities. A first version of this system is currently undergoing tests with select users. As the work here represents the first version of the system, it is expected that improvements will be made. This is a system that can be used at enterprise scale to enable searches to more easily yield usable information to workers. This system uses a supervised learning model to identify well related documents from several categories. Examples of these categories include (but are not limited to) formation evaluation and well completion reports. A machine learning model was trained to classify documents according to input from a well document expert. This input came in the form of a set of labeled documents compiled by said expert. This model was then applied to over 20 million documents that are deemed relevant to the exploration process. The inferred classifications for each document were stored in a search engine in order to facilitate retrieval of documents by each of the labels from above. The benefits of this system are twofold. First, it reduces the number of documents that come back for a given search of a large corpus of documents. Second, it allows users without technical experience in well-related work to more easily find documents.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
利用监督学习改进相关文档的信息检索
我们介绍了一个系统,快速检索相关的相关信息,从语料库超过2000万的文件。这使得勘探人员能够更快地检索重要的业务数据。跟踪做出复杂业务决策所需的所有信息是一个耗时且容易出错的过程。这带来了代价高昂的误判和错失机会的直接风险。该系统的第一个版本目前正在选定用户中进行测试。由于这里的工作是该系统的第一个版本,预计还会有所改进。这是一个可以在企业规模上使用的系统,使搜索能够更容易地为工作人员提供可用的信息。该系统使用监督学习模型从几个类别中识别出相关良好的文档。这些类别的例子包括(但不限于)地层评价和完井报告。训练机器学习模型,根据井文档专家的输入对文档进行分类。该输入以上述专家编写的一组标记文档的形式出现。该模型随后被应用于超过2000万份被认为与勘探过程相关的文件。每个文档的推断分类存储在搜索引擎中,以便通过上面的每个标签检索文档。这个系统的好处是双重的。首先,它减少了对大型文档语料库进行给定搜索时返回的文档数量。其次,它允许没有相关工作技术经验的用户更容易找到文档。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Economic Yardsticks for the End of Economic Life: Holdback and Its Adjuncts Gas Transport Modeling in Organic-Rich Shales with Nonequilibrium Sorption Kinetics Team Coaching in Oil and Gas Fluid-Pipe Interaction in Horizontal Gas-Liquid Flow A Robust Workflow for Optimizing Drilling/Completion/Frac Design Using Machine Learning and Artificial Intelligence
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1