Integrated Cloud Computing Environment for Upstream Geoscience Workflows

M. Al-Habib, Yasser Al-Ghamdi
{"title":"Integrated Cloud Computing Environment for Upstream Geoscience Workflows","authors":"M. Al-Habib, Yasser Al-Ghamdi","doi":"10.2118/204848-ms","DOIUrl":null,"url":null,"abstract":"\n Extensive computing resources are required to leverage todays advanced geoscience workflows that are used to explore and characterize giant petroleum resources. In these cases, high-performance workstations are often unable to adequately handle the scale of computing required. The workflows typically utilize complex and massive data sets, which require advanced computing resources to store, process, manage, and visualize various forms of the data throughout the various lifecycles. This work describes a large-scale geoscience end-to-end interpretation platform customized to run on a cluster-based remote visualization environment.\n A team of computing infrastructure and geoscience workflow experts was established to collaborate on the deployment, which was broken down into separate phases. Initially, an evaluation and analysis phase was conducted to analyze computing requirements and assess potential solutions. A testing environment was then designed, implemented and benchmarked. The third phase used the test environment to determine the scale of infrastructure required for the production environment. Finally, the full-scale customized production environment was deployed for end users.\n During testing phase, aspects such as connectivity, stability, interactivity, functionality, and performance were investigated using the largest available geoscience datasets. Multiple computing configurations were benchmarked until optimal performance was achieved, under applicable corporate information security guidelines.\n It was observed that the customized production environment was able to execute workflows that were unable to run on local user workstations. For example, while conducting connectivity, stability and interactivity benchmarking, the test environment was operated for extended periods to ensure stability for workflows that require multiple days to run.\n To estimate the scale of the required production environment, varying categories of users’ portfolio were determined based on data type, scale and workflow. Continuous monitoring of system resources and utilization enabled continuous improvements to the final solution.\n The utilization of a fit-for-purpose, customized remote visualization solution may reduce or ultimately eliminate the need to deploy high-end workstations to all end users. Rather, a shared, scalable and reliable cluster-based solution can serve a much larger user community in a highly performant manner.","PeriodicalId":11024,"journal":{"name":"Day 4 Wed, December 01, 2021","volume":"92 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2021-12-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Day 4 Wed, December 01, 2021","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2118/204848-ms","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Extensive computing resources are required to leverage todays advanced geoscience workflows that are used to explore and characterize giant petroleum resources. In these cases, high-performance workstations are often unable to adequately handle the scale of computing required. The workflows typically utilize complex and massive data sets, which require advanced computing resources to store, process, manage, and visualize various forms of the data throughout the various lifecycles. This work describes a large-scale geoscience end-to-end interpretation platform customized to run on a cluster-based remote visualization environment. A team of computing infrastructure and geoscience workflow experts was established to collaborate on the deployment, which was broken down into separate phases. Initially, an evaluation and analysis phase was conducted to analyze computing requirements and assess potential solutions. A testing environment was then designed, implemented and benchmarked. The third phase used the test environment to determine the scale of infrastructure required for the production environment. Finally, the full-scale customized production environment was deployed for end users. During testing phase, aspects such as connectivity, stability, interactivity, functionality, and performance were investigated using the largest available geoscience datasets. Multiple computing configurations were benchmarked until optimal performance was achieved, under applicable corporate information security guidelines. It was observed that the customized production environment was able to execute workflows that were unable to run on local user workstations. For example, while conducting connectivity, stability and interactivity benchmarking, the test environment was operated for extended periods to ensure stability for workflows that require multiple days to run. To estimate the scale of the required production environment, varying categories of users’ portfolio were determined based on data type, scale and workflow. Continuous monitoring of system resources and utilization enabled continuous improvements to the final solution. The utilization of a fit-for-purpose, customized remote visualization solution may reduce or ultimately eliminate the need to deploy high-end workstations to all end users. Rather, a shared, scalable and reliable cluster-based solution can serve a much larger user community in a highly performant manner.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
地球科学上游工作流程集成云计算环境
为了利用当今先进的地球科学工作流程来勘探和描述巨大的石油资源,需要大量的计算资源。在这些情况下,高性能工作站通常无法充分处理所需的计算规模。工作流通常利用复杂和大量的数据集,这需要先进的计算资源来存储、处理、管理和可视化整个生命周期中各种形式的数据。这项工作描述了一个大规模的地球科学端到端解释平台,该平台是定制的,可在基于集群的远程可视化环境中运行。一个由计算基础设施和地球科学工作流程专家组成的团队在部署过程中进行协作,并将其分成不同的阶段。最初,进行了评估和分析阶段,以分析计算需求并评估潜在的解决方案。然后设计、实现和基准测试一个测试环境。第三阶段使用测试环境来确定生产环境所需的基础设施的规模。最后,为最终用户部署了全面定制的生产环境。在测试阶段,使用最大的可用地球科学数据集对连接性、稳定性、交互性、功能和性能等方面进行了调查。在适用的公司信息安全指导方针下,对多个计算配置进行基准测试,直到达到最佳性能。据观察,定制的生产环境能够执行无法在本地用户工作站上运行的工作流。例如,在进行连接性、稳定性和交互性基准测试时,测试环境的运行时间延长,以确保需要多天运行的工作流的稳定性。为了估计所需生产环境的规模,根据数据类型、规模和工作流确定用户组合的不同类别。对系统资源和利用率的持续监控使得最终解决方案得到持续改进。使用适合用途的定制远程可视化解决方案可以减少或最终消除为所有最终用户部署高端工作站的需要。相反,基于集群的共享、可扩展和可靠的解决方案可以以高性能的方式为更大的用户社区提供服务。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Condensate Banking Removal Using Slow Release of In-Situ Energized Fluid Full-Field History-Matching of Commingling Stacked Reservoirs: A Case Study of an Oman Southern Asset Recovery Improvement Using Geological, Technical and Operational Factors of Field Development That Influence the Character of Inflow Profiles in Horizontal Laterals Unlocking Growth Opportunities Through Saturation Evaluation Behind Complex Completion by Applying State-of-Art Pulsed Neutron Technology Development of New Models to Determine the Rheological Parameters of Water-based Drilling Fluid Using Artificial Intelligence
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1