Day 1 Tue, October 26, 2021最新文献_第3页

Adding GPU Acceleration to an Industrial CPU-Based Simulator, Development Strategy and Results 将GPU加速添加到基于工业cpu的模拟器，开发策略和结果

Day 1 Tue, October 26, 2021

Pub Date : 2021-10-19 DOI: 10.2118/203936-ms

H. Cao, Rustem Zaydullin, Terrence Liao, N. Gohaud, E. Obi, G. Darche

Running multi-million cell simulation problems in minutes has been a dream for reservoir engineers for decades. Today, with the advancement of Graphic Processing Unit (GPU), we have a real chance to make this dream a reality. Here we present our experience in the step-by-step transformation of a fully developed industrial CPU-based simulator into a fully functional GPU-based simulator. We also demonstrate significant accelerations achieved through the use of GPU technology. To achieve the best performance possible, we choose to use CUDA (NVIDIA GPU’s native language), and offload as much computations to GPU as possible. Our CUDA implementation covers all reservoir computes, which include property calculation, linearization, linear solver, etc. The well and Field Management still reside on CPU and need minor changes for their interaction with GPU-based reservoir. Importantly, there is no change to the nonlinear logic. The GPU and CPU parts are overlapped, fully utilizing the asynchronous nature of GPU operations. Each reservoir computation can be run in three modes, CPU_only (existing one), GPU_only, CPU followed by GPU. The latter is only used for result checking and debugging. In early 2019, we prototyped two reservoir linearization operations (mass accumulation and mass flux) in CUDA; both showed very strong runtime speed-up of several hundred times, 1 P100-GPU (NVIDIA) vs 1 POWER8NVL CPU core rated at 2.8 GHz (IBM). Encouraged by this success, we moved into linear solver development and managed to move the entire linear solver module into GPU. Again, strong speed-up of ~50 times was achieved (1 GPU vs 1 CPU). The focus for 2019 has been on standard Black-Oil cases. Our implementation was tested with multiple "million-cell range" models (SPE10 and other real field cases). In early 2020, we managed to put SPE10 fully on GPU, and finished the entire 2000 day time-stepping in ~35 sec with a single P100 card. After that our effort has switched to compositional AIM (Adaptive Implicit Method), with focus on compositional flash and AIM implementation for reservoir linearization and linear solver, both show early promising results. GPU-based reservoir simulation is a future trend for HPC. The development of a reservoir simulator is complex, multi-discipline and time-consuming work. Our paper demonstrates a clear strategy to add tremendous GPU acceleration into an existing CPU-based simulator. Our approach fully utilizes the strength of the existing CPU simulator and minimizes the GPU development effort. This paper is also the first publication targeting GPU acceleration for compositional AIM models.

几十年来，在几分钟内运行数百万个单元模拟问题一直是油藏工程师的梦想。今天，随着图形处理单元(GPU)的进步，我们有真正的机会让这个梦想成为现实。在这里，我们介绍了我们在逐步将完全开发的基于工业cpu的模拟器转变为功能齐全的基于gpu的模拟器的经验。我们还演示了通过使用GPU技术实现的显著加速。为了实现最佳性能，我们选择使用CUDA (NVIDIA GPU的原生语言)，并尽可能多地将计算卸载到GPU上。我们的CUDA实现涵盖了所有油藏计算，包括属性计算，线性化，线性求解等。井和现场管理仍然驻留在CPU上，需要对其与基于gpu的储层的交互进行微小的更改。重要的是，非线性逻辑没有改变。GPU和CPU部分重叠，充分利用了GPU操作的异步特性。每个储层计算可以在CPU_only(已有)、GPU_only、CPU、GPU三种模式下运行。后者仅用于结果检查和调试。2019年初，我们在CUDA中原型化了两种油藏线性化操作(质量积累和质量通量);1个P100-GPU (NVIDIA) vs 1个2.8 GHz的POWER8NVL CPU核心(IBM)，两者都显示出数百倍的强大运行时加速。受到这一成功的鼓舞，我们转向线性求解器开发，并设法将整个线性求解器模块转移到GPU中。同样，实现了约50倍的强大加速(1个GPU vs 1个CPU)。2019年的重点是标准的黑油案例。我们的实现用多个“百万单元范围”模型(SPE10和其他实际现场案例)进行了测试。在2020年初，我们成功地将SPE10完全放在GPU上，并在35秒内完成了整个2000天的时间步进。之后，我们的工作转向了组合AIM(自适应隐式方法)，重点研究了组合flash和AIM在油藏线性化和线性求解中的实现，两者都显示出了早期有希望的结果。基于gpu的油藏模拟是高性能计算的未来发展趋势。油藏模拟器的开发是一项复杂、多学科、耗时的工作。我们的论文展示了一种清晰的策略，将巨大的GPU加速添加到现有的基于cpu的模拟器中。我们的方法充分利用了现有CPU模拟器的优势，并最大限度地减少了GPU的开发工作量。这篇论文也是第一篇针对合成AIM模型的GPU加速的论文。

{"title":"Adding GPU Acceleration to an Industrial CPU-Based Simulator, Development Strategy and Results","authors":"H. Cao, Rustem Zaydullin, Terrence Liao, N. Gohaud, E. Obi, G. Darche","doi":"10.2118/203936-ms","DOIUrl":"https://doi.org/10.2118/203936-ms","url":null,"abstract":"\u0000 Running multi-million cell simulation problems in minutes has been a dream for reservoir engineers for decades. Today, with the advancement of Graphic Processing Unit (GPU), we have a real chance to make this dream a reality. Here we present our experience in the step-by-step transformation of a fully developed industrial CPU-based simulator into a fully functional GPU-based simulator. We also demonstrate significant accelerations achieved through the use of GPU technology.\u0000 To achieve the best performance possible, we choose to use CUDA (NVIDIA GPU’s native language), and offload as much computations to GPU as possible. Our CUDA implementation covers all reservoir computes, which include property calculation, linearization, linear solver, etc. The well and Field Management still reside on CPU and need minor changes for their interaction with GPU-based reservoir. Importantly, there is no change to the nonlinear logic. The GPU and CPU parts are overlapped, fully utilizing the asynchronous nature of GPU operations. Each reservoir computation can be run in three modes, CPU_only (existing one), GPU_only, CPU followed by GPU. The latter is only used for result checking and debugging.\u0000 In early 2019, we prototyped two reservoir linearization operations (mass accumulation and mass flux) in CUDA; both showed very strong runtime speed-up of several hundred times, 1 P100-GPU (NVIDIA) vs 1 POWER8NVL CPU core rated at 2.8 GHz (IBM). Encouraged by this success, we moved into linear solver development and managed to move the entire linear solver module into GPU. Again, strong speed-up of ~50 times was achieved (1 GPU vs 1 CPU). The focus for 2019 has been on standard Black-Oil cases. Our implementation was tested with multiple \"million-cell range\" models (SPE10 and other real field cases). In early 2020, we managed to put SPE10 fully on GPU, and finished the entire 2000 day time-stepping in ~35 sec with a single P100 card. After that our effort has switched to compositional AIM (Adaptive Implicit Method), with focus on compositional flash and AIM implementation for reservoir linearization and linear solver, both show early promising results.\u0000 GPU-based reservoir simulation is a future trend for HPC. The development of a reservoir simulator is complex, multi-discipline and time-consuming work. Our paper demonstrates a clear strategy to add tremendous GPU acceleration into an existing CPU-based simulator. Our approach fully utilizes the strength of the existing CPU simulator and minimizes the GPU development effort. This paper is also the first publication targeting GPU acceleration for compositional AIM models.","PeriodicalId":11146,"journal":{"name":"Day 1 Tue, October 26, 2021","volume":"8 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2021-10-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89985867","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Optimization of Water-Alternating-CO2 Injection Field Operations Using a Machine-Learning-Assisted Workflow 利用机器学习辅助工作流程优化水-交变co2注入现场作业

Day 1 Tue, October 26, 2021

Pub Date : 2021-10-19 DOI: 10.2118/203913-ms

You Junyu, Ampomah William, Sun Qian

This paper will present a robust workflow to address multi-objective optimization (MOO) of CO2-EOR-sequestration projects with a large number of operational control parameters. Farnsworth Unit (FWU) field, a mature oil reservoir undergoing CO2 alternating water injection (CO2-WAG) enhanced oil recovery (EOR), will be used as a field case to validate the proposed optimization protocol. The expected outcome of this work would be a repository of Pareto-optimal solutions of multiple objective functions, including oil recovery, carbon storage volume, and project economics. FWU's numerical model is employed to demonstrate the proposed optimization workflow. Since using MOO requires computationally intensive procedures, machine-learning-based proxies are introduced to substitute for the high-fidelity model, thus reducing the total computation overhead. The vector machine regression combined with the Gaussian kernel (Gaussian -SVR) is utilized to construct proxies. An iterative self-adjusting process prepares the training knowledgebase to develop robust proxies and minimizes computational time. The proxies’ hyperparameters will be optimally designed using Bayesian Optimization to achieve better generalization performance. Trained proxies will be coupled with Multi-objective Particle Swarm Optimization (MOPSO) protocol to construct the Pareto-front solution repository. The outcomes of this workflow will be a repository containing Pareto-optimal solutions of multiple objectives considered in the CO2-WAG project. The proposed optimization workflow will be compared with another established methodology employing a multi-layer neural network to validate its feasibility in handling MOO with a large number of parameters to control. Optimization parameters used include operational variables that might be used to control the CO2-WAG process, such as the duration of the water/gas injection period, producer bottomhole pressure (BHP) control, and water injection rate of each well included in the numerical model. It is proven that the workflow coupling Gaussian -SVR proxies and the iterative self-adjusting protocol is more computationally efficient. The MOO process is made more rapid by squeezing the size of the required training knowledgebase while maintaining the high accuracy of the optimized results. The outcomes of the optimization study show promising results in successfully establishing the solution repository considering multiple objective functions. Results are also verified by validating the Pareto fronts with simulation results using obtained optimized control parameters. The outcome from this work could provide field operators an opportunity to design a CO2-WAG project using as many inputs as possible from the reservoir models. The proposed work introduces a novel concept that couples Gaussian -SVR proxies with a self-adjusting protocol to increase the computational efficiency of the proposed workflow and to guarantee the high accuracy of the obtained

本文将提出一个强大的工作流来解决具有大量操作控制参数的二氧化碳- eor封存项目的多目标优化(MOO)。Farnsworth Unit (FWU)油田是一个成熟的油藏，正在进行二氧化碳交替注水(CO2- wag)提高采收率(EOR)，将作为现场案例来验证所提出的优化方案。这项工作的预期结果将是一个多目标函数的帕累托最优解库，包括石油采收率、碳储量和项目经济性。采用FWU的数值模型对所提出的优化流程进行了验证。由于使用MOO需要计算密集型的过程，因此引入了基于机器学习的代理来替代高保真模型，从而减少了总计算开销。利用向量机回归与高斯核(Gaussian -SVR)相结合的方法构建代理。迭代的自调整过程使训练知识库能够开发健壮的代理并最大限度地减少计算时间。采用贝叶斯优化方法对代理的超参数进行优化设计，以获得更好的泛化性能。将训练好的代理与多目标粒子群优化(MOPSO)协议相结合，构建Pareto-front解库。该工作流的结果将是一个存储库，其中包含CO2-WAG项目中考虑的多个目标的帕累托最优解。将提出的优化工作流程与另一种采用多层神经网络的方法进行比较，以验证其在处理具有大量参数控制的MOO时的可行性。所使用的优化参数包括可用于控制CO2-WAG过程的操作变量，例如注水/注气周期的持续时间、生产井底压力(BHP)控制以及数值模型中包含的每口井的注水速度。结果表明，耦合高斯-SVR代理和迭代自调整协议的工作流计算效率更高。通过压缩所需训练知识库的大小，使mooo过程更加快速，同时保持优化结果的高准确性。优化研究结果表明，在成功建立考虑多目标函数的解决方案库方面取得了良好的效果。利用得到的优化控制参数，将Pareto front与仿真结果进行了验证。这项工作的结果可以为油田运营商提供一个机会，利用尽可能多的油藏模型输入来设计CO2-WAG项目。本文提出了一种新的概念，将高斯-SVR代理与自调整协议相结合，以提高所提出工作流的计算效率，并保证所获得的优化结果的高准确性。更重要的是，该工作流可以优化复杂CO2-WAG过程中使用的大量控制参数，这大大扩展了其在解决具有相似期望结果的各种项目中的大规模多目标优化问题方面的实用性。

{"title":"Optimization of Water-Alternating-CO2 Injection Field Operations Using a Machine-Learning-Assisted Workflow","authors":"You Junyu, Ampomah William, Sun Qian","doi":"10.2118/203913-ms","DOIUrl":"https://doi.org/10.2118/203913-ms","url":null,"abstract":"\u0000 This paper will present a robust workflow to address multi-objective optimization (MOO) of CO2-EOR-sequestration projects with a large number of operational control parameters. Farnsworth Unit (FWU) field, a mature oil reservoir undergoing CO2 alternating water injection (CO2-WAG) enhanced oil recovery (EOR), will be used as a field case to validate the proposed optimization protocol. The expected outcome of this work would be a repository of Pareto-optimal solutions of multiple objective functions, including oil recovery, carbon storage volume, and project economics.\u0000 FWU's numerical model is employed to demonstrate the proposed optimization workflow. Since using MOO requires computationally intensive procedures, machine-learning-based proxies are introduced to substitute for the high-fidelity model, thus reducing the total computation overhead. The vector machine regression combined with the Gaussian kernel (Gaussian -SVR) is utilized to construct proxies. An iterative self-adjusting process prepares the training knowledgebase to develop robust proxies and minimizes computational time. The proxies’ hyperparameters will be optimally designed using Bayesian Optimization to achieve better generalization performance. Trained proxies will be coupled with Multi-objective Particle Swarm Optimization (MOPSO) protocol to construct the Pareto-front solution repository.\u0000 The outcomes of this workflow will be a repository containing Pareto-optimal solutions of multiple objectives considered in the CO2-WAG project. The proposed optimization workflow will be compared with another established methodology employing a multi-layer neural network to validate its feasibility in handling MOO with a large number of parameters to control. Optimization parameters used include operational variables that might be used to control the CO2-WAG process, such as the duration of the water/gas injection period, producer bottomhole pressure (BHP) control, and water injection rate of each well included in the numerical model. It is proven that the workflow coupling Gaussian -SVR proxies and the iterative self-adjusting protocol is more computationally efficient. The MOO process is made more rapid by squeezing the size of the required training knowledgebase while maintaining the high accuracy of the optimized results. The outcomes of the optimization study show promising results in successfully establishing the solution repository considering multiple objective functions. Results are also verified by validating the Pareto fronts with simulation results using obtained optimized control parameters. The outcome from this work could provide field operators an opportunity to design a CO2-WAG project using as many inputs as possible from the reservoir models.\u0000 The proposed work introduces a novel concept that couples Gaussian -SVR proxies with a self-adjusting protocol to increase the computational efficiency of the proposed workflow and to guarantee the high accuracy of the obtained","PeriodicalId":11146,"journal":{"name":"Day 1 Tue, October 26, 2021","volume":"151 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2021-10-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"77588830","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

High-Order Adaptive Scheme for Reactive Transport in Heterogeneous Porous Media 非均质多孔介质中反应输运的高阶自适应方案

Day 1 Tue, October 26, 2021

Pub Date : 2021-10-19 DOI: 10.2118/203972-ms

Ricardo H. Deucher, H. Tchelepi

Subsurface sequestration of carbon dioxide, contaminant transport, and enhanced oil recovery processes often involve complex reaction dynamics. The rock-fluid interactions span a very wide range of length and time scales, and it is important for the numerical solutions to resolve these scales properly. To address these challenges, we extend the adaptive transport scheme for the simulation of reactive transport in heterogeneous porous media developed previously (Deucher and Tchelepi, 2021) to account for (a) higher-order approximation of the convective fluxes and (b) coupling with a chemical solver connected to geochemical databases. The numerical results demonstrate that adaptivity is more effective when a higher-order approximation of the fluxes is used. This is because of lower levels of numerical dispersion compared with low-order approximations, which helps resolve the displacement fronts more accurately. As a result, the regions that experience significant concentration and saturation gradients are more confined, and that leads to improvements in the computational efficiency of the adaptive scheme. The robustness of the approach is demonstrated using a highly heterogeneous two-phase case with multiple wells and a variable total liquid-rate. Due to the modularity of the adaptive scheme, coupling with a chemical solver module is straightforward. The scheme is tested for a three-dimensional case that considers injection of carbonated water in a reservoir matrix of calcite. The results show that the adaptive scheme leads to an accurate representation of the reference concentration distributions of the six reactive components throughout the simulation and leads to a large reduction in the number of cell updates required to achieve the solution.

二氧化碳的地下封存、污染物输送和提高采收率的过程往往涉及复杂的反应动力学。岩石-流体相互作用跨越了非常广泛的长度和时间尺度，对于数值解来说，正确地解决这些尺度是非常重要的。为了应对这些挑战，我们扩展了以前开发的用于模拟非均质多孔介质中反应性输移的自适应输移方案(Deucher和Tchelepi, 2021)，以考虑(a)对流通量的高阶近似和(b)与连接到地球化学数据库的化学求解器的耦合。数值结果表明，当采用高阶近似时，自适应更有效。这是因为与低阶近似相比，数值色散水平较低，这有助于更准确地解决位移前沿。因此，经历显著浓度和饱和度梯度的区域更受限制，这导致自适应方案的计算效率提高。该方法的鲁棒性通过具有多井和可变总液率的高度非均质两相情况得到了验证。由于自适应方案的模块化，与化学求解器模块的耦合是直接的。在方解石储层基质中注入碳酸水的三维情况下，对该方案进行了测试。结果表明，该自适应方案在整个模拟过程中准确地表示了六种反应组分的参考浓度分布，并大大减少了实现该解决方案所需的单元更新次数。

{"title":"High-Order Adaptive Scheme for Reactive Transport in Heterogeneous Porous Media","authors":"Ricardo H. Deucher, H. Tchelepi","doi":"10.2118/203972-ms","DOIUrl":"https://doi.org/10.2118/203972-ms","url":null,"abstract":"\u0000 Subsurface sequestration of carbon dioxide, contaminant transport, and enhanced oil recovery processes often involve complex reaction dynamics. The rock-fluid interactions span a very wide range of length and time scales, and it is important for the numerical solutions to resolve these scales properly. To address these challenges, we extend the adaptive transport scheme for the simulation of reactive transport in heterogeneous porous media developed previously (Deucher and Tchelepi, 2021) to account for (a) higher-order approximation of the convective fluxes and (b) coupling with a chemical solver connected to geochemical databases.\u0000 The numerical results demonstrate that adaptivity is more effective when a higher-order approximation of the fluxes is used. This is because of lower levels of numerical dispersion compared with low-order approximations, which helps resolve the displacement fronts more accurately. As a result, the regions that experience significant concentration and saturation gradients are more confined, and that leads to improvements in the computational efficiency of the adaptive scheme. The robustness of the approach is demonstrated using a highly heterogeneous two-phase case with multiple wells and a variable total liquid-rate.\u0000 Due to the modularity of the adaptive scheme, coupling with a chemical solver module is straightforward. The scheme is tested for a three-dimensional case that considers injection of carbonated water in a reservoir matrix of calcite. The results show that the adaptive scheme leads to an accurate representation of the reference concentration distributions of the six reactive components throughout the simulation and leads to a large reduction in the number of cell updates required to achieve the solution.","PeriodicalId":11146,"journal":{"name":"Day 1 Tue, October 26, 2021","volume":"134 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2021-10-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"88909512","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

A GPU-Based Integrated Simulation Framework for Modelling of Complex Subsurface Applications 基于gpu的复杂地下应用建模集成仿真框架

Day 1 Tue, October 26, 2021

Pub Date : 2021-10-19 DOI: 10.2118/204000-ms

M. Khait, D. Voskov

Alternative to CPU computing architectures, such as GPU, continue to evolve increasing the gap in peak memory bandwidth achievable on a conventional workstation or laptop. Such architectures are attractive for reservoir simulation, which performance is generally bounded by system memory bandwidth. However, to harvest the benefit of a new architecture, the source code has to be inevitably rewritten, sometimes almost completely. One of the biggest challenges here is to refactor the Jacobian assembly which typically involves large volumes of code and complex data processing. We demonstrate an effective and general way to simplify the linearization stage extracting complex physics-related computations from the main simulation loop and leaving only an algebraic multi-linear interpolation kernel instead. In this work, we provide the detailed description of simulation performance benefits from execution of the entire nonlinear loop on the GPU platform. We evaluate the computational performance of Delft Advanced Research Terra Simulator (DARTS) for various subsurface applications of practical interest on both CPU and GPU platforms, comparing particular workflow phases including Jacobian assembly and linear system solution with both stages of the Constraint Pressure Residual preconditioner.

CPU计算架构的替代方案，如GPU，不断发展，增加了传统工作站或笔记本电脑上可实现的峰值内存带宽的差距。这种架构对油藏模拟很有吸引力，因为油藏模拟的性能通常受到系统内存带宽的限制。然而，为了获得新架构的好处，源代码必须不可避免地重写，有时几乎是完全重写。这里最大的挑战之一是重构雅可比集合，这通常涉及大量代码和复杂的数据处理。我们展示了一种有效和通用的方法来简化线性化阶段，从主模拟环路中提取复杂的物理相关计算，而只留下一个代数多线性插值核。在这项工作中，我们详细描述了在GPU平台上执行整个非线性回路所带来的仿真性能优势。我们评估了Delft Advanced Research Terra Simulator (DARTS)在CPU和GPU平台上各种实际应用的计算性能，比较了特定的工作流程阶段，包括雅可比装配和线性系统解决方案与约束压力剩余预调节器的两个阶段。

{"title":"A GPU-Based Integrated Simulation Framework for Modelling of Complex Subsurface Applications","authors":"M. Khait, D. Voskov","doi":"10.2118/204000-ms","DOIUrl":"https://doi.org/10.2118/204000-ms","url":null,"abstract":"\u0000 Alternative to CPU computing architectures, such as GPU, continue to evolve increasing the gap in peak memory bandwidth achievable on a conventional workstation or laptop. Such architectures are attractive for reservoir simulation, which performance is generally bounded by system memory bandwidth. However, to harvest the benefit of a new architecture, the source code has to be inevitably rewritten, sometimes almost completely. One of the biggest challenges here is to refactor the Jacobian assembly which typically involves large volumes of code and complex data processing. We demonstrate an effective and general way to simplify the linearization stage extracting complex physics-related computations from the main simulation loop and leaving only an algebraic multi-linear interpolation kernel instead. In this work, we provide the detailed description of simulation performance benefits from execution of the entire nonlinear loop on the GPU platform. We evaluate the computational performance of Delft Advanced Research Terra Simulator (DARTS) for various subsurface applications of practical interest on both CPU and GPU platforms, comparing particular workflow phases including Jacobian assembly and linear system solution with both stages of the Constraint Pressure Residual preconditioner.","PeriodicalId":11146,"journal":{"name":"Day 1 Tue, October 26, 2021","volume":"3 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2021-10-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87757233","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Streamline Tracing and Applications in Dual Porosity Dual Permeability Models 流线示踪及其在双孔双渗模型中的应用

Day 1 Tue, October 26, 2021

Pub Date : 2021-10-19 DOI: 10.2118/203993-ms

Tsubasa Onishi, Hongquan Chen, Jiang Xie, Shusei Tanaka, D. Kam, Zhiming Wang, X. Wen, A. Datta-Gupta

Streamline-based methods have proven to be effective for various subsurface flow and transport modeling problems. However, the applications are limited in dual-porosity and dual-permeability (DPDK) system due to the difficulty in describing interactions between matrix and fracture during streamline tracing. In this work, we present a robust streamline tracing algorithm for DPDK models and apply the new algorithm to rate allocation optimization in a waterflood reservoir. In the proposed method, streamlines are traced in both fracture and matrix domains. The inter-fluxes between fracture and matrix are described by switching streamlines from one domain to another using a probability computed based on the inter-fluxes. The approach is fundamentally similar to the existing streamline tracing technique and can be utilized in streamline-assisted applications, such as flow diagnostics, history matching, and production optimization. The proposed method is benchmarked with a finite-volume based approach where grid-based time-of-flight was obtained by solving the stationary transport equation. We first validated our method using simple examples. Visual time-of-flight comparisons as well as tracer concentration and allocation factors at wells show good agreement. Next, we applied the proposed method to field scale models to demonstrate the robustness. The results show that our method offers reduced numerical artifacts and better represents reservoir heterogeneity and well connectivity with sub-grid resolutions. The proposed method is then used for rate allocation optimization in DPDK models. A streamline-based gradient free algorithm is used to optimize net present value by adjusting both injection and production well rates under operational constraints. The results show that the optimized schedule offers significant improvement in recovery factor, net present value, and sweep efficiency compared to the base scenario using equal rate injection and production. The optimization algorithm is computationally efficient as it requires only a few forward reservoir simulations.

基于流线的方法已被证明对各种地下流动和输运建模问题是有效的。然而，由于在流线示踪过程中难以描述基质与裂缝之间的相互作用，该技术在双孔双渗(DPDK)体系中的应用受到限制。在这项工作中，我们提出了一种针对DPDK模型的鲁棒流线跟踪算法，并将新算法应用于注水油藏的费率分配优化。在该方法中，在裂缝和矩阵域中都可以跟踪流线。裂缝与基体之间的互流是用基于互流计算的概率从一个域切换到另一个域来描述的。该方法与现有的流线跟踪技术基本相似，可用于流线辅助应用，如流体诊断、历史匹配和生产优化。该方法以基于有限体积的方法为基准，通过求解静止输运方程获得基于网格的飞行时间。我们首先用简单的例子验证了我们的方法。目视飞行时间对比以及井中示踪剂浓度和分配因子显示出良好的一致性。接下来，我们将提出的方法应用于现场尺度模型以证明其鲁棒性。结果表明，该方法减少了数值伪影，更好地代表了储层非均质性和井连通性的亚网格分辨率。然后将该方法用于DPDK模型的速率分配优化。一种基于流线的无梯度算法通过在操作约束下调整注入井和生产井的速度来优化净现值。结果表明，与使用等量注入和生产的基本方案相比，优化后的方案在采收率、净现值和波及效率方面都有显著提高。该优化算法只需要少量的正向油藏模拟，计算效率高。

{"title":"Streamline Tracing and Applications in Dual Porosity Dual Permeability Models","authors":"Tsubasa Onishi, Hongquan Chen, Jiang Xie, Shusei Tanaka, D. Kam, Zhiming Wang, X. Wen, A. Datta-Gupta","doi":"10.2118/203993-ms","DOIUrl":"https://doi.org/10.2118/203993-ms","url":null,"abstract":"Streamline-based methods have proven to be effective for various subsurface flow and transport modeling problems. However, the applications are limited in dual-porosity and dual-permeability (DPDK) system due to the difficulty in describing interactions between matrix and fracture during streamline tracing. In this work, we present a robust streamline tracing algorithm for DPDK models and apply the new algorithm to rate allocation optimization in a waterflood reservoir.\u0000 In the proposed method, streamlines are traced in both fracture and matrix domains. The inter-fluxes between fracture and matrix are described by switching streamlines from one domain to another using a probability computed based on the inter-fluxes. The approach is fundamentally similar to the existing streamline tracing technique and can be utilized in streamline-assisted applications, such as flow diagnostics, history matching, and production optimization.\u0000 The proposed method is benchmarked with a finite-volume based approach where grid-based time-of-flight was obtained by solving the stationary transport equation. We first validated our method using simple examples. Visual time-of-flight comparisons as well as tracer concentration and allocation factors at wells show good agreement. Next, we applied the proposed method to field scale models to demonstrate the robustness. The results show that our method offers reduced numerical artifacts and better represents reservoir heterogeneity and well connectivity with sub-grid resolutions. The proposed method is then used for rate allocation optimization in DPDK models. A streamline-based gradient free algorithm is used to optimize net present value by adjusting both injection and production well rates under operational constraints. The results show that the optimized schedule offers significant improvement in recovery factor, net present value, and sweep efficiency compared to the base scenario using equal rate injection and production. The optimization algorithm is computationally efficient as it requires only a few forward reservoir simulations.","PeriodicalId":11146,"journal":{"name":"Day 1 Tue, October 26, 2021","volume":"11 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2021-10-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"90393387","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

An Optimization-Based Facility Network Solver for Well Allocation in Reservoir Simulation 基于优化的油藏模拟配井设施网络求解器

Day 1 Tue, October 26, 2021

Pub Date : 2021-10-19 DOI: 10.2118/203954-ms

K. Wiegand, Y. Zaretskiy, K. Mukundakrishnan, L. Patacchini

When coupling reservoir simulators to surface network solvers, an often used strategy is to perform a rule or priority-driven allocation based on individual well and group constraints, augmented by back-pressure constraints computed periodically by the network solver. The allocation algorithm uses an iteration that applies well-established heuristics in a sequential manner until all constraints are met. The rationale for this approach is simply to maximize performance and simulation throughput; one of its drawbacks is that the computed allocation may not be feasible with respect to the overall network balance, especially in cases where not all wells can be choked individually. In the work presented here, the authors integrate the well allocation process into the network flow solver, in the form of an optimization engine, to ensure that the solution conforms to the network rate and pressure balance equations. Results for three stand-alone test cases are discussed.

当将油藏模拟器与地面网络求解器耦合时，一种常用的策略是根据单井和井群约束执行规则或优先级驱动的分配，并辅以网络求解器周期性计算的背压约束。分配算法使用迭代，以顺序的方式应用已建立的启发式，直到满足所有约束。这种方法的基本原理只是为了最大化性能和模拟吞吐量;它的一个缺点是，计算分配对于整个网络平衡来说可能不可行，特别是在并非所有井都可以单独堵塞的情况下。在本文中，作者将井分配过程以优化引擎的形式集成到网络流量求解器中，以确保解符合网络速率和压力平衡方程。讨论了三个独立测试用例的结果。

引用次数: 0

A Physics-Based Proxy for Surface and Subsurface Coupled Simulation Models 基于物理的地表和地下耦合模拟模型代理

Day 1 Tue, October 26, 2021

Pub Date : 2021-10-19 DOI: 10.2118/204004-ms

Changdong Yang, Jincong He, S. Du, Zhenzhen Wang, Tsubasa Onishi, X. Guan, Jianping Chen, X. Wen

Full-physics subsurface simulation models coupled with surface network can be computationally expensive. In this paper, we propose a physics-based subsurface model proxy that significantly reduces the run-time of the coupled model to enable rapid decision-making for reservoir management. In the coupled model the subsurface reservoir simulator generates well inflow performance relationship (IPR) curves which are used by the surface network model to determine well rates that satisfy surface constraints. In the proposed proxy model, the CPU intensive reservoir simulation is replaced with an IPR database constructed from a data pool of one or multiple simulation runs. The IPR database captures well performance that represents subsurface reservoir dynamics. The proxy model can then be used to predict the production performance of new scenarios – for example new drilling sequence – by intelligently looking up the appropriate IPR curves for oil, gas and water phases for each well and solving it with the surface network. All necessary operational events in the surface network and field management logic (such as facility constraints, well conditional shut-in, and group guide rate balancing) for the full-coupled model can be implemented and honored. In the proposed proxy model, while the reservoir simulation component is eliminated for efficiency. The entirety of the surface network model is retained, which offers certain advantages. It is particularly suitable for investigating the impact of different surface operations, such as maintenance schedule and production routing changes, with the aim of minimizing production capacity off-line due to maintenance. Replacing the computationally intensive subsurface simulation with the appropriate IPR significantly improves the run time of the coupled model while preserving the essential physics of the reservoir. The accuracy depends on the difference between the scenarios that the proxy is trained on and the scenarios being evaluated. Initial testing with a complex reservoir with more than 300 wells showed the accuracy of the proxy model to be more than 95%. The computation speedup could be an order of magnitude, depending largely on complexity of the surface network model. Prior work exists in the literature that uses decline curves to replicate subsurface model performance. The use of the multi-phase IPR database and the intelligent lookup mechanism in the proposed method allows it to be more accurate and flexible in handling complexities such as multi-phase flow and interference in the surface network.

与地表网络相结合的全物理场地下模拟模型在计算上是非常昂贵的。在本文中，我们提出了一种基于物理的地下模型代理，可以显着减少耦合模型的运行时间，从而为油藏管理提供快速决策。在耦合模型中，地下油藏模拟器生成井流入动态关系曲线(IPR)，这些曲线被地面网络模型用于确定满足地面约束的井速。在提出的代理模型中，CPU密集型油藏模拟被由一个或多个模拟运行的数据池构建的IPR数据库所取代。IPR数据库捕获了代表地下储层动态的油井动态。然后，代理模型可以通过智能查找每口井的油、气和水相的IPR曲线，并与地面网络一起求解，来预测新情况下的生产动态，例如新的钻井顺序。对于全耦合模型，地面网络和现场管理逻辑(如设施约束、井条件关井和组导向速率平衡)中所有必要的操作事件都可以实现和执行。在提出的代理模型中，为了提高效率，省去了油藏模拟部分。保留了表面网络模型的整体性，具有一定的优势。它特别适用于研究不同地面作业的影响，例如维护计划和生产路线的变化，目的是最大限度地减少由于维护而导致的离线生产能力。用适当的IPR取代计算密集型的地下模拟，显著提高了耦合模型的运行时间，同时保留了油藏的基本物理特性。准确性取决于代理所训练的场景和正在评估的场景之间的差异。在300多口井的复杂油藏中进行的初步测试表明，代理模型的准确性超过95%。计算速度可能是一个数量级，这主要取决于表面网络模型的复杂性。先前的文献中存在使用衰减曲线来复制地下模型性能的工作。该方法利用多相IPR数据库和智能查找机制，在处理多相流和地面网络干扰等复杂问题时更加准确和灵活。

{"title":"A Physics-Based Proxy for Surface and Subsurface Coupled Simulation Models","authors":"Changdong Yang, Jincong He, S. Du, Zhenzhen Wang, Tsubasa Onishi, X. Guan, Jianping Chen, X. Wen","doi":"10.2118/204004-ms","DOIUrl":"https://doi.org/10.2118/204004-ms","url":null,"abstract":"\u0000 Full-physics subsurface simulation models coupled with surface network can be computationally expensive. In this paper, we propose a physics-based subsurface model proxy that significantly reduces the run-time of the coupled model to enable rapid decision-making for reservoir management.\u0000 In the coupled model the subsurface reservoir simulator generates well inflow performance relationship (IPR) curves which are used by the surface network model to determine well rates that satisfy surface constraints. In the proposed proxy model, the CPU intensive reservoir simulation is replaced with an IPR database constructed from a data pool of one or multiple simulation runs. The IPR database captures well performance that represents subsurface reservoir dynamics. The proxy model can then be used to predict the production performance of new scenarios – for example new drilling sequence – by intelligently looking up the appropriate IPR curves for oil, gas and water phases for each well and solving it with the surface network. All necessary operational events in the surface network and field management logic (such as facility constraints, well conditional shut-in, and group guide rate balancing) for the full-coupled model can be implemented and honored.\u0000 In the proposed proxy model, while the reservoir simulation component is eliminated for efficiency. The entirety of the surface network model is retained, which offers certain advantages. It is particularly suitable for investigating the impact of different surface operations, such as maintenance schedule and production routing changes, with the aim of minimizing production capacity off-line due to maintenance. Replacing the computationally intensive subsurface simulation with the appropriate IPR significantly improves the run time of the coupled model while preserving the essential physics of the reservoir. The accuracy depends on the difference between the scenarios that the proxy is trained on and the scenarios being evaluated. Initial testing with a complex reservoir with more than 300 wells showed the accuracy of the proxy model to be more than 95%. The computation speedup could be an order of magnitude, depending largely on complexity of the surface network model.\u0000 Prior work exists in the literature that uses decline curves to replicate subsurface model performance. The use of the multi-phase IPR database and the intelligent lookup mechanism in the proposed method allows it to be more accurate and flexible in handling complexities such as multi-phase flow and interference in the surface network.","PeriodicalId":11146,"journal":{"name":"Day 1 Tue, October 26, 2021","volume":"18 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2021-10-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84855728","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Modelling Methane Extraction from Stimulated Coalbed Influenced by Multidomain and Thermal Effects 多域效应和热效应影响下煤层气模拟开采

Day 1 Tue, October 26, 2021

Pub Date : 2021-10-19 DOI: 10.2118/203990-ms

Wai Li, Jishan Liu, J. Zeng, Y. Leong, D. Elsworth, Jianwei Tian

The process of extracting coalbed methane (CBM) is not only of significance for unconventional energy supply but also important in mine safety. The recent advance in fracking techniques, such as carbon dioxide (CO2) fracking, intensifies the complexity of stimulated coalbeds. This work focuses on developing a fully coupled multidomain model to describe and get insight into the process of CBM extraction, particularly from those compound-fractured coalbeds. A group of partial differential equations (PDEs) are derived to characterize gas transport from matrix to fractures and borehole. A stimulated coalbed is defined as an assembly of three interacting porous media: matrix, continuous fractures (CF) and radial primary hydraulic fracture (RF). Matrix and CF constitute a dual-porosity-dual-permeability system, while RF is simplified as an 1-D cracked medium. These media further form three distinct domains: non-stimulated reservoir domain (NSRD), stimulated reservoir domain (SRD) and RF. The effects of coal deformation, heat transfer, and non-thermal sorption are coupled into the model to reflect the multiple processes in CBM extraction. The finite element method is employed to numerically solve the PDEs. The proposed model is verified by comparing its simulation results to a set of well production data from Southern Qinshui Basin in Shanxi Province, China. Great consistency is observed, showing the satisfactory accuracy of the model for CBM extraction. After that, the difference between various stimulation patterns is presented by simulating the CBM extraction process with different stimulation patterns including (1) unstimulated coalbed; (2) double-wing fracture + NSRD; (3) multiple RFs + NSRD; (4) SRD + NSRD and (5) multiple RFs + SRD + NSRD. The results suggest that Pattern (5) (often formed by CO2 fracking) boosts the efficiency of CBM extraction because it generates a complex fracture network at various scales by both increasing the number of radial fractures and activating the micro-fractures in coal blocks. Sensitivity analysis is also performed to understand the influences of key factors on gas extraction from a stimulated coalbed with multiple domains. It is found that the distinct properties of different domains originate various evolutions, which in turn influences the CBM production. Ignoring thermal effects in CBM extraction will either overestimate or underestimate the production, which is the net effect of thermal strain and non-isothermal sorption. The proposed model provides a useful approach to accurately evaluate CBM extraction by taking the complex evolutions of coalbed properties and the interactions between different components and domains into account. The importance of multidomain and thermal effects for CBM reservoir simulation is also highlighted.

煤层气开采过程不仅对非常规能源供应具有重要意义，而且对矿井安全也具有重要意义。压裂技术的最新进展，如二氧化碳(CO2)压裂，加剧了受激煤层的复杂性。这项工作的重点是开发一个完全耦合的多域模型，以描述和深入了解煤层气开采过程，特别是那些复合裂缝煤层。导出了一组偏微分方程(PDEs)来描述气体从基质到裂缝和井眼的输运。煤层气被定义为三种相互作用的多孔介质的组合:基质、连续裂缝(CF)和径向原生水力裂缝(RF)。基质和CF构成双孔隙度-双渗透率体系，RF简化为一维裂缝介质。这些介质进一步形成了三个不同的区域:非增产储层区域(NSRD)、增产储层区域(SRD)和RF。将煤体变形、换热和非热吸附的影响耦合到模型中，以反映煤层气抽采过程的多过程。采用有限元法对其进行数值求解。通过与山西沁水盆地南部一组井的生产数据对比，验证了该模型的有效性。结果表明，该模型具有较好的煤层气提取精度。在此基础上，通过对不同煤层气开采模式的模拟分析，得出了不同煤层气开采模式之间的差异。(2)双翼断裂+ NSRD;(3)多个RFs + NSRD;(4) SRD + NSRD和(5)多个RFs + SRD + NSRD。结果表明，模式(5)(通常由CO2压裂形成)既增加了径向裂缝数量，又激活了煤块内的微裂缝，形成了不同尺度的复杂裂缝网络，从而提高了煤层气开采效率。通过敏感性分析，了解了多域煤层气开采过程中关键因素对煤层气开采的影响。研究发现，不同结构域的不同性质导致不同的演化，进而影响煤层气的产生。忽略煤层气开采过程中的热效应将高估或低估煤层气的产量，这是热应变和非等温吸附的净效应。该模型考虑了煤层性质的复杂演变以及不同组分和区域之间的相互作用，为准确评价煤层气开采提供了一种有用的方法。强调了多域效应和热效应对煤层气储层模拟的重要性。

{"title":"Modelling Methane Extraction from Stimulated Coalbed Influenced by Multidomain and Thermal Effects","authors":"Wai Li, Jishan Liu, J. Zeng, Y. Leong, D. Elsworth, Jianwei Tian","doi":"10.2118/203990-ms","DOIUrl":"https://doi.org/10.2118/203990-ms","url":null,"abstract":"\u0000 The process of extracting coalbed methane (CBM) is not only of significance for unconventional energy supply but also important in mine safety. The recent advance in fracking techniques, such as carbon dioxide (CO2) fracking, intensifies the complexity of stimulated coalbeds. This work focuses on developing a fully coupled multidomain model to describe and get insight into the process of CBM extraction, particularly from those compound-fractured coalbeds. A group of partial differential equations (PDEs) are derived to characterize gas transport from matrix to fractures and borehole. A stimulated coalbed is defined as an assembly of three interacting porous media: matrix, continuous fractures (CF) and radial primary hydraulic fracture (RF). Matrix and CF constitute a dual-porosity-dual-permeability system, while RF is simplified as an 1-D cracked medium. These media further form three distinct domains: non-stimulated reservoir domain (NSRD), stimulated reservoir domain (SRD) and RF. The effects of coal deformation, heat transfer, and non-thermal sorption are coupled into the model to reflect the multiple processes in CBM extraction. The finite element method is employed to numerically solve the PDEs. The proposed model is verified by comparing its simulation results to a set of well production data from Southern Qinshui Basin in Shanxi Province, China. Great consistency is observed, showing the satisfactory accuracy of the model for CBM extraction. After that, the difference between various stimulation patterns is presented by simulating the CBM extraction process with different stimulation patterns including (1) unstimulated coalbed; (2) double-wing fracture + NSRD; (3) multiple RFs + NSRD; (4) SRD + NSRD and (5) multiple RFs + SRD + NSRD. The results suggest that Pattern (5) (often formed by CO2 fracking) boosts the efficiency of CBM extraction because it generates a complex fracture network at various scales by both increasing the number of radial fractures and activating the micro-fractures in coal blocks. Sensitivity analysis is also performed to understand the influences of key factors on gas extraction from a stimulated coalbed with multiple domains. It is found that the distinct properties of different domains originate various evolutions, which in turn influences the CBM production. Ignoring thermal effects in CBM extraction will either overestimate or underestimate the production, which is the net effect of thermal strain and non-isothermal sorption. The proposed model provides a useful approach to accurately evaluate CBM extraction by taking the complex evolutions of coalbed properties and the interactions between different components and domains into account. The importance of multidomain and thermal effects for CBM reservoir simulation is also highlighted.","PeriodicalId":11146,"journal":{"name":"Day 1 Tue, October 26, 2021","volume":"227 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2021-10-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"78841948","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Adaptive Time Stepping, Linearization and a Posteriori Error Control for Multiphase Flow with Wells 多相流井的自适应时间步进、线性化和后验误差控制

Day 1 Tue, October 26, 2021

Pub Date : 2021-10-19 DOI: 10.2118/203974-ms

E. Ahmed, Ø. Klemetsdal, X. Raynaud, O. Møyner, H. Nilsen

We present in this paper a-posteriori error estimators for multiphase flow with singular well sources. The estimators are fully and locally computable, distinguish the various error components, and target the singular effects of wells. On the basis of these estimators we design an adaptive fully-implicit solver that yields optimal nonlinear iterations and efficient time-stepping, while maintaining the accuracy of the solution. A key point is that the singular nature of the solution in the near-well region is explicitly captured and efficiently estimated using the adequate norms. Numerical experiments illustrate the efficiency of our estimates and the performance of the adaptive algorithm.

本文给出了具有奇异井源的多相流的后验误差估计。该估计器是完全和局部可计算的，能够区分各种误差分量，并针对井的奇异效应。在这些估计量的基础上，我们设计了一个自适应全隐式求解器，在保持解的准确性的同时，产生最优的非线性迭代和有效的时间步进。关键的一点是，近井区域解的奇异性被明确地捕获，并使用适当的规范有效地估计。数值实验证明了我们估计的有效性和自适应算法的性能。

引用次数: 1

A Massively Parallel Reservoir Simulator on the GPU Architecture 基于GPU架构的大规模并行水库模拟器

Day 1 Tue, October 26, 2021

Pub Date : 2021-10-19 DOI: 10.2118/203918-ms

Usuf Middya, A. Manea, Alhubail Maitham Makki, Todd R. Ferguson, T. Byer, A. Dogru

Reservoir simulation computational costs have been continuously growing due to high-resolution reservoir characterization, increasing model complexity, and uncertainty analysis workflows. Reducing simulation costs by upscaling is often necessary for operational requirements. Fast evolving HPC technologies offer opportunities to reduce cost without compromising fidelity. This work presents a novel in-house massively parallel full-physics reservoir simulator running on the emerging GPU architecture. Almost all the simulation kernels have been designed and implemented to honor the GPU SIMD programming paradigm. These kernels include physical property calculations, phase equilibrium computations, Jacobian construction, linear and nonlinear solvers, and wells. Novel techniques are devised in various kernels to expose enough parallelism to ensure that the control and data-flow patterns are well suited for the GPU environment. Mixed-precision computation is also employed when appropriate (e.g., in derivative calculation) to reduce computational costs without compromising the solution accuracy. The GPU implementation of the simulator is tested and benchmarked using various reservoir models, ranging from the synthetic SPE10 Benchmark (Christie & Blunt, 2001) to several industrial-scale models. These real field models range in size from tens of millions of cells to more than billion cells with black-oil and multicomponent compositional fluid. The GPU simulator is benchmarked on the IBM AC922 massively parallel architecture having tens of NVidia Volta V100 GPUs. To compare performance with CPU architectures, an optimized CPU implementation of the simulator is benchmarked on the IBM AC922 CPUs and on a cluster consisting of thousands of Intel's Haswell-EP Xeon® CPU E5-2680 v3. Detailed analysis of several numerical experiments comparing the simulator performance on the GPU and the CPU architectures is presented. In almost all of the cases, the analysis shows that the use of hardware acceleration offers substantial benefits in terms of wall time and power consumption. This novel in-house full-physics, black-oil and compositional reservoir simulator employs several novel techniques in various simulation kernels to ensure full utilization of the GPU resources. Detailed analysis is presented to highlight the simulator performance in terms of runtime reduction, parallel scalability and power savings.

由于高分辨率油藏表征、不断增加的模型复杂性和不确定性分析工作流程，油藏模拟计算成本不断增长。通过升级来降低模拟成本通常是满足操作需求所必需的。快速发展的高性能计算技术提供了在不影响保真度的情况下降低成本的机会。这项工作提出了一个新颖的内部大规模并行全物理油藏模拟器运行在新兴的GPU架构。几乎所有的仿真内核都是为GPU SIMD编程范式而设计和实现的。这些核心包括物理性质计算、相平衡计算、雅可比矩阵构造、线性和非线性求解器以及井。在各种内核中设计了新的技术来暴露足够的并行性，以确保控制和数据流模式非常适合GPU环境。在适当的情况下(例如，在导数计算中)也采用混合精度计算来减少计算成本而不影响解的精度。模拟器的GPU实现使用各种储层模型进行测试和基准测试，范围从合成SPE10基准(Christie & Blunt, 2001)到几个工业规模模型。这些实际油田模型的大小范围从数千万到超过10亿个细胞，其中含有黑油和多组分组成流体。GPU模拟器在IBM AC922大规模并行架构上进行基准测试，该架构拥有数十个NVidia Volta V100 GPU。为了与CPU架构进行性能比较，模拟器的优化CPU实现在IBM AC922 CPU和由数千个英特尔Haswell-EP Xeon®CPU E5-2680 v3组成的集群上进行基准测试。并对仿真器在GPU和CPU架构下的性能进行了数值比较。在几乎所有的情况下，分析表明，使用硬件加速在运行时间和功耗方面提供了实质性的好处。这个新颖的内部全物理、黑油和成分油藏模拟器在各种模拟内核中采用了几种新颖的技术，以确保GPU资源的充分利用。详细分析了该仿真器在减少运行时间、并行可扩展性和节能方面的性能。

{"title":"A Massively Parallel Reservoir Simulator on the GPU Architecture","authors":"Usuf Middya, A. Manea, Alhubail Maitham Makki, Todd R. Ferguson, T. Byer, A. Dogru","doi":"10.2118/203918-ms","DOIUrl":"https://doi.org/10.2118/203918-ms","url":null,"abstract":"\u0000 Reservoir simulation computational costs have been continuously growing due to high-resolution reservoir characterization, increasing model complexity, and uncertainty analysis workflows. Reducing simulation costs by upscaling is often necessary for operational requirements. Fast evolving HPC technologies offer opportunities to reduce cost without compromising fidelity.\u0000 This work presents a novel in-house massively parallel full-physics reservoir simulator running on the emerging GPU architecture. Almost all the simulation kernels have been designed and implemented to honor the GPU SIMD programming paradigm. These kernels include physical property calculations, phase equilibrium computations, Jacobian construction, linear and nonlinear solvers, and wells. Novel techniques are devised in various kernels to expose enough parallelism to ensure that the control and data-flow patterns are well suited for the GPU environment. Mixed-precision computation is also employed when appropriate (e.g., in derivative calculation) to reduce computational costs without compromising the solution accuracy.\u0000 The GPU implementation of the simulator is tested and benchmarked using various reservoir models, ranging from the synthetic SPE10 Benchmark (Christie & Blunt, 2001) to several industrial-scale models. These real field models range in size from tens of millions of cells to more than billion cells with black-oil and multicomponent compositional fluid. The GPU simulator is benchmarked on the IBM AC922 massively parallel architecture having tens of NVidia Volta V100 GPUs. To compare performance with CPU architectures, an optimized CPU implementation of the simulator is benchmarked on the IBM AC922 CPUs and on a cluster consisting of thousands of Intel's Haswell-EP Xeon® CPU E5-2680 v3. Detailed analysis of several numerical experiments comparing the simulator performance on the GPU and the CPU architectures is presented. In almost all of the cases, the analysis shows that the use of hardware acceleration offers substantial benefits in terms of wall time and power consumption.\u0000 This novel in-house full-physics, black-oil and compositional reservoir simulator employs several novel techniques in various simulation kernels to ensure full utilization of the GPU resources. Detailed analysis is presented to highlight the simulator performance in terms of runtime reduction, parallel scalability and power savings.","PeriodicalId":11146,"journal":{"name":"Day 1 Tue, October 26, 2021","volume":"33 1","pages":""},"PeriodicalIF":0.0,"publicationDate":"2021-10-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87979323","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1