Rigid transformations for stabilized lower dimensional space to support subsurface uncertainty quantification and interpretation

IF 2 3区地球科学 Q3 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Computational Geosciences Pub Date : 2024-03-08 DOI:10.1007/s10596-024-10278-x

Ademide O. Mabadeje, Michael J. Pyrcz

{"title":"Rigid transformations for stabilized lower dimensional space to support subsurface uncertainty quantification and interpretation","authors":"Ademide O. Mabadeje, Michael J. Pyrcz","doi":"10.1007/s10596-024-10278-x","DOIUrl":null,"url":null,"abstract":"Subsurface datasets commonly are big data, i.e., they meet big data criteria, such as large data volume, significant feature variety, high sampling velocity, and limited data veracity. Large data volume is enhanced by the large number of necessary features derived from the imposition of various features derived from physical, engineering, and geological inputs, constraints that may invoke the curse of dimensionality. Existing dimensionality reduction (DR) methods are either linear or nonlinear; however, for subsurface datasets, nonlinear dimensionality reduction (NDR) methods are most applicable due to data complexity. Metric-multidimensional scaling (MDS) is a suitable NDR method that retains the data's intrinsic structure and could quantify uncertainty space. However, like other NDR methods, MDS is limited by its inability to achieve a stabilized unique solution of the low dimensional space (LDS) invariant to Euclidean transformations and has no extension for inclusions of out-of-sample points (OOSP). To support subsurface inferential workflows, it is imperative to transform these datasets into meaningful, stable representations of reduced dimensionality that permit OOSP without model recalculation.We propose using rigid transformations to obtain a unique solution of stabilized Euclidean invariant representation for LDS. First, compute a dissimilarity matrix as the MDS input using a distance metric to obtain the LDS for \\(N\\)-samples and repeat for multiple realizations. Then, select the base case and perform a rigid transformation on further realizations to obtain rotation and translation matrices that enforce Euclidean transformation invariance under ensemble expectation. The expected stabilized solution identifies anchor positions using a convex hull algorithm compared to the \\(N+1\\) case from prior matrices to obtain a stabilized representation consisting of the OOSP. Next, the loss function and normalized stress are computed via distances between samples in the high-dimensional space and LDS to quantify and visualize distortion in a 2-D registration problem. To test our proposed workflow, a different sample size experiment is conducted for Euclidean and Manhattan distance metrics as the MDS dissimilarity matrix inputs for a synthetic dataset.The workflow is also demonstrated using wells from the Duvernay Formation and OOSP with different petrophysical properties typically found in unconventional reservoirs to track and understand its behavior in LDS. The results show that our method is effective for NDR methods to obtain unique, repeatable, stable representations of LDS invariant to Euclidean transformations. In addition, we propose a distortion-based metric, stress ratio (SR), that quantifies and visualizes the uncertainty space for samples in subsurface datasets, which is helpful for model updating and inferential analysis for OOSP. Therefore, we recommend the workflow's integration as an invariant transformation mitigation unit in LDS for unique solutions to ensure repeatability and rational comparison in NDR methods for subsurface energy resource engineering big data inferential workflows, e.g., analog data selection and sensitivity analysis.","PeriodicalId":10662,"journal":{"name":"Computational Geosciences","volume":"59 1","pages":""},"PeriodicalIF":2.0000,"publicationDate":"2024-03-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computational Geosciences","FirstCategoryId":"89","ListUrlMain":"https://doi.org/10.1007/s10596-024-10278-x","RegionNum":3,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}

引用次数: 0

Abstract

Subsurface datasets commonly are big data, i.e., they meet big data criteria, such as large data volume, significant feature variety, high sampling velocity, and limited data veracity. Large data volume is enhanced by the large number of necessary features derived from the imposition of various features derived from physical, engineering, and geological inputs, constraints that may invoke the curse of dimensionality. Existing dimensionality reduction (DR) methods are either linear or nonlinear; however, for subsurface datasets, nonlinear dimensionality reduction (NDR) methods are most applicable due to data complexity. Metric-multidimensional scaling (MDS) is a suitable NDR method that retains the data's intrinsic structure and could quantify uncertainty space. However, like other NDR methods, MDS is limited by its inability to achieve a stabilized unique solution of the low dimensional space (LDS) invariant to Euclidean transformations and has no extension for inclusions of out-of-sample points (OOSP). To support subsurface inferential workflows, it is imperative to transform these datasets into meaningful, stable representations of reduced dimensionality that permit OOSP without model recalculation.

We propose using rigid transformations to obtain a unique solution of stabilized Euclidean invariant representation for LDS. First, compute a dissimilarity matrix as the MDS input using a distance metric to obtain the LDS for \(N\)-samples and repeat for multiple realizations. Then, select the base case and perform a rigid transformation on further realizations to obtain rotation and translation matrices that enforce Euclidean transformation invariance under ensemble expectation. The expected stabilized solution identifies anchor positions using a convex hull algorithm compared to the \(N+1\) case from prior matrices to obtain a stabilized representation consisting of the OOSP. Next, the loss function and normalized stress are computed via distances between samples in the high-dimensional space and LDS to quantify and visualize distortion in a 2-D registration problem. To test our proposed workflow, a different sample size experiment is conducted for Euclidean and Manhattan distance metrics as the MDS dissimilarity matrix inputs for a synthetic dataset.

The workflow is also demonstrated using wells from the Duvernay Formation and OOSP with different petrophysical properties typically found in unconventional reservoirs to track and understand its behavior in LDS. The results show that our method is effective for NDR methods to obtain unique, repeatable, stable representations of LDS invariant to Euclidean transformations. In addition, we propose a distortion-based metric, stress ratio (SR), that quantifies and visualizes the uncertainty space for samples in subsurface datasets, which is helpful for model updating and inferential analysis for OOSP. Therefore, we recommend the workflow's integration as an invariant transformation mitigation unit in LDS for unique solutions to ensure repeatability and rational comparison in NDR methods for subsurface energy resource engineering big data inferential workflows, e.g., analog data selection and sensitivity analysis.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

稳定低维空间的刚性变换，支持地下不确定性量化和解释

地下数据集通常是大数据，即符合大数据标准，如数据量大、特征种类多、采样速度快、数据真实性有限。大数据量因大量必要特征而增强，这些特征来自于物理、工程和地质输入的各种特征，这些约束条件可能会引发维度诅咒。现有的降维（DR）方法既有线性的，也有非线性的；然而，对于地下数据集，由于数据的复杂性，非线性降维（NDR）方法最为适用。公制多维缩放（MDS）是一种合适的非线性降维方法，它保留了数据的内在结构，并能量化不确定性空间。然而，与其他 NDR 方法一样，MDS 也受到限制，因为它无法获得不受欧几里得变换影响的低维空间（LDS）的稳定唯一解，也无法扩展到包含样本外点（OOSP）。为了支持地下推断工作流程，必须将这些数据集转换为有意义的、稳定的降维表示，以便在不重新计算模型的情况下实现 OOSP。首先，使用距离度量计算一个不相似矩阵作为 MDS 输入，以获得 \(N\)-samples 的 LDS，并重复多次实现。然后，选择基本情况并对进一步的实现进行刚性变换，以获得在集合期望下执行欧几里得变换不变性的旋转和平移矩阵。预期稳定解使用凸壳算法确定锚点位置，并与先验矩阵的（N+1）情况进行比较，以获得由 OOSP 组成的稳定表示。接下来，通过高维空间样本间的距离和 LDS 计算损失函数和归一化应力，以量化和可视化二维配准问题中的失真。为了测试我们提出的工作流程，对合成数据集的欧几里得距离和曼哈顿距离指标作为 MDS 差异性矩阵输入进行了不同样本大小的实验。该工作流程还使用了非常规储层中具有不同岩石物理特性的 Duvernay Formation 和 OOSP 油井进行了演示，以跟踪和了解其在 LDS 中的行为。结果表明，我们的方法对于 NDR 方法来说是有效的，可以获得对欧几里得变换不变的 LDS 唯一、可重复、稳定的表示。此外，我们还提出了一种基于变形的度量--应力比（SR），该度量可量化和可视化地下数据集中样本的不确定性空间，有助于 OOSP 的模型更新和推理分析。因此，我们建议将该工作流程整合为 LDS 中的一个不变量变换缓解单元，用于独特的解决方案，以确保地下能源资源工程大数据推断工作流程（如模拟数据选择和灵敏度分析）中 NDR 方法的可重复性和合理比较。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Computational Geosciences 地学-地球科学综合

CiteScore

6.10

自引率

4.00%

发文量

审稿时长

6-12 weeks

期刊介绍： Computational Geosciences publishes high quality papers on mathematical modeling, simulation, numerical analysis, and other computational aspects of the geosciences. In particular the journal is focused on advanced numerical methods for the simulation of subsurface flow and transport, and associated aspects such as discretization, gridding, upscaling, optimization, data assimilation, uncertainty assessment, and high performance parallel and grid computing. Papers treating similar topics but with applications to other fields in the geosciences, such as geomechanics, geophysics, oceanography, or meteorology, will also be considered. The journal provides a platform for interaction and multidisciplinary collaboration among diverse scientific groups, from both academia and industry, which share an interest in developing mathematical models and efficient algorithms for solving them, such as mathematicians, engineers, chemists, physicists, and geoscientists.