Nadège Polette , Olivier Le Maître , Pierre Sochala , Alexandrine Gesret
{"title":"Change of measure for Bayesian field inversion with hierarchical hyperparameters sampling","authors":"Nadège Polette , Olivier Le Maître , Pierre Sochala , Alexandrine Gesret","doi":"10.1016/j.jcp.2025.113888","DOIUrl":null,"url":null,"abstract":"<div><div>This paper proposes an effective treatment of hyperparameters in the Bayesian inference of a scalar field from indirect observations. Obtaining the joint posterior distribution of the field and its hyperparameters is challenging. The infinite dimensionality of the field requires a finite parametrization that usually involves hyperparameters to reflect the limited prior knowledge. In the present work, we consider a Karhunen-Loève (KL) decomposition for the random field and hyperparameters to account for the lack of prior knowledge of its autocovariance function. The hyperparameters must be inferred. To efficiently sample jointly the KL coordinates of the field and the autocovariance hyperparameters, we introduce a change of measure to reformulate the joint posterior distribution into a hierarchical Bayesian form. The likelihood depends only on the field's coordinates in a fixed KL basis, with a prior conditioned on the hyperparameters. We exploit this structure to derive an efficient Markov Chain Monte Carlo (MCMC) sampling scheme based on an adapted Metropolis–Hasting algorithm. We rely on surrogate models (Polynomial Chaos expansions) of the forward model predictions to further accelerate the MCMC sampling. A first application to a transient diffusion problem shows that our method is consistent with other approaches based on a change of coordinates (Sraj et al., 2016, <span><span>[21]</span></span>). A second application to a seismic traveltime tomography highlights the importance of inferring the hyperparameters. A third application to a 2D anisotropic groundwater flow problem illustrates the method on a more complex geometry.</div></div>","PeriodicalId":352,"journal":{"name":"Journal of Computational Physics","volume":"529 ","pages":"Article 113888"},"PeriodicalIF":3.8000,"publicationDate":"2025-05-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Computational Physics","FirstCategoryId":"101","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0021999125001718","RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/2/25 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0
Abstract
This paper proposes an effective treatment of hyperparameters in the Bayesian inference of a scalar field from indirect observations. Obtaining the joint posterior distribution of the field and its hyperparameters is challenging. The infinite dimensionality of the field requires a finite parametrization that usually involves hyperparameters to reflect the limited prior knowledge. In the present work, we consider a Karhunen-Loève (KL) decomposition for the random field and hyperparameters to account for the lack of prior knowledge of its autocovariance function. The hyperparameters must be inferred. To efficiently sample jointly the KL coordinates of the field and the autocovariance hyperparameters, we introduce a change of measure to reformulate the joint posterior distribution into a hierarchical Bayesian form. The likelihood depends only on the field's coordinates in a fixed KL basis, with a prior conditioned on the hyperparameters. We exploit this structure to derive an efficient Markov Chain Monte Carlo (MCMC) sampling scheme based on an adapted Metropolis–Hasting algorithm. We rely on surrogate models (Polynomial Chaos expansions) of the forward model predictions to further accelerate the MCMC sampling. A first application to a transient diffusion problem shows that our method is consistent with other approaches based on a change of coordinates (Sraj et al., 2016, [21]). A second application to a seismic traveltime tomography highlights the importance of inferring the hyperparameters. A third application to a 2D anisotropic groundwater flow problem illustrates the method on a more complex geometry.
本文提出了一种对标量场间接观测贝叶斯推断中超参数的有效处理方法。获得关节场的后验分布及其超参数是具有挑战性的。该领域的无限维度需要有限的参数化,通常涉及超参数来反映有限的先验知识。在目前的工作中,我们考虑了随机场和超参数的karhunen - lo (KL)分解,以解释其自协方差函数缺乏先验知识。必须推断出超参数。为了有效地对场的KL坐标和自协方差超参数进行联合采样,我们引入了测度变化,将联合后验分布重新表述为层次贝叶斯形式。可能性仅取决于场的坐标在一个固定的KL基础上,与先验条件的超参数。我们利用这种结构推导出一种有效的基于Metropolis-Hasting算法的马尔可夫链蒙特卡罗(MCMC)采样方案。我们依靠前向模型预测的代理模型(多项式混沌展开)来进一步加速MCMC采样。对瞬态扩散问题的首次应用表明,我们的方法与基于坐标变化的其他方法是一致的(Sraj等,2016,[21])。地震走时层析成像的第二个应用突出了推断超参数的重要性。第三个应用是二维各向异性地下水流动问题,说明了该方法在更复杂的几何上的应用。
期刊介绍:
Journal of Computational Physics thoroughly treats the computational aspects of physical problems, presenting techniques for the numerical solution of mathematical equations arising in all areas of physics. The journal seeks to emphasize methods that cross disciplinary boundaries.
The Journal of Computational Physics also publishes short notes of 4 pages or less (including figures, tables, and references but excluding title pages). Letters to the Editor commenting on articles already published in this Journal will also be considered. Neither notes nor letters should have an abstract.