SIAM Journal on Scientific Computing最新文献_第3页

Adaptivity in Local Kernel Based Methods for Approximating the Action of Linear Operators 基于局部核的线性算子作用近似方法的自适应性

IF 3.1 2区数学 Q1 MATHEMATICS, APPLIED

SIAM Journal on Scientific Computing

Pub Date : 2024-08-19 DOI: 10.1137/23m1598052

Jonah A. Reeger

SIAM Journal on Scientific Computing, Volume 46, Issue 4, Page A2683-A2708, August 2024.
Abstract. Building on the successes of local kernel methods for approximating the solutions to partial differential equations (PDEs) and the evaluation of definite integrals (quadrature/cubature), a local estimate of the error in such approximations is developed. This estimate is useful for determining locations in the solution domain where increased node density (equivalently, reduction in the spacing between nodes) can decrease the error in the solution. An adaptive procedure for adding nodes to the domain for both the approximation of derivatives and the approximate evaluation of definite integrals is described. This method efficiently computes the error estimate at a set of prescribed points and adds new nodes for approximation where the error is too large. Computational experiments demonstrate close agreement between the error estimate and actual absolute error in the approximation. Such methods are necessary or desirable when approximating solutions to PDEs (or in the case of quadrature/cubature), where the initial data and subsequent solution (or integrand) exhibit localized features that require significant refinement to resolve and where uniform increases in the density of nodes across the entire computational domain is not possible or too burdensome.

SIAM 科学计算期刊》，第 46 卷第 4 期，第 A2683-A2708 页，2024 年 8 月。摘要基于近似偏微分方程（PDE）解和定积分（正交/余量）求值的局部内核方法的成功经验，本文提出了对此类近似误差的局部估计。该估计值有助于确定求解域中增加节点密度（等同于减少节点间距）可减少求解误差的位置。本文介绍了一种自适应程序，用于在导数近似和定积分近似计算的域中添加节点。这种方法能有效计算一组规定点的误差估计值，并在误差过大的地方添加新节点进行近似。计算实验表明，误差估计值与近似值的实际绝对误差非常接近。在近似 PDEs 的解时（或在正交/余量的情况下），初始数据和随后的解（或积分）表现出局部特征，需要大量细化才能解决，而在整个计算域中均匀增加节点密度是不可能或过于繁琐时，这种方法是必要或可取的。

{"title":"Adaptivity in Local Kernel Based Methods for Approximating the Action of Linear Operators","authors":"Jonah A. Reeger","doi":"10.1137/23m1598052","DOIUrl":"https://doi.org/10.1137/23m1598052","url":null,"abstract":"SIAM Journal on Scientific Computing, Volume 46, Issue 4, Page A2683-A2708, August 2024. Abstract. Building on the successes of local kernel methods for approximating the solutions to partial differential equations (PDEs) and the evaluation of definite integrals (quadrature/cubature), a local estimate of the error in such approximations is developed. This estimate is useful for determining locations in the solution domain where increased node density (equivalently, reduction in the spacing between nodes) can decrease the error in the solution. An adaptive procedure for adding nodes to the domain for both the approximation of derivatives and the approximate evaluation of definite integrals is described. This method efficiently computes the error estimate at a set of prescribed points and adds new nodes for approximation where the error is too large. Computational experiments demonstrate close agreement between the error estimate and actual absolute error in the approximation. Such methods are necessary or desirable when approximating solutions to PDEs (or in the case of quadrature/cubature), where the initial data and subsequent solution (or integrand) exhibit localized features that require significant refinement to resolve and where uniform increases in the density of nodes across the entire computational domain is not possible or too burdensome.","PeriodicalId":49526,"journal":{"name":"SIAM Journal on Scientific Computing","volume":"16 1","pages":""},"PeriodicalIF":3.1,"publicationDate":"2024-08-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142217607","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A Meshless Solver for Blood Flow Simulations in Elastic Vessels Using a Physics-Informed Neural Network 利用物理信息神经网络模拟弹性血管中血流的无网格求解器

IF 3.1 2区数学 Q1 MATHEMATICS, APPLIED

SIAM Journal on Scientific Computing

Pub Date : 2024-08-19 DOI: 10.1137/23m1622696

Han Zhang, Raymond H. Chan, Xue-Cheng Tai

SIAM Journal on Scientific Computing, Volume 46, Issue 4, Page C479-C507, August 2024.
Abstract. Investigating blood flow in the cardiovascular system is crucial for assessing cardiovascular health. Computational approaches offer some noninvasive alternatives to measure blood flow dynamics. Numerical simulations based on traditional methods such as finite-element and other numerical discretizations have been extensively studied and have yielded excellent results. However, adapting these methods to real-life simulations remains a complex task. In this paper, we propose a method that offers flexibility and can efficiently handle real-life simulations. We suggest utilizing the physics-informed neural network to solve the Navier–Stokes equation in a deformable domain, specifically addressing the simulation of blood flow in elastic vessels. Our approach models blood flow using an incompressible, viscous Navier–Stokes equation in an arbitrary Lagrangian–Eulerian form. The mechanical model for the vessel wall structure is formulated by an equation of Newton’s second law of momentum and linear elasticity to the force exerted by the fluid flow. Our method is a mesh-free approach that eliminates the need for discretization and meshing of the computational domain. This makes it highly efficient in solving simulations involving complex geometries. Additionally, with the availability of well-developed open-source machine learning framework packages and parallel modules, our method can easily be accelerated through GPU computing and parallel computing. To evaluate our approach, we conducted experiments on regular cylinder vessels as well as vessels with plaque on their walls. We compared our results to a solution calculated by finite element methods using a dense grid and small time steps, which we considered as the ground truth solution. We report the relative error and the time consumed to solve the problem, highlighting the advantages of our method.

SIAM 科学计算期刊》，第 46 卷第 4 期，第 C479-C507 页，2024 年 8 月。摘要调查心血管系统中的血流对于评估心血管健康至关重要。计算方法为测量血流动态提供了一些非侵入性的替代方法。基于有限元和其他数值离散等传统方法的数值模拟已得到广泛研究，并取得了卓越成果。然而，将这些方法应用于实际模拟仍是一项复杂的任务。在本文中，我们提出了一种既灵活又能有效处理实际模拟的方法。我们建议利用物理信息神经网络来求解可变形域中的纳维-斯托克斯方程，特别是解决弹性血管中的血流模拟问题。我们的方法使用不可压缩的粘性纳维-斯托克斯方程，以任意拉格朗日-欧拉形式模拟血流。血管壁结构的力学模型是通过牛顿第二动量定律和线性弹性方程来计算流体流动所产生的力。我们的方法是一种无网格方法，无需对计算域进行离散化和网格划分。这使得它在解决涉及复杂几何形状的模拟时非常高效。此外，由于有了完善的开源机器学习框架包和并行模块，我们的方法可以很容易地通过 GPU 计算和并行计算来加速。为了评估我们的方法，我们在普通圆柱形容器和壁上有斑块的容器上进行了实验。我们将我们的结果与使用密集网格和小时间步长的有限元方法计算出的解决方案进行了比较，并将其视为基本真理解决方案。我们报告了相对误差和解决问题所消耗的时间，突出了我们方法的优势。

{"title":"A Meshless Solver for Blood Flow Simulations in Elastic Vessels Using a Physics-Informed Neural Network","authors":"Han Zhang, Raymond H. Chan, Xue-Cheng Tai","doi":"10.1137/23m1622696","DOIUrl":"https://doi.org/10.1137/23m1622696","url":null,"abstract":"SIAM Journal on Scientific Computing, Volume 46, Issue 4, Page C479-C507, August 2024. Abstract. Investigating blood flow in the cardiovascular system is crucial for assessing cardiovascular health. Computational approaches offer some noninvasive alternatives to measure blood flow dynamics. Numerical simulations based on traditional methods such as finite-element and other numerical discretizations have been extensively studied and have yielded excellent results. However, adapting these methods to real-life simulations remains a complex task. In this paper, we propose a method that offers flexibility and can efficiently handle real-life simulations. We suggest utilizing the physics-informed neural network to solve the Navier–Stokes equation in a deformable domain, specifically addressing the simulation of blood flow in elastic vessels. Our approach models blood flow using an incompressible, viscous Navier–Stokes equation in an arbitrary Lagrangian–Eulerian form. The mechanical model for the vessel wall structure is formulated by an equation of Newton’s second law of momentum and linear elasticity to the force exerted by the fluid flow. Our method is a mesh-free approach that eliminates the need for discretization and meshing of the computational domain. This makes it highly efficient in solving simulations involving complex geometries. Additionally, with the availability of well-developed open-source machine learning framework packages and parallel modules, our method can easily be accelerated through GPU computing and parallel computing. To evaluate our approach, we conducted experiments on regular cylinder vessels as well as vessels with plaque on their walls. We compared our results to a solution calculated by finite element methods using a dense grid and small time steps, which we considered as the ground truth solution. We report the relative error and the time consumed to solve the problem, highlighting the advantages of our method.","PeriodicalId":49526,"journal":{"name":"SIAM Journal on Scientific Computing","volume":"6 1","pages":""},"PeriodicalIF":3.1,"publicationDate":"2024-08-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142217606","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

The Effect of Approximate Coarsest-Level Solves on the Convergence of Multigrid V-Cycle Methods 近似最粗求解对多网格 V 循环方法收敛性的影响

IF 3.1 2区数学 Q1 MATHEMATICS, APPLIED

SIAM Journal on Scientific Computing

Pub Date : 2024-08-14 DOI: 10.1137/23m1578255

Petr Vacek, Erin Carson, Kirk M. Soodhalter

SIAM Journal on Scientific Computing, Volume 46, Issue 4, Page A2634-A2659, August 2024.
Abstract. The multigrid V-cycle method is a popular method for solving systems of linear equations. It computes an approximate solution by using smoothing on fine levels and solving a system of linear equations on the coarsest level. Solving on the coarsest level depends on the size and difficulty of the problem. If the size permits, it is typical to use a direct method based on LU or Cholesky decomposition. In settings with large coarsest-level problems, approximate solvers such as iterative Krylov subspace methods, or direct methods based on low-rank approximation, are often used. The accuracy of the coarsest-level solver is typically determined based on the experience of the users with the concrete problems and methods. In this paper, we present an approach to analyzing the effects of approximate coarsest-level solves on the convergence of the V-cycle method for symmetric positive definite problems. Using these results, we derive coarsest-level stopping criterion through which we may control the difference between the approximation computed by a V-cycle method with approximate coarsest-level solver and the approximation which would be computed if the coarsest-level problems were solved exactly. The coarsest-level stopping criterion may thus be set up such that the V-cycle method converges to a chosen finest-level accuracy in (nearly) the same number of V-cycle iterations as the V-cycle method with exact coarsest-level solver. We also utilize the theoretical results to discuss how the convergence of the V-cycle method may be affected by the choice of a tolerance in a coarsest-level stopping criterion based on the relative residual norm. Reproducibility of computational results. This paper has been awarded the “SIAM Reproducibility Badge: Code and data available” as a recognition that the authors have followed reproducibility principles valued by SISC and the scientific computing community. Code and data that allow readers to reproduce the results in this paper are available at https://doi.org/10.5281/zenodo.11178544.

SIAM 科学计算期刊》，第 46 卷第 4 期，第 A2634-A2659 页，2024 年 8 月。摘要多网格 V 循环法是求解线性方程组的一种常用方法。它通过在精细级上进行平滑处理并在最粗级上求解线性方程组来计算近似解。在最粗级上求解取决于问题的大小和难度。如果规模允许，通常采用基于 LU 或 Cholesky 分解的直接方法。在处理大型最粗级问题时，通常会使用近似求解器，如迭代克雷洛夫子空间方法或基于低阶近似的直接方法。最粗级求解器的精度通常根据用户对具体问题和方法的经验来确定。在本文中，我们提出了一种分析近似最粗级求解对对称正定问题 V 循环方法收敛性影响的方法。利用这些结果，我们得出了最粗级停止准则，通过该准则，我们可以控制使用近似最粗级求解器的 V 循环方法计算出的近似值与精确求解最粗级问题时计算出的近似值之间的差异。因此，可以设定最粗级停止准则，使 V 循环方法在（几乎）与使用精确最粗级求解器的 V 循环方法相同的 V 循环迭代次数内收敛到所选的最细级精度。我们还利用理论结果讨论了 V 循环方法的收敛性如何受到基于相对残差规范的最粗级停止准则中容限选择的影响。计算结果的可重复性。本文被授予 "SIAM 可重复性徽章"：代码和数据可用"，以表彰作者遵循了 SISC 和科学计算界重视的可重现性原则。读者可通过 https://doi.org/10.5281/zenodo.11178544 获取代码和数据，以重现本文中的结果。

{"title":"The Effect of Approximate Coarsest-Level Solves on the Convergence of Multigrid V-Cycle Methods","authors":"Petr Vacek, Erin Carson, Kirk M. Soodhalter","doi":"10.1137/23m1578255","DOIUrl":"https://doi.org/10.1137/23m1578255","url":null,"abstract":"SIAM Journal on Scientific Computing, Volume 46, Issue 4, Page A2634-A2659, August 2024. Abstract. The multigrid V-cycle method is a popular method for solving systems of linear equations. It computes an approximate solution by using smoothing on fine levels and solving a system of linear equations on the coarsest level. Solving on the coarsest level depends on the size and difficulty of the problem. If the size permits, it is typical to use a direct method based on LU or Cholesky decomposition. In settings with large coarsest-level problems, approximate solvers such as iterative Krylov subspace methods, or direct methods based on low-rank approximation, are often used. The accuracy of the coarsest-level solver is typically determined based on the experience of the users with the concrete problems and methods. In this paper, we present an approach to analyzing the effects of approximate coarsest-level solves on the convergence of the V-cycle method for symmetric positive definite problems. Using these results, we derive coarsest-level stopping criterion through which we may control the difference between the approximation computed by a V-cycle method with approximate coarsest-level solver and the approximation which would be computed if the coarsest-level problems were solved exactly. The coarsest-level stopping criterion may thus be set up such that the V-cycle method converges to a chosen finest-level accuracy in (nearly) the same number of V-cycle iterations as the V-cycle method with exact coarsest-level solver. We also utilize the theoretical results to discuss how the convergence of the V-cycle method may be affected by the choice of a tolerance in a coarsest-level stopping criterion based on the relative residual norm. Reproducibility of computational results. This paper has been awarded the “SIAM Reproducibility Badge: Code and data available” as a recognition that the authors have followed reproducibility principles valued by SISC and the scientific computing community. Code and data that allow readers to reproduce the results in this paper are available at https://doi.org/10.5281/zenodo.11178544.","PeriodicalId":49526,"journal":{"name":"SIAM Journal on Scientific Computing","volume":"23 1","pages":""},"PeriodicalIF":3.1,"publicationDate":"2024-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142217608","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Implicit Adaptive Mesh Refinement for Dispersive Tsunami Propagation 针对分散海啸传播的隐式自适应网格细化

IF 3.1 2区数学 Q1 MATHEMATICS, APPLIED

SIAM Journal on Scientific Computing

Pub Date : 2024-08-14 DOI: 10.1137/23m1585210

Marsha J. Berger, Randall J. LeVeque

SIAM Journal on Scientific Computing, Volume 46, Issue 4, Page B554-B578, August 2024.
Abstract. We present an algorithm to solve the dispersive depth-averaged Serre–Green–Naghdi equations using patch-based adaptive mesh refinement. These equations require adding additional higher derivative terms to the nonlinear shallow water equations. This has been implemented as a new component of the open source GeoClaw software that is widely used for modeling tsunamis, storm surge, and related hazards, improving its accuracy on shorter wavelength phenomena. We use a formulation that requires solving an elliptic system of equations at each time step, making the method implicit. The adaptive algorithm allows different time steps on different refinement levels and solves the implicit equations level by level. Computational examples are presented to illustrate the stability and accuracy on a radially symmetric test case and two realistic tsunami modeling problems, including a hypothetical asteroid impact creating a short wavelength tsunami for which dispersive terms are necessary. Reproducibility of computational results. This paper has been awarded the “SIAM Reproducibility Badge: Code and data available” as a recognition that the authors have followed reproducibility principles valued by SISC and the scientific computing community. Code and data that allow readers to reproduce the results in this paper are available at https://github.com/rjleveque/ImplicitAMR-paper and in the supplementary materials (ImplicitAMR-paper.zip [174KB]).

SIAM 科学计算期刊》，第 46 卷第 4 期，第 B554-B578 页，2024 年 8 月。摘要我们提出了一种利用基于补丁的自适应网格细化求解色散深度平均 Serre-Green-Naghdi 方程的算法。这些方程需要在非线性浅水方程中添加额外的高导数项。这已作为开源 GeoClaw 软件的一个新组件实施，该软件被广泛用于海啸、风暴潮和相关灾害的建模，提高了其对较短波长现象的精度。我们使用的方法需要在每个时间步求解一个椭圆方程组，因此该方法是隐式的。自适应算法允许在不同细化级别上采用不同的时间步长，并逐级求解隐式方程。计算实例展示了一个径向对称测试案例和两个现实海啸建模问题的稳定性和准确性，包括一个假想的小行星撞击产生的短波长海啸，其中分散项是必要的。计算结果的可重复性。本文被授予 "SIAM 可重现徽章"：代码和数据可用"，以表彰作者遵循了 SISC 和科学计算界重视的可重现性原则。读者可以通过 https://github.com/rjleveque/ImplicitAMR-paper 和补充材料（ImplicitAMR-paper.zip [174KB]）中的代码和数据重现本文的结果。

{"title":"Implicit Adaptive Mesh Refinement for Dispersive Tsunami Propagation","authors":"Marsha J. Berger, Randall J. LeVeque","doi":"10.1137/23m1585210","DOIUrl":"https://doi.org/10.1137/23m1585210","url":null,"abstract":"SIAM Journal on Scientific Computing, Volume 46, Issue 4, Page B554-B578, August 2024. Abstract. We present an algorithm to solve the dispersive depth-averaged Serre–Green–Naghdi equations using patch-based adaptive mesh refinement. These equations require adding additional higher derivative terms to the nonlinear shallow water equations. This has been implemented as a new component of the open source GeoClaw software that is widely used for modeling tsunamis, storm surge, and related hazards, improving its accuracy on shorter wavelength phenomena. We use a formulation that requires solving an elliptic system of equations at each time step, making the method implicit. The adaptive algorithm allows different time steps on different refinement levels and solves the implicit equations level by level. Computational examples are presented to illustrate the stability and accuracy on a radially symmetric test case and two realistic tsunami modeling problems, including a hypothetical asteroid impact creating a short wavelength tsunami for which dispersive terms are necessary. Reproducibility of computational results. This paper has been awarded the “SIAM Reproducibility Badge: Code and data available” as a recognition that the authors have followed reproducibility principles valued by SISC and the scientific computing community. Code and data that allow readers to reproduce the results in this paper are available at https://github.com/rjleveque/ImplicitAMR-paper and in the supplementary materials (ImplicitAMR-paper.zip [174KB]).","PeriodicalId":49526,"journal":{"name":"SIAM Journal on Scientific Computing","volume":"26 1","pages":""},"PeriodicalIF":3.1,"publicationDate":"2024-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142217612","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A Preconditioned Krylov Subspace Method for Linear Inverse Problems with General-Form Tikhonov Regularization 用通用形式提霍诺夫正则化处理线性逆问题的预处理克雷洛夫子空间方法

IF 3.1 2区数学 Q1 MATHEMATICS, APPLIED

SIAM Journal on Scientific Computing

Pub Date : 2024-08-14 DOI: 10.1137/23m1593802

Haibo Li

SIAM Journal on Scientific Computing, Volume 46, Issue 4, Page A2607-A2633, August 2024.
Abstract. Tikhonov regularization is a widely used technique in solving inverse problems that can enforce prior properties on the desired solution. In this paper, we propose a Krylov subspace based iterative method for solving linear inverse problems with general-form Tikhonov regularization term [math], where [math] is a positive semidefinite matrix. An iterative process called the preconditioned Golub–Kahan bidiagonalization (pGKB) is designed, which implicitly utilizes a proper preconditioner to generate a series of solution subspaces with desirable properties encoded by the regularizer [math]. Based on the pGKB process, we propose an iterative regularization algorithm via projecting the original problem onto small dimensional solution subspaces. We analyze the regularization properties of this algorithm, including the incorporation of prior properties of the desired solution into the solution subspace and the semiconvergence behavior of the regularized solution. To overcome instabilities caused by semiconvergence, we further propose two pGKB based hybrid regularization algorithms. All the proposed algorithms are tested on both small-scale and large-scale linear inverse problems. Numerical results demonstrate that these iterative algorithms exhibit excellent performance, outperforming other state-of-the-art algorithms in some cases.

SIAM 科学计算期刊》，第 46 卷第 4 期，第 A2607-A2633 页，2024 年 8 月。摘要Tikhonov 正则化是一种广泛应用于逆问题求解的技术，它可以对所求解强制执行先验特性。本文提出了一种基于 Krylov 子空间的迭代方法，用于求解带有一般形式 Tikhonov 正则化项 [math]（其中 [math] 为正半有限矩阵）的线性逆问题。我们设计了一种称为预处理 Golub-Kahan 二对角化（pGKB）的迭代过程，它隐含地利用适当的预处理来生成一系列具有正则化[math]所编码的理想特性的求解子空间。基于 pGKB 过程，我们提出了一种通过将原始问题投影到小维度解子空间的迭代正则化算法。我们分析了该算法的正则化特性，包括将所需解的先验特性纳入解子空间以及正则化解的半收敛行为。为了克服半收敛引起的不稳定性，我们进一步提出了两种基于 pGKB 的混合正则化算法。所有提出的算法都在小型和大型线性逆问题上进行了测试。数值结果表明，这些迭代算法表现出色，在某些情况下优于其他最先进的算法。

{"title":"A Preconditioned Krylov Subspace Method for Linear Inverse Problems with General-Form Tikhonov Regularization","authors":"Haibo Li","doi":"10.1137/23m1593802","DOIUrl":"https://doi.org/10.1137/23m1593802","url":null,"abstract":"SIAM Journal on Scientific Computing, Volume 46, Issue 4, Page A2607-A2633, August 2024. Abstract. Tikhonov regularization is a widely used technique in solving inverse problems that can enforce prior properties on the desired solution. In this paper, we propose a Krylov subspace based iterative method for solving linear inverse problems with general-form Tikhonov regularization term [math], where [math] is a positive semidefinite matrix. An iterative process called the preconditioned Golub–Kahan bidiagonalization (pGKB) is designed, which implicitly utilizes a proper preconditioner to generate a series of solution subspaces with desirable properties encoded by the regularizer [math]. Based on the pGKB process, we propose an iterative regularization algorithm via projecting the original problem onto small dimensional solution subspaces. We analyze the regularization properties of this algorithm, including the incorporation of prior properties of the desired solution into the solution subspace and the semiconvergence behavior of the regularized solution. To overcome instabilities caused by semiconvergence, we further propose two pGKB based hybrid regularization algorithms. All the proposed algorithms are tested on both small-scale and large-scale linear inverse problems. Numerical results demonstrate that these iterative algorithms exhibit excellent performance, outperforming other state-of-the-art algorithms in some cases.","PeriodicalId":49526,"journal":{"name":"SIAM Journal on Scientific Computing","volume":"3 1","pages":""},"PeriodicalIF":3.1,"publicationDate":"2024-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142227248","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A High-Order Fast Direct Solver for Surface PDEs 曲面 PDE 的高阶快速直接求解器

IF 3.1 2区数学 Q1 MATHEMATICS, APPLIED

SIAM Journal on Scientific Computing

Pub Date : 2024-08-13 DOI: 10.1137/22m1525259

Daniel Fortunato

SIAM Journal on Scientific Computing, Volume 46, Issue 4, Page A2582-A2606, August 2024.
Abstract. We introduce a fast direct solver for variable-coefficient elliptic PDEs on surfaces based on the hierarchical Poincaré–Steklov method. The method takes as input an unstructured, high-order quadrilateral mesh of a surface and discretizes surface differential operators on each element using a high-order spectral collocation scheme. Elemental solution operators and Dirichlet-to-Neumann maps tangent to the surface are precomputed and merged in a pairwise fashion to yield a hierarchy of solution operators that may be applied in [math] operations for a mesh with [math] degrees of freedom. The resulting fast direct solver may be used to accelerate high-order implicit time-stepping schemes, as the precomputed operators can be reused for fast elliptic solves on surfaces. On a standard laptop, precomputation for a 12th-order surface mesh with over 1 million degrees of freedom takes 10 seconds, while subsequent solves take only 0.25 seconds. We apply the method to a range of problems on both smooth surfaces and surfaces with sharp corners and edges, including the static Laplace–Beltrami problem, the Hodge decomposition of a tangential vector field, and some time-dependent nonlinear reaction-diffusion systems. Reproducibility of computational results. This paper has been awarded the “SIAM Reproducibility Badge: code and data available”, as a recognition that the authors have followed reproducibility principles valued by SISC and the scientific computing community. Code and data that allow readers to reproduce the results in this paper are available at https://github.com/danfortunato/surface-hps-sisc.

SIAM 科学计算期刊》，第 46 卷第 4 期，第 A2582-A2606 页，2024 年 8 月。摘要。我们介绍了一种基于分层 Poincaré-Steklov 方法的曲面上变系数椭圆 PDE 快速直接求解器。该方法以表面的非结构化高阶四边形网格为输入，使用高阶谱配位方案对每个元素上的表面微分算子进行离散化。预先计算元素求解算子和与曲面相切的 Dirichlet 到 Neumann 映射，并以成对的方式进行合并，以生成求解算子的层次结构，可用于具有 [math] 自由度的网格的 [math] 操作。由此产生的快速直接求解器可用于加速高阶隐式时间步进方案，因为预计算算子可重复用于曲面上的快速椭圆求解。在标准笔记本电脑上，对超过 100 万自由度的 12 阶曲面网格进行预计算需要 10 秒，而后续求解只需 0.25 秒。我们将该方法应用于一系列光滑表面和尖角及边缘表面的问题，包括静态拉普拉斯-贝尔特拉米问题、切向矢量场的霍奇分解，以及一些随时间变化的非线性反应扩散系统。计算结果的可重复性。本文被授予 "SIAM 可重复性徽章：代码和数据可用"，以表彰作者遵循了 SISC 和科学计算界重视的可重复性原则。读者可通过 https://github.com/danfortunato/surface-hps-sisc 获取代码和数据，以重现本文中的结果。

{"title":"A High-Order Fast Direct Solver for Surface PDEs","authors":"Daniel Fortunato","doi":"10.1137/22m1525259","DOIUrl":"https://doi.org/10.1137/22m1525259","url":null,"abstract":"SIAM Journal on Scientific Computing, Volume 46, Issue 4, Page A2582-A2606, August 2024. Abstract. We introduce a fast direct solver for variable-coefficient elliptic PDEs on surfaces based on the hierarchical Poincaré–Steklov method. The method takes as input an unstructured, high-order quadrilateral mesh of a surface and discretizes surface differential operators on each element using a high-order spectral collocation scheme. Elemental solution operators and Dirichlet-to-Neumann maps tangent to the surface are precomputed and merged in a pairwise fashion to yield a hierarchy of solution operators that may be applied in [math] operations for a mesh with [math] degrees of freedom. The resulting fast direct solver may be used to accelerate high-order implicit time-stepping schemes, as the precomputed operators can be reused for fast elliptic solves on surfaces. On a standard laptop, precomputation for a 12th-order surface mesh with over 1 million degrees of freedom takes 10 seconds, while subsequent solves take only 0.25 seconds. We apply the method to a range of problems on both smooth surfaces and surfaces with sharp corners and edges, including the static Laplace–Beltrami problem, the Hodge decomposition of a tangential vector field, and some time-dependent nonlinear reaction-diffusion systems. Reproducibility of computational results. This paper has been awarded the “SIAM Reproducibility Badge: code and data available”, as a recognition that the authors have followed reproducibility principles valued by SISC and the scientific computing community. Code and data that allow readers to reproduce the results in this paper are available at https://github.com/danfortunato/surface-hps-sisc.","PeriodicalId":49526,"journal":{"name":"SIAM Journal on Scientific Computing","volume":"23 1","pages":""},"PeriodicalIF":3.1,"publicationDate":"2024-08-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142217609","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Least-Squares Neural Network (LSNN) Method for Linear Advection-Reaction Equation: Discontinuity Interface 线性平流-反作用方程的最小二乘神经网络 (LSNN) 方法：不连续界面

IF 3.1 2区数学 Q1 MATHEMATICS, APPLIED

SIAM Journal on Scientific Computing

Pub Date : 2024-08-12 DOI: 10.1137/23m1568107

Zhiqiang Cai, Junpyo Choi, Min Liu

SIAM Journal on Scientific Computing, Volume 46, Issue 4, Page C448-C478, August 2024.
Abstract. We studied the least-squares ReLU neural network (LSNN) method for solving a linear advection-reaction equation with discontinuous solution in [Z. Cai et al., J. Comput. Phys., 443 (2021), 110514]. The method is based on a least-squares formulation and uses a new class of approximating functions: ReLU neural network (NN) functions. A critical and additional component of the LSNN method, differing from other NN-based methods, is the introduction of a properly designed and physics preserved discrete differential operator. In this paper, we study the LSNN method for problems with discontinuity interfaces. First, we show that ReLU NN functions with depth [math] can approximate any [math]-dimensional step function on a discontinuity interface generated by a vector field as streamlines with any prescribed accuracy. By decomposing the solution into continuous and discontinuous parts, we prove theoretically that the discretization error of the LSNN method using ReLU NN functions with depth [math] is mainly determined by the continuous part of the solution provided that the solution jump is constant. Numerical results for both two- and three-dimensional test problems with various discontinuity interfaces show that the LSNN method with enough layers is accurate and does not exhibit the common Gibbs phenomena along discontinuity interfaces.

SIAM 科学计算期刊》，第 46 卷第 4 期，第 C448-C478 页，2024 年 8 月。摘要我们在[Z. Cai et al., J. Comput. Phys., 443 (2021), 110514]一文中研究了求解具有不连续解的线性平流反应方程的最小二乘 ReLU 神经网络（LSNN）方法。该方法基于最小二乘法，并使用了一类新的近似函数：ReLU 神经网络 (NN) 函数。与其他基于 NN 的方法不同，LSNN 方法的一个关键和额外的组成部分是引入了一个经过适当设计并保留了物理特性的离散微分算子。在本文中，我们研究了针对不连续界面问题的 LSNN 方法。首先，我们证明了深度为[math]的 ReLU NN 函数可以以任意规定的精度将矢量场产生的不连续界面上的任意[math]维阶跃函数近似为流线。通过将解分解为连续部分和不连续部分，我们从理论上证明了使用深度[数学]ReLU NN 函数的 LSNN 方法的离散化误差主要由解的连续部分决定，前提是解的跳跃是恒定的。对具有各种不连续界面的二维和三维测试问题的数值结果表明，具有足够多层次的 LSNN 方法是精确的，不会在不连续界面上出现常见的吉布斯现象。

{"title":"Least-Squares Neural Network (LSNN) Method for Linear Advection-Reaction Equation: Discontinuity Interface","authors":"Zhiqiang Cai, Junpyo Choi, Min Liu","doi":"10.1137/23m1568107","DOIUrl":"https://doi.org/10.1137/23m1568107","url":null,"abstract":"SIAM Journal on Scientific Computing, Volume 46, Issue 4, Page C448-C478, August 2024. Abstract. We studied the least-squares ReLU neural network (LSNN) method for solving a linear advection-reaction equation with discontinuous solution in [Z. Cai et al., J. Comput. Phys., 443 (2021), 110514]. The method is based on a least-squares formulation and uses a new class of approximating functions: ReLU neural network (NN) functions. A critical and additional component of the LSNN method, differing from other NN-based methods, is the introduction of a properly designed and physics preserved discrete differential operator. In this paper, we study the LSNN method for problems with discontinuity interfaces. First, we show that ReLU NN functions with depth [math] can approximate any [math]-dimensional step function on a discontinuity interface generated by a vector field as streamlines with any prescribed accuracy. By decomposing the solution into continuous and discontinuous parts, we prove theoretically that the discretization error of the LSNN method using ReLU NN functions with depth [math] is mainly determined by the continuous part of the solution provided that the solution jump is constant. Numerical results for both two- and three-dimensional test problems with various discontinuity interfaces show that the LSNN method with enough layers is accurate and does not exhibit the common Gibbs phenomena along discontinuity interfaces.","PeriodicalId":49526,"journal":{"name":"SIAM Journal on Scientific Computing","volume":"95 1","pages":""},"PeriodicalIF":3.1,"publicationDate":"2024-08-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142217611","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A Levenberg–Marquardt Method for Nonsmooth Regularized Least Squares 非光滑正则化最小二乘法的 Levenberg-Marquardt 方法

IF 3.1 2区数学 Q1 MATHEMATICS, APPLIED

SIAM Journal on Scientific Computing

Pub Date : 2024-08-12 DOI: 10.1137/22m1538971

Aleksandr Y. Aravkin, Robert Baraldi, Dominique Orban

SIAM Journal on Scientific Computing, Volume 46, Issue 4, Page A2557-A2581, August 2024.
Abstract. We develop a Levenberg–Marquardt method for minimizing the sum of a smooth nonlinear least-squares term [math] and a nonsmooth term [math]. Both [math] and [math] may be nonconvex. Steps are computed by minimizing the sum of a regularized linear least-squares model and a model of [math] using a first-order method such as the proximal gradient method. We establish global convergence to a first-order stationary point under the assumptions that [math] and its Jacobian are Lipschitz continuous and [math] is proper and lower semicontinuous. In the worst case, our method performs [math] iterations to bring a measure of stationarity below [math]. We also derive a trust-region variant that enjoys similar asymptotic worst-case iteration complexity as a special case of the trust-region algorithm of Aravkin, Baraldi, and Orban [SIAM J. Optim., 32 (2022), pp. 900–929]. We report numerical results on three examples: a group-lasso basis-pursuit denoise example, a nonlinear support vector machine, and parameter estimation in a neuroscience application. To implement those examples, we describe in detail how to evaluate proximal operators for separable [math] and for the group lasso with trust-region constraint. In all cases, the Levenberg–Marquardt methods perform fewer outer iterations than either a proximal gradient method with adaptive step length or a quasi-Newton trust-region method, neither of which exploit the least-squares structure of the problem. Our results also highlight the need for more sophisticated subproblem solvers than simple first-order methods.

SIAM 科学计算期刊》，第 46 卷第 4 期，第 A2557-A2581 页，2024 年 8 月。摘要。我们开发了一种 Levenberg-Marquardt 方法，用于最小化平滑非线性最小二乘项 [math] 和非平滑项 [math] 之和。[math]和[math]都可能是非凸的。计算步骤是通过使用一阶方法（如近似梯度法）最小化正则化线性最小二乘模型与 [math] 模型之和。在[math]及其雅各布连为 Lipschitz 连续、[math]为适当的低半连续的假设条件下，我们建立了对一阶静止点的全局收敛。在最坏的情况下，我们的方法会进行 [math] 次迭代，使静止度低于 [math]。我们还推导了一个信任区域变体，它与 Aravkin、Baraldi 和 Orban 的信任区域算法的一个特例[SIAM J. Optim.，32 (2022)，第 900-929 页]具有相似的渐进最坏情况迭代复杂度。我们报告了三个实例的数值结果：一个组-拉索基搜索去噪实例、一个非线性支持向量机和一个神经科学应用中的参数估计。为了实现这些示例，我们详细描述了如何评估可分离[数学]的近算子和带有信任区域约束的群拉索。在所有情况下，Levenberg-Marquardt 方法的外部迭代次数都少于具有自适应步长的近似梯度方法或准牛顿信任区域方法，而这两种方法都没有利用问题的最小二乘结构。我们的研究结果还突出表明，除了简单的一阶方法外，还需要更复杂的子问题求解器。

{"title":"A Levenberg–Marquardt Method for Nonsmooth Regularized Least Squares","authors":"Aleksandr Y. Aravkin, Robert Baraldi, Dominique Orban","doi":"10.1137/22m1538971","DOIUrl":"https://doi.org/10.1137/22m1538971","url":null,"abstract":"SIAM Journal on Scientific Computing, Volume 46, Issue 4, Page A2557-A2581, August 2024. Abstract. We develop a Levenberg–Marquardt method for minimizing the sum of a smooth nonlinear least-squares term [math] and a nonsmooth term [math]. Both [math] and [math] may be nonconvex. Steps are computed by minimizing the sum of a regularized linear least-squares model and a model of [math] using a first-order method such as the proximal gradient method. We establish global convergence to a first-order stationary point under the assumptions that [math] and its Jacobian are Lipschitz continuous and [math] is proper and lower semicontinuous. In the worst case, our method performs [math] iterations to bring a measure of stationarity below [math]. We also derive a trust-region variant that enjoys similar asymptotic worst-case iteration complexity as a special case of the trust-region algorithm of Aravkin, Baraldi, and Orban [SIAM J. Optim., 32 (2022), pp. 900–929]. We report numerical results on three examples: a group-lasso basis-pursuit denoise example, a nonlinear support vector machine, and parameter estimation in a neuroscience application. To implement those examples, we describe in detail how to evaluate proximal operators for separable [math] and for the group lasso with trust-region constraint. In all cases, the Levenberg–Marquardt methods perform fewer outer iterations than either a proximal gradient method with adaptive step length or a quasi-Newton trust-region method, neither of which exploit the least-squares structure of the problem. Our results also highlight the need for more sophisticated subproblem solvers than simple first-order methods.","PeriodicalId":49526,"journal":{"name":"SIAM Journal on Scientific Computing","volume":"34 1","pages":""},"PeriodicalIF":3.1,"publicationDate":"2024-08-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142217628","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Cut Finite Element Discretizations of Cell-by-Cell EMI Electrophysiology Models 逐个细胞 EMI 电生理学模型的切割有限元离散化

IF 3.1 2区数学 Q1 MATHEMATICS, APPLIED

SIAM Journal on Scientific Computing

Pub Date : 2024-08-06 DOI: 10.1137/23m1580632

Nanna Berre, Marie E. Rognes, André Massing

SIAM Journal on Scientific Computing, Volume 46, Issue 4, Page B527-B553, August 2024.
Abstract. The EMI (extracellular-membrane-intracellular) model describes electrical activity in excitable tissue, where the extracellular and intracellular spaces and cellular membrane are explicitly represented. The model couples a system of partial differential equations (PDEs) in the intracellular and extracellular spaces with a system of ordinary differential equations (ODEs) on the membrane. A key challenge for the EMI model is the generation of high-quality meshes conforming to the complex geometries of brain cells. To overcome this challenge, we propose a novel cut finite element method (CutFEM) where the membrane geometry can be represented independently of a structured and easy-to-generate background mesh for the remaining computational domain. Starting from a Godunov splitting scheme, the EMI model is split into separate PDE and ODE parts. The resulting PDE part is a nonstandard elliptic interface problem, for which we devise two different CutFEM formulations: one single-dimensional formulation with the intra/extracellular electrical potentials as unknowns, and a multi-dimensional formulation that also introduces the electrical current over the membrane as an additional unknown leading to a penalized saddle point problem. Both formulations are augmented by suitably designed ghost penalties to ensure stability and convergence properties that are insensitive to how the membrane surface mesh cuts the background mesh. For the ODE part, we introduce a new unfitted discretization to solve the membrane bound ODEs on a membrane interface that is not aligned with the background mesh. Finally, we perform extensive numerical experiments to demonstrate that CutFEM is a promising approach to efficiently simulate electrical activity in geometrically resolved brain cells. Reproducibility of computational results. This paper has been awarded the “SIAM Reproducibility Badge: Code and data available” as a recognition that the authors have followed reproducibility principles valued by SISC and the scientific computing community. Code and data that allow readers to reproduce the results in this paper are available at https://zenodo.org/record/8068506.

SIAM 科学计算期刊》，第 46 卷第 4 期，第 B527-B553 页，2024 年 8 月。摘要。EMI（细胞外-膜-细胞内）模型描述了可兴奋组织中的电活动，其中明确表示了细胞外和细胞内空间以及细胞膜。该模型将细胞内外空间的偏微分方程（PDE）系统与膜上的常微分方程（ODE）系统结合起来。EMI 模型面临的一个关键挑战是生成符合脑细胞复杂几何形状的高质量网格。为了克服这一挑战，我们提出了一种新颖的切割有限元方法（CutFEM），在这种方法中，膜的几何形状可以独立于结构化且易于生成的背景网格来表示其余计算域。从戈杜诺夫分割方案开始，EMI 模型被分割成独立的 PDE 和 ODE 部分。由此产生的 PDE 部分是一个非标准椭圆界面问题，我们为此设计了两种不同的 CutFEM 公式：一种是单维公式，将细胞内/外电势作为未知数；另一种是多维公式，将膜上的电流作为额外的未知数，从而导致一个受惩罚的鞍点问题。这两种计算方法都采用了适当设计的幽灵惩罚，以确保稳定性和收敛性，而这种稳定性和收敛性对膜表面网格如何切割背景网格并不敏感。对于 ODE 部分，我们引入了一种新的非拟合离散化方法，以求解与背景网格不一致的膜界面上的膜约束 ODE。最后，我们进行了大量数值实验，证明 CutFEM 是一种有效模拟几何解析脑细胞电活动的可行方法。计算结果的可重复性。本文被授予 "SIAM 可重复性徽章"：代码和数据可用"，以表彰作者遵循了 SISC 和科学计算界重视的可重现性原则。读者可通过 https://zenodo.org/record/8068506 获取代码和数据，以重现本文中的结果。

{"title":"Cut Finite Element Discretizations of Cell-by-Cell EMI Electrophysiology Models","authors":"Nanna Berre, Marie E. Rognes, André Massing","doi":"10.1137/23m1580632","DOIUrl":"https://doi.org/10.1137/23m1580632","url":null,"abstract":"SIAM Journal on Scientific Computing, Volume 46, Issue 4, Page B527-B553, August 2024. Abstract. The EMI (extracellular-membrane-intracellular) model describes electrical activity in excitable tissue, where the extracellular and intracellular spaces and cellular membrane are explicitly represented. The model couples a system of partial differential equations (PDEs) in the intracellular and extracellular spaces with a system of ordinary differential equations (ODEs) on the membrane. A key challenge for the EMI model is the generation of high-quality meshes conforming to the complex geometries of brain cells. To overcome this challenge, we propose a novel cut finite element method (CutFEM) where the membrane geometry can be represented independently of a structured and easy-to-generate background mesh for the remaining computational domain. Starting from a Godunov splitting scheme, the EMI model is split into separate PDE and ODE parts. The resulting PDE part is a nonstandard elliptic interface problem, for which we devise two different CutFEM formulations: one single-dimensional formulation with the intra/extracellular electrical potentials as unknowns, and a multi-dimensional formulation that also introduces the electrical current over the membrane as an additional unknown leading to a penalized saddle point problem. Both formulations are augmented by suitably designed ghost penalties to ensure stability and convergence properties that are insensitive to how the membrane surface mesh cuts the background mesh. For the ODE part, we introduce a new unfitted discretization to solve the membrane bound ODEs on a membrane interface that is not aligned with the background mesh. Finally, we perform extensive numerical experiments to demonstrate that CutFEM is a promising approach to efficiently simulate electrical activity in geometrically resolved brain cells. Reproducibility of computational results. This paper has been awarded the “SIAM Reproducibility Badge: Code and data available” as a recognition that the authors have followed reproducibility principles valued by SISC and the scientific computing community. Code and data that allow readers to reproduce the results in this paper are available at https://zenodo.org/record/8068506.","PeriodicalId":49526,"journal":{"name":"SIAM Journal on Scientific Computing","volume":"10 1","pages":""},"PeriodicalIF":3.1,"publicationDate":"2024-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141942985","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Energetic Variational Neural Network Discretizations of Gradient Flows 梯度流的能量变异神经网络离散化

IF 3.1 2区数学 Q1 MATHEMATICS, APPLIED

SIAM Journal on Scientific Computing

Pub Date : 2024-08-05 DOI: 10.1137/22m1529427

Ziqing Hu, Chun Liu, Yiwei Wang, Zhiliang Xu

SIAM Journal on Scientific Computing, Volume 46, Issue 4, Page A2528-A2556, August 2024.
Abstract. We present a structure-preserving Eulerian algorithm for solving [math]-gradient flows and a structure-preserving Lagrangian algorithm for solving generalized diffusions. Both algorithms employ neural networks as tools for spatial discretization. Unlike most existing methods that construct numerical discretizations based on the strong or weak form of the underlying PDE, the proposed schemes are constructed based on the energy-dissipation law directly. This guarantees the monotonic decay of the system’s free energy, which avoids unphysical states of solutions and is crucial for the long-term stability of numerical computations. To address challenges arising from nonlinear neural network discretization, we perform temporal discretizations on these variational systems before spatial discretizations. This approach is computationally memory-efficient when implementing neural network-based algorithms. The proposed neural network-based schemes are mesh-free, allowing us to solve gradient flows in high dimensions. Various numerical experiments are presented to demonstrate the accuracy and energy stability of the proposed numerical schemes.

SIAM 科学计算期刊》，第 46 卷第 4 期，第 A2528-A2556 页，2024 年 8 月。摘要。我们提出了一种求解[数学]梯度流的结构保留欧拉算法和一种求解广义扩散的结构保留拉格朗日算法。这两种算法都采用神经网络作为空间离散化工具。与大多数根据底层 PDE 的强或弱形式构建数值离散的现有方法不同，所提出的方案是直接根据能量消耗定律构建的。这保证了系统自由能的单调衰减，避免了解的非物理状态，对数值计算的长期稳定性至关重要。为了应对非线性神经网络离散化带来的挑战，我们在空间离散化之前对这些变分系统进行了时间离散化。在实施基于神经网络的算法时，这种方法具有计算记忆效率。所提出的基于神经网络的方案是无网格的，使我们能够解决高维度的梯度流问题。各种数值实验证明了所提数值方案的准确性和能量稳定性。

{"title":"Energetic Variational Neural Network Discretizations of Gradient Flows","authors":"Ziqing Hu, Chun Liu, Yiwei Wang, Zhiliang Xu","doi":"10.1137/22m1529427","DOIUrl":"https://doi.org/10.1137/22m1529427","url":null,"abstract":"SIAM Journal on Scientific Computing, Volume 46, Issue 4, Page A2528-A2556, August 2024. Abstract. We present a structure-preserving Eulerian algorithm for solving [math]-gradient flows and a structure-preserving Lagrangian algorithm for solving generalized diffusions. Both algorithms employ neural networks as tools for spatial discretization. Unlike most existing methods that construct numerical discretizations based on the strong or weak form of the underlying PDE, the proposed schemes are constructed based on the energy-dissipation law directly. This guarantees the monotonic decay of the system’s free energy, which avoids unphysical states of solutions and is crucial for the long-term stability of numerical computations. To address challenges arising from nonlinear neural network discretization, we perform temporal discretizations on these variational systems before spatial discretizations. This approach is computationally memory-efficient when implementing neural network-based algorithms. The proposed neural network-based schemes are mesh-free, allowing us to solve gradient flows in high dimensions. Various numerical experiments are presented to demonstrate the accuracy and energy stability of the proposed numerical schemes.","PeriodicalId":49526,"journal":{"name":"SIAM Journal on Scientific Computing","volume":"11 1","pages":""},"PeriodicalIF":3.1,"publicationDate":"2024-08-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141942986","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0