首页 > 最新文献

SIAM Journal on Scientific Computing最新文献

英文 中文
Multilevel Parareal Algorithm with Averaging for Oscillatory Problems 针对振荡问题的多级 Parareal 平均算法
IF 3.1 2区 数学 Q1 MATHEMATICS, APPLIED Pub Date : 2024-08-20 DOI: 10.1137/23m1547123
Juliane Rosemeier, Terry Haut, Beth Wingate
SIAM Journal on Scientific Computing, Volume 46, Issue 4, Page A2709-A2736, August 2024.
Abstract. The present study is an extension of the work done by Peddle, Haut, and Wingate [SIAM J. Sci. Comput., 41 (2019), pp. A3476–A3497] and Haut and Wingate [SIAM J. Sci. Comput., 36 (2014), pp. A693–A713], where a two-level Parareal method with mapping and averaging is examined. The method proposed in this paper is a multilevel Parareal method with arbitrarily many levels, which is not restricted to the two-level case. We give an asymptotic error estimate which reduces to the two-level estimate for the case when only two levels are considered. Introducing more than two levels has important consequences for the averaging procedure, as we choose separate averaging windows for each of the different levels, which is an additional new feature of the present study. The different averaging windows make the proposed method especially appropriate for nonlinear multiscale problems, because we can introduce a level for each intrinsic scale of the problem and adapt the averaging procedure such that we reproduce the behavior of the model on the particular scale resolved by the level. The method is applied to nonlinear differential equations. The nonlinearities can generate a range of frequencies in the problem. The computational cost of the new method is investigated and studied on several examples.
SIAM 科学计算期刊》,第 46 卷第 4 期,第 A2709-A2736 页,2024 年 8 月。 摘要本研究是对 Peddle、Haut 和 Wingate [SIAM J. Sci. Comput., 41 (2019), pp. A3476-A3497] 以及 Haut 和 Wingate [SIAM J. Sci. Comput., 36 (2014), pp.本文提出的方法是一种具有任意多层次的多层次 Parareal 方法,它并不局限于两层情况。我们给出了一个渐近误差估计值,在只考虑两级的情况下,该估计值与两级估计值相减。引入两个以上的水平对平均过程有重要影响,因为我们为每个不同的水平选择了不同的平均窗口,这是本研究的另一个新特点。不同的平均窗口使所提出的方法特别适用于非线性多尺度问题,因为我们可以为问题的每个内在尺度引入一个层次,并调整平均程序,从而重现模型在该层次所解析的特定尺度上的行为。该方法适用于非线性微分方程。非线性会在问题中产生一系列频率。新方法的计算成本在几个例子中进行了调查和研究。
{"title":"Multilevel Parareal Algorithm with Averaging for Oscillatory Problems","authors":"Juliane Rosemeier, Terry Haut, Beth Wingate","doi":"10.1137/23m1547123","DOIUrl":"https://doi.org/10.1137/23m1547123","url":null,"abstract":"SIAM Journal on Scientific Computing, Volume 46, Issue 4, Page A2709-A2736, August 2024. <br/> Abstract. The present study is an extension of the work done by Peddle, Haut, and Wingate [SIAM J. Sci. Comput., 41 (2019), pp. A3476–A3497] and Haut and Wingate [SIAM J. Sci. Comput., 36 (2014), pp. A693–A713], where a two-level Parareal method with mapping and averaging is examined. The method proposed in this paper is a multilevel Parareal method with arbitrarily many levels, which is not restricted to the two-level case. We give an asymptotic error estimate which reduces to the two-level estimate for the case when only two levels are considered. Introducing more than two levels has important consequences for the averaging procedure, as we choose separate averaging windows for each of the different levels, which is an additional new feature of the present study. The different averaging windows make the proposed method especially appropriate for nonlinear multiscale problems, because we can introduce a level for each intrinsic scale of the problem and adapt the averaging procedure such that we reproduce the behavior of the model on the particular scale resolved by the level. The method is applied to nonlinear differential equations. The nonlinearities can generate a range of frequencies in the problem. The computational cost of the new method is investigated and studied on several examples.","PeriodicalId":49526,"journal":{"name":"SIAM Journal on Scientific Computing","volume":"22 1","pages":""},"PeriodicalIF":3.1,"publicationDate":"2024-08-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142217605","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Implicit-explicit Schemes for Incompressible Flow Problems with Variable Viscosity 粘性可变的不可压缩流动问题的隐式-显式方案
IF 3.1 2区 数学 Q1 MATHEMATICS, APPLIED Pub Date : 2024-08-19 DOI: 10.1137/23m1606526
Gabriel Barrenechea, Ernesto Castillo, Douglas Pacheco
SIAM Journal on Scientific Computing, Volume 46, Issue 4, Page A2660-A2682, August 2024.
Abstract. This article investigates different implicit-explicit (IMEX) methods for incompressible flows with variable viscosity. The viscosity field may depend on space and time alone or, for example, on velocity gradients. Unlike most previous works on IMEX schemes, which focus on the convective term, we propose also treating parts of the diffusive term explicitly, which can reduce the coupling between the velocity components. We present different IMEX alternatives for the variable-viscosity Navier–Stokes system, analyzing their theoretical and algorithmic properties. Temporal stability is proven for all the methods presented, including monolithic and fractional-step variants. These results are unconditional except for one of the fractional-step discretizations, whose stability is shown for time-step sizes under an upper bound that depends solely on the problem data. The key finding of this work is a class of IMEX schemes whose steps decouple the velocity components and are fully linearized (even if the viscosity depends nonlinearly on the velocity) without requiring any CFL condition for stability. Moreover, in the presence of Neumann boundaries, some of our formulations lead naturally to conditions involving normal pseudotractions. This generalizes to the variable-viscosity case what happens for the standard Laplacian form with constant viscosity. Our analysis is supported by a series of numerical experiments.
SIAM 科学计算期刊》,第 46 卷第 4 期,第 A2660-A2682 页,2024 年 8 月。 摘要本文研究了粘性可变不可压缩流的不同隐式-显式(IMEX)方法。粘度场可能仅取决于空间和时间,也可能取决于速度梯度等。与以往大多数侧重于对流项的 IMEX 方案不同,我们建议同时显式处理部分扩散项,这样可以减少速度分量之间的耦合。我们针对变粘度 Navier-Stokes 系统提出了不同的 IMEX 替代方案,分析了它们的理论和算法特性。所介绍的所有方法,包括单步和分数步变体,都证明了时间稳定性。这些结果都是无条件的,只有一种分数步离散法例外,它的稳定性是在一个完全取决于问题数据的上限值下的时间步长。这项工作的关键发现是一类 IMEX 方案,其步骤与速度分量解耦并完全线性化(即使粘度非线性地依赖于速度),而不需要任何 CFL 稳定性条件。此外,在存在 Neumann 边界的情况下,我们的一些公式自然会引出涉及正常伪抽象的条件。这就把粘性恒定的标准拉普拉斯形式的情况推广到了粘性可变的情况。我们的分析得到了一系列数值实验的支持。
{"title":"Implicit-explicit Schemes for Incompressible Flow Problems with Variable Viscosity","authors":"Gabriel Barrenechea, Ernesto Castillo, Douglas Pacheco","doi":"10.1137/23m1606526","DOIUrl":"https://doi.org/10.1137/23m1606526","url":null,"abstract":"SIAM Journal on Scientific Computing, Volume 46, Issue 4, Page A2660-A2682, August 2024. <br/> Abstract. This article investigates different implicit-explicit (IMEX) methods for incompressible flows with variable viscosity. The viscosity field may depend on space and time alone or, for example, on velocity gradients. Unlike most previous works on IMEX schemes, which focus on the convective term, we propose also treating parts of the diffusive term explicitly, which can reduce the coupling between the velocity components. We present different IMEX alternatives for the variable-viscosity Navier–Stokes system, analyzing their theoretical and algorithmic properties. Temporal stability is proven for all the methods presented, including monolithic and fractional-step variants. These results are unconditional except for one of the fractional-step discretizations, whose stability is shown for time-step sizes under an upper bound that depends solely on the problem data. The key finding of this work is a class of IMEX schemes whose steps decouple the velocity components and are fully linearized (even if the viscosity depends nonlinearly on the velocity) without requiring any CFL condition for stability. Moreover, in the presence of Neumann boundaries, some of our formulations lead naturally to conditions involving normal pseudotractions. This generalizes to the variable-viscosity case what happens for the standard Laplacian form with constant viscosity. Our analysis is supported by a series of numerical experiments.","PeriodicalId":49526,"journal":{"name":"SIAM Journal on Scientific Computing","volume":"81 1","pages":""},"PeriodicalIF":3.1,"publicationDate":"2024-08-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142217626","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Adaptivity in Local Kernel Based Methods for Approximating the Action of Linear Operators 基于局部核的线性算子作用近似方法的自适应性
IF 3.1 2区 数学 Q1 MATHEMATICS, APPLIED Pub Date : 2024-08-19 DOI: 10.1137/23m1598052
Jonah A. Reeger
SIAM Journal on Scientific Computing, Volume 46, Issue 4, Page A2683-A2708, August 2024.
Abstract. Building on the successes of local kernel methods for approximating the solutions to partial differential equations (PDEs) and the evaluation of definite integrals (quadrature/cubature), a local estimate of the error in such approximations is developed. This estimate is useful for determining locations in the solution domain where increased node density (equivalently, reduction in the spacing between nodes) can decrease the error in the solution. An adaptive procedure for adding nodes to the domain for both the approximation of derivatives and the approximate evaluation of definite integrals is described. This method efficiently computes the error estimate at a set of prescribed points and adds new nodes for approximation where the error is too large. Computational experiments demonstrate close agreement between the error estimate and actual absolute error in the approximation. Such methods are necessary or desirable when approximating solutions to PDEs (or in the case of quadrature/cubature), where the initial data and subsequent solution (or integrand) exhibit localized features that require significant refinement to resolve and where uniform increases in the density of nodes across the entire computational domain is not possible or too burdensome.
SIAM 科学计算期刊》,第 46 卷第 4 期,第 A2683-A2708 页,2024 年 8 月。 摘要基于近似偏微分方程(PDE)解和定积分(正交/余量)求值的局部内核方法的成功经验,本文提出了对此类近似误差的局部估计。该估计值有助于确定求解域中增加节点密度(等同于减少节点间距)可减少求解误差的位置。本文介绍了一种自适应程序,用于在导数近似和定积分近似计算的域中添加节点。这种方法能有效计算一组规定点的误差估计值,并在误差过大的地方添加新节点进行近似。计算实验表明,误差估计值与近似值的实际绝对误差非常接近。在近似 PDEs 的解时(或在正交/余量的情况下),初始数据和随后的解(或积分)表现出局部特征,需要大量细化才能解决,而在整个计算域中均匀增加节点密度是不可能或过于繁琐时,这种方法是必要或可取的。
{"title":"Adaptivity in Local Kernel Based Methods for Approximating the Action of Linear Operators","authors":"Jonah A. Reeger","doi":"10.1137/23m1598052","DOIUrl":"https://doi.org/10.1137/23m1598052","url":null,"abstract":"SIAM Journal on Scientific Computing, Volume 46, Issue 4, Page A2683-A2708, August 2024. <br/> Abstract. Building on the successes of local kernel methods for approximating the solutions to partial differential equations (PDEs) and the evaluation of definite integrals (quadrature/cubature), a local estimate of the error in such approximations is developed. This estimate is useful for determining locations in the solution domain where increased node density (equivalently, reduction in the spacing between nodes) can decrease the error in the solution. An adaptive procedure for adding nodes to the domain for both the approximation of derivatives and the approximate evaluation of definite integrals is described. This method efficiently computes the error estimate at a set of prescribed points and adds new nodes for approximation where the error is too large. Computational experiments demonstrate close agreement between the error estimate and actual absolute error in the approximation. Such methods are necessary or desirable when approximating solutions to PDEs (or in the case of quadrature/cubature), where the initial data and subsequent solution (or integrand) exhibit localized features that require significant refinement to resolve and where uniform increases in the density of nodes across the entire computational domain is not possible or too burdensome.","PeriodicalId":49526,"journal":{"name":"SIAM Journal on Scientific Computing","volume":"16 1","pages":""},"PeriodicalIF":3.1,"publicationDate":"2024-08-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142217607","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Meshless Solver for Blood Flow Simulations in Elastic Vessels Using a Physics-Informed Neural Network 利用物理信息神经网络模拟弹性血管中血流的无网格求解器
IF 3.1 2区 数学 Q1 MATHEMATICS, APPLIED Pub Date : 2024-08-19 DOI: 10.1137/23m1622696
Han Zhang, Raymond H. Chan, Xue-Cheng Tai
SIAM Journal on Scientific Computing, Volume 46, Issue 4, Page C479-C507, August 2024.
Abstract. Investigating blood flow in the cardiovascular system is crucial for assessing cardiovascular health. Computational approaches offer some noninvasive alternatives to measure blood flow dynamics. Numerical simulations based on traditional methods such as finite-element and other numerical discretizations have been extensively studied and have yielded excellent results. However, adapting these methods to real-life simulations remains a complex task. In this paper, we propose a method that offers flexibility and can efficiently handle real-life simulations. We suggest utilizing the physics-informed neural network to solve the Navier–Stokes equation in a deformable domain, specifically addressing the simulation of blood flow in elastic vessels. Our approach models blood flow using an incompressible, viscous Navier–Stokes equation in an arbitrary Lagrangian–Eulerian form. The mechanical model for the vessel wall structure is formulated by an equation of Newton’s second law of momentum and linear elasticity to the force exerted by the fluid flow. Our method is a mesh-free approach that eliminates the need for discretization and meshing of the computational domain. This makes it highly efficient in solving simulations involving complex geometries. Additionally, with the availability of well-developed open-source machine learning framework packages and parallel modules, our method can easily be accelerated through GPU computing and parallel computing. To evaluate our approach, we conducted experiments on regular cylinder vessels as well as vessels with plaque on their walls. We compared our results to a solution calculated by finite element methods using a dense grid and small time steps, which we considered as the ground truth solution. We report the relative error and the time consumed to solve the problem, highlighting the advantages of our method.
SIAM 科学计算期刊》,第 46 卷第 4 期,第 C479-C507 页,2024 年 8 月。 摘要调查心血管系统中的血流对于评估心血管健康至关重要。计算方法为测量血流动态提供了一些非侵入性的替代方法。基于有限元和其他数值离散等传统方法的数值模拟已得到广泛研究,并取得了卓越成果。然而,将这些方法应用于实际模拟仍是一项复杂的任务。在本文中,我们提出了一种既灵活又能有效处理实际模拟的方法。我们建议利用物理信息神经网络来求解可变形域中的纳维-斯托克斯方程,特别是解决弹性血管中的血流模拟问题。我们的方法使用不可压缩的粘性纳维-斯托克斯方程,以任意拉格朗日-欧拉形式模拟血流。血管壁结构的力学模型是通过牛顿第二动量定律和线性弹性方程来计算流体流动所产生的力。我们的方法是一种无网格方法,无需对计算域进行离散化和网格划分。这使得它在解决涉及复杂几何形状的模拟时非常高效。此外,由于有了完善的开源机器学习框架包和并行模块,我们的方法可以很容易地通过 GPU 计算和并行计算来加速。为了评估我们的方法,我们在普通圆柱形容器和壁上有斑块的容器上进行了实验。我们将我们的结果与使用密集网格和小时间步长的有限元方法计算出的解决方案进行了比较,并将其视为基本真理解决方案。我们报告了相对误差和解决问题所消耗的时间,突出了我们方法的优势。
{"title":"A Meshless Solver for Blood Flow Simulations in Elastic Vessels Using a Physics-Informed Neural Network","authors":"Han Zhang, Raymond H. Chan, Xue-Cheng Tai","doi":"10.1137/23m1622696","DOIUrl":"https://doi.org/10.1137/23m1622696","url":null,"abstract":"SIAM Journal on Scientific Computing, Volume 46, Issue 4, Page C479-C507, August 2024. <br/> Abstract. Investigating blood flow in the cardiovascular system is crucial for assessing cardiovascular health. Computational approaches offer some noninvasive alternatives to measure blood flow dynamics. Numerical simulations based on traditional methods such as finite-element and other numerical discretizations have been extensively studied and have yielded excellent results. However, adapting these methods to real-life simulations remains a complex task. In this paper, we propose a method that offers flexibility and can efficiently handle real-life simulations. We suggest utilizing the physics-informed neural network to solve the Navier–Stokes equation in a deformable domain, specifically addressing the simulation of blood flow in elastic vessels. Our approach models blood flow using an incompressible, viscous Navier–Stokes equation in an arbitrary Lagrangian–Eulerian form. The mechanical model for the vessel wall structure is formulated by an equation of Newton’s second law of momentum and linear elasticity to the force exerted by the fluid flow. Our method is a mesh-free approach that eliminates the need for discretization and meshing of the computational domain. This makes it highly efficient in solving simulations involving complex geometries. Additionally, with the availability of well-developed open-source machine learning framework packages and parallel modules, our method can easily be accelerated through GPU computing and parallel computing. To evaluate our approach, we conducted experiments on regular cylinder vessels as well as vessels with plaque on their walls. We compared our results to a solution calculated by finite element methods using a dense grid and small time steps, which we considered as the ground truth solution. We report the relative error and the time consumed to solve the problem, highlighting the advantages of our method.","PeriodicalId":49526,"journal":{"name":"SIAM Journal on Scientific Computing","volume":"6 1","pages":""},"PeriodicalIF":3.1,"publicationDate":"2024-08-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142217606","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The Effect of Approximate Coarsest-Level Solves on the Convergence of Multigrid V-Cycle Methods 近似最粗求解对多网格 V 循环方法收敛性的影响
IF 3.1 2区 数学 Q1 MATHEMATICS, APPLIED Pub Date : 2024-08-14 DOI: 10.1137/23m1578255
Petr Vacek, Erin Carson, Kirk M. Soodhalter
SIAM Journal on Scientific Computing, Volume 46, Issue 4, Page A2634-A2659, August 2024.
Abstract. The multigrid V-cycle method is a popular method for solving systems of linear equations. It computes an approximate solution by using smoothing on fine levels and solving a system of linear equations on the coarsest level. Solving on the coarsest level depends on the size and difficulty of the problem. If the size permits, it is typical to use a direct method based on LU or Cholesky decomposition. In settings with large coarsest-level problems, approximate solvers such as iterative Krylov subspace methods, or direct methods based on low-rank approximation, are often used. The accuracy of the coarsest-level solver is typically determined based on the experience of the users with the concrete problems and methods. In this paper, we present an approach to analyzing the effects of approximate coarsest-level solves on the convergence of the V-cycle method for symmetric positive definite problems. Using these results, we derive coarsest-level stopping criterion through which we may control the difference between the approximation computed by a V-cycle method with approximate coarsest-level solver and the approximation which would be computed if the coarsest-level problems were solved exactly. The coarsest-level stopping criterion may thus be set up such that the V-cycle method converges to a chosen finest-level accuracy in (nearly) the same number of V-cycle iterations as the V-cycle method with exact coarsest-level solver. We also utilize the theoretical results to discuss how the convergence of the V-cycle method may be affected by the choice of a tolerance in a coarsest-level stopping criterion based on the relative residual norm. Reproducibility of computational results. This paper has been awarded the “SIAM Reproducibility Badge: Code and data available” as a recognition that the authors have followed reproducibility principles valued by SISC and the scientific computing community. Code and data that allow readers to reproduce the results in this paper are available at https://doi.org/10.5281/zenodo.11178544.
SIAM 科学计算期刊》,第 46 卷第 4 期,第 A2634-A2659 页,2024 年 8 月。 摘要多网格 V 循环法是求解线性方程组的一种常用方法。它通过在精细级上进行平滑处理并在最粗级上求解线性方程组来计算近似解。在最粗级上求解取决于问题的大小和难度。如果规模允许,通常采用基于 LU 或 Cholesky 分解的直接方法。在处理大型最粗级问题时,通常会使用近似求解器,如迭代克雷洛夫子空间方法或基于低阶近似的直接方法。最粗级求解器的精度通常根据用户对具体问题和方法的经验来确定。在本文中,我们提出了一种分析近似最粗级求解对对称正定问题 V 循环方法收敛性影响的方法。利用这些结果,我们得出了最粗级停止准则,通过该准则,我们可以控制使用近似最粗级求解器的 V 循环方法计算出的近似值与精确求解最粗级问题时计算出的近似值之间的差异。因此,可以设定最粗级停止准则,使 V 循环方法在(几乎)与使用精确最粗级求解器的 V 循环方法相同的 V 循环迭代次数内收敛到所选的最细级精度。我们还利用理论结果讨论了 V 循环方法的收敛性如何受到基于相对残差规范的最粗级停止准则中容限选择的影响。计算结果的可重复性。本文被授予 "SIAM 可重复性徽章":代码和数据可用",以表彰作者遵循了 SISC 和科学计算界重视的可重现性原则。读者可通过 https://doi.org/10.5281/zenodo.11178544 获取代码和数据,以重现本文中的结果。
{"title":"The Effect of Approximate Coarsest-Level Solves on the Convergence of Multigrid V-Cycle Methods","authors":"Petr Vacek, Erin Carson, Kirk M. Soodhalter","doi":"10.1137/23m1578255","DOIUrl":"https://doi.org/10.1137/23m1578255","url":null,"abstract":"SIAM Journal on Scientific Computing, Volume 46, Issue 4, Page A2634-A2659, August 2024. <br/> Abstract. The multigrid V-cycle method is a popular method for solving systems of linear equations. It computes an approximate solution by using smoothing on fine levels and solving a system of linear equations on the coarsest level. Solving on the coarsest level depends on the size and difficulty of the problem. If the size permits, it is typical to use a direct method based on LU or Cholesky decomposition. In settings with large coarsest-level problems, approximate solvers such as iterative Krylov subspace methods, or direct methods based on low-rank approximation, are often used. The accuracy of the coarsest-level solver is typically determined based on the experience of the users with the concrete problems and methods. In this paper, we present an approach to analyzing the effects of approximate coarsest-level solves on the convergence of the V-cycle method for symmetric positive definite problems. Using these results, we derive coarsest-level stopping criterion through which we may control the difference between the approximation computed by a V-cycle method with approximate coarsest-level solver and the approximation which would be computed if the coarsest-level problems were solved exactly. The coarsest-level stopping criterion may thus be set up such that the V-cycle method converges to a chosen finest-level accuracy in (nearly) the same number of V-cycle iterations as the V-cycle method with exact coarsest-level solver. We also utilize the theoretical results to discuss how the convergence of the V-cycle method may be affected by the choice of a tolerance in a coarsest-level stopping criterion based on the relative residual norm. Reproducibility of computational results. This paper has been awarded the “SIAM Reproducibility Badge: Code and data available” as a recognition that the authors have followed reproducibility principles valued by SISC and the scientific computing community. Code and data that allow readers to reproduce the results in this paper are available at https://doi.org/10.5281/zenodo.11178544.","PeriodicalId":49526,"journal":{"name":"SIAM Journal on Scientific Computing","volume":"23 1","pages":""},"PeriodicalIF":3.1,"publicationDate":"2024-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142217608","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Implicit Adaptive Mesh Refinement for Dispersive Tsunami Propagation 针对分散海啸传播的隐式自适应网格细化
IF 3.1 2区 数学 Q1 MATHEMATICS, APPLIED Pub Date : 2024-08-14 DOI: 10.1137/23m1585210
Marsha J. Berger, Randall J. LeVeque
SIAM Journal on Scientific Computing, Volume 46, Issue 4, Page B554-B578, August 2024.
Abstract. We present an algorithm to solve the dispersive depth-averaged Serre–Green–Naghdi equations using patch-based adaptive mesh refinement. These equations require adding additional higher derivative terms to the nonlinear shallow water equations. This has been implemented as a new component of the open source GeoClaw software that is widely used for modeling tsunamis, storm surge, and related hazards, improving its accuracy on shorter wavelength phenomena. We use a formulation that requires solving an elliptic system of equations at each time step, making the method implicit. The adaptive algorithm allows different time steps on different refinement levels and solves the implicit equations level by level. Computational examples are presented to illustrate the stability and accuracy on a radially symmetric test case and two realistic tsunami modeling problems, including a hypothetical asteroid impact creating a short wavelength tsunami for which dispersive terms are necessary. Reproducibility of computational results. This paper has been awarded the “SIAM Reproducibility Badge: Code and data available” as a recognition that the authors have followed reproducibility principles valued by SISC and the scientific computing community. Code and data that allow readers to reproduce the results in this paper are available at https://github.com/rjleveque/ImplicitAMR-paper and in the supplementary materials (ImplicitAMR-paper.zip [174KB]).
SIAM 科学计算期刊》,第 46 卷第 4 期,第 B554-B578 页,2024 年 8 月。 摘要我们提出了一种利用基于补丁的自适应网格细化求解色散深度平均 Serre-Green-Naghdi 方程的算法。这些方程需要在非线性浅水方程中添加额外的高导数项。这已作为开源 GeoClaw 软件的一个新组件实施,该软件被广泛用于海啸、风暴潮和相关灾害的建模,提高了其对较短波长现象的精度。我们使用的方法需要在每个时间步求解一个椭圆方程组,因此该方法是隐式的。自适应算法允许在不同细化级别上采用不同的时间步长,并逐级求解隐式方程。计算实例展示了一个径向对称测试案例和两个现实海啸建模问题的稳定性和准确性,包括一个假想的小行星撞击产生的短波长海啸,其中分散项是必要的。计算结果的可重复性。本文被授予 "SIAM 可重现徽章":代码和数据可用",以表彰作者遵循了 SISC 和科学计算界重视的可重现性原则。读者可以通过 https://github.com/rjleveque/ImplicitAMR-paper 和补充材料(ImplicitAMR-paper.zip [174KB])中的代码和数据重现本文的结果。
{"title":"Implicit Adaptive Mesh Refinement for Dispersive Tsunami Propagation","authors":"Marsha J. Berger, Randall J. LeVeque","doi":"10.1137/23m1585210","DOIUrl":"https://doi.org/10.1137/23m1585210","url":null,"abstract":"SIAM Journal on Scientific Computing, Volume 46, Issue 4, Page B554-B578, August 2024. <br/> Abstract. We present an algorithm to solve the dispersive depth-averaged Serre–Green–Naghdi equations using patch-based adaptive mesh refinement. These equations require adding additional higher derivative terms to the nonlinear shallow water equations. This has been implemented as a new component of the open source GeoClaw software that is widely used for modeling tsunamis, storm surge, and related hazards, improving its accuracy on shorter wavelength phenomena. We use a formulation that requires solving an elliptic system of equations at each time step, making the method implicit. The adaptive algorithm allows different time steps on different refinement levels and solves the implicit equations level by level. Computational examples are presented to illustrate the stability and accuracy on a radially symmetric test case and two realistic tsunami modeling problems, including a hypothetical asteroid impact creating a short wavelength tsunami for which dispersive terms are necessary. Reproducibility of computational results. This paper has been awarded the “SIAM Reproducibility Badge: Code and data available” as a recognition that the authors have followed reproducibility principles valued by SISC and the scientific computing community. Code and data that allow readers to reproduce the results in this paper are available at https://github.com/rjleveque/ImplicitAMR-paper and in the supplementary materials (ImplicitAMR-paper.zip [174KB]).","PeriodicalId":49526,"journal":{"name":"SIAM Journal on Scientific Computing","volume":"26 1","pages":""},"PeriodicalIF":3.1,"publicationDate":"2024-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142217612","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Preconditioned Krylov Subspace Method for Linear Inverse Problems with General-Form Tikhonov Regularization 用通用形式提霍诺夫正则化处理线性逆问题的预处理克雷洛夫子空间方法
IF 3.1 2区 数学 Q1 MATHEMATICS, APPLIED Pub Date : 2024-08-14 DOI: 10.1137/23m1593802
Haibo Li
SIAM Journal on Scientific Computing, Volume 46, Issue 4, Page A2607-A2633, August 2024.
Abstract. Tikhonov regularization is a widely used technique in solving inverse problems that can enforce prior properties on the desired solution. In this paper, we propose a Krylov subspace based iterative method for solving linear inverse problems with general-form Tikhonov regularization term [math], where [math] is a positive semidefinite matrix. An iterative process called the preconditioned Golub–Kahan bidiagonalization (pGKB) is designed, which implicitly utilizes a proper preconditioner to generate a series of solution subspaces with desirable properties encoded by the regularizer [math]. Based on the pGKB process, we propose an iterative regularization algorithm via projecting the original problem onto small dimensional solution subspaces. We analyze the regularization properties of this algorithm, including the incorporation of prior properties of the desired solution into the solution subspace and the semiconvergence behavior of the regularized solution. To overcome instabilities caused by semiconvergence, we further propose two pGKB based hybrid regularization algorithms. All the proposed algorithms are tested on both small-scale and large-scale linear inverse problems. Numerical results demonstrate that these iterative algorithms exhibit excellent performance, outperforming other state-of-the-art algorithms in some cases.
SIAM 科学计算期刊》,第 46 卷第 4 期,第 A2607-A2633 页,2024 年 8 月。 摘要Tikhonov 正则化是一种广泛应用于逆问题求解的技术,它可以对所求解强制执行先验特性。本文提出了一种基于 Krylov 子空间的迭代方法,用于求解带有一般形式 Tikhonov 正则化项 [math](其中 [math] 为正半有限矩阵)的线性逆问题。我们设计了一种称为预处理 Golub-Kahan 二对角化(pGKB)的迭代过程,它隐含地利用适当的预处理来生成一系列具有正则化[math]所编码的理想特性的求解子空间。基于 pGKB 过程,我们提出了一种通过将原始问题投影到小维度解子空间的迭代正则化算法。我们分析了该算法的正则化特性,包括将所需解的先验特性纳入解子空间以及正则化解的半收敛行为。为了克服半收敛引起的不稳定性,我们进一步提出了两种基于 pGKB 的混合正则化算法。所有提出的算法都在小型和大型线性逆问题上进行了测试。数值结果表明,这些迭代算法表现出色,在某些情况下优于其他最先进的算法。
{"title":"A Preconditioned Krylov Subspace Method for Linear Inverse Problems with General-Form Tikhonov Regularization","authors":"Haibo Li","doi":"10.1137/23m1593802","DOIUrl":"https://doi.org/10.1137/23m1593802","url":null,"abstract":"SIAM Journal on Scientific Computing, Volume 46, Issue 4, Page A2607-A2633, August 2024. <br/> Abstract. Tikhonov regularization is a widely used technique in solving inverse problems that can enforce prior properties on the desired solution. In this paper, we propose a Krylov subspace based iterative method for solving linear inverse problems with general-form Tikhonov regularization term [math], where [math] is a positive semidefinite matrix. An iterative process called the preconditioned Golub–Kahan bidiagonalization (pGKB) is designed, which implicitly utilizes a proper preconditioner to generate a series of solution subspaces with desirable properties encoded by the regularizer [math]. Based on the pGKB process, we propose an iterative regularization algorithm via projecting the original problem onto small dimensional solution subspaces. We analyze the regularization properties of this algorithm, including the incorporation of prior properties of the desired solution into the solution subspace and the semiconvergence behavior of the regularized solution. To overcome instabilities caused by semiconvergence, we further propose two pGKB based hybrid regularization algorithms. All the proposed algorithms are tested on both small-scale and large-scale linear inverse problems. Numerical results demonstrate that these iterative algorithms exhibit excellent performance, outperforming other state-of-the-art algorithms in some cases.","PeriodicalId":49526,"journal":{"name":"SIAM Journal on Scientific Computing","volume":"3 1","pages":""},"PeriodicalIF":3.1,"publicationDate":"2024-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142227248","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A High-Order Fast Direct Solver for Surface PDEs 曲面 PDE 的高阶快速直接求解器
IF 3.1 2区 数学 Q1 MATHEMATICS, APPLIED Pub Date : 2024-08-13 DOI: 10.1137/22m1525259
Daniel Fortunato
SIAM Journal on Scientific Computing, Volume 46, Issue 4, Page A2582-A2606, August 2024.
Abstract. We introduce a fast direct solver for variable-coefficient elliptic PDEs on surfaces based on the hierarchical Poincaré–Steklov method. The method takes as input an unstructured, high-order quadrilateral mesh of a surface and discretizes surface differential operators on each element using a high-order spectral collocation scheme. Elemental solution operators and Dirichlet-to-Neumann maps tangent to the surface are precomputed and merged in a pairwise fashion to yield a hierarchy of solution operators that may be applied in [math] operations for a mesh with [math] degrees of freedom. The resulting fast direct solver may be used to accelerate high-order implicit time-stepping schemes, as the precomputed operators can be reused for fast elliptic solves on surfaces. On a standard laptop, precomputation for a 12th-order surface mesh with over 1 million degrees of freedom takes 10 seconds, while subsequent solves take only 0.25 seconds. We apply the method to a range of problems on both smooth surfaces and surfaces with sharp corners and edges, including the static Laplace–Beltrami problem, the Hodge decomposition of a tangential vector field, and some time-dependent nonlinear reaction-diffusion systems. Reproducibility of computational results. This paper has been awarded the “SIAM Reproducibility Badge: code and data available”, as a recognition that the authors have followed reproducibility principles valued by SISC and the scientific computing community. Code and data that allow readers to reproduce the results in this paper are available at https://github.com/danfortunato/surface-hps-sisc.
SIAM 科学计算期刊》,第 46 卷第 4 期,第 A2582-A2606 页,2024 年 8 月。 摘要。我们介绍了一种基于分层 Poincaré-Steklov 方法的曲面上变系数椭圆 PDE 快速直接求解器。该方法以表面的非结构化高阶四边形网格为输入,使用高阶谱配位方案对每个元素上的表面微分算子进行离散化。预先计算元素求解算子和与曲面相切的 Dirichlet 到 Neumann 映射,并以成对的方式进行合并,以生成求解算子的层次结构,可用于具有 [math] 自由度的网格的 [math] 操作。由此产生的快速直接求解器可用于加速高阶隐式时间步进方案,因为预计算算子可重复用于曲面上的快速椭圆求解。在标准笔记本电脑上,对超过 100 万自由度的 12 阶曲面网格进行预计算需要 10 秒,而后续求解只需 0.25 秒。我们将该方法应用于一系列光滑表面和尖角及边缘表面的问题,包括静态拉普拉斯-贝尔特拉米问题、切向矢量场的霍奇分解,以及一些随时间变化的非线性反应扩散系统。计算结果的可重复性。本文被授予 "SIAM 可重复性徽章:代码和数据可用",以表彰作者遵循了 SISC 和科学计算界重视的可重复性原则。读者可通过 https://github.com/danfortunato/surface-hps-sisc 获取代码和数据,以重现本文中的结果。
{"title":"A High-Order Fast Direct Solver for Surface PDEs","authors":"Daniel Fortunato","doi":"10.1137/22m1525259","DOIUrl":"https://doi.org/10.1137/22m1525259","url":null,"abstract":"SIAM Journal on Scientific Computing, Volume 46, Issue 4, Page A2582-A2606, August 2024. <br/> Abstract. We introduce a fast direct solver for variable-coefficient elliptic PDEs on surfaces based on the hierarchical Poincaré–Steklov method. The method takes as input an unstructured, high-order quadrilateral mesh of a surface and discretizes surface differential operators on each element using a high-order spectral collocation scheme. Elemental solution operators and Dirichlet-to-Neumann maps tangent to the surface are precomputed and merged in a pairwise fashion to yield a hierarchy of solution operators that may be applied in [math] operations for a mesh with [math] degrees of freedom. The resulting fast direct solver may be used to accelerate high-order implicit time-stepping schemes, as the precomputed operators can be reused for fast elliptic solves on surfaces. On a standard laptop, precomputation for a 12th-order surface mesh with over 1 million degrees of freedom takes 10 seconds, while subsequent solves take only 0.25 seconds. We apply the method to a range of problems on both smooth surfaces and surfaces with sharp corners and edges, including the static Laplace–Beltrami problem, the Hodge decomposition of a tangential vector field, and some time-dependent nonlinear reaction-diffusion systems. Reproducibility of computational results. This paper has been awarded the “SIAM Reproducibility Badge: code and data available”, as a recognition that the authors have followed reproducibility principles valued by SISC and the scientific computing community. Code and data that allow readers to reproduce the results in this paper are available at https://github.com/danfortunato/surface-hps-sisc.","PeriodicalId":49526,"journal":{"name":"SIAM Journal on Scientific Computing","volume":"23 1","pages":""},"PeriodicalIF":3.1,"publicationDate":"2024-08-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142217609","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Least-Squares Neural Network (LSNN) Method for Linear Advection-Reaction Equation: Discontinuity Interface 线性平流-反作用方程的最小二乘神经网络 (LSNN) 方法:不连续界面
IF 3.1 2区 数学 Q1 MATHEMATICS, APPLIED Pub Date : 2024-08-12 DOI: 10.1137/23m1568107
Zhiqiang Cai, Junpyo Choi, Min Liu
SIAM Journal on Scientific Computing, Volume 46, Issue 4, Page C448-C478, August 2024.
Abstract. We studied the least-squares ReLU neural network (LSNN) method for solving a linear advection-reaction equation with discontinuous solution in [Z. Cai et al., J. Comput. Phys., 443 (2021), 110514]. The method is based on a least-squares formulation and uses a new class of approximating functions: ReLU neural network (NN) functions. A critical and additional component of the LSNN method, differing from other NN-based methods, is the introduction of a properly designed and physics preserved discrete differential operator. In this paper, we study the LSNN method for problems with discontinuity interfaces. First, we show that ReLU NN functions with depth [math] can approximate any [math]-dimensional step function on a discontinuity interface generated by a vector field as streamlines with any prescribed accuracy. By decomposing the solution into continuous and discontinuous parts, we prove theoretically that the discretization error of the LSNN method using ReLU NN functions with depth [math] is mainly determined by the continuous part of the solution provided that the solution jump is constant. Numerical results for both two- and three-dimensional test problems with various discontinuity interfaces show that the LSNN method with enough layers is accurate and does not exhibit the common Gibbs phenomena along discontinuity interfaces.
SIAM 科学计算期刊》,第 46 卷第 4 期,第 C448-C478 页,2024 年 8 月。 摘要我们在[Z. Cai et al., J. Comput. Phys., 443 (2021), 110514]一文中研究了求解具有不连续解的线性平流反应方程的最小二乘 ReLU 神经网络(LSNN)方法。该方法基于最小二乘法,并使用了一类新的近似函数:ReLU 神经网络 (NN) 函数。与其他基于 NN 的方法不同,LSNN 方法的一个关键和额外的组成部分是引入了一个经过适当设计并保留了物理特性的离散微分算子。在本文中,我们研究了针对不连续界面问题的 LSNN 方法。首先,我们证明了深度为[math]的 ReLU NN 函数可以以任意规定的精度将矢量场产生的不连续界面上的任意[math]维阶跃函数近似为流线。通过将解分解为连续部分和不连续部分,我们从理论上证明了使用深度[数学]ReLU NN 函数的 LSNN 方法的离散化误差主要由解的连续部分决定,前提是解的跳跃是恒定的。对具有各种不连续界面的二维和三维测试问题的数值结果表明,具有足够多层次的 LSNN 方法是精确的,不会在不连续界面上出现常见的吉布斯现象。
{"title":"Least-Squares Neural Network (LSNN) Method for Linear Advection-Reaction Equation: Discontinuity Interface","authors":"Zhiqiang Cai, Junpyo Choi, Min Liu","doi":"10.1137/23m1568107","DOIUrl":"https://doi.org/10.1137/23m1568107","url":null,"abstract":"SIAM Journal on Scientific Computing, Volume 46, Issue 4, Page C448-C478, August 2024. <br/> Abstract. We studied the least-squares ReLU neural network (LSNN) method for solving a linear advection-reaction equation with discontinuous solution in [Z. Cai et al., J. Comput. Phys., 443 (2021), 110514]. The method is based on a least-squares formulation and uses a new class of approximating functions: ReLU neural network (NN) functions. A critical and additional component of the LSNN method, differing from other NN-based methods, is the introduction of a properly designed and physics preserved discrete differential operator. In this paper, we study the LSNN method for problems with discontinuity interfaces. First, we show that ReLU NN functions with depth [math] can approximate any [math]-dimensional step function on a discontinuity interface generated by a vector field as streamlines with any prescribed accuracy. By decomposing the solution into continuous and discontinuous parts, we prove theoretically that the discretization error of the LSNN method using ReLU NN functions with depth [math] is mainly determined by the continuous part of the solution provided that the solution jump is constant. Numerical results for both two- and three-dimensional test problems with various discontinuity interfaces show that the LSNN method with enough layers is accurate and does not exhibit the common Gibbs phenomena along discontinuity interfaces.","PeriodicalId":49526,"journal":{"name":"SIAM Journal on Scientific Computing","volume":"95 1","pages":""},"PeriodicalIF":3.1,"publicationDate":"2024-08-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142217611","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Levenberg–Marquardt Method for Nonsmooth Regularized Least Squares 非光滑正则化最小二乘法的 Levenberg-Marquardt 方法
IF 3.1 2区 数学 Q1 MATHEMATICS, APPLIED Pub Date : 2024-08-12 DOI: 10.1137/22m1538971
Aleksandr Y. Aravkin, Robert Baraldi, Dominique Orban
SIAM Journal on Scientific Computing, Volume 46, Issue 4, Page A2557-A2581, August 2024.
Abstract. We develop a Levenberg–Marquardt method for minimizing the sum of a smooth nonlinear least-squares term [math] and a nonsmooth term [math]. Both [math] and [math] may be nonconvex. Steps are computed by minimizing the sum of a regularized linear least-squares model and a model of [math] using a first-order method such as the proximal gradient method. We establish global convergence to a first-order stationary point under the assumptions that [math] and its Jacobian are Lipschitz continuous and [math] is proper and lower semicontinuous. In the worst case, our method performs [math] iterations to bring a measure of stationarity below [math]. We also derive a trust-region variant that enjoys similar asymptotic worst-case iteration complexity as a special case of the trust-region algorithm of Aravkin, Baraldi, and Orban [SIAM J. Optim., 32 (2022), pp. 900–929]. We report numerical results on three examples: a group-lasso basis-pursuit denoise example, a nonlinear support vector machine, and parameter estimation in a neuroscience application. To implement those examples, we describe in detail how to evaluate proximal operators for separable [math] and for the group lasso with trust-region constraint. In all cases, the Levenberg–Marquardt methods perform fewer outer iterations than either a proximal gradient method with adaptive step length or a quasi-Newton trust-region method, neither of which exploit the least-squares structure of the problem. Our results also highlight the need for more sophisticated subproblem solvers than simple first-order methods.
SIAM 科学计算期刊》,第 46 卷第 4 期,第 A2557-A2581 页,2024 年 8 月。 摘要。我们开发了一种 Levenberg-Marquardt 方法,用于最小化平滑非线性最小二乘项 [math] 和非平滑项 [math] 之和。[math]和[math]都可能是非凸的。计算步骤是通过使用一阶方法(如近似梯度法)最小化正则化线性最小二乘模型与 [math] 模型之和。在[math]及其雅各布连为 Lipschitz 连续、[math]为适当的低半连续的假设条件下,我们建立了对一阶静止点的全局收敛。在最坏的情况下,我们的方法会进行 [math] 次迭代,使静止度低于 [math]。我们还推导了一个信任区域变体,它与 Aravkin、Baraldi 和 Orban 的信任区域算法的一个特例[SIAM J. Optim.,32 (2022),第 900-929 页]具有相似的渐进最坏情况迭代复杂度。我们报告了三个实例的数值结果:一个组-拉索基搜索去噪实例、一个非线性支持向量机和一个神经科学应用中的参数估计。为了实现这些示例,我们详细描述了如何评估可分离[数学]的近算子和带有信任区域约束的群拉索。在所有情况下,Levenberg-Marquardt 方法的外部迭代次数都少于具有自适应步长的近似梯度方法或准牛顿信任区域方法,而这两种方法都没有利用问题的最小二乘结构。我们的研究结果还突出表明,除了简单的一阶方法外,还需要更复杂的子问题求解器。
{"title":"A Levenberg–Marquardt Method for Nonsmooth Regularized Least Squares","authors":"Aleksandr Y. Aravkin, Robert Baraldi, Dominique Orban","doi":"10.1137/22m1538971","DOIUrl":"https://doi.org/10.1137/22m1538971","url":null,"abstract":"SIAM Journal on Scientific Computing, Volume 46, Issue 4, Page A2557-A2581, August 2024. <br/> Abstract. We develop a Levenberg–Marquardt method for minimizing the sum of a smooth nonlinear least-squares term [math] and a nonsmooth term [math]. Both [math] and [math] may be nonconvex. Steps are computed by minimizing the sum of a regularized linear least-squares model and a model of [math] using a first-order method such as the proximal gradient method. We establish global convergence to a first-order stationary point under the assumptions that [math] and its Jacobian are Lipschitz continuous and [math] is proper and lower semicontinuous. In the worst case, our method performs [math] iterations to bring a measure of stationarity below [math]. We also derive a trust-region variant that enjoys similar asymptotic worst-case iteration complexity as a special case of the trust-region algorithm of Aravkin, Baraldi, and Orban [SIAM J. Optim., 32 (2022), pp. 900–929]. We report numerical results on three examples: a group-lasso basis-pursuit denoise example, a nonlinear support vector machine, and parameter estimation in a neuroscience application. To implement those examples, we describe in detail how to evaluate proximal operators for separable [math] and for the group lasso with trust-region constraint. In all cases, the Levenberg–Marquardt methods perform fewer outer iterations than either a proximal gradient method with adaptive step length or a quasi-Newton trust-region method, neither of which exploit the least-squares structure of the problem. Our results also highlight the need for more sophisticated subproblem solvers than simple first-order methods.","PeriodicalId":49526,"journal":{"name":"SIAM Journal on Scientific Computing","volume":"34 1","pages":""},"PeriodicalIF":3.1,"publicationDate":"2024-08-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142217628","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
SIAM Journal on Scientific Computing
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1