首页 > 最新文献

Acta Numerica最新文献

英文 中文
Time parallelization for hyperbolic and parabolic problems 双曲型和抛物型问题的时间并行化
IF 14.2 1区 数学 Q1 MATHEMATICS Pub Date : 2025-07-01 DOI: 10.1017/s0962492924000072
Martin J. Gander, Shu-Lin Wu, Tao Zhou

Time parallelization, also known as PinT (parallel-in-time), is a new research direction for the development of algorithms used for solving very large-scale evolution problems on highly parallel computing architectures. Despite the fact that interesting theoretical work on PinT appeared as early as 1964, it was not until 2004, when processor clock speeds reached their physical limit, that research in PinT took off. A distinctive characteristic of parallelization in time is that information flow only goes forward in time, meaning that time evolution processes seem necessarily to be sequential. Nevertheless, many algorithms have been developed for PinT computations over the past two decades, and they are often grouped into four basic classes according to how the techniques work and are used: shooting-type methods; waveform relaxation methods based on domain decomposition; multigrid methods in space–time; and direct time parallel methods. However, over the past few years, it has been recognized that highly successful PinT algorithms for parabolic problems struggle when applied to hyperbolic problems. We will therefore focus on this important aspect, first by providing a summary of the fundamental differences between parabolic and hyperbolic problems for time parallelization. We then group PinT algorithms into two basic groups. The first group contains four effective PinT techniques for hyperbolic problems: Schwarz waveform relaxation (SWR) with its relation to tent pitching; parallel integral deferred correction; ParaExp; and ParaDiag. While the methods in the first group also work well for parabolic problems, we then present PinT methods specifically designed for parabolic problems in the second group: Parareal; the parallel full approximation scheme in space–time (PFASST); multigrid reduction in time (MGRiT); and space–time multigrid (STMG). We complement our analysis with numerical illustrations using four time-dependent PDEs: the heat equation; the advection–diffusion equation; Burgers’ equation; and the second-order wave equation.

时间并行化,也称为PinT (parallel-in- Time),是一种新的研究方向,用于在高度并行计算架构上解决非常大规模的进化问题。尽管早在1964年就出现了关于PinT的有趣理论工作,但直到2004年处理器时钟速度达到其物理极限时,对PinT的研究才开始起飞。时间上并行化的一个显著特征是信息流只在时间上向前移动,这意味着时间演化过程似乎必然是顺序的。尽管如此,在过去的二十年里,许多算法已经被开发出来用于品脱计算,它们通常根据技术的工作和使用方式分为四类:射击型方法;基于域分解的波形松弛方法;时空中的多重网格方法;和直接时间并行法。然而,在过去的几年里,人们已经认识到,对于抛物线问题非常成功的PinT算法在应用于双曲问题时遇到了困难。因此,我们将重点关注这一重要方面,首先概述时间并行化的抛物型和双曲型问题之间的基本区别。然后,我们将PinT算法分为两个基本组。第一组包含四种有效的双曲问题的品脱技术:Schwarz波形松弛(SWR)及其与帐篷俯仰度的关系;并行积分延迟校正;ParaExp;和ParaDiag。虽然第一组方法也适用于抛物线问题,但我们随后提出了专门为第二组抛物线问题设计的PinT方法:Parareal;平行时空全近似格式(PFASST);多网格时间缩减(MGRiT);时空多重网格(STMG)。我们用四个与时间相关的偏微分方程的数值插图来补充我们的分析:热方程;平流扩散方程;汉堡的方程;二阶波动方程。
{"title":"Time parallelization for hyperbolic and parabolic problems","authors":"Martin J. Gander, Shu-Lin Wu, Tao Zhou","doi":"10.1017/s0962492924000072","DOIUrl":"https://doi.org/10.1017/s0962492924000072","url":null,"abstract":"<p>Time parallelization, also known as PinT (parallel-in-time), is a new research direction for the development of algorithms used for solving very large-scale evolution problems on highly parallel computing architectures. Despite the fact that interesting theoretical work on PinT appeared as early as 1964, it was not until 2004, when processor clock speeds reached their physical limit, that research in PinT took off. A distinctive characteristic of parallelization in time is that information flow only goes forward in time, meaning that time evolution processes seem necessarily to be sequential. Nevertheless, many algorithms have been developed for PinT computations over the past two decades, and they are often grouped into four basic classes according to how the techniques work and are used: shooting-type methods; waveform relaxation methods based on domain decomposition; multigrid methods in space–time; and direct time parallel methods. However, over the past few years, it has been recognized that highly successful PinT algorithms for parabolic problems struggle when applied to hyperbolic problems. We will therefore focus on this important aspect, first by providing a summary of the fundamental differences between parabolic and hyperbolic problems for time parallelization. We then group PinT algorithms into two basic groups. The first group contains four effective PinT techniques for hyperbolic problems: Schwarz waveform relaxation (SWR) with its relation to tent pitching; parallel integral deferred correction; ParaExp; and ParaDiag. While the methods in the first group also work well for parabolic problems, we then present PinT methods specifically designed for parabolic problems in the second group: Parareal; the parallel full approximation scheme in space–time (PFASST); multigrid reduction in time (MGRiT); and space–time multigrid (STMG). We complement our analysis with numerical illustrations using four time-dependent PDEs: the heat equation; the advection–diffusion equation; Burgers’ equation; and the second-order wave equation.</p>","PeriodicalId":48863,"journal":{"name":"Acta Numerica","volume":"26 1","pages":""},"PeriodicalIF":14.2,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144520756","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Ensemble Kalman methods: A mean-field perspective 集合卡尔曼方法:平均场视角
IF 14.2 1区 数学 Q1 MATHEMATICS Pub Date : 2025-07-01 DOI: 10.1017/s0962492924000060
Edoardo Calvello, Sebastian Reich, Andrew M. Stuart

Ensemble Kalman methods, introduced in 1994 in the context of ocean state estimation, are now widely used for state estimation and parameter estimation (inverse problems) in many arenae. Their success stems from the fact that they take an underlying computational model as a black box to provide a systematic, derivative-free methodology for incorporating observations; furthermore the ensemble approach allows for sensitivities and uncertainties to be calculated. Analysis of the accuracy of ensemble Kalman methods, especially in terms of uncertainty quantification, is lagging behind empirical success; this paper provides a unifying mean-field-based framework for their analysis. Both state estimation and parameter estimation problems are considered, and formulations in both discrete and continuous time are employed. For state estimation problems, both the control and filtering approaches are considered; analogously for parameter estimation problems, the optimization and Bayesian perspectives are both studied. As well as providing an elegant framework, the mean-field perspective also allows for the derivation of a variety of methods used in practice. In addition it unifies a wide-ranging literature in the field and suggests open problems.

集合卡尔曼方法于1994年在海洋状态估计的背景下被引入,现在被广泛用于许多领域的状态估计和参数估计(逆问题)。他们的成功源于这样一个事实:他们把一个潜在的计算模型作为一个黑箱,为合并观察提供了一个系统的、无导数的方法;此外,集合方法允许计算灵敏度和不确定度。对集合卡尔曼方法的精度分析,特别是在不确定性量化方面,滞后于经验的成功;本文提供了一个统一的基于平均场的分析框架。同时考虑了状态估计和参数估计问题,并采用了离散时间和连续时间的计算公式。对于状态估计问题,考虑了控制和滤波两种方法;类似地,对于参数估计问题,优化和贝叶斯观点都进行了研究。除了提供一个优雅的框架外,平均场视角还允许推导出实践中使用的各种方法。此外,它统一了该领域广泛的文献,并提出了开放的问题。
{"title":"Ensemble Kalman methods: A mean-field perspective","authors":"Edoardo Calvello, Sebastian Reich, Andrew M. Stuart","doi":"10.1017/s0962492924000060","DOIUrl":"https://doi.org/10.1017/s0962492924000060","url":null,"abstract":"<p>Ensemble Kalman methods, introduced in 1994 in the context of ocean state estimation, are now widely used for state estimation and parameter estimation (inverse problems) in many arenae. Their success stems from the fact that they take an underlying computational model as a black box to provide a systematic, derivative-free methodology for incorporating observations; furthermore the ensemble approach allows for sensitivities and uncertainties to be calculated. Analysis of the accuracy of ensemble Kalman methods, especially in terms of uncertainty quantification, is lagging behind empirical success; this paper provides a unifying mean-field-based framework for their analysis. Both state estimation and parameter estimation problems are considered, and formulations in both discrete and continuous time are employed. For state estimation problems, both the control and filtering approaches are considered; analogously for parameter estimation problems, the optimization and Bayesian perspectives are both studied. As well as providing an elegant framework, the mean-field perspective also allows for the derivation of a variety of methods used in practice. In addition it unifies a wide-ranging literature in the field and suggests open problems.</p>","PeriodicalId":48863,"journal":{"name":"Acta Numerica","volume":"19 1","pages":""},"PeriodicalIF":14.2,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144521054","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Sparse linear least-squares problems 稀疏线性最小二乘问题
IF 14.2 1区 数学 Q1 MATHEMATICS Pub Date : 2025-07-01 DOI: 10.1017/s0962492924000059
Jennifer Scott, Miroslav Tůma

Least-squares problems are a cornerstone of computational science and engineering. Over the years, the size of the problems that researchers and practitioners face has constantly increased, making it essential that sparsity is exploited in the solution process. The goal of this article is to present a broad review of key algorithms for solving large-scale linear least-squares problems. This includes sparse direct methods and algebraic preconditioners that are used in combination with iterative solvers. Where software is available, this is highlighted.

最小二乘问题是计算科学和工程的基石。多年来,研究人员和实践者面临的问题规模不断增加,使得在解决方案过程中利用稀疏性变得至关重要。本文的目的是对解决大规模线性最小二乘问题的关键算法进行综述。这包括与迭代求解器结合使用的稀疏直接方法和代数预条件。在有软件可用的地方,这是突出显示的。
{"title":"Sparse linear least-squares problems","authors":"Jennifer Scott, Miroslav Tůma","doi":"10.1017/s0962492924000059","DOIUrl":"https://doi.org/10.1017/s0962492924000059","url":null,"abstract":"<p>Least-squares problems are a cornerstone of computational science and engineering. Over the years, the size of the problems that researchers and practitioners face has constantly increased, making it essential that sparsity is exploited in the solution process. The goal of this article is to present a broad review of key algorithms for solving large-scale linear least-squares problems. This includes sparse direct methods and algebraic preconditioners that are used in combination with iterative solvers. Where software is available, this is highlighted.</p>","PeriodicalId":48863,"journal":{"name":"Acta Numerica","volume":"11 1","pages":""},"PeriodicalIF":14.2,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144520751","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Cut finite element methods 切割有限元法
IF 14.2 1区 数学 Q1 MATHEMATICS Pub Date : 2025-07-01 DOI: 10.1017/s0962492925000017
Erik Burman, Peter Hansbo, Mats G. Larson, Sara Zahedi

Cut finite element methods (CutFEM) extend the standard finite element method to unfitted meshes, enabling the accurate resolution of domain boundaries and interfaces without requiring the mesh to conform to them. This approach preserves the key properties and accuracy of the standard method while addressing challenges posed by complex geometries and moving interfaces.

In recent years, CutFEM has gained significant attention for its ability to discretize partial differential equations in domains with intricate geometries. This paper provides a comprehensive review of the core concepts and key developments in CutFEM, beginning with its formulation for common model problems and the presentation of fundamental analytical results, including error estimates and condition number estimates for the resulting algebraic systems. Stabilization techniques for cut elements, which ensure numerical robustness, are also explored. Finally, extensions to methods involving Lagrange multipliers and applications to time-dependent problems are discussed.

切削有限元法(CutFEM)将标准有限元法扩展到非拟合网格,可以在不要求网格符合的情况下精确地求解域边界和界面。这种方法保留了标准方法的关键属性和准确性,同时解决了复杂几何形状和移动界面带来的挑战。近年来,CutFEM因其在复杂几何区域中离散偏微分方程的能力而受到广泛关注。本文对CutFEM的核心概念和关键发展进行了全面的回顾,从常见模型问题的表述和基本分析结果的介绍开始,包括对所得代数系统的误差估计和条件数估计。同时还探讨了保证数值鲁棒性的切割单元稳定化技术。最后,讨论了涉及拉格朗日乘子的方法的扩展及其在时变问题上的应用。
{"title":"Cut finite element methods","authors":"Erik Burman, Peter Hansbo, Mats G. Larson, Sara Zahedi","doi":"10.1017/s0962492925000017","DOIUrl":"https://doi.org/10.1017/s0962492925000017","url":null,"abstract":"<p>Cut finite element methods (CutFEM) extend the standard finite element method to unfitted meshes, enabling the accurate resolution of domain boundaries and interfaces without requiring the mesh to conform to them. This approach preserves the key properties and accuracy of the standard method while addressing challenges posed by complex geometries and moving interfaces.</p><p>In recent years, CutFEM has gained significant attention for its ability to discretize partial differential equations in domains with intricate geometries. This paper provides a comprehensive review of the core concepts and key developments in CutFEM, beginning with its formulation for common model problems and the presentation of fundamental analytical results, including error estimates and condition number estimates for the resulting algebraic systems. Stabilization techniques for cut elements, which ensure numerical robustness, are also explored. Finally, extensions to methods involving Lagrange multipliers and applications to time-dependent problems are discussed.</p>","PeriodicalId":48863,"journal":{"name":"Acta Numerica","volume":"1 1","pages":""},"PeriodicalIF":14.2,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144520754","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Distributionally robust optimization 分布鲁棒优化
IF 14.2 1区 数学 Q1 MATHEMATICS Pub Date : 2025-07-01 DOI: 10.1017/s0962492924000084
Daniel Kuhn, Soroosh Shafiee, Wolfram Wiesemann

Distributionally robust optimization (DRO) studies decision problems under uncertainty where the probability distribution governing the uncertain problem parameters is itself uncertain. A key component of any DRO model is its ambiguity set, that is, a family of probability distributions consistent with any available structural or statistical information. DRO seeks decisions that perform best under the worst distribution in the ambiguity set. This worst case criterion is supported by findings in psychology and neuroscience, which indicate that many decision-makers have a low tolerance for distributional ambiguity. DRO is rooted in statistics, operations research and control theory, and recent research has uncovered its deep connections to regularization techniques and adversarial training in machine learning. This survey presents the key findings of the field in a unified and self-contained manner.

分布鲁棒优化(DRO)研究不确定条件下的决策问题,其中控制不确定问题参数的概率分布本身是不确定的。任何DRO模型的一个关键组成部分是它的模糊集,即与任何可用的结构或统计信息一致的概率分布族。DRO寻求在模糊集的最差分布下表现最好的决策。心理学和神经科学的研究结果支持这种最坏情况的标准,表明许多决策者对分配模糊性的容忍度很低。DRO根植于统计学、运筹学和控制理论,最近的研究发现了它与机器学习中的正则化技术和对抗性训练的深刻联系。这项调查以统一和独立的方式呈现了该领域的主要发现。
{"title":"Distributionally robust optimization","authors":"Daniel Kuhn, Soroosh Shafiee, Wolfram Wiesemann","doi":"10.1017/s0962492924000084","DOIUrl":"https://doi.org/10.1017/s0962492924000084","url":null,"abstract":"<p>Distributionally robust optimization (DRO) studies decision problems under uncertainty where the probability distribution governing the uncertain problem parameters is itself uncertain. A key component of any DRO model is its ambiguity set, that is, a family of probability distributions consistent with any available structural or statistical information. DRO seeks decisions that perform best under the worst distribution in the ambiguity set. This worst case criterion is supported by findings in psychology and neuroscience, which indicate that many decision-makers have a low tolerance for distributional ambiguity. DRO is rooted in statistics, operations research and control theory, and recent research has uncovered its deep connections to regularization techniques and adversarial training in machine learning. This survey presents the key findings of the field in a unified and self-contained manner.</p>","PeriodicalId":48863,"journal":{"name":"Acta Numerica","volume":"19 1","pages":""},"PeriodicalIF":14.2,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144520825","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The discontinuous Petrov–Galerkin method 不连续Petrov-Galerkin方法
IF 14.2 1区 数学 Q1 MATHEMATICS Pub Date : 2025-07-01 DOI: 10.1017/s0962492924000102
Leszek Demkowicz, Jay Gopalakrishnan

The discontinuous Petrov–Galerkin (DPG) method is a Petrov–Galerkin finite element method with test functions designed for obtaining stability. These test functions are computable locally, element by element, and are motivated by optimal test functions which attain the supremum in an inf-sup condition. A profound consequence of the use of nearly optimal test functions is that the DPG method can inherit the stability of the (undiscretized) variational formulation, be it coercive or not. This paper combines a presentation of the fundamentals of the DPG ideas with a review of the ongoing research on theory and applications of the DPG methodology. The scope of the presented theory is restricted to linear problems on Hilbert spaces, but pointers to extensions are provided. Multiple viewpoints to the basic theory are provided. They show that the DPG method is equivalent to a method which minimizes a residual in a dual norm, as well as to a mixed method where one solution component is an approximate error representation function. Being a residual minimization method, the DPG method yields Hermitian positive definite stiffness matrix systems even for non-self-adjoint boundary value problems. Having a built-in error representation, the method has the out-of-the-box feature that it can immediately be used in automatic adaptive algorithms. Contrary to standard Galerkin methods, which are uninformed about test and trial norms, the DPG method must be equipped with a concrete test norm which enters the computations. Of particular interest are variational formulations in which one can tailor the norm to obtain robust stability. Key techniques to rigorously prove convergence of DPG schemes, including construction of Fortin operators, which in the DPG case can be done element by element, are discussed in detail. Pointers to open frontiers are presented.

不连续Petrov-Galerkin (DPG)法是一种Petrov-Galerkin有限元法,其测试函数是为获得稳定性而设计的。这些测试函数是局部可计算的,逐元素计算,并由最优测试函数驱动,该测试函数在不稳定条件下达到极值。使用近最优测试函数的一个深刻的结果是,DPG方法可以继承(未离散的)变分公式的稳定性,无论是否强制。本文将介绍DPG的基本思想,并对DPG方法论的理论和应用进行综述。提出的理论的范围仅限于Hilbert空间上的线性问题,但提供了扩展的指针。对基本理论提出了多种观点。他们表明DPG方法等价于最小化对偶范数残差的方法,也等价于一个解分量是近似误差表示函数的混合方法。DPG方法是一种残差最小化方法,即使对于非自伴随边值问题也能得到厄米正定刚度矩阵系统。由于具有内置的错误表示,该方法具有开箱即用的特性,可以立即用于自动自适应算法。与标准伽辽金法不知道试验和试验规范不同,DPG法必须有一个具体的试验规范进入计算。特别令人感兴趣的是变分公式,其中可以调整规范以获得鲁棒稳定性。详细讨论了严格证明DPG方案收敛性的关键技术,包括Fortin算子的构造,在DPG情况下,Fortin算子的构造可以逐元进行。给出了开放边界的指针。
{"title":"The discontinuous Petrov–Galerkin method","authors":"Leszek Demkowicz, Jay Gopalakrishnan","doi":"10.1017/s0962492924000102","DOIUrl":"https://doi.org/10.1017/s0962492924000102","url":null,"abstract":"<p>The discontinuous Petrov–Galerkin (DPG) method is a Petrov–Galerkin finite element method with test functions designed for obtaining stability. These test functions are computable locally, element by element, and are motivated by optimal test functions which attain the supremum in an inf-sup condition. A profound consequence of the use of nearly optimal test functions is that the DPG method can inherit the stability of the (undiscretized) variational formulation, be it coercive or not. This paper combines a presentation of the fundamentals of the DPG ideas with a review of the ongoing research on theory and applications of the DPG methodology. The scope of the presented theory is restricted to linear problems on Hilbert spaces, but pointers to extensions are provided. Multiple viewpoints to the basic theory are provided. They show that the DPG method is equivalent to a method which minimizes a residual in a dual norm, as well as to a mixed method where one solution component is an approximate error representation function. Being a residual minimization method, the DPG method yields Hermitian positive definite stiffness matrix systems even for non-self-adjoint boundary value problems. Having a built-in error representation, the method has the out-of-the-box feature that it can immediately be used in automatic adaptive algorithms. Contrary to standard Galerkin methods, which are uninformed about test and trial norms, the DPG method must be equipped with a concrete test norm which enters the computations. Of particular interest are variational formulations in which one can tailor the norm to obtain robust stability. Key techniques to rigorously prove convergence of DPG schemes, including construction of Fortin operators, which in the DPG case can be done element by element, are discussed in detail. Pointers to open frontiers are presented.</p>","PeriodicalId":48863,"journal":{"name":"Acta Numerica","volume":"633 1","pages":""},"PeriodicalIF":14.2,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144520755","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Acceleration methods for fixed-point iterations 定点迭代的加速方法
IF 14.2 1区 数学 Q1 MATHEMATICS Pub Date : 2025-07-01 DOI: 10.1017/s0962492924000096
Yousef Saad

A pervasive approach in scientific computing is to express the solution to a given problem as the limit of a sequence of vectors or other mathematical objects. In many situations these sequences are generated by slowly converging iterative procedures, and this led practitioners to seek faster alternatives to reach the limit. ‘Acceleration techniques’ comprise a broad array of methods specifically designed with this goal in mind. They started as a means of improving the convergence of general scalar sequences by various forms of ‘extrapolation to the limit’, i.e. by extrapolating the most recent iterates to the limit via linear combinations. Extrapolation methods of this type, the best-known of which is Aitken’s delta-squared process, require only the sequence of vectors as input.

However, limiting methods to use only the iterates is too restrictive. Accelerating sequences generated by fixed-point iterations by utilizing both the iterates and the fixed-point mapping itself has proved highly successful across various areas of physics. A notable example of these fixed-point accelerators (FP-accelerators) is a method developed by Donald Anderson in 1965 and now widely known as Anderson acceleration (AA). Furthermore, quasi-Newton and inexact Newton methods can also be placed in this category since they can be invoked to find limits of fixed-point iteration sequences by employing exactly the same ingredients as those of the FP-accelerators.

This paper presents an overview of these methods – with an emphasis on those, such as AA, that are geared toward accelerating fixed-point iterations. We will navigate through existing variants of accelerators, their implementations and their applications, to unravel the close connections between them. These connections were often not recognized by the originators of certain methods, who sometimes stumbled on slight variations of already established ideas. Furthermore, even though new accelerators were invented in different corners of science, the underlying principles behind them are strikingly similar or identical.

The plan of this article will approximately follow the historical trajectory of extrapolation and acceleration methods, beginning with a brief description of extrapolation ideas, followed by the special case of linear systems, the application to self-consistent field (SCF) iterations, and a detailed view of Anderson acceleration. The last part of the paper is concerned with more recent developments, including theoretical aspects, and a few thoughts on accelerating machine learning algorithms.

在科学计算中,一种普遍的方法是将给定问题的解表示为向量序列或其他数学对象的极限。在许多情况下,这些序列是由缓慢收敛的迭代过程生成的,这导致从业者寻求更快的替代方案来达到极限。“加速技术”包括一系列专门为实现这一目标而设计的方法。它们最初是作为一种改进一般标量序列收敛性的手段,通过各种形式的“外推到极限”,即通过线性组合外推最近的迭代到极限。这种类型的外推方法,其中最著名的是艾特肯的δ平方过程,只需要向量序列作为输入。但是,将方法限制为只使用迭代就太严格了。通过利用迭代和定点映射本身来加速由定点迭代生成的序列,在物理的各个领域都被证明是非常成功的。这些定点加速器(FP-accelerators)的一个显著例子是唐纳德·安德森(Donald Anderson)在1965年开发的一种方法,现在被广泛称为安德森加速(AA)。此外,准牛顿法和不精确牛顿法也可以归为这一类,因为它们可以通过使用与fp加速器完全相同的成分来调用,以找到定点迭代序列的极限。本文给出了这些方法的概述——重点是那些用于加速定点迭代的方法,比如AA。我们将浏览加速器的现有变体,它们的实现和应用,以揭示它们之间的密切联系。这些联系往往没有被某些方法的创始者认识到,他们有时会偶然发现已经确立的思想的细微变化。此外,尽管新的加速器是在不同的科学领域发明的,但它们背后的基本原理却惊人地相似或相同。本文的计划将大致遵循外推和加速方法的历史轨迹,首先简要描述外推思想,然后是线性系统的特殊情况,在自洽场(SCF)迭代中的应用,以及对安德森加速的详细看法。论文的最后一部分关注的是最近的发展,包括理论方面,以及一些关于加速机器学习算法的想法。
{"title":"Acceleration methods for fixed-point iterations","authors":"Yousef Saad","doi":"10.1017/s0962492924000096","DOIUrl":"https://doi.org/10.1017/s0962492924000096","url":null,"abstract":"<p>A pervasive approach in scientific computing is to express the solution to a given problem as the limit of a sequence of vectors or other mathematical objects. In many situations these sequences are generated by slowly converging iterative procedures, and this led practitioners to seek faster alternatives to reach the limit. ‘Acceleration techniques’ comprise a broad array of methods specifically designed with this goal in mind. They started as a means of improving the convergence of general scalar sequences by various forms of ‘extrapolation to the limit’, i.e. by extrapolating the most recent iterates to the limit via linear combinations. Extrapolation methods of this type, the best-known of which is Aitken’s delta-squared process, require only the sequence of vectors as input.</p><p>However, limiting methods to use only the iterates is too restrictive. Accelerating sequences generated by fixed-point iterations by utilizing both the iterates and the fixed-point mapping itself has proved highly successful across various areas of physics. A notable example of these fixed-point accelerators (FP-accelerators) is a method developed by Donald Anderson in 1965 and now widely known as Anderson acceleration (AA). Furthermore, quasi-Newton and inexact Newton methods can also be placed in this category since they can be invoked to find limits of fixed-point iteration sequences by employing exactly the same ingredients as those of the FP-accelerators.</p><p>This paper presents an overview of these methods – with an emphasis on those, such as AA, that are geared toward accelerating fixed-point iterations. We will navigate through existing variants of accelerators, their implementations and their applications, to unravel the close connections between them. These connections were often not recognized by the originators of certain methods, who sometimes stumbled on slight variations of already established ideas. Furthermore, even though new accelerators were invented in different corners of science, the underlying principles behind them are strikingly similar or identical.</p><p>The plan of this article will approximately follow the historical trajectory of extrapolation and acceleration methods, beginning with a brief description of extrapolation ideas, followed by the special case of linear systems, the application to self-consistent field (SCF) iterations, and a detailed view of Anderson acceleration. The last part of the paper is concerned with more recent developments, including theoretical aspects, and a few thoughts on accelerating machine learning algorithms.</p>","PeriodicalId":48863,"journal":{"name":"Acta Numerica","volume":"7 1","pages":""},"PeriodicalIF":14.2,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144521057","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Optimization problems governed by systems of PDEs with uncertainties 不确定偏微分方程系统的优化问题
IF 14.2 1区 数学 Q1 MATHEMATICS Pub Date : 2025-07-01 DOI: 10.1017/s0962492925000029
Matthias Heinkenschloss, Drew P. Kouri

This paper reviews current theoretical and numerical approaches to optimization problems governed by partial differential equations (PDEs) that depend on random variables or random fields. Such problems arise in many engineering, science, economics and societal decision-making tasks. This paper focuses on problems in which the governing PDEs are parametrized by the random variables/fields, and the decisions are made at the beginning and are not revised once uncertainty is revealed. Examples of such problems are presented to motivate the topic of this paper, and to illustrate the impact of different ways to model uncertainty in the formulations of the optimization problem and their impact on the solution. A linear–quadratic elliptic optimal control problem is used to provide a detailed discussion of the set-up for the risk-neutral optimization problem formulation, study the existence and characterization of its solution, and survey numerical methods for computing it. Different ways to model uncertainty in the PDE-constrained optimization problem are surveyed in an abstract setting, including risk measures, distributionally robust optimization formulations, probabilistic functions and chance constraints, and stochastic orders. Furthermore, approximation-based optimization approaches and stochastic methods for the solution of the large-scale PDE-constrained optimization problems under uncertainty are described. Some possible future research directions are outlined.

本文综述了目前研究依赖于随机变量或随机场的偏微分方程(PDEs)优化问题的理论和数值方法。这些问题出现在许多工程、科学、经济和社会决策任务中。本文主要研究了控制偏微分方程被随机变量/域参数化,并且在开始时做出决策,一旦发现不确定性就不修改决策的问题。这些问题的例子是为了激发本文的主题,并说明不同的方法来模拟不确定性的影响,在优化问题的公式及其对解决方案的影响。利用线性二次型椭圆型最优控制问题,详细讨论了风险中立优化问题表述的建立条件,研究了其解的存在性和性质,并探讨了计算该问题的数值方法。在一个抽象的环境下,研究了pde约束优化问题中不确定性建模的不同方法,包括风险度量、分布鲁棒优化公式、概率函数和机会约束以及随机顺序。在此基础上,给出了求解不确定条件下大规模pde约束优化问题的近似优化方法和随机方法。展望了未来可能的研究方向。
{"title":"Optimization problems governed by systems of PDEs with uncertainties","authors":"Matthias Heinkenschloss, Drew P. Kouri","doi":"10.1017/s0962492925000029","DOIUrl":"https://doi.org/10.1017/s0962492925000029","url":null,"abstract":"<p>This paper reviews current theoretical and numerical approaches to optimization problems governed by partial differential equations (PDEs) that depend on random variables or random fields. Such problems arise in many engineering, science, economics and societal decision-making tasks. This paper focuses on problems in which the governing PDEs are parametrized by the random variables/fields, and the decisions are made at the beginning and are not revised once uncertainty is revealed. Examples of such problems are presented to motivate the topic of this paper, and to illustrate the impact of different ways to model uncertainty in the formulations of the optimization problem and their impact on the solution. A linear–quadratic elliptic optimal control problem is used to provide a detailed discussion of the set-up for the risk-neutral optimization problem formulation, study the existence and characterization of its solution, and survey numerical methods for computing it. Different ways to model uncertainty in the PDE-constrained optimization problem are surveyed in an abstract setting, including risk measures, distributionally robust optimization formulations, probabilistic functions and chance constraints, and stochastic orders. Furthermore, approximation-based optimization approaches and stochastic methods for the solution of the large-scale PDE-constrained optimization problems under uncertainty are described. Some possible future research directions are outlined.</p>","PeriodicalId":48863,"journal":{"name":"Acta Numerica","volume":"26 1","pages":""},"PeriodicalIF":14.2,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144520758","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Splitting methods for differential equations 微分方程的分割方法
IF 14.2 1区 数学 Q1 MATHEMATICS Pub Date : 2024-09-04 DOI: 10.1017/s0962492923000077
Sergio Blanes, Fernando Casas, Ander Murua

This overview is devoted to splitting methods, a class of numerical integrators intended for differential equations that can be subdivided into different problems easier to solve than the original system. Closely connected with this class of integrators are composition methods, in which one or several low-order schemes are composed to construct higher-order numerical approximations to the exact solution. We analyse in detail the order conditions that have to be satisfied by these classes of methods to achieve a given order, and provide some insight about their qualitative properties in connection with geometric numerical integration and the treatment of highly oscillatory problems. Since splitting methods have received considerable attention in the realm of partial differential equations, we also cover this subject in the present survey, with special attention to parabolic equations and their problems. An exhaustive list of methods of different orders is collected and tested on simple examples. Finally, some applications of splitting methods in different areas, ranging from celestial mechanics to statistics, are also provided.

本综述主要介绍拆分方法,这是一类数值积分方法,适用于可细分为比原始系统更易求解的不同问题的微分方程。与这一类积分器密切相关的是组合方法,其中一个或多个低阶方案组成高阶数值近似精确解。我们详细分析了这几类方法为达到给定阶数而必须满足的阶数条件,并结合几何数值积分和高振荡问题的处理,对它们的定性特性提出了一些见解。由于分裂方法在偏微分方程领域受到了广泛关注,我们在本研究中也涉及这一主题,并特别关注抛物方程及其问题。我们收集了不同阶数的详尽方法列表,并在简单示例中进行了测试。最后,我们还介绍了分裂方法在从天体力学到统计学等不同领域的一些应用。
{"title":"Splitting methods for differential equations","authors":"Sergio Blanes, Fernando Casas, Ander Murua","doi":"10.1017/s0962492923000077","DOIUrl":"https://doi.org/10.1017/s0962492923000077","url":null,"abstract":"<p>This overview is devoted to splitting methods, a class of numerical integrators intended for differential equations that can be subdivided into different problems easier to solve than the original system. Closely connected with this class of integrators are composition methods, in which one or several low-order schemes are composed to construct higher-order numerical approximations to the exact solution. We analyse in detail the order conditions that have to be satisfied by these classes of methods to achieve a given order, and provide some insight about their qualitative properties in connection with geometric numerical integration and the treatment of highly oscillatory problems. Since splitting methods have received considerable attention in the realm of partial differential equations, we also cover this subject in the present survey, with special attention to parabolic equations and their problems. An exhaustive list of methods of different orders is collected and tested on simple examples. Finally, some applications of splitting methods in different areas, ranging from celestial mechanics to statistics, are also provided.</p>","PeriodicalId":48863,"journal":{"name":"Acta Numerica","volume":"31 1","pages":""},"PeriodicalIF":14.2,"publicationDate":"2024-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142130832","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The geometry of monotone operator splitting methods 单调算子拆分法的几何原理
IF 14.2 1区 数学 Q1 MATHEMATICS Pub Date : 2024-09-04 DOI: 10.1017/s0962492923000065
Patrick L. Combettes

We propose a geometric framework to describe and analyse a wide array of operator splitting methods for solving monotone inclusion problems. The initial inclusion problem, which typically involves several operators combined through monotonicity-preserving operations, is seldom solvable in its original form. We embed it in an auxiliary space, where it is associated with a surrogate monotone inclusion problem with a more tractable structure and which allows for easy recovery of solutions to the initial problem. The surrogate problem is solved by successive projections onto half-spaces containing its solution set. The outer approximation half-spaces are constructed by using the individual operators present in the model separately. This geometric framework is shown to encompass traditional methods as well as state-of-the-art asynchronous block-iterative algorithms, and its flexible structure provides a pattern to design new ones.

我们提出了一个几何框架,用于描述和分析解决单调包含问题的各种算子拆分方法。初始包含问题通常涉及通过单调性保留操作组合的多个算子,很少能以其原始形式求解。我们将其嵌入一个辅助空间,在这个辅助空间中,它与一个具有更易处理结构的代理单调包含问题相关联,并允许轻松恢复初始问题的解。代问题通过连续投影到包含其解集的半空间来求解。外部近似半空间是通过分别使用模型中的各个算子来构建的。这一几何框架既包括传统方法,也包括最先进的异步分块迭代算法,其灵活的结构为设计新算法提供了模式。
{"title":"The geometry of monotone operator splitting methods","authors":"Patrick L. Combettes","doi":"10.1017/s0962492923000065","DOIUrl":"https://doi.org/10.1017/s0962492923000065","url":null,"abstract":"<p>We propose a geometric framework to describe and analyse a wide array of operator splitting methods for solving monotone inclusion problems. The initial inclusion problem, which typically involves several operators combined through monotonicity-preserving operations, is seldom solvable in its original form. We embed it in an auxiliary space, where it is associated with a surrogate monotone inclusion problem with a more tractable structure and which allows for easy recovery of solutions to the initial problem. The surrogate problem is solved by successive projections onto half-spaces containing its solution set. The outer approximation half-spaces are constructed by using the individual operators present in the model separately. This geometric framework is shown to encompass traditional methods as well as state-of-the-art asynchronous block-iterative algorithms, and its flexible structure provides a pattern to design new ones.</p>","PeriodicalId":48863,"journal":{"name":"Acta Numerica","volume":"25 1","pages":""},"PeriodicalIF":14.2,"publicationDate":"2024-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142130834","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Acta Numerica
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1