SIAM Journal on Scientific Computing最新文献_第2页

Generalizing Lloyd’s Algorithm for Graph Clustering 图形聚类的劳埃德算法推广

IF 3.1 2区数学 Q1 MATHEMATICS, APPLIED

SIAM Journal on Scientific Computing

Pub Date : 2024-09-03 DOI: 10.1137/23m1556800

Tareq Zaman, Nicolas Nytko, Ali Taghibakhshi, Scott MacLachlan, Luke Olson, Matthew West

SIAM Journal on Scientific Computing, Volume 46, Issue 5, Page A2819-A2847, October 2024.
Abstract. Clustering is a commonplace problem in many areas of data science, with applications in biology and bioinformatics, understanding chemical structure, image segmentation, building recommender systems, and many more fields. While there are many different clustering variants (based on given distance or graph structure, probability distributions, or data density), we consider here the problem of clustering nodes in a graph, motivated by the problem of aggregating discrete degrees of freedom in multigrid and domain decomposition methods for solving sparse linear systems. Specifically, we consider the challenge of forming balanced clusters in the graph of a sparse matrix for use in algebraic multigrid, although the algorithm has general applicability. Based on an extension of the Bellman–Ford algorithm, we generalize Lloyd’s algorithm for partitioning subsets of [math] to balance the number of nodes in each cluster; this is accompanied by a rebalancing algorithm that reduces the overall energy in the system. The algorithm provides control over the number of clusters and leads to “well centered” partitions of the graph. Theoretical results are provided to establish linear complexity and numerical results in the context of algebraic multigrid highlight the benefits of improved clustering. Reproducibility of computational results. This paper has been awarded the “SIAM Reproducibility Badge: Code and data available” as a recognition that the authors have followed reproducibility principles valued by SISC and the scientific computing community. Code and data that allow readers to reproduce the results in this paper are available at https://github.com/lukeolson/paper-lloyd-data and in the supplementary materials (paper-lloyd-data.zip [88.1MB]).

SIAM 科学计算期刊》，第 46 卷第 5 期，第 A2819-A2847 页，2024 年 10 月。摘要聚类是数据科学许多领域的常见问题，应用于生物学和生物信息学、理解化学结构、图像分割、构建推荐系统等许多领域。虽然有许多不同的聚类变体（基于给定的距离或图结构、概率分布或数据密度），但我们在此考虑的是图中节点的聚类问题，其动机来自多网格和域分解方法中离散自由度的聚合问题，用于求解稀疏线性系统。具体来说，我们考虑的难题是如何在稀疏矩阵图中形成平衡的簇，以用于代数多网格，尽管该算法具有普遍适用性。基于 Bellman-Ford 算法的扩展，我们对 [math] 的 Lloyd 子集划分算法进行了推广，以平衡每个簇中的节点数量；同时还采用了一种再平衡算法，以降低系统的总体能量。该算法可控制簇的数量，并导致图的 "居中 "分区。理论结果确定了线性复杂性，代数多网格的数值结果突出了改进聚类的好处。计算结果的可重复性。本文被授予 "SIAM 可重现徽章"：代码和数据可用"，以表彰作者遵循了 SISC 和科学计算界重视的可重现性原则。读者可以通过 https://github.com/lukeolson/paper-lloyd-data 和补充材料（paper-lloyd-data.zip [88.1MB]）中的代码和数据重现本文的结果。

{"title":"Generalizing Lloyd’s Algorithm for Graph Clustering","authors":"Tareq Zaman, Nicolas Nytko, Ali Taghibakhshi, Scott MacLachlan, Luke Olson, Matthew West","doi":"10.1137/23m1556800","DOIUrl":"https://doi.org/10.1137/23m1556800","url":null,"abstract":"SIAM Journal on Scientific Computing, Volume 46, Issue 5, Page A2819-A2847, October 2024. Abstract. Clustering is a commonplace problem in many areas of data science, with applications in biology and bioinformatics, understanding chemical structure, image segmentation, building recommender systems, and many more fields. While there are many different clustering variants (based on given distance or graph structure, probability distributions, or data density), we consider here the problem of clustering nodes in a graph, motivated by the problem of aggregating discrete degrees of freedom in multigrid and domain decomposition methods for solving sparse linear systems. Specifically, we consider the challenge of forming balanced clusters in the graph of a sparse matrix for use in algebraic multigrid, although the algorithm has general applicability. Based on an extension of the Bellman–Ford algorithm, we generalize Lloyd’s algorithm for partitioning subsets of [math] to balance the number of nodes in each cluster; this is accompanied by a rebalancing algorithm that reduces the overall energy in the system. The algorithm provides control over the number of clusters and leads to “well centered” partitions of the graph. Theoretical results are provided to establish linear complexity and numerical results in the context of algebraic multigrid highlight the benefits of improved clustering. Reproducibility of computational results. This paper has been awarded the “SIAM Reproducibility Badge: Code and data available” as a recognition that the authors have followed reproducibility principles valued by SISC and the scientific computing community. Code and data that allow readers to reproduce the results in this paper are available at https://github.com/lukeolson/paper-lloyd-data and in the supplementary materials (paper-lloyd-data.zip [88.1MB]).","PeriodicalId":49526,"journal":{"name":"SIAM Journal on Scientific Computing","volume":null,"pages":null},"PeriodicalIF":3.1,"publicationDate":"2024-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142217598","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A Neural Network Approach for Stochastic Optimal Control 随机优化控制的神经网络方法

IF 3.1 2区数学 Q1 MATHEMATICS, APPLIED

SIAM Journal on Scientific Computing

Pub Date : 2024-09-03 DOI: 10.1137/23m155832x

Xingjian Li, Deepanshu Verma, Lars Ruthotto

SIAM Journal on Scientific Computing, Volume 46, Issue 5, Page C535-C556, October 2024.
Abstract. We present a neural network approach for approximating the value function of high-dimensional stochastic control problems. Our training process simultaneously updates our value function estimate and identifies the part of the state space likely to be visited by optimal trajectories. Our approach leverages insights from optimal control theory and the fundamental relation between semilinear parabolic partial differential equations and forward-backward stochastic differential equations. To focus the sampling on relevant states during neural network training, we use the stochastic Pontryagin maximum principle (PMP) to obtain the optimal controls for the current value function estimate. By design, our approach coincides with the method of characteristics for the nonviscous Hamilton–Jacobi–Bellman equation arising in deterministic control problems. Our training loss consists of a weighted sum of the objective functional of the control problem and penalty terms that enforce the HJB equations along the sampled trajectories. Importantly, training is unsupervised in that it does not require solutions of the control problem. Our numerical experiments highlight our scheme’s ability to identify the relevant parts of the state space and produce meaningful value estimates. Using a two-dimensional model problem, we demonstrate the importance of the stochastic PMP to inform the sampling and compare it to a finite element approach. With a nonlinear control affine quadcopter example, we illustrate that our approach can handle complicated dynamics. For a 100-dimensional benchmark problem, we demonstrate that our approach improves accuracy and time-to-solution, and, via a modification, we show the wider applicability of our scheme. Reproducibility of computational results.This paper has been awarded the “SIAM Reproducibility Badge: Code and data available” as recognition that the authors have followed reproducibility principles valued by SISC and the scientific computing community. Code and data that allow readers to reproduce the results in this paper are available at https://github.com/EmoryMLIP/NeuralSOC and in the supplementary material (NeuralSOC-main.zip [ 29.9MB]).

SIAM 科学计算期刊》，第 46 卷第 5 期，第 C535-C556 页，2024 年 10 月。摘要我们提出了一种近似高维随机控制问题价值函数的神经网络方法。我们的训练过程可同时更新我们的价值函数估计值，并确定最优轨迹可能访问的状态空间部分。我们的方法充分利用了最优控制理论以及半线性抛物线偏微分方程和前向后向随机微分方程之间的基本关系。为了在神经网络训练期间将采样重点放在相关状态上，我们使用随机庞特里亚金最大原则（PMP）来获得当前价值函数估计的最优控制。通过设计，我们的方法与确定性控制问题中出现的非粘性汉密尔顿-雅各比-贝尔曼方程的特征方法不谋而合。我们的训练损失由控制问题目标函数的加权和以及沿采样轨迹强制执行 HJB 方程的惩罚项组成。重要的是，训练是无监督的，因为它不需要控制问题的解决方案。我们的数值实验突出表明，我们的方案能够识别状态空间的相关部分，并产生有意义的值估计。通过一个二维模型问题，我们证明了随机 PMP 对采样的重要性，并将其与有限元方法进行了比较。通过一个非线性控制仿真四旋翼飞行器的例子，我们说明了我们的方法可以处理复杂的动力学问题。对于一个 100 维的基准问题，我们证明了我们的方法提高了准确性并缩短了求解时间，而且通过修改，我们展示了我们方案更广泛的适用性。计算结果的可重复性：本文被授予 "SIAM 可重复性徽章"：代码和数据可用"，以表彰作者遵循了 SISC 和科学计算界所珍视的可重现性原则。读者可以通过 https://github.com/EmoryMLIP/NeuralSOC 和补充材料（NeuralSOC-main.zip [ 29.9MB]）中的代码和数据重现本文的结果。

{"title":"A Neural Network Approach for Stochastic Optimal Control","authors":"Xingjian Li, Deepanshu Verma, Lars Ruthotto","doi":"10.1137/23m155832x","DOIUrl":"https://doi.org/10.1137/23m155832x","url":null,"abstract":"SIAM Journal on Scientific Computing, Volume 46, Issue 5, Page C535-C556, October 2024. Abstract. We present a neural network approach for approximating the value function of high-dimensional stochastic control problems. Our training process simultaneously updates our value function estimate and identifies the part of the state space likely to be visited by optimal trajectories. Our approach leverages insights from optimal control theory and the fundamental relation between semilinear parabolic partial differential equations and forward-backward stochastic differential equations. To focus the sampling on relevant states during neural network training, we use the stochastic Pontryagin maximum principle (PMP) to obtain the optimal controls for the current value function estimate. By design, our approach coincides with the method of characteristics for the nonviscous Hamilton–Jacobi–Bellman equation arising in deterministic control problems. Our training loss consists of a weighted sum of the objective functional of the control problem and penalty terms that enforce the HJB equations along the sampled trajectories. Importantly, training is unsupervised in that it does not require solutions of the control problem. Our numerical experiments highlight our scheme’s ability to identify the relevant parts of the state space and produce meaningful value estimates. Using a two-dimensional model problem, we demonstrate the importance of the stochastic PMP to inform the sampling and compare it to a finite element approach. With a nonlinear control affine quadcopter example, we illustrate that our approach can handle complicated dynamics. For a 100-dimensional benchmark problem, we demonstrate that our approach improves accuracy and time-to-solution, and, via a modification, we show the wider applicability of our scheme. Reproducibility of computational results.This paper has been awarded the “SIAM Reproducibility Badge: Code and data available” as recognition that the authors have followed reproducibility principles valued by SISC and the scientific computing community. Code and data that allow readers to reproduce the results in this paper are available at https://github.com/EmoryMLIP/NeuralSOC and in the supplementary material (NeuralSOC-main.zip [ 29.9MB]).","PeriodicalId":49526,"journal":{"name":"SIAM Journal on Scientific Computing","volume":null,"pages":null},"PeriodicalIF":3.1,"publicationDate":"2024-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142217597","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Bounds on Nonlinear Errors for Variance Computation with Stochastic Rounding 随机舍入方差计算的非线性误差限值

IF 3.1 2区数学 Q1 MATHEMATICS, APPLIED

SIAM Journal on Scientific Computing

Pub Date : 2024-09-03 DOI: 10.1137/23m1563001

E-M. El Arar, D. Sohier, P. de Oliveira Castro, E. Petit

SIAM Journal on Scientific Computing, Volume 46, Issue 5, Page B579-B599, October 2024.
Abstract. The main objective of this work is to investigate nonlinear errors and pairwise summation using stochastic rounding (SR) in variance computation algorithms. We estimate the forward error of computations under SR through two methods: the first is based on a bound of the variance and the Bienaymé–Chebyshev inequality, while the second is based on martingales and the Azuma–Hoeffding inequality. The study shows that for pairwise summation, using SR results in a probabilistic bound of the forward error proportional to [math] rather than the deterministic bound in [math] when using the default rounding mode. We examine two algorithms that compute the variance, one called “textbook” and the other “two-pass,” which both exhibit nonlinear errors. Using the two methods mentioned above, we show that the forward errors of these algorithms have probabilistic bounds under SR in [math] instead of [math] for the deterministic bounds. We show that this advantage holds using pairwise summation for both textbook and two-pass, with probabilistic bounds of the forward error proportional to [math]. Reproducibility of computational results. This paper has been awarded the “SIAM Reproducibility Badge: Code and data available” as recognition that the authors have followed reproducibility principles valued by SISC and the scientific computing community. Code and data that allow the reader to reproduce the results in this paper are available at https://github.com/verificarlo/sr-non-linear-bounds and in the supplementary material (sr-non-linear-bounds-main.zip [8.62KB]).

SIAM 科学计算期刊》，第 46 卷第 5 期，第 B579-B599 页，2024 年 10 月。摘要这项工作的主要目的是研究方差计算算法中使用随机舍入（SR）的非线性误差和成对求和。我们通过两种方法估算了 SR 条件下计算的前向误差：第一种方法基于方差约束和 Bienaymé-Chebyshev 不等式，第二种方法基于马氏不等式和 Azuma-Hoeffding 不等式。研究表明，对于成对求和，在使用默认舍入模式时，使用 SR 可以得到与[math]成比例的前向误差概率约束，而不是[math]中的确定性约束。我们研究了两种计算方差的算法，一种称为 "教科书式"，另一种称为 "双通式"，这两种算法都表现出非线性误差。利用上述两种方法，我们证明了这些算法的前向误差在[math]的 SR 下具有概率边界，而不是[math]的确定性边界。我们证明了这一优势在教科书算法和双程算法中使用成对求和时都是成立的，前向误差的概率边界与[math]成正比。计算结果的可重复性。本文被授予 "SIAM 可重现徽章"：代码和数据可用"，以表彰作者遵循了 SISC 和科学计算界重视的可重现性原则。读者可以通过 https://github.com/verificarlo/sr-non-linear-bounds 和补充材料（sr-non-linear-bounds-main.zip [8.62KB]）中的代码和数据重现本文的结果。

{"title":"Bounds on Nonlinear Errors for Variance Computation with Stochastic Rounding","authors":"E-M. El Arar, D. Sohier, P. de Oliveira Castro, E. Petit","doi":"10.1137/23m1563001","DOIUrl":"https://doi.org/10.1137/23m1563001","url":null,"abstract":"SIAM Journal on Scientific Computing, Volume 46, Issue 5, Page B579-B599, October 2024. Abstract. The main objective of this work is to investigate nonlinear errors and pairwise summation using stochastic rounding (SR) in variance computation algorithms. We estimate the forward error of computations under SR through two methods: the first is based on a bound of the variance and the Bienaymé–Chebyshev inequality, while the second is based on martingales and the Azuma–Hoeffding inequality. The study shows that for pairwise summation, using SR results in a probabilistic bound of the forward error proportional to [math] rather than the deterministic bound in [math] when using the default rounding mode. We examine two algorithms that compute the variance, one called “textbook” and the other “two-pass,” which both exhibit nonlinear errors. Using the two methods mentioned above, we show that the forward errors of these algorithms have probabilistic bounds under SR in [math] instead of [math] for the deterministic bounds. We show that this advantage holds using pairwise summation for both textbook and two-pass, with probabilistic bounds of the forward error proportional to [math]. Reproducibility of computational results. This paper has been awarded the “SIAM Reproducibility Badge: Code and data available” as recognition that the authors have followed reproducibility principles valued by SISC and the scientific computing community. Code and data that allow the reader to reproduce the results in this paper are available at https://github.com/verificarlo/sr-non-linear-bounds and in the supplementary material (sr-non-linear-bounds-main.zip [8.62KB]).","PeriodicalId":49526,"journal":{"name":"SIAM Journal on Scientific Computing","volume":null,"pages":null},"PeriodicalIF":3.1,"publicationDate":"2024-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142217603","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Efficient GMRES+AMG on GPUs: Composite Smoothers And Mixed [math]-Cycles GPU 上的高效 GMRES+AMG：复合平滑器和混合[数学]循环

IF 3.1 2区数学 Q1 MATHEMATICS, APPLIED

SIAM Journal on Scientific Computing

Pub Date : 2024-09-03 DOI: 10.1137/23m1578632

Stephen Thomas, Allison H. Baker

SIAM Journal on Scientific Computing, Ahead of Print.
Abstract. In this study, we introduce algorithms optimized for GPU architectures, aimed at efficiently solving large sparse linear systems, a central challenge in Navier–Stokes pressure projection problems. Our approach includes an adaptation of the GMRES algorithm, drawing inspiration from the merged vector operations first proposed by Bielich et al. [Parallel Comput., 112 (2022), 102940]. This adaptation increases computational intensity on GPU platforms through optimized vector update strategies. The algorithm incorporates modified and classical Gram–Schmidt methods with an algebraic multigrid (AMG) preconditioner, each tailored for GPU performance. A key innovation in our work is the development of a Gram–Schmidt projector [math] employing a rank-1 perturbation of the identity matrix. Designed to maximize the high memory bandwidth utilization of the AMD MI-250X GPU, this approach includes a strategy for treating the unit diagonal that minimizes memory reads, leading to a 25% increase in computational efficiency. The application of perturbation theory further ensures that orthogonality loss is limited to [math], where [math] is the number of iterations. Additionally, we introduce a mixed AMG [math]-cycle strategy combining ILU(0) and [math]-Jacobi smoothers, which achieves a 30–50% reduction in GPU compute times compared to conventional methods, while maintaining low backward error. This strategy, alongside our novel treatment of the diagonal in triangular matrices, marks a substantial increase in AMG efficicency for GPU systems. We believe that these contributions represent a significant advance in optimizing GMRES+AMG algorithms for GPU computations. The empirical results demonstrate notable speed increments and maintain rigorous backward error bounds, underscoring the potential of our methods to substantially increase computational efficiency in large-scale scientific applications.

SIAM 科学计算期刊》，提前印刷。摘要在本研究中，我们介绍了针对 GPU 架构进行优化的算法，旨在高效求解大型稀疏线性系统，这是 Navier-Stokes 压力投影问题的核心挑战。我们的方法包括对 GMRES 算法的改编，从 Bielich 等人首次提出的合并矢量运算中汲取灵感[《并行计算》，112 (2022)，102940]。这种调整通过优化向量更新策略提高了 GPU 平台的计算强度。该算法结合了修正的经典格兰-施密特方法和代数多网格（AMG）预处理器，每种方法都是为 GPU 性能量身定制的。我们工作中的一项关键创新是开发了一种格拉姆-施密特投影器[math]，采用了秩-1扰动特征矩阵。这种方法旨在最大限度地利用 AMD MI-250X GPU 的高内存带宽，其中包括一种处理单元对角线的策略，可最大限度地减少内存读取，从而将计算效率提高 25%。扰动理论的应用进一步确保了正交损失仅限于 [math]，其中 [math] 是迭代次数。此外，我们还引入了一种混合 AMG [math]循环策略，它结合了 ILU(0) 和 [math]-Jacobi 平滑器，与传统方法相比，GPU 计算时间减少了 30-50%，同时保持了较低的后向误差。这一策略以及我们对三角形矩阵对角线的新颖处理，标志着 GPU 系统 AMG 效率的大幅提升。我们相信，这些贡献代表了为 GPU 计算优化 GMRES+AMG 算法的重大进展。实证结果显示了显著的速度提升，并保持了严格的后向误差边界，突出了我们的方法在大规模科学应用中大幅提高计算效率的潜力。

{"title":"Efficient GMRES+AMG on GPUs: Composite Smoothers And Mixed [math]-Cycles","authors":"Stephen Thomas, Allison H. Baker","doi":"10.1137/23m1578632","DOIUrl":"https://doi.org/10.1137/23m1578632","url":null,"abstract":"SIAM Journal on Scientific Computing, Ahead of Print. Abstract. In this study, we introduce algorithms optimized for GPU architectures, aimed at efficiently solving large sparse linear systems, a central challenge in Navier–Stokes pressure projection problems. Our approach includes an adaptation of the GMRES algorithm, drawing inspiration from the merged vector operations first proposed by Bielich et al. [Parallel Comput., 112 (2022), 102940]. This adaptation increases computational intensity on GPU platforms through optimized vector update strategies. The algorithm incorporates modified and classical Gram–Schmidt methods with an algebraic multigrid (AMG) preconditioner, each tailored for GPU performance. A key innovation in our work is the development of a Gram–Schmidt projector [math] employing a rank-1 perturbation of the identity matrix. Designed to maximize the high memory bandwidth utilization of the AMD MI-250X GPU, this approach includes a strategy for treating the unit diagonal that minimizes memory reads, leading to a 25% increase in computational efficiency. The application of perturbation theory further ensures that orthogonality loss is limited to [math], where [math] is the number of iterations. Additionally, we introduce a mixed AMG [math]-cycle strategy combining ILU(0) and [math]-Jacobi smoothers, which achieves a 30–50% reduction in GPU compute times compared to conventional methods, while maintaining low backward error. This strategy, alongside our novel treatment of the diagonal in triangular matrices, marks a substantial increase in AMG efficicency for GPU systems. We believe that these contributions represent a significant advance in optimizing GMRES+AMG algorithms for GPU computations. The empirical results demonstrate notable speed increments and maintain rigorous backward error bounds, underscoring the potential of our methods to substantially increase computational efficiency in large-scale scientific applications.","PeriodicalId":49526,"journal":{"name":"SIAM Journal on Scientific Computing","volume":null,"pages":null},"PeriodicalIF":3.1,"publicationDate":"2024-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142217596","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Inverse Source Problem of the Biharmonic Equation from Multifrequency Phaseless Data 从多频无相数据看比谐波方程的反源问题

IF 3.1 2区数学 Q1 MATHEMATICS, APPLIED

SIAM Journal on Scientific Computing

Pub Date : 2024-09-03 DOI: 10.1137/24m162889x

Yan Chang, Yukun Guo, Yue Zhao

SIAM Journal on Scientific Computing, Volume 46, Issue 5, Page A2799-A2818, October 2024.
Abstract. This work deals with an inverse source problem for the biharmonic wave equation. A two-stage numerical method is proposed to identify the unknown source from the multifrequency phaseless data. In the first stage, we introduce some artificial auxiliary point sources to the inverse source system and establish a phase retrieval formula. Theoretically, we point out that the phase can be uniquely determined and estimate the stability of this phase retrieval approach. Once the phase information is retrieved, the Fourier method is adopted to reconstruct the source function from the phased multifrequency data. The proposed method is easy to implement and there is no forward solver involved in the reconstruction. Numerical experiments are conducted to verify the performance of the proposed method.

SIAM 科学计算期刊》，第 46 卷第 5 期，第 A2799-A2818 页，2024 年 10 月。摘要本研究涉及双谐波方程的反源问题。提出了一种两阶段数值方法，以从多频无相数据中识别未知源。在第一阶段，我们在反源系统中引入了一些人工辅助点源，并建立了一个相位检索公式。我们从理论上指出相位可以唯一确定，并估计了这种相位检索方法的稳定性。一旦检索到相位信息，就可以采用傅立叶方法从相位多频数据中重建源函数。所提出的方法易于实施，重建过程中不涉及前向求解器。为验证所提方法的性能，我们进行了数值实验。

引用次数: 0

A Pseudoreversible Normalizing Flow for Stochastic Dynamical Systems with Various Initial Distributions 具有各种初始分布的随机动力系统的伪逆转归一化流程

IF 3.1 2区数学 Q1 MATHEMATICS, APPLIED

SIAM Journal on Scientific Computing

Pub Date : 2024-08-21 DOI: 10.1137/23m1585635

Minglei Yang, Pengjun Wang, Diego del-Castillo-Negrete, Yanzhao Cao, Guannan Zhang

SIAM Journal on Scientific Computing, Volume 46, Issue 4, Page C508-C533, August 2024.
Abstract. We present a pseudoreversible normalizing flow method for efficiently generating samples of the state of a stochastic differential equation (SDE) with various initial distributions. The primary objective is to construct an accurate and efficient sampler that can be used as a surrogate model for computationally expensive numerical integration of SDEs, such as those employed in particle simulation. After training, the normalizing flow model can directly generate samples of the SDE’s final state without simulating trajectories. The existing normalizing flow model for SDEs depends on the initial distribution, meaning the model needs to be retrained when the initial distribution changes. The main novelty of our normalizing flow model is that it can learn the conditional distribution of the state, i.e., the distribution of the final state conditional on any initial state, such that the model only needs to be trained once and the trained model can be used to handle various initial distributions. This feature can provide a significant computational saving in studies of how the final state varies with the initial distribution. Additionally, we propose to use a pseudoreversible network architecture to define the normalizing flow model, which has sufficient expressive power and training efficiency for a variety of SDEs in science and engineering, e.g., in particle physics. We provide a rigorous convergence analysis of the pseudoreversible normalizing flow model to the target probability density function in the Kullback–Leibler divergence metric. Numerical experiments are provided to demonstrate the effectiveness of the proposed normalizing flow model. Reproducibility of computational results. This paper has been awarded the “SIAM Reproducibility Badge: Code and data available” as a recognition that the authors have followed reproducibility principles valued by SISC and the scientific computing community. Code and data that allow readers to reproduce the results in this paper are available at https://github.com/mlmathphy/PRNF and in the supplementary materials (PRNF-main.zip [27.4MB]).

SIAM 科学计算期刊》，第 46 卷第 4 期，第 C508-C533 页，2024 年 8 月。摘要。我们提出了一种伪可逆归一化流方法，用于高效生成具有各种初始分布的随机微分方程（SDE）的状态样本。其主要目的是构建一种精确高效的采样器，可用作计算成本高昂的 SDE 数值积分的代用模型，如粒子模拟中使用的代用模型。经过训练后，归一化流模型可以直接生成 SDE 最终状态的样本，而无需模拟轨迹。现有的 SDE 归一化流模型依赖于初始分布，这意味着当初始分布发生变化时，模型需要重新训练。我们的归一化流模型的主要新颖之处在于它可以学习状态的条件分布，即以任意初始状态为条件的最终状态分布，因此模型只需训练一次，而且训练后的模型可用于处理各种初始分布。在研究最终状态如何随初始分布变化时，这一特点可以大大节省计算量。此外，我们还建议使用伪可逆网络架构来定义归一化流模型，它具有足够的表达能力和训练效率，适用于科学和工程领域的各种 SDE，例如粒子物理学。我们用 Kullback-Leibler 发散度量对伪可逆归一化流模型到目标概率密度函数进行了严格的收敛分析。我们还提供了数值实验来证明所提出的归一化流动模型的有效性。计算结果的可重复性。本文被授予 "SIAM 可重复性徽章"：代码和数据可用"，以表彰作者遵循了 SISC 和科学计算界重视的可重现性原则。读者可以通过 https://github.com/mlmathphy/PRNF 和补充材料（PRNF-main.zip [27.4MB]）中的代码和数据重现本文的结果。

{"title":"A Pseudoreversible Normalizing Flow for Stochastic Dynamical Systems with Various Initial Distributions","authors":"Minglei Yang, Pengjun Wang, Diego del-Castillo-Negrete, Yanzhao Cao, Guannan Zhang","doi":"10.1137/23m1585635","DOIUrl":"https://doi.org/10.1137/23m1585635","url":null,"abstract":"SIAM Journal on Scientific Computing, Volume 46, Issue 4, Page C508-C533, August 2024. Abstract. We present a pseudoreversible normalizing flow method for efficiently generating samples of the state of a stochastic differential equation (SDE) with various initial distributions. The primary objective is to construct an accurate and efficient sampler that can be used as a surrogate model for computationally expensive numerical integration of SDEs, such as those employed in particle simulation. After training, the normalizing flow model can directly generate samples of the SDE’s final state without simulating trajectories. The existing normalizing flow model for SDEs depends on the initial distribution, meaning the model needs to be retrained when the initial distribution changes. The main novelty of our normalizing flow model is that it can learn the conditional distribution of the state, i.e., the distribution of the final state conditional on any initial state, such that the model only needs to be trained once and the trained model can be used to handle various initial distributions. This feature can provide a significant computational saving in studies of how the final state varies with the initial distribution. Additionally, we propose to use a pseudoreversible network architecture to define the normalizing flow model, which has sufficient expressive power and training efficiency for a variety of SDEs in science and engineering, e.g., in particle physics. We provide a rigorous convergence analysis of the pseudoreversible normalizing flow model to the target probability density function in the Kullback–Leibler divergence metric. Numerical experiments are provided to demonstrate the effectiveness of the proposed normalizing flow model. Reproducibility of computational results. This paper has been awarded the “SIAM Reproducibility Badge: Code and data available” as a recognition that the authors have followed reproducibility principles valued by SISC and the scientific computing community. Code and data that allow readers to reproduce the results in this paper are available at https://github.com/mlmathphy/PRNF and in the supplementary materials (PRNF-main.zip [27.4MB]).","PeriodicalId":49526,"journal":{"name":"SIAM Journal on Scientific Computing","volume":null,"pages":null},"PeriodicalIF":3.1,"publicationDate":"2024-08-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142227246","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A Sketch-and-Select Arnoldi Process 草图-选择-阿诺德工艺

IF 3.1 2区数学 Q1 MATHEMATICS, APPLIED

SIAM Journal on Scientific Computing

Pub Date : 2024-08-21 DOI: 10.1137/23m1588007

Stefan Güttel, Igor Simunec

SIAM Journal on Scientific Computing, Volume 46, Issue 4, Page A2774-A2797, August 2024.
Abstract. A sketch-and-select Arnoldi process to generate a well-conditioned basis of a Krylov space at low cost is proposed. At each iteration the procedure utilizes randomized sketching to select a limited number of previously computed basis vectors to project out of the current basis vector. The computational cost grows linearly with the dimension of the Krylov space. The subset selection problem for the projection step is approximately solved with a number of heuristic algorithms and greedy methods used in statistical learning and compressive sensing. Reproducibility of computational results. This paper has been awarded the “SIAM Reproducibility Badge: Code and data available” as a recognition that the authors have followed reproducibility principles valued by SISC and the scientific computing community. Code and data that allow readers to reproduce the results in this paper are available at https://github.com/simunec/sketch-select-arnoldi and in the supplementary materials (sketch-select-arnoldi-main.zip [2.21MB]).

SIAM 科学计算期刊》，第 46 卷第 4 期，第 A2774-A2797 页，2024 年 8 月。摘要。本文提出了一种 "草图-选择阿诺德过程"（sketch-and-select Arnoldi process），以低成本生成条件良好的克雷洛夫空间基。在每次迭代时，该过程利用随机草图选择有限数量的先前计算的基向量，以投影出当前的基向量。计算成本与克雷洛夫空间的维度呈线性增长。投影步骤的子集选择问题可以通过统计学习和压缩传感中使用的一些启发式算法和贪婪方法近似解决。计算结果的可重复性。本文被授予 "SIAM 可重现徽章"：代码和数据可用"，以表彰作者遵循了 SISC 和科学计算界重视的可重现性原则。读者可以通过 https://github.com/simunec/sketch-select-arnoldi 和补充材料（sketch-select-arnoldi-main.zip [2.21MB]）中的代码和数据重现本文的结果。

引用次数: 0

A Fast Iterative PDE-Based Algorithm for Feedback Controls of Nonsmooth Mean-Field Control Problems 基于 PDE 的非光滑平均场控制问题反馈控制快速迭代算法

IF 3.1 2区数学 Q1 MATHEMATICS, APPLIED

SIAM Journal on Scientific Computing

Pub Date : 2024-08-20 DOI: 10.1137/21m1441158

Christoph Reisinger, Wolfgang Stockinger, Yufei Zhang

SIAM Journal on Scientific Computing, Volume 46, Issue 4, Page A2737-A2773, August 2024.
Abstract. We propose a PDE-based accelerated gradient algorithm for optimal feedback controls of McKean–Vlasov dynamics that involve mean-field interactions both in the state and action. The method exploits a forward-backward splitting approach and iteratively refines the approximate controls based on the gradients of smooth costs, the proximal maps of nonsmooth costs, and dynamically updated momentum parameters. At each step, the state dynamics is approximated via a particle system, and the required gradient is evaluated through a coupled system of nonlocal linear PDEs. The latter is solved by finite difference approximation or neural network-based residual approximation, depending on the state dimension. We present exhaustive numerical experiments for low- and high-dimensional mean-field control problems, including sparse stabilization of stochastic Cucker–Smale models, which reveal that our algorithm captures important structures of the optimal feedback control and achieves a robust performance with respect to parameter perturbation.

SIAM 科学计算期刊》，第 46 卷第 4 期，第 A2737-A2773 页，2024 年 8 月。摘要我们提出了一种基于 PDE 的加速梯度算法，用于麦金-弗拉索夫（McKean-Vlasov）动力学的最优反馈控制。该方法利用前向-后向分裂方法，根据平滑代价的梯度、非平滑代价的近似图和动态更新的动量参数迭代改进近似控制。每一步都通过粒子系统对状态动态进行近似，并通过非局部线性 PDE 耦合系统评估所需梯度。后者根据状态维度，通过有限差分近似或基于神经网络的残差近似来求解。我们针对低维和高维均场控制问题（包括随机 Cucker-Smale 模型的稀疏稳定）进行了详尽的数值实验，结果表明我们的算法捕捉到了最优反馈控制的重要结构，并在参数扰动方面实现了稳健的性能。

{"title":"A Fast Iterative PDE-Based Algorithm for Feedback Controls of Nonsmooth Mean-Field Control Problems","authors":"Christoph Reisinger, Wolfgang Stockinger, Yufei Zhang","doi":"10.1137/21m1441158","DOIUrl":"https://doi.org/10.1137/21m1441158","url":null,"abstract":"SIAM Journal on Scientific Computing, Volume 46, Issue 4, Page A2737-A2773, August 2024. Abstract. We propose a PDE-based accelerated gradient algorithm for optimal feedback controls of McKean–Vlasov dynamics that involve mean-field interactions both in the state and action. The method exploits a forward-backward splitting approach and iteratively refines the approximate controls based on the gradients of smooth costs, the proximal maps of nonsmooth costs, and dynamically updated momentum parameters. At each step, the state dynamics is approximated via a particle system, and the required gradient is evaluated through a coupled system of nonlocal linear PDEs. The latter is solved by finite difference approximation or neural network-based residual approximation, depending on the state dimension. We present exhaustive numerical experiments for low- and high-dimensional mean-field control problems, including sparse stabilization of stochastic Cucker–Smale models, which reveal that our algorithm captures important structures of the optimal feedback control and achieves a robust performance with respect to parameter perturbation.","PeriodicalId":49526,"journal":{"name":"SIAM Journal on Scientific Computing","volume":null,"pages":null},"PeriodicalIF":3.1,"publicationDate":"2024-08-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142217604","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Multilevel Parareal Algorithm with Averaging for Oscillatory Problems 针对振荡问题的多级 Parareal 平均算法

IF 3.1 2区数学 Q1 MATHEMATICS, APPLIED

SIAM Journal on Scientific Computing

Pub Date : 2024-08-20 DOI: 10.1137/23m1547123

Juliane Rosemeier, Terry Haut, Beth Wingate

SIAM Journal on Scientific Computing, Volume 46, Issue 4, Page A2709-A2736, August 2024.
Abstract. The present study is an extension of the work done by Peddle, Haut, and Wingate [SIAM J. Sci. Comput., 41 (2019), pp. A3476–A3497] and Haut and Wingate [SIAM J. Sci. Comput., 36 (2014), pp. A693–A713], where a two-level Parareal method with mapping and averaging is examined. The method proposed in this paper is a multilevel Parareal method with arbitrarily many levels, which is not restricted to the two-level case. We give an asymptotic error estimate which reduces to the two-level estimate for the case when only two levels are considered. Introducing more than two levels has important consequences for the averaging procedure, as we choose separate averaging windows for each of the different levels, which is an additional new feature of the present study. The different averaging windows make the proposed method especially appropriate for nonlinear multiscale problems, because we can introduce a level for each intrinsic scale of the problem and adapt the averaging procedure such that we reproduce the behavior of the model on the particular scale resolved by the level. The method is applied to nonlinear differential equations. The nonlinearities can generate a range of frequencies in the problem. The computational cost of the new method is investigated and studied on several examples.

SIAM 科学计算期刊》，第 46 卷第 4 期，第 A2709-A2736 页，2024 年 8 月。摘要本研究是对 Peddle、Haut 和 Wingate [SIAM J. Sci. Comput., 41 (2019), pp. A3476-A3497] 以及 Haut 和 Wingate [SIAM J. Sci. Comput., 36 (2014), pp.本文提出的方法是一种具有任意多层次的多层次 Parareal 方法，它并不局限于两层情况。我们给出了一个渐近误差估计值，在只考虑两级的情况下，该估计值与两级估计值相减。引入两个以上的水平对平均过程有重要影响，因为我们为每个不同的水平选择了不同的平均窗口，这是本研究的另一个新特点。不同的平均窗口使所提出的方法特别适用于非线性多尺度问题，因为我们可以为问题的每个内在尺度引入一个层次，并调整平均程序，从而重现模型在该层次所解析的特定尺度上的行为。该方法适用于非线性微分方程。非线性会在问题中产生一系列频率。新方法的计算成本在几个例子中进行了调查和研究。

{"title":"Multilevel Parareal Algorithm with Averaging for Oscillatory Problems","authors":"Juliane Rosemeier, Terry Haut, Beth Wingate","doi":"10.1137/23m1547123","DOIUrl":"https://doi.org/10.1137/23m1547123","url":null,"abstract":"SIAM Journal on Scientific Computing, Volume 46, Issue 4, Page A2709-A2736, August 2024. Abstract. The present study is an extension of the work done by Peddle, Haut, and Wingate [SIAM J. Sci. Comput., 41 (2019), pp. A3476–A3497] and Haut and Wingate [SIAM J. Sci. Comput., 36 (2014), pp. A693–A713], where a two-level Parareal method with mapping and averaging is examined. The method proposed in this paper is a multilevel Parareal method with arbitrarily many levels, which is not restricted to the two-level case. We give an asymptotic error estimate which reduces to the two-level estimate for the case when only two levels are considered. Introducing more than two levels has important consequences for the averaging procedure, as we choose separate averaging windows for each of the different levels, which is an additional new feature of the present study. The different averaging windows make the proposed method especially appropriate for nonlinear multiscale problems, because we can introduce a level for each intrinsic scale of the problem and adapt the averaging procedure such that we reproduce the behavior of the model on the particular scale resolved by the level. The method is applied to nonlinear differential equations. The nonlinearities can generate a range of frequencies in the problem. The computational cost of the new method is investigated and studied on several examples.","PeriodicalId":49526,"journal":{"name":"SIAM Journal on Scientific Computing","volume":null,"pages":null},"PeriodicalIF":3.1,"publicationDate":"2024-08-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142217605","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Implicit-explicit Schemes for Incompressible Flow Problems with Variable Viscosity 粘性可变的不可压缩流动问题的隐式-显式方案

IF 3.1 2区数学 Q1 MATHEMATICS, APPLIED

SIAM Journal on Scientific Computing

Pub Date : 2024-08-19 DOI: 10.1137/23m1606526

Gabriel Barrenechea, Ernesto Castillo, Douglas Pacheco

SIAM Journal on Scientific Computing, Volume 46, Issue 4, Page A2660-A2682, August 2024.
Abstract. This article investigates different implicit-explicit (IMEX) methods for incompressible flows with variable viscosity. The viscosity field may depend on space and time alone or, for example, on velocity gradients. Unlike most previous works on IMEX schemes, which focus on the convective term, we propose also treating parts of the diffusive term explicitly, which can reduce the coupling between the velocity components. We present different IMEX alternatives for the variable-viscosity Navier–Stokes system, analyzing their theoretical and algorithmic properties. Temporal stability is proven for all the methods presented, including monolithic and fractional-step variants. These results are unconditional except for one of the fractional-step discretizations, whose stability is shown for time-step sizes under an upper bound that depends solely on the problem data. The key finding of this work is a class of IMEX schemes whose steps decouple the velocity components and are fully linearized (even if the viscosity depends nonlinearly on the velocity) without requiring any CFL condition for stability. Moreover, in the presence of Neumann boundaries, some of our formulations lead naturally to conditions involving normal pseudotractions. This generalizes to the variable-viscosity case what happens for the standard Laplacian form with constant viscosity. Our analysis is supported by a series of numerical experiments.

SIAM 科学计算期刊》，第 46 卷第 4 期，第 A2660-A2682 页，2024 年 8 月。摘要本文研究了粘性可变不可压缩流的不同隐式-显式（IMEX）方法。粘度场可能仅取决于空间和时间，也可能取决于速度梯度等。与以往大多数侧重于对流项的 IMEX 方案不同，我们建议同时显式处理部分扩散项，这样可以减少速度分量之间的耦合。我们针对变粘度 Navier-Stokes 系统提出了不同的 IMEX 替代方案，分析了它们的理论和算法特性。所介绍的所有方法，包括单步和分数步变体，都证明了时间稳定性。这些结果都是无条件的，只有一种分数步离散法例外，它的稳定性是在一个完全取决于问题数据的上限值下的时间步长。这项工作的关键发现是一类 IMEX 方案，其步骤与速度分量解耦并完全线性化（即使粘度非线性地依赖于速度），而不需要任何 CFL 稳定性条件。此外，在存在 Neumann 边界的情况下，我们的一些公式自然会引出涉及正常伪抽象的条件。这就把粘性恒定的标准拉普拉斯形式的情况推广到了粘性可变的情况。我们的分析得到了一系列数值实验的支持。

{"title":"Implicit-explicit Schemes for Incompressible Flow Problems with Variable Viscosity","authors":"Gabriel Barrenechea, Ernesto Castillo, Douglas Pacheco","doi":"10.1137/23m1606526","DOIUrl":"https://doi.org/10.1137/23m1606526","url":null,"abstract":"SIAM Journal on Scientific Computing, Volume 46, Issue 4, Page A2660-A2682, August 2024. Abstract. This article investigates different implicit-explicit (IMEX) methods for incompressible flows with variable viscosity. The viscosity field may depend on space and time alone or, for example, on velocity gradients. Unlike most previous works on IMEX schemes, which focus on the convective term, we propose also treating parts of the diffusive term explicitly, which can reduce the coupling between the velocity components. We present different IMEX alternatives for the variable-viscosity Navier–Stokes system, analyzing their theoretical and algorithmic properties. Temporal stability is proven for all the methods presented, including monolithic and fractional-step variants. These results are unconditional except for one of the fractional-step discretizations, whose stability is shown for time-step sizes under an upper bound that depends solely on the problem data. The key finding of this work is a class of IMEX schemes whose steps decouple the velocity components and are fully linearized (even if the viscosity depends nonlinearly on the velocity) without requiring any CFL condition for stability. Moreover, in the presence of Neumann boundaries, some of our formulations lead naturally to conditions involving normal pseudotractions. This generalizes to the variable-viscosity case what happens for the standard Laplacian form with constant viscosity. Our analysis is supported by a series of numerical experiments.","PeriodicalId":49526,"journal":{"name":"SIAM Journal on Scientific Computing","volume":null,"pages":null},"PeriodicalIF":3.1,"publicationDate":"2024-08-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142217626","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0