Foundations of Computational Mathematics最新文献

英文中文

Representations of the Symmetric Group are Decomposable in Polynomial Time 对称群的表示在多项式时间内是可分解的

IF 3 1区数学 Q2 COMPUTER SCIENCE, THEORY & METHODS

Foundations of Computational Mathematics

Pub Date : 2025-02-10 DOI: 10.1007/s10208-025-09697-8

Sheehan Olver

We introduce an algorithm to decompose matrix representations of the symmetric group over the reals into irreducible representations, which as a by-product also computes the multiplicities of the irreducible representations. The algorithm applied to a d-dimensional representation of (S_n) is shown to have a complexity of ({mathcal {O}}(n^2 d^3)) operations for determining which irreducible representations are present and their corresponding multiplicities and a further ({mathcal {O}}(n d^4)) operations to fully decompose representations with non-trivial multiplicities. These complexity bounds are pessimistic and in a practical implementation using floating point arithmetic and exploiting sparsity we observe better complexity. We demonstrate this algorithm on the problem of computing multiplicities of two tensor products of irreducible representations (the Kronecker coefficients problem) as well as higher order tensor products. For hook and hook-like irreducible representations the algorithm has polynomial complexity as n increases. We also demonstrate an application to constructing a basis of homogeneous polynomials so that applying a permutation of variables induces an irreducible representation.

介绍了一种将实数上对称群的矩阵表示分解为不可约表示的算法，该算法还计算了不可约表示的多重性。应用于(S_n)的d维表示的算法显示，用于确定存在哪些不可约表示及其相应的多重性的操作的复杂性为({mathcal {O}}(n^2 d^3))，以及用于完全分解具有非平凡多重性的表示的进一步({mathcal {O}}(n d^4))操作。这些复杂度界限是悲观的，在使用浮点运算和利用稀疏性的实际实现中，我们观察到更好的复杂度。我们在计算不可约表示的两个张量积的多重性问题（Kronecker系数问题）以及高阶张量积的问题上证明了该算法。对于钩子和类钩子不可约表示，算法复杂度随n的增加呈多项式。我们还演示了构造齐次多项式基的一个应用，以便应用变量的置换诱导出不可约表示。

{"title":"Representations of the Symmetric Group are Decomposable in Polynomial Time","authors":"Sheehan Olver","doi":"10.1007/s10208-025-09697-8","DOIUrl":"https://doi.org/10.1007/s10208-025-09697-8","url":null,"abstract":"We introduce an algorithm to decompose matrix representations of the symmetric group over the reals into irreducible representations, which as a by-product also computes the multiplicities of the irreducible representations. The algorithm applied to a d-dimensional representation of (S_n) is shown to have a complexity of ({mathcal {O}}(n^2 d^3)) operations for determining which irreducible representations are present and their corresponding multiplicities and a further ({mathcal {O}}(n d^4)) operations to fully decompose representations with non-trivial multiplicities. These complexity bounds are pessimistic and in a practical implementation using floating point arithmetic and exploiting sparsity we observe better complexity. We demonstrate this algorithm on the problem of computing multiplicities of two tensor products of irreducible representations (the Kronecker coefficients problem) as well as higher order tensor products. For hook and hook-like irreducible representations the algorithm has polynomial complexity as n increases. We also demonstrate an application to constructing a basis of homogeneous polynomials so that applying a permutation of variables induces an irreducible representation.","PeriodicalId":55151,"journal":{"name":"Foundations of Computational Mathematics","volume":"62 1","pages":""},"PeriodicalIF":3.0,"publicationDate":"2025-02-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143385014","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Safely Learning Dynamical Systems 安全学习动态系统

IF 3 1区数学 Q2 COMPUTER SCIENCE, THEORY & METHODS

Foundations of Computational Mathematics

Pub Date : 2025-02-04 DOI: 10.1007/s10208-025-09689-8

Amir Ali Ahmadi, Abraar Chaudhry, Vikas Sindhwani, Stephen Tu

A fundamental challenge in learning an unknown dynamical system is to reduce model uncertainty by making measurements while maintaining safety. In this work, we formulate a mathematical definition of what it means to safely learn a dynamical system by sequentially deciding where to initialize the next trajectory. In our framework, the state of the system is required to stay within a safety region for a horizon of T time steps under the action of all dynamical systems that (i) belong to a given initial uncertainty set, and (ii) are consistent with the information gathered so far. For our first set of results, we consider the setting of safely learning a linear dynamical system involving n states. For the case (T=1), we present a linear programming-based algorithm that either safely recovers the true dynamics from at most n trajectories, or certifies that safe learning is impossible. For (T=2), we give a semidefinite representation of the set of safe initial conditions and show that (lceil n/2 rceil ) trajectories generically suffice for safe learning. For (T = infty ), we provide semidefinite representable inner approximations of the set of safe initial conditions and show that one trajectory generically suffices for safe learning. Finally, we extend a number of our results to the cases where the initial uncertainty set contains sparse, low-rank, or permutation matrices, or when the dynamical system involves a control input. Our second set of results concerns the problem of safely learning a general class of nonlinear dynamical systems. For the case (T=1), we give a second-order cone programming based representation of the set of safe initial conditions. For (T=infty ), we provide semidefinite representable inner approximations to the set of safe initial conditions. We show how one can safely collect trajectories and fit a polynomial model of the nonlinear dynamics that is consistent with the initial uncertainty set and best agrees with the observations. We also present extensions of some of our results to the cases where the measurements are noisy or the dynamical system involves disturbances.

学习未知动力系统的一个基本挑战是在保持安全的情况下通过测量来减少模型的不确定性。在这项工作中，我们通过顺序决定初始化下一个轨迹的位置来制定安全学习动力系统的数学定义。在我们的框架中，在所有动力系统的作用下(i)属于给定的初始不确定性集，（ii）与迄今为止收集的信息一致的情况下，系统的状态需要在T个时间步长的视界内保持在安全区域内。对于我们的第一组结果，我们考虑安全学习涉及n个状态的线性动力系统的设置。对于(T=1)，我们提出了一种基于线性规划的算法，该算法可以安全地从最多n个轨迹中恢复真实动态，或者证明安全学习是不可能的。对于(T=2)，我们给出了一组安全初始条件的半确定表示，并表明(lceil n/2 rceil )轨迹一般足以满足安全学习。对于(T = infty )，我们提供了一组安全初始条件的半确定可表示的内部近似，并表明一个轨迹一般足以满足安全学习。最后，我们将我们的一些结果扩展到初始不确定性集包含稀疏，低秩或排列矩阵的情况下，或者当动力系统涉及控制输入时。我们的第二组结果涉及安全学习一类一般非线性动力系统的问题。对于(T=1)情况，我们给出了安全初始条件集合的二阶锥规划表示。对于(T=infty )，我们提供了安全初始条件集合的半定可表示内逼近。我们展示了如何安全地收集轨迹并拟合非线性动力学的多项式模型，该模型与初始不确定性集一致，并与观测结果最一致。我们还将我们的一些结果扩展到测量有噪声或动力系统包含扰动的情况。

{"title":"Safely Learning Dynamical Systems","authors":"Amir Ali Ahmadi, Abraar Chaudhry, Vikas Sindhwani, Stephen Tu","doi":"10.1007/s10208-025-09689-8","DOIUrl":"https://doi.org/10.1007/s10208-025-09689-8","url":null,"abstract":"A fundamental challenge in learning an unknown dynamical system is to reduce model uncertainty by making measurements while maintaining safety. In this work, we formulate a mathematical definition of what it means to safely learn a dynamical system by sequentially deciding where to initialize the next trajectory. In our framework, the state of the system is required to stay within a safety region for a horizon of T time steps under the action of all dynamical systems that (i) belong to a given initial uncertainty set, and (ii) are consistent with the information gathered so far. For our first set of results, we consider the setting of safely learning a linear dynamical system involving n states. For the case (T=1), we present a linear programming-based algorithm that either safely recovers the true dynamics from at most n trajectories, or certifies that safe learning is impossible. For (T=2), we give a semidefinite representation of the set of safe initial conditions and show that (lceil n/2 rceil ) trajectories generically suffice for safe learning. For (T = infty ), we provide semidefinite representable inner approximations of the set of safe initial conditions and show that one trajectory generically suffices for safe learning. Finally, we extend a number of our results to the cases where the initial uncertainty set contains sparse, low-rank, or permutation matrices, or when the dynamical system involves a control input. Our second set of results concerns the problem of safely learning a general class of nonlinear dynamical systems. For the case (T=1), we give a second-order cone programming based representation of the set of safe initial conditions. For (T=infty ), we provide semidefinite representable inner approximations to the set of safe initial conditions. We show how one can safely collect trajectories and fit a polynomial model of the nonlinear dynamics that is consistent with the initial uncertainty set and best agrees with the observations. We also present extensions of some of our results to the cases where the measurements are noisy or the dynamical system involves disturbances.","PeriodicalId":55151,"journal":{"name":"Foundations of Computational Mathematics","volume":"28 1","pages":""},"PeriodicalIF":3.0,"publicationDate":"2025-02-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143125227","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Stabilizing Decomposition of Multiparameter Persistence Modules 多参数持久模块的稳定分解

IF 3 1区数学 Q2 COMPUTER SCIENCE, THEORY & METHODS

Foundations of Computational Mathematics

Pub Date : 2025-01-27 DOI: 10.1007/s10208-025-09695-w

Håvard Bakke Bjerkevik

While decomposition of one-parameter persistence modules behaves nicely, as demonstrated by the algebraic stability theorem, decomposition of multiparameter modules is known to be unstable in a certain precise sense. Until now, it has not been clear that there is any way to get around this and build a meaningful stability theory for multiparameter module decomposition. We introduce new tools, in particular (epsilon )-refinements and (epsilon )-erosion neighborhoods, to start building such a theory. We then define the (epsilon )-pruning of a module, which is a new invariant acting like a “refined barcode” that shows great promise to extract features from a module by approximately decomposing it. Our main theorem can be interpreted as a generalization of the algebraic stability theorem to multiparameter modules up to a factor of 2r, where r is the maximal pointwise dimension of one of the modules. Furthermore, we show that the factor 2r is close to optimal. Finally, we discuss the possibility of strengthening the stability theorem for modules that decompose into pointwise low-dimensional summands, and pose a conjecture phrased purely in terms of basic linear algebra and graph theory that seems to capture the difficulty of doing this. We also show that this conjecture is relevant for other areas of multipersistence, like the computational complexity of approximating the interleaving distance, and recent applications of relative homological algebra to multipersistence.

单参数持久模块的分解具有良好的代数稳定性定理，而多参数模块的分解在一定精确意义上是不稳定的。到目前为止，还不清楚是否有任何方法可以绕过这个问题，并为多参数模块分解建立一个有意义的稳定性理论。我们引入了新的工具，特别是(epsilon ) -精化和(epsilon ) -侵蚀邻域，开始建立这样一个理论。然后我们定义一个模块的(epsilon ) -剪枝，这是一个新的不变量，它的作用就像一个“精致的条形码”，它很有希望通过近似分解从模块中提取特征。我们的主要定理可以被解释为代数稳定性定理在多参数模上的推广，直到2r的因子，其中r是其中一个模的最大点向维。进一步，我们证明因子2r接近最优。最后，我们讨论了加强分解为点向低维和的模块的稳定性定理的可能性，并提出了一个纯粹用基本线性代数和图论来表达的猜想，似乎抓住了这样做的困难。我们还表明，这个猜想与多持久性的其他领域有关，比如近似交错距离的计算复杂性，以及相对同态代数在多持久性中的最新应用。

{"title":"Stabilizing Decomposition of Multiparameter Persistence Modules","authors":"Håvard Bakke Bjerkevik","doi":"10.1007/s10208-025-09695-w","DOIUrl":"https://doi.org/10.1007/s10208-025-09695-w","url":null,"abstract":"While decomposition of one-parameter persistence modules behaves nicely, as demonstrated by the algebraic stability theorem, decomposition of multiparameter modules is known to be unstable in a certain precise sense. Until now, it has not been clear that there is any way to get around this and build a meaningful stability theory for multiparameter module decomposition. We introduce new tools, in particular (epsilon )-refinements and (epsilon )-erosion neighborhoods, to start building such a theory. We then define the (epsilon )-pruning of a module, which is a new invariant acting like a “refined barcode” that shows great promise to extract features from a module by approximately decomposing it. Our main theorem can be interpreted as a generalization of the algebraic stability theorem to multiparameter modules up to a factor of 2r, where r is the maximal pointwise dimension of one of the modules. Furthermore, we show that the factor 2r is close to optimal. Finally, we discuss the possibility of strengthening the stability theorem for modules that decompose into pointwise low-dimensional summands, and pose a conjecture phrased purely in terms of basic linear algebra and graph theory that seems to capture the difficulty of doing this. We also show that this conjecture is relevant for other areas of multipersistence, like the computational complexity of approximating the interleaving distance, and recent applications of relative homological algebra to multipersistence.","PeriodicalId":55151,"journal":{"name":"Foundations of Computational Mathematics","volume":"48 1","pages":""},"PeriodicalIF":3.0,"publicationDate":"2025-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143049647","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Optimal Regularization for a Data Source 数据源的最佳正则化

IF 3 1区数学 Q2 COMPUTER SCIENCE, THEORY & METHODS

Foundations of Computational Mathematics

Pub Date : 2025-01-27 DOI: 10.1007/s10208-025-09693-y

Oscar Leong, Eliza O’ Reilly, Yong Sheng Soh, Venkat Chandrasekaran

In optimization-based approaches to inverse problems and to statistical estimation, it is common to augment criteria that enforce data fidelity with a regularizer that promotes desired structural properties in the solution. The choice of a suitable regularizer is typically driven by a combination of prior domain information and computational considerations. Convex regularizers are attractive computationally but they are limited in the types of structure they can promote. On the other hand, nonconvex regularizers are more flexible in the forms of structure they can promote and they have showcased strong empirical performance in some applications, but they come with the computational challenge of solving the associated optimization problems. In this paper, we seek a systematic understanding of the power and the limitations of convex regularization by investigating the following questions: Given a distribution, what is the optimal regularizer for data drawn from the distribution? What properties of a data source govern whether the optimal regularizer is convex? We address these questions for the class of regularizers specified by functionals that are continuous, positively homogeneous, and positive away from the origin. We say that a regularizer is optimal for a data distribution if the Gibbs density with energy given by the regularizer maximizes the population likelihood (or equivalently, minimizes cross-entropy loss) over all regularizer-induced Gibbs densities. As the regularizers we consider are in one-to-one correspondence with star bodies, we leverage dual Brunn-Minkowski theory to show that a radial function derived from a data distribution is akin to a “computational sufficient statistic” as it is the key quantity for identifying optimal regularizers and for assessing the amenability of a data source to convex regularization. Using tools such as (Gamma )-convergence from variational analysis, we show that our results are robust in the sense that the optimal regularizers for a sample drawn from a distribution converge to their population counterparts as the sample size grows large. Finally, we give generalization guarantees for various families of star bodies that recover previous results for polyhedral regularizers (i.e., dictionary learning) and lead to new ones for a variety of classes of star bodies.

在反问题和统计估计的基于优化的方法中，通常会使用正则化器来增强标准，以增强数据保真度，从而提高解决方案中所需的结构属性。一个合适的正则化器的选择通常是由先验领域信息和计算考虑的组合驱动的。凸正则化在计算上很有吸引力，但它们在可以提升的结构类型上受到限制。另一方面，非凸正则化在结构形式上更灵活，它们可以促进，并且在一些应用中展示了强大的经验性能，但是它们带来了解决相关优化问题的计算挑战。在本文中，我们通过研究以下问题来寻求对凸正则化的能力和局限性的系统理解：给定一个分布，从该分布中提取的数据的最佳正则化器是什么？数据源的哪些属性决定了最优正则化器是否为凸？我们针对连续的、正齐次的、离原点正的泛函所指定的一类正则子来解决这些问题。我们说正则化器对于数据分布是最优的，如果由正则化器给出能量的吉布斯密度在所有正则化器诱导的吉布斯密度上使总体可能性最大化（或等效地，使交叉熵损失最小化）。由于我们考虑的正则化器与恒星体是一对一对应的，我们利用双重布伦-闵可夫斯基理论来表明，从数据分布中导出的径向函数类似于“计算充分统计”，因为它是识别最佳正则化器和评估数据源对凸正则化的适应性的关键量。使用变分分析的(Gamma ) -收敛等工具，我们证明了我们的结果是鲁棒的，因为从分布中提取的样本的最佳正则化器随着样本量的增加而收敛到与其对应的总体。最后，我们给出了各种星体族的泛化保证，这些保证恢复了多面体正则化器（即字典学习）的先前结果，并导致了各种星体类的新结果。

{"title":"Optimal Regularization for a Data Source","authors":"Oscar Leong, Eliza O’ Reilly, Yong Sheng Soh, Venkat Chandrasekaran","doi":"10.1007/s10208-025-09693-y","DOIUrl":"https://doi.org/10.1007/s10208-025-09693-y","url":null,"abstract":"In optimization-based approaches to inverse problems and to statistical estimation, it is common to augment criteria that enforce data fidelity with a regularizer that promotes desired structural properties in the solution. The choice of a suitable regularizer is typically driven by a combination of prior domain information and computational considerations. Convex regularizers are attractive computationally but they are limited in the types of structure they can promote. On the other hand, nonconvex regularizers are more flexible in the forms of structure they can promote and they have showcased strong empirical performance in some applications, but they come with the computational challenge of solving the associated optimization problems. In this paper, we seek a systematic understanding of the power and the limitations of convex regularization by investigating the following questions: Given a distribution, what is the optimal regularizer for data drawn from the distribution? What properties of a data source govern whether the optimal regularizer is convex? We address these questions for the class of regularizers specified by functionals that are continuous, positively homogeneous, and positive away from the origin. We say that a regularizer is optimal for a data distribution if the Gibbs density with energy given by the regularizer maximizes the population likelihood (or equivalently, minimizes cross-entropy loss) over all regularizer-induced Gibbs densities. As the regularizers we consider are in one-to-one correspondence with star bodies, we leverage dual Brunn-Minkowski theory to show that a radial function derived from a data distribution is akin to a “computational sufficient statistic” as it is the key quantity for identifying optimal regularizers and for assessing the amenability of a data source to convex regularization. Using tools such as (Gamma )-convergence from variational analysis, we show that our results are robust in the sense that the optimal regularizers for a sample drawn from a distribution converge to their population counterparts as the sample size grows large. Finally, we give generalization guarantees for various families of star bodies that recover previous results for polyhedral regularizers (i.e., dictionary learning) and lead to new ones for a variety of classes of star bodies. ","PeriodicalId":55151,"journal":{"name":"Foundations of Computational Mathematics","volume":"48 1","pages":""},"PeriodicalIF":3.0,"publicationDate":"2025-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143049667","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Sharp Bounds for Max-sliced Wasserstein Distances 最大切片Wasserstein距离的尖锐界

IF 3 1区数学 Q2 COMPUTER SCIENCE, THEORY & METHODS

Foundations of Computational Mathematics

Pub Date : 2025-01-22 DOI: 10.1007/s10208-025-09690-1

March T. Boedihardjo

We obtain essentially matching upper and lower bounds for the expected max-sliced 1-Wasserstein distance between a probability measure on a separable Hilbert space and its empirical distribution from n samples. By proving a Banach space version of this result, we also obtain an upper bound, that is sharp up to a log factor, for the expected max-sliced 2-Wasserstein distance between a symmetric probability measure (mu ) on a Euclidean space and its symmetrized empirical distribution in terms of the operator norm of the covariance matrix of (mu ) and the diameter of the support of (mu ).

我们从n个样本中获得可分离Hilbert空间上的概率测度与其经验分布之间的期望最大切片1-Wasserstein距离的基本匹配上界和下界。通过证明这一结果的Banach空间版本，我们也得到了欧几里德空间上对称概率测度(mu )与其对称经验分布（以协方差矩阵(mu )的算子范数和(mu )的支撑直径表示）之间的期望最大切2-Wasserstein距离的上界，该上界精确到一个对数因子。

引用次数: 0

Active Manifolds, Stratifications, and Convergence to Local Minima in Nonsmooth Optimization 非光滑优化中的主动流形、分层和局部最小收敛

IF 3 1区数学 Q2 COMPUTER SCIENCE, THEORY & METHODS

Foundations of Computational Mathematics

Pub Date : 2025-01-22 DOI: 10.1007/s10208-025-09691-0

Damek Davis, Dmitriy Drusvyatskiy, Liwei Jiang

In this work, we develop new regularity conditions in nonsmooth analysis that parallel the stratification conditions of Whitney, Kuo, and Verdier. They quantify how subgradients interact with a certain “active manifold” that captures the nonsmooth activity of the function. Based on these new conditions, we show that several subgradient-based methods converge only to local minimizers when applied to generic Lipschitz and subdifferentially regular functions that are definable in an o-minimal structure. At a high level, our argument is appealingly transparent: we interpret the nonsmooth dynamics as an approximate Riemannian gradient method on the active manifold. As a by-product, we extend the stochastic processes techniques of Pemantle.

在这项工作中，我们在非光滑分析中开发了新的规则条件，与Whitney， Kuo和Verdier的分层条件平行。他们量化了子梯度如何与捕获函数的非平滑活动的某个“活动流形”相互作用。基于这些新的条件，我们证明了几种基于次梯度的方法，当应用于可在0 -极小结构中定义的一般Lipschitz函数和次微分正则函数时，只收敛于局部极小值。在高水平上，我们的论证是透明的：我们将非光滑动力学解释为活动流形上的近似黎曼梯度方法。作为一个副产品，我们扩展了Pemantle的随机过程技术。

引用次数: 0

Optimal Convergence Rates for the Spectrum of the Graph Laplacian on Poisson Point Clouds 泊松点云上图拉普拉斯谱的最优收敛速率

IF 3 1区数学 Q2 COMPUTER SCIENCE, THEORY & METHODS

Foundations of Computational Mathematics

Pub Date : 2025-01-22 DOI: 10.1007/s10208-025-09696-9

Scott Armstrong, Raghavendra Venkatraman

We prove optimal convergence rates for eigenvalues and eigenvectors of the graph Laplacian on Poisson point clouds. Our results are valid down to the critical percolation threshold, yielding error estimates for relatively sparse graphs.

证明了泊松点云上拉普拉斯图的特征值和特征向量的最优收敛速率。我们的结果是有效的，直到临界渗透阈值，产生相对稀疏图的误差估计。

引用次数: 0

Accuracy Controlled Schemes for the Eigenvalue Problem of the Radiative Transfer Equation 辐射传递方程特征值问题的精度控制格式

IF 3 1区数学 Q2 COMPUTER SCIENCE, THEORY & METHODS

Foundations of Computational Mathematics

Pub Date : 2025-01-21 DOI: 10.1007/s10208-025-09694-x

Wolfgang Dahmen, Olga Mula

The criticality problem in nuclear engineering asks for the principal eigenpair of a Boltzmann operator describing neutron transport in a reactor core. Being able to reliably design, and control such reactors requires assessing these quantities within quantifiable accuracy tolerances. In this paper, we propose a paradigm that deviates from the common practice of approximately solving the corresponding spectral problem with a fixed, presumably sufficiently fine discretization. Instead, the present approach is based on first contriving iterative schemes, formulated in function space, that are shown to converge at a quantitative rate without assuming any a priori excess regularity properties, and that exploit only properties of the optical parameters in the underlying radiative transfer model. We develop the analytical and numerical tools for approximately realizing each iteration step within judiciously chosen accuracy tolerances, verified by a posteriori estimates, so as to still warrant quantifiable convergence to the exact eigenpair. This is carried out in full first for a Newton scheme. Since this is only locally convergent we analyze in addition the convergence of a power iteration in function space to produce sufficiently accurate initial guesses. Here we have to deal with intrinsic difficulties posed by compact but unsymmetric operators preventing standard arguments used in the finite dimensional case. Our main point is that we can avoid any condition on an initial guess to be already in a small neighborhood of the exact solution. We close with a discussion of remaining intrinsic obstructions to a certifiable numerical implementation, mainly related to not knowing the gap between the principal eigenvalue and the next smaller one in modulus.

核工程中的临界问题要求描述反应堆堆芯中中子输运的玻尔兹曼算子的主特征对。为了能够可靠地设计和控制这些反应器，需要在可量化的精度公差范围内评估这些数量。在本文中，我们提出了一种范式，它偏离了用固定的、可能足够精细的离散化近似解决相应光谱问题的常见做法。相反，目前的方法是基于首先设计的迭代方案，在函数空间中表述，显示出以定量速率收敛，而不假设任何先验的超额正则性，并且仅利用底层辐射传递模型中的光学参数的性质。我们开发了解析和数值工具，以在明智选择的精度公差范围内近似实现每个迭代步骤，并通过后验估计进行验证，以便仍然保证可量化收敛到精确的特征对。这首先在牛顿格式中完全实现。由于这只是局部收敛的，我们还分析了幂迭代在函数空间中的收敛性，以产生足够准确的初始猜测。在这里，我们必须处理紧但不对称的操作符所带来的内在困难，这些操作符阻止在有限维情况下使用标准参数。我们的主要观点是，我们可以避免初始猜测已经处于精确解的小邻域的任何条件。我们最后讨论了可证明的数值实现的剩余固有障碍，主要与不知道主特征值与下一个较小的模之间的差距有关。

{"title":"Accuracy Controlled Schemes for the Eigenvalue Problem of the Radiative Transfer Equation","authors":"Wolfgang Dahmen, Olga Mula","doi":"10.1007/s10208-025-09694-x","DOIUrl":"https://doi.org/10.1007/s10208-025-09694-x","url":null,"abstract":"The criticality problem in nuclear engineering asks for the principal eigenpair of a Boltzmann operator describing neutron transport in a reactor core. Being able to reliably design, and control such reactors requires assessing these quantities within quantifiable accuracy tolerances. In this paper, we propose a paradigm that deviates from the common practice of approximately solving the corresponding spectral problem with a fixed, presumably sufficiently fine discretization. Instead, the present approach is based on first contriving iterative schemes, formulated in function space, that are shown to converge at a quantitative rate without assuming any a priori excess regularity properties, and that exploit only properties of the optical parameters in the underlying radiative transfer model. We develop the analytical and numerical tools for approximately realizing each iteration step within judiciously chosen accuracy tolerances, verified by a posteriori estimates, so as to still warrant quantifiable convergence to the exact eigenpair. This is carried out in full first for a Newton scheme. Since this is only locally convergent we analyze in addition the convergence of a power iteration in function space to produce sufficiently accurate initial guesses. Here we have to deal with intrinsic difficulties posed by compact but unsymmetric operators preventing standard arguments used in the finite dimensional case. Our main point is that we can avoid any condition on an initial guess to be already in a small neighborhood of the exact solution. We close with a discussion of remaining intrinsic obstructions to a certifiable numerical implementation, mainly related to not knowing the gap between the principal eigenvalue and the next smaller one in modulus. ","PeriodicalId":55151,"journal":{"name":"Foundations of Computational Mathematics","volume":"20 1","pages":""},"PeriodicalIF":3.0,"publicationDate":"2025-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142991448","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Conley Index for Multivalued Maps on Finite Topological Spaces 有限拓扑空间上多值映射的Conley索引

IF 3 1区数学 Q2 COMPUTER SCIENCE, THEORY & METHODS

Foundations of Computational Mathematics

Pub Date : 2024-12-09 DOI: 10.1007/s10208-024-09685-4

Jonathan Barmak, Marian Mrozek, Thomas Wanner

We develop Conley’s theory for multivalued maps on finite topological spaces. More precisely, for discrete-time dynamical systems generated by the iteration of a multivalued map which satisfies appropriate regularity conditions, we establish the notions of isolated invariant sets and index pairs, and use them to introduce a well-defined Conley index. In addition, we verify some of its fundamental properties such as the Ważewski property and continuation.

我们发展了有限拓扑空间上多值映射的Conley理论。更准确地说，对于由满足适当正则性条件的多值映射迭代生成的离散动力系统，我们建立了孤立不变集和指标对的概念，并利用它们引入了定义良好的Conley指标。此外，我们还验证了它的一些基本性质，如Ważewski性质和延拓性。

引用次数: 0

Generalized Pseudospectral Shattering and Inverse-Free Matrix Pencil Diagonalization 广义伪谱破碎与无逆矩阵铅笔对角化

IF 3 1区数学 Q2 COMPUTER SCIENCE, THEORY & METHODS

Foundations of Computational Mathematics

Pub Date : 2024-12-09 DOI: 10.1007/s10208-024-09682-7

James Demmel, Ioana Dumitriu, Ryan Schneider

We present a randomized, inverse-free algorithm for producing an approximate diagonalization of any (n times n) matrix pencil (A, B). The bulk of the algorithm rests on a randomized divide-and-conquer eigensolver for the generalized eigenvalue problem originally proposed by Ballard, Demmel and Dumitriu (Technical Report 2010). We demonstrate that this divide-and-conquer approach can be formulated to succeed with high probability provided the input pencil is sufficiently well-behaved, which is accomplished by generalizing the recent pseudospectral shattering work of Banks, Garza-Vargas, Kulkarni and Srivastava (Foundations of Computational Mathematics 2023). In particular, we show that perturbing and scaling (A, B) regularizes its pseudospectra, allowing divide-and-conquer to run over a simple random grid and in turn producing an accurate diagonalization of (A, B) in the backward error sense. The main result of the paper states the existence of a randomized algorithm that with high probability (and in exact arithmetic) produces invertible S, T and diagonal D such that (||A - SDT^{-1}||_2 le varepsilon ) and (||B - ST^{-1}||_2 le varepsilon ) in at most (O left( log ^2 left( frac{n}{varepsilon } right) T_{text {MM}}(n) right) ) operations, where (T_{text {MM}}(n)) is the asymptotic complexity of matrix multiplication. This not only provides a new set of guarantees for highly parallel generalized eigenvalue solvers but also establishes nearly matrix multiplication time as an upper bound on the complexity of inverse-free, exact-arithmetic matrix pencil diagonalization.

我们提出了一种随机的、无逆的算法，用于产生任何(n times n)矩阵笔（a， B）的近似对角化。该算法的大部分依赖于一个随机分治的特征求解器，用于解决最初由Ballard， Demmel和Dumitriu（技术报告2010）提出的广义特征值问题。我们证明，如果输入笔的行为足够好，这种分而征服的方法可以以高概率制定成功，这是通过推广Banks， Garza-Vargas， Kulkarni和Srivastava（《计算数学基础》2023）最近的伪谱粉碎工作来完成的。特别是，我们表明，扰动和缩放（A， B）使其伪谱正则化，允许分治法在一个简单的随机网格上运行，并反过来在向后误差意义上产生（A， B）的精确对角化。本文的主要结果表明存在一种随机算法，该算法以高概率（并以精确算法）产生可逆的S， T和对角D，使得(||A - SDT^{-1}||_2 le varepsilon )和(||B - ST^{-1}||_2 le varepsilon )最多在(O left( log ^2 left( frac{n}{varepsilon } right) T_{text {MM}}(n) right) )次运算中，其中(T_{text {MM}}(n))是矩阵乘法的渐近复杂度。这不仅为高度并行的广义特征值解提供了一组新的保证，而且建立了近似矩阵乘法时间作为无逆精确算术矩阵铅笔对角化复杂度的上界。

{"title":"Generalized Pseudospectral Shattering and Inverse-Free Matrix Pencil Diagonalization","authors":"James Demmel, Ioana Dumitriu, Ryan Schneider","doi":"10.1007/s10208-024-09682-7","DOIUrl":"https://doi.org/10.1007/s10208-024-09682-7","url":null,"abstract":"We present a randomized, inverse-free algorithm for producing an approximate diagonalization of any (n times n) matrix pencil (A, B). The bulk of the algorithm rests on a randomized divide-and-conquer eigensolver for the generalized eigenvalue problem originally proposed by Ballard, Demmel and Dumitriu (Technical Report 2010). We demonstrate that this divide-and-conquer approach can be formulated to succeed with high probability provided the input pencil is sufficiently well-behaved, which is accomplished by generalizing the recent pseudospectral shattering work of Banks, Garza-Vargas, Kulkarni and Srivastava (Foundations of Computational Mathematics 2023). In particular, we show that perturbing and scaling (A, B) regularizes its pseudospectra, allowing divide-and-conquer to run over a simple random grid and in turn producing an accurate diagonalization of (A, B) in the backward error sense. The main result of the paper states the existence of a randomized algorithm that with high probability (and in exact arithmetic) produces invertible S, T and diagonal D such that (||A - SDT^{-1}||_2 le varepsilon ) and (||B - ST^{-1}||_2 le varepsilon ) in at most (O left( log ^2 left( frac{n}{varepsilon } right) T_{text {MM}}(n) right) ) operations, where (T_{text {MM}}(n)) is the asymptotic complexity of matrix multiplication. This not only provides a new set of guarantees for highly parallel generalized eigenvalue solvers but also establishes nearly matrix multiplication time as an upper bound on the complexity of inverse-free, exact-arithmetic matrix pencil diagonalization.","PeriodicalId":55151,"journal":{"name":"Foundations of Computational Mathematics","volume":"47 1","pages":""},"PeriodicalIF":3.0,"publicationDate":"2024-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142796989","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Foundations of Computational Mathematics

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀