Pub Date : 2024-09-16DOI: 10.1007/s10208-024-09678-3
Shreya Arya, Arnab Auddy, Ranthony A. Clark, Sunhyuk Lim, Facundo Mémoli, Daniel Packer
The Gromov–Wasserstein distance—a generalization of the usual Wasserstein distance—permits comparing probability measures defined on possibly different metric spaces. Recently, this notion of distance has found several applications in Data Science and in Machine Learning. With the goal of aiding both the interpretability of dissimilarity measures computed through the Gromov–Wasserstein distance and the assessment of the approximation quality of computational techniques designed to estimate the Gromov–Wasserstein distance, we determine the precise value of a certain variant of the Gromov–Wasserstein distance between unit spheres of different dimensions. Indeed, we consider a two-parameter family ({d_{{{text {GW}}}p,q}}_{p,q=1}^{infty }) of Gromov–Wasserstein distances between metric measure spaces. By exploiting a suitable interaction between specific values of the parameters p and q and the metric of the underlying spaces, we are able to determine the exact value of the distance (d_{{{text {GW}}}4,2}) between all pairs of unit spheres of different dimensions endowed with their Euclidean distance and their uniform measure.
格罗莫夫-瓦瑟斯坦距离--通常的瓦瑟斯坦距离的广义化--允许比较定义在可能不同的度量空间上的概率度量。最近,这一距离概念在数据科学和机器学习中得到了广泛应用。为了帮助解释通过格罗莫夫-瓦瑟斯坦距离计算出的不相似度量,并评估旨在估算格罗莫夫-瓦瑟斯坦距离的计算技术的近似质量,我们确定了不同维度的单位球之间格罗莫夫-瓦瑟斯坦距离的某个变体的精确值。事实上,我们考虑的是度量空间之间的格罗莫夫-瓦瑟斯坦距离的双参数族(({d_{{text {GW}}}p,q}}_{p,q=1}^{infty } )。通过利用参数 p 和 q 的特定值与底层空间度量之间的相互作用,我们能够确定所有不同维度的单位球之间的距离 (d_{{text{GW}}4,2})的精确值。
{"title":"The Gromov–Wasserstein Distance Between Spheres","authors":"Shreya Arya, Arnab Auddy, Ranthony A. Clark, Sunhyuk Lim, Facundo Mémoli, Daniel Packer","doi":"10.1007/s10208-024-09678-3","DOIUrl":"https://doi.org/10.1007/s10208-024-09678-3","url":null,"abstract":"<p>The Gromov–Wasserstein distance—a generalization of the usual Wasserstein distance—permits comparing probability measures defined on possibly different metric spaces. Recently, this notion of distance has found several applications in Data Science and in Machine Learning. With the goal of aiding both the interpretability of dissimilarity measures computed through the Gromov–Wasserstein distance and the assessment of the approximation quality of computational techniques designed to estimate the Gromov–Wasserstein distance, we determine the precise value of a certain variant of the Gromov–Wasserstein distance between unit spheres of different dimensions. Indeed, we consider a two-parameter family <span>({d_{{{text {GW}}}p,q}}_{p,q=1}^{infty })</span> of Gromov–Wasserstein distances between metric measure spaces. By exploiting a suitable interaction between specific values of the parameters <i>p</i> and <i>q</i> and the metric of the underlying spaces, we are able to determine the exact value of the distance <span>(d_{{{text {GW}}}4,2})</span> between all pairs of unit spheres of different dimensions endowed with their Euclidean distance and their uniform measure.</p>","PeriodicalId":55151,"journal":{"name":"Foundations of Computational Mathematics","volume":"13 1","pages":""},"PeriodicalIF":3.0,"publicationDate":"2024-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142235260","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-09-16DOI: 10.1007/s10208-024-09677-4
T. Lelièvre, R. Santet, G. Stoltz
Hamiltonian Monte Carlo (HMC) is a Markov chain Monte Carlo method that allows to sample high dimensional probability measures. It relies on the integration of the Hamiltonian dynamics to propose a move which is then accepted or rejected thanks to a Metropolis procedure. Unbiased sampling is guaranteed by the preservation by the numerical integrators of two key properties of the Hamiltonian dynamics: volume-preservation and reversibility up to momentum reversal. For separable Hamiltonian functions, some standard explicit numerical schemes, such as the Störmer–Verlet integrator, satisfy these properties. However, for numerical or physical reasons, one may consider a Hamiltonian function which is nonseparable, in which case the standard numerical schemes which preserve the volume and satisfy reversibility up to momentum reversal are implicit. When implemented in practice, such implicit schemes may admit many solutions or none, especially when the timestep is too large. We show here how to enforce the numerical reversibility, and thus unbiasedness, of HMC schemes in this context by introducing a reversibility check. In addition, for some specific forms of the Hamiltonian function, we discuss the consistency of these HMC schemes with some Langevin dynamics, and show in particular that our algorithm yields an efficient discretization of the metropolized overdamped Langevin dynamics with position-dependent diffusion coefficients. Numerical results illustrate the relevance of the reversibility check on simple problems.
{"title":"Unbiasing Hamiltonian Monte Carlo Algorithms for a General Hamiltonian Function","authors":"T. Lelièvre, R. Santet, G. Stoltz","doi":"10.1007/s10208-024-09677-4","DOIUrl":"https://doi.org/10.1007/s10208-024-09677-4","url":null,"abstract":"<p>Hamiltonian Monte Carlo (HMC) is a Markov chain Monte Carlo method that allows to sample high dimensional probability measures. It relies on the integration of the Hamiltonian dynamics to propose a move which is then accepted or rejected thanks to a Metropolis procedure. Unbiased sampling is guaranteed by the preservation by the numerical integrators of two key properties of the Hamiltonian dynamics: volume-preservation and reversibility up to momentum reversal. For separable Hamiltonian functions, some standard explicit numerical schemes, such as the Störmer–Verlet integrator, satisfy these properties. However, for numerical or physical reasons, one may consider a Hamiltonian function which is nonseparable, in which case the standard numerical schemes which preserve the volume and satisfy reversibility up to momentum reversal are implicit. When implemented in practice, such implicit schemes may admit many solutions or none, especially when the timestep is too large. We show here how to enforce the numerical reversibility, and thus unbiasedness, of HMC schemes in this context by introducing a reversibility check. In addition, for some specific forms of the Hamiltonian function, we discuss the consistency of these HMC schemes with some Langevin dynamics, and show in particular that our algorithm yields an efficient discretization of the metropolized overdamped Langevin dynamics with position-dependent diffusion coefficients. Numerical results illustrate the relevance of the reversibility check on simple problems.</p>","PeriodicalId":55151,"journal":{"name":"Foundations of Computational Mathematics","volume":"185 1","pages":""},"PeriodicalIF":3.0,"publicationDate":"2024-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142235261","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-09-04DOI: 10.1007/s10208-024-09672-9
Magnus Bakke Botnan, Steffen Oppermann, Steve Oudot
In this paper, we introduce the signed barcode, a new visual representation of the global structure of the rank invariant of a multi-parameter persistence module or, more generally, of a poset representation. Like its unsigned counterpart in one-parameter persistence, the signed barcode decomposes the rank invariant as a ({mathbb {Z}})-linear combination of rank invariants of indicator modules supported on segments in the poset. We develop the theory behind these decompositions, both for the usual rank invariant and for its generalizations, showing under what conditions they exist and are unique. We also show that, like its unsigned counterpart, the signed barcode reflects in part the algebraic structure of the module: specifically, it derives from the terms in the minimal rank-exact resolution of the module, i.e., its minimal projective resolution relative to the class of short exact sequences on which the rank invariant is additive. To complete the picture, we show some experimental results that illustrate the contribution of the signed barcode in the exploration of multi-parameter persistence modules.
{"title":"Signed Barcodes for Multi-parameter Persistence via Rank Decompositions and Rank-Exact Resolutions","authors":"Magnus Bakke Botnan, Steffen Oppermann, Steve Oudot","doi":"10.1007/s10208-024-09672-9","DOIUrl":"https://doi.org/10.1007/s10208-024-09672-9","url":null,"abstract":"<p>In this paper, we introduce the signed barcode, a new visual representation of the global structure of the rank invariant of a multi-parameter persistence module or, more generally, of a poset representation. Like its unsigned counterpart in one-parameter persistence, the signed barcode decomposes the rank invariant as a <span>({mathbb {Z}})</span>-linear combination of rank invariants of indicator modules supported on segments in the poset. We develop the theory behind these decompositions, both for the usual rank invariant and for its generalizations, showing under what conditions they exist and are unique. We also show that, like its unsigned counterpart, the signed barcode reflects in part the algebraic structure of the module: specifically, it derives from the terms in the minimal rank-exact resolution of the module, i.e., its minimal projective resolution relative to the class of short exact sequences on which the rank invariant is additive. To complete the picture, we show some experimental results that illustrate the contribution of the signed barcode in the exploration of multi-parameter persistence modules.\u0000</p>","PeriodicalId":55151,"journal":{"name":"Foundations of Computational Mathematics","volume":"6 1","pages":""},"PeriodicalIF":3.0,"publicationDate":"2024-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142138385","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-08-26DOI: 10.1007/s10208-024-09675-6
Olaf Parczyk, Sebastian Pokutta, Christoph Spiegel, Tibor Szabó
We study two related problems concerning the number of homogeneous subsets of given size in graphs that go back to questions of Erdős. Most notably, we improve the upper bounds on the Ramsey multiplicity of (K_4) and (K_5) and settle the minimum number of independent sets of size 4 in graphs with clique number at most 4. Motivated by the elusiveness of the symmetric Ramsey multiplicity problem, we also introduce an off-diagonal variant and obtain tight results when counting monochromatic (K_4) or (K_5) in only one of the colors and triangles in the other. The extremal constructions for each problem turn out to be blow-ups of a graph of constant size and were found through search heuristics. They are complemented by lower bounds established using flag algebras, resulting in a fully computer-assisted approach. For some of our theorems we can also derive that the extremal construction is stable in a very strong sense. More broadly, these problems lead us to the study of the region of possible pairs of clique and independent set densities that can be realized as the limit of some sequence of graphs.
{"title":"New Ramsey Multiplicity Bounds and Search Heuristics","authors":"Olaf Parczyk, Sebastian Pokutta, Christoph Spiegel, Tibor Szabó","doi":"10.1007/s10208-024-09675-6","DOIUrl":"https://doi.org/10.1007/s10208-024-09675-6","url":null,"abstract":"<p>We study two related problems concerning the number of homogeneous subsets of given size in graphs that go back to questions of Erdős. Most notably, we improve the upper bounds on the Ramsey multiplicity of <span>(K_4)</span> and <span>(K_5)</span> and settle the minimum number of independent sets of size 4 in graphs with clique number at most 4. Motivated by the elusiveness of the symmetric Ramsey multiplicity problem, we also introduce an off-diagonal variant and obtain tight results when counting monochromatic <span>(K_4)</span> or <span>(K_5)</span> in only one of the colors and triangles in the other. The extremal constructions for each problem turn out to be blow-ups of a graph of constant size and were found through search heuristics. They are complemented by lower bounds established using flag algebras, resulting in a fully computer-assisted approach. For some of our theorems we can also derive that the extremal construction is stable in a very strong sense. More broadly, these problems lead us to the study of the region of possible pairs of clique and independent set densities that can be realized as the limit of some sequence of graphs.</p>","PeriodicalId":55151,"journal":{"name":"Foundations of Computational Mathematics","volume":"2 1","pages":""},"PeriodicalIF":3.0,"publicationDate":"2024-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142084945","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-08-23DOI: 10.1007/s10208-024-09679-2
Thomas Chaplin, Heather A. Harrington, Ulrike Tillmann
Weighted digraphs are used to model a variety of natural systems and can exhibit interesting structure across a range of scales. In order to understand and compare these systems, we require stable, interpretable, multiscale descriptors. To this end, we propose grounded persistent path homology (GrPPH)—a new, functorial, topological descriptor that describes the structure of an edge-weighted digraph via a persistence barcode. We show there is a choice of circuit basis for the graph which yields geometrically interpretable representatives for the features in the barcode. Moreover, we show the barcode is stable, in bottleneck distance, to both numerical and structural perturbations.
{"title":"Grounded Persistent Path Homology: A Stable, Topological Descriptor for Weighted Digraphs","authors":"Thomas Chaplin, Heather A. Harrington, Ulrike Tillmann","doi":"10.1007/s10208-024-09679-2","DOIUrl":"https://doi.org/10.1007/s10208-024-09679-2","url":null,"abstract":"<p>Weighted digraphs are used to model a variety of natural systems and can exhibit interesting structure across a range of scales. In order to understand and compare these systems, we require stable, interpretable, multiscale descriptors. To this end, we propose grounded persistent path homology (<span>GrPPH</span>)—a new, functorial, topological descriptor that describes the structure of an edge-weighted digraph via a persistence barcode. We show there is a choice of circuit basis for the graph which yields geometrically interpretable representatives for the features in the barcode. Moreover, we show the barcode is stable, in bottleneck distance, to both numerical and structural perturbations.</p>","PeriodicalId":55151,"journal":{"name":"Foundations of Computational Mathematics","volume":"88 1","pages":""},"PeriodicalIF":3.0,"publicationDate":"2024-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142045624","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-08-22DOI: 10.1007/s10208-024-09664-9
Raphaël Berthier, Andrea Montanari, Kangjie Zhou
Gradient-based learning in multi-layer neural networks displays a number of striking features. In particular, the decrease rate of empirical risk is non-monotone even after averaging over large batches. Long plateaus in which one observes barely any progress alternate with intervals of rapid decrease. These successive phases of learning often take place on very different time scales. Finally, models learnt in an early phase are typically ‘simpler’ or ‘easier to learn’ although in a way that is difficult to formalize. Although theoretical explanations of these phenomena have been put forward, each of them captures at best certain specific regimes. In this paper, we study the gradient flow dynamics of a wide two-layer neural network in high-dimension, when data are distributed according to a single-index model (i.e., the target function depends on a one-dimensional projection of the covariates). Based on a mixture of new rigorous results, non-rigorous mathematical derivations, and numerical simulations, we propose a scenario for the learning dynamics in this setting. In particular, the proposed evolution exhibits separation of timescales and intermittency. These behaviors arise naturally because the population gradient flow can be recast as a singularly perturbed dynamical system.
{"title":"Learning Time-Scales in Two-Layers Neural Networks","authors":"Raphaël Berthier, Andrea Montanari, Kangjie Zhou","doi":"10.1007/s10208-024-09664-9","DOIUrl":"https://doi.org/10.1007/s10208-024-09664-9","url":null,"abstract":"<p>Gradient-based learning in multi-layer neural networks displays a number of striking features. In particular, the decrease rate of empirical risk is non-monotone even after averaging over large batches. Long plateaus in which one observes barely any progress alternate with intervals of rapid decrease. These successive phases of learning often take place on very different time scales. Finally, models learnt in an early phase are typically ‘simpler’ or ‘easier to learn’ although in a way that is difficult to formalize. Although theoretical explanations of these phenomena have been put forward, each of them captures at best certain specific regimes. In this paper, we study the gradient flow dynamics of a wide two-layer neural network in high-dimension, when data are distributed according to a single-index model (i.e., the target function depends on a one-dimensional projection of the covariates). Based on a mixture of new rigorous results, non-rigorous mathematical derivations, and numerical simulations, we propose a scenario for the learning dynamics in this setting. In particular, the proposed evolution exhibits separation of timescales and intermittency. These behaviors arise naturally because the population gradient flow can be recast as a singularly perturbed dynamical system.</p>","PeriodicalId":55151,"journal":{"name":"Foundations of Computational Mathematics","volume":"17 1","pages":""},"PeriodicalIF":3.0,"publicationDate":"2024-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142042699","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-08-16DOI: 10.1007/s10208-024-09668-5
Adrien Laurent, Hans Munthe-Kaas
The exotic aromatic Butcher series were originally introduced for the calculation of order conditions for the high order numerical integration of ergodic stochastic differential equations in (mathbb {R} ^d) and on manifolds. We prove in this paper that exotic aromatic B-series satisfy a universal geometric property, namely that they are characterised by locality and equivariance with respect to orthogonal changes of coordinates. This characterisation confirms that exotic aromatic B-series are a fundamental geometric object that naturally generalises aromatic B-series and B-series, as they share similar equivariance properties. In addition, we provide a classification of the main subsets of the exotic aromatic B-series, in particular the exotic B-series, using different equivariance properties. Along the analysis, we present a generalised definition of exotic aromatic trees, dual vector fields, and we explore the impact of degeneracies on the classification.
奇异芳香布彻数列最初是为了计算 (mathbb {R} ^d)和流形上的遍历随机微分方程的高阶数值积分的阶次条件而引入的。我们在本文中证明了奇异芳香 B 系列满足一个普遍的几何性质,即它们具有关于坐标正交变化的局部性和等差性。这一特性证实了奇异芳香 B 系列是一个基本几何对象,它自然地概括了芳香 B 系列和 B 系列,因为它们具有相似的等差数列特性。此外,我们还利用不同的等差数列性质,对奇异芳香 B 系列,特别是奇异 B 系列的主要子集进行了分类。在分析过程中,我们提出了外来芳香树、对偶向量场的广义定义,并探讨了退化对分类的影响。
{"title":"The Universal Equivariance Properties of Exotic Aromatic B-Series","authors":"Adrien Laurent, Hans Munthe-Kaas","doi":"10.1007/s10208-024-09668-5","DOIUrl":"https://doi.org/10.1007/s10208-024-09668-5","url":null,"abstract":"<p>The exotic aromatic Butcher series were originally introduced for the calculation of order conditions for the high order numerical integration of ergodic stochastic differential equations in <span>(mathbb {R} ^d)</span> and on manifolds. We prove in this paper that exotic aromatic B-series satisfy a universal geometric property, namely that they are characterised by locality and equivariance with respect to orthogonal changes of coordinates. This characterisation confirms that exotic aromatic B-series are a fundamental geometric object that naturally generalises aromatic B-series and B-series, as they share similar equivariance properties. In addition, we provide a classification of the main subsets of the exotic aromatic B-series, in particular the exotic B-series, using different equivariance properties. Along the analysis, we present a generalised definition of exotic aromatic trees, dual vector fields, and we explore the impact of degeneracies on the classification.</p>","PeriodicalId":55151,"journal":{"name":"Foundations of Computational Mathematics","volume":"31 1","pages":""},"PeriodicalIF":3.0,"publicationDate":"2024-08-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141994403","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
We introduce a new class of numerical schemes which allow for low-regularity approximations to the expectation ( mathbb {E}(|u_{k}(t, v^{eta })|^2)), where (u_k) denotes the k-th Fourier coefficient of the solution u of the dispersive equation and ( v^{eta }(x) ) the associated random initial data. This quantity plays an important role in physics, in particular in the study of wave turbulence where one needs to adopt a statistical approach in order to obtain deep insight into the generic long-time behaviour of solutions to dispersive equations. Our new class of schemes is based on Wick’s theorem and Feynman diagrams together with a resonance-based discretisation (Bruned and Schratz in Forum Math Pi 10:E2, 2022) set in a more general context: we introduce a novel combinatorial structure called paired decorated forests which are two decorated trees whose decorations on the leaves come in pair. The character of the scheme draws its inspiration from the treatment of singular stochastic partial differential equations via regularity structures. In contrast to classical approaches, we do not discretise the PDE itself, but rather its expectation. This allows us to heavily exploit the optimal resonance structure and underlying gain in regularity on the finite dimensional (discrete) level.
我们引入了一类新的数值方案,它允许对期望值( mathbb {E}(|u_{k}(t, v^{eta })|^2)) 进行低规则性近似,其中(u_k) 表示分散方程解 u 的第 k 个傅里叶系数,( v^{eta }(x) ) 表示相关的随机初始数据。这个量在物理学中发挥着重要作用,尤其是在波湍流研究中,人们需要采用统计方法来深入了解分散方程解的一般长期行为。我们的新方案基于威克定理和费曼图,以及基于共振的离散化(Bruned 和 Schratz 在 Forum Math Pi 10:E2, 2022 上发表),其背景更为宽泛:我们引入了一种新颖的组合结构,称为配对装饰林,即两棵叶子上的装饰成对的装饰树。该方案的特点源自通过正则结构处理奇异随机偏微分方程。与经典方法不同的是,我们并不对 PDE 本身进行离散化,而是对其期望进行离散化。这样,我们就能在有限维(离散)水平上大量利用最优共振结构和正则性的潜在增益。
{"title":"Approximations of Dispersive PDEs in the Presence of Low-Regularity Randomness","authors":"Yvonne Alama Bronsard, Yvain Bruned, Katharina Schratz","doi":"10.1007/s10208-023-09625-8","DOIUrl":"https://doi.org/10.1007/s10208-023-09625-8","url":null,"abstract":"<p>We introduce a new class of numerical schemes which allow for low-regularity approximations to the expectation <span>( mathbb {E}(|u_{k}(t, v^{eta })|^2))</span>, where <span>(u_k)</span> denotes the <i>k</i>-th Fourier coefficient of the solution <i>u</i> of the dispersive equation and <span>( v^{eta }(x) )</span> the associated random initial data. This quantity plays an important role in physics, in particular in the study of wave turbulence where one needs to adopt a statistical approach in order to obtain deep insight into the <i>generic</i> long-time behaviour of solutions to dispersive equations. Our new class of schemes is based on Wick’s theorem and Feynman diagrams together with a resonance-based discretisation (Bruned and Schratz in Forum Math Pi 10:E2, 2022) set in a more general context: we introduce a novel combinatorial structure called paired decorated forests which are two decorated trees whose decorations on the leaves come in pair. The character of the scheme draws its inspiration from the treatment of singular stochastic partial differential equations via regularity structures. In contrast to classical approaches, we do not discretise the PDE itself, but rather its expectation. This allows us to heavily exploit the optimal resonance structure and underlying gain in regularity on the finite dimensional (discrete) level.</p>","PeriodicalId":55151,"journal":{"name":"Foundations of Computational Mathematics","volume":"16 1","pages":""},"PeriodicalIF":3.0,"publicationDate":"2024-08-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141992016","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-08-13DOI: 10.1007/s10208-024-09658-7
Jess Banks, Jorge Garza-Vargas, Nikhil Srivastava
Rapid convergence of the shifted QR algorithm on symmetric matrices was shown more than 50 years ago. Since then, despite significant interest and its practical relevance, an understanding of the dynamics and convergence properties of the shifted QR algorithm on nonsymmetric matrices has remained elusive. We introduce a new family of shifting strategies for the Hessenberg shifted QR algorithm. We prove that when the input is a diagonalizable Hessenberg matrix H of bounded eigenvector condition number(kappa _V(H))—defined as the minimum condition number of V over all diagonalizations (VDV^{-1}) of H—then the shifted QR algorithm with a certain strategy from our family is guaranteed to converge rapidly to a Hessenberg matrix with a zero subdiagonal entry, in exact arithmetic. Our convergence result is nonasymptotic, showing that the geometric mean of certain subdiagonal entries of H decays by a fixed constant in every QR iteration. The arithmetic cost of implementing each iteration of our strategy scales roughly logarithmically in the eigenvector condition number (kappa _V(H)), which is a measure of the nonnormality of H. The key ideas in the design and analysis of our strategy are: (1) we are able to precisely characterize when a certain shifting strategy based on Ritz values stagnates. We use this information to design certain “exceptional shifts” which are guaranteed to escape stagnation whenever it occurs. (2) We use higher degree shifts (of degree roughly (log kappa _V(H))) to dampen transient effects due to nonnormality, allowing us to treat nonnormal matrices in a manner similar to normal matrices.
移位 QR 算法在对称矩阵上的快速收敛性早在 50 多年前就已显示出来。从那时起,尽管人们对这一算法产生了浓厚的兴趣并认为它具有实际意义,但对非对称矩阵上的移位 QR 算法的动态和收敛特性的理解却始终难以捉摸。我们为海森堡移动 QR 算法引入了一个新的移动策略系列。我们证明,当输入是有界特征向量条件数 (kappa_V(H))的可对角化的海森堡矩阵 H 时--定义为 H 的所有对角化 (VDV^{-1})上 V 的最小条件数--那么使用我们族中的某种策略的移位 QR 算法就能保证以精确算术迅速收敛到子对角线项为零的海森堡矩阵。我们的收敛结果是非渐近的,表明 H 的某些对角线子项的几何平均数在每次 QR 迭代中都会以固定常数递减。实现我们策略的每次迭代的算术成本大致按特征向量条件数 (kappa _V(H)) 的对数缩放,这是 H 的非正态性的度量:(1)我们能够精确地描述某种基于里兹值的移动策略何时停滞。我们利用这一信息来设计某些 "特殊的移位",以保证在停滞发生时摆脱停滞。(2) 我们使用较高程度的移位(大致为 (log kappa _V(H)) 的程度)来抑制非正态性引起的瞬态效应,使我们能够以类似于正态矩阵的方式处理非正态性矩阵。
{"title":"Global Convergence of Hessenberg Shifted QR I: Exact Arithmetic","authors":"Jess Banks, Jorge Garza-Vargas, Nikhil Srivastava","doi":"10.1007/s10208-024-09658-7","DOIUrl":"https://doi.org/10.1007/s10208-024-09658-7","url":null,"abstract":"<p>Rapid convergence of the shifted QR algorithm on symmetric matrices was shown more than 50 years ago. Since then, despite significant interest and its practical relevance, an understanding of the dynamics and convergence properties of the shifted QR algorithm on nonsymmetric matrices has remained elusive. We introduce a new family of shifting strategies for the Hessenberg shifted QR algorithm. We prove that when the input is a diagonalizable Hessenberg matrix <i>H</i> of bounded <i>eigenvector condition number</i> <span>(kappa _V(H))</span>—defined as the minimum condition number of <i>V</i> over all diagonalizations <span>(VDV^{-1})</span> of <i>H</i>—then the shifted QR algorithm with a certain strategy from our family is guaranteed to converge rapidly to a Hessenberg matrix with a zero subdiagonal entry, in exact arithmetic. Our convergence result is nonasymptotic, showing that the geometric mean of certain subdiagonal entries of <i>H</i> decays by a fixed constant in every <i>QR</i> iteration. The arithmetic cost of implementing each iteration of our strategy scales roughly logarithmically in the eigenvector condition number <span>(kappa _V(H))</span>, which is a measure of the nonnormality of <i>H</i>. The key ideas in the design and analysis of our strategy are: (1) we are able to precisely characterize when a certain shifting strategy based on Ritz values stagnates. We use this information to design certain “exceptional shifts” which are guaranteed to escape stagnation whenever it occurs. (2) We use higher degree shifts (of degree roughly <span>(log kappa _V(H))</span>) to dampen transient effects due to nonnormality, allowing us to treat nonnormal matrices in a manner similar to normal matrices.</p>","PeriodicalId":55151,"journal":{"name":"Foundations of Computational Mathematics","volume":"47 1","pages":""},"PeriodicalIF":3.0,"publicationDate":"2024-08-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141974002","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-08-06DOI: 10.1007/s10208-024-09666-7
Eduard Feireisl, Mária Lukáčová-Medvid’ová, Bangwei She, Yuhuan Yuan
We consider the Navier–Stokes–Fourier system governing the motion of a general compressible, heat conducting, Newtonian fluid driven by random initial/boundary data. Convergence of the stochastic collocation and Monte Carlo numerical methods is shown under the hypothesis that approximate solutions are bounded in probability. Abstract results are illustrated by numerical experiments for the Rayleigh–Bénard convection problem.
{"title":"Convergence of Numerical Methods for the Navier–Stokes–Fourier System Driven by Uncertain Initial/Boundary Data","authors":"Eduard Feireisl, Mária Lukáčová-Medvid’ová, Bangwei She, Yuhuan Yuan","doi":"10.1007/s10208-024-09666-7","DOIUrl":"https://doi.org/10.1007/s10208-024-09666-7","url":null,"abstract":"<p>We consider the Navier–Stokes–Fourier system governing the motion of a general compressible, heat conducting, Newtonian fluid driven by random initial/boundary data. Convergence of the stochastic collocation and Monte Carlo numerical methods is shown under the hypothesis that approximate solutions are bounded in probability. Abstract results are illustrated by numerical experiments for the Rayleigh–Bénard convection problem.\u0000</p>","PeriodicalId":55151,"journal":{"name":"Foundations of Computational Mathematics","volume":"156 1","pages":""},"PeriodicalIF":3.0,"publicationDate":"2024-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141899473","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}