首页 > 最新文献

Foundations of Computational Mathematics最新文献

英文 中文
Learning Time-Scales in Two-Layers Neural Networks 两层神经网络的学习时标
IF 3 1区 数学 Q2 COMPUTER SCIENCE, THEORY & METHODS Pub Date : 2024-08-22 DOI: 10.1007/s10208-024-09664-9
Raphaël Berthier, Andrea Montanari, Kangjie Zhou

Gradient-based learning in multi-layer neural networks displays a number of striking features. In particular, the decrease rate of empirical risk is non-monotone even after averaging over large batches. Long plateaus in which one observes barely any progress alternate with intervals of rapid decrease. These successive phases of learning often take place on very different time scales. Finally, models learnt in an early phase are typically ‘simpler’ or ‘easier to learn’ although in a way that is difficult to formalize. Although theoretical explanations of these phenomena have been put forward, each of them captures at best certain specific regimes. In this paper, we study the gradient flow dynamics of a wide two-layer neural network in high-dimension, when data are distributed according to a single-index model (i.e., the target function depends on a one-dimensional projection of the covariates). Based on a mixture of new rigorous results, non-rigorous mathematical derivations, and numerical simulations, we propose a scenario for the learning dynamics in this setting. In particular, the proposed evolution exhibits separation of timescales and intermittency. These behaviors arise naturally because the population gradient flow can be recast as a singularly perturbed dynamical system.

多层神经网络中基于梯度的学习显示出许多显著特点。特别是,经验风险的下降率即使在对大量批次进行平均后也是非单调的。在漫长的高原期与快速下降期交替出现,在高原期几乎看不到任何进展。这些连续的学习阶段往往发生在非常不同的时间尺度上。最后,在早期阶段学习的模型通常 "更简单 "或 "更容易学习",尽管这种学习方式很难正规化。尽管对这些现象已经提出了理论解释,但每种解释最多只能捕捉到某些特定的机制。在本文中,我们研究了当数据按照单指数模型分布时(即目标函数取决于协变量的一维投影),宽双层神经网络在高维条件下的梯度流动态。基于新的严格结果、非严格数学推导和数值模拟,我们提出了在这种情况下的学习动力学方案。特别是,所提出的进化表现出时标分离和间歇性。由于种群梯度流可以重塑为奇异扰动动态系统,这些行为自然而然地产生了。
{"title":"Learning Time-Scales in Two-Layers Neural Networks","authors":"Raphaël Berthier, Andrea Montanari, Kangjie Zhou","doi":"10.1007/s10208-024-09664-9","DOIUrl":"https://doi.org/10.1007/s10208-024-09664-9","url":null,"abstract":"<p>Gradient-based learning in multi-layer neural networks displays a number of striking features. In particular, the decrease rate of empirical risk is non-monotone even after averaging over large batches. Long plateaus in which one observes barely any progress alternate with intervals of rapid decrease. These successive phases of learning often take place on very different time scales. Finally, models learnt in an early phase are typically ‘simpler’ or ‘easier to learn’ although in a way that is difficult to formalize. Although theoretical explanations of these phenomena have been put forward, each of them captures at best certain specific regimes. In this paper, we study the gradient flow dynamics of a wide two-layer neural network in high-dimension, when data are distributed according to a single-index model (i.e., the target function depends on a one-dimensional projection of the covariates). Based on a mixture of new rigorous results, non-rigorous mathematical derivations, and numerical simulations, we propose a scenario for the learning dynamics in this setting. In particular, the proposed evolution exhibits separation of timescales and intermittency. These behaviors arise naturally because the population gradient flow can be recast as a singularly perturbed dynamical system.</p>","PeriodicalId":55151,"journal":{"name":"Foundations of Computational Mathematics","volume":"17 1","pages":""},"PeriodicalIF":3.0,"publicationDate":"2024-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142042699","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The Universal Equivariance Properties of Exotic Aromatic B-Series 奇异芳香 B 系列的通用等差数列特性
IF 3 1区 数学 Q2 COMPUTER SCIENCE, THEORY & METHODS Pub Date : 2024-08-16 DOI: 10.1007/s10208-024-09668-5
Adrien Laurent, Hans Munthe-Kaas

The exotic aromatic Butcher series were originally introduced for the calculation of order conditions for the high order numerical integration of ergodic stochastic differential equations in (mathbb {R} ^d) and on manifolds. We prove in this paper that exotic aromatic B-series satisfy a universal geometric property, namely that they are characterised by locality and equivariance with respect to orthogonal changes of coordinates. This characterisation confirms that exotic aromatic B-series are a fundamental geometric object that naturally generalises aromatic B-series and B-series, as they share similar equivariance properties. In addition, we provide a classification of the main subsets of the exotic aromatic B-series, in particular the exotic B-series, using different equivariance properties. Along the analysis, we present a generalised definition of exotic aromatic trees, dual vector fields, and we explore the impact of degeneracies on the classification.

奇异芳香布彻数列最初是为了计算 (mathbb {R} ^d)和流形上的遍历随机微分方程的高阶数值积分的阶次条件而引入的。我们在本文中证明了奇异芳香 B 系列满足一个普遍的几何性质,即它们具有关于坐标正交变化的局部性和等差性。这一特性证实了奇异芳香 B 系列是一个基本几何对象,它自然地概括了芳香 B 系列和 B 系列,因为它们具有相似的等差数列特性。此外,我们还利用不同的等差数列性质,对奇异芳香 B 系列,特别是奇异 B 系列的主要子集进行了分类。在分析过程中,我们提出了外来芳香树、对偶向量场的广义定义,并探讨了退化对分类的影响。
{"title":"The Universal Equivariance Properties of Exotic Aromatic B-Series","authors":"Adrien Laurent, Hans Munthe-Kaas","doi":"10.1007/s10208-024-09668-5","DOIUrl":"https://doi.org/10.1007/s10208-024-09668-5","url":null,"abstract":"<p>The exotic aromatic Butcher series were originally introduced for the calculation of order conditions for the high order numerical integration of ergodic stochastic differential equations in <span>(mathbb {R} ^d)</span> and on manifolds. We prove in this paper that exotic aromatic B-series satisfy a universal geometric property, namely that they are characterised by locality and equivariance with respect to orthogonal changes of coordinates. This characterisation confirms that exotic aromatic B-series are a fundamental geometric object that naturally generalises aromatic B-series and B-series, as they share similar equivariance properties. In addition, we provide a classification of the main subsets of the exotic aromatic B-series, in particular the exotic B-series, using different equivariance properties. Along the analysis, we present a generalised definition of exotic aromatic trees, dual vector fields, and we explore the impact of degeneracies on the classification.</p>","PeriodicalId":55151,"journal":{"name":"Foundations of Computational Mathematics","volume":"31 1","pages":""},"PeriodicalIF":3.0,"publicationDate":"2024-08-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141994403","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Approximations of Dispersive PDEs in the Presence of Low-Regularity Randomness 存在低随机性的分散性多变量方程的近似值
IF 3 1区 数学 Q2 COMPUTER SCIENCE, THEORY & METHODS Pub Date : 2024-08-15 DOI: 10.1007/s10208-023-09625-8
Yvonne Alama Bronsard, Yvain Bruned, Katharina Schratz

We introduce a new class of numerical schemes which allow for low-regularity approximations to the expectation ( mathbb {E}(|u_{k}(t, v^{eta })|^2)), where (u_k) denotes the k-th Fourier coefficient of the solution u of the dispersive equation and ( v^{eta }(x) ) the associated random initial data. This quantity plays an important role in physics, in particular in the study of wave turbulence where one needs to adopt a statistical approach in order to obtain deep insight into the generic long-time behaviour of solutions to dispersive equations. Our new class of schemes is based on Wick’s theorem and Feynman diagrams together with a resonance-based discretisation (Bruned and Schratz in Forum Math Pi 10:E2, 2022) set in a more general context: we introduce a novel combinatorial structure called paired decorated forests which are two decorated trees whose decorations on the leaves come in pair. The character of the scheme draws its inspiration from the treatment of singular stochastic partial differential equations via regularity structures. In contrast to classical approaches, we do not discretise the PDE itself, but rather its expectation. This allows us to heavily exploit the optimal resonance structure and underlying gain in regularity on the finite dimensional (discrete) level.

我们引入了一类新的数值方案,它允许对期望值( mathbb {E}(|u_{k}(t, v^{eta })|^2)) 进行低规则性近似,其中(u_k) 表示分散方程解 u 的第 k 个傅里叶系数,( v^{eta }(x) ) 表示相关的随机初始数据。这个量在物理学中发挥着重要作用,尤其是在波湍流研究中,人们需要采用统计方法来深入了解分散方程解的一般长期行为。我们的新方案基于威克定理和费曼图,以及基于共振的离散化(Bruned 和 Schratz 在 Forum Math Pi 10:E2, 2022 上发表),其背景更为宽泛:我们引入了一种新颖的组合结构,称为配对装饰林,即两棵叶子上的装饰成对的装饰树。该方案的特点源自通过正则结构处理奇异随机偏微分方程。与经典方法不同的是,我们并不对 PDE 本身进行离散化,而是对其期望进行离散化。这样,我们就能在有限维(离散)水平上大量利用最优共振结构和正则性的潜在增益。
{"title":"Approximations of Dispersive PDEs in the Presence of Low-Regularity Randomness","authors":"Yvonne Alama Bronsard, Yvain Bruned, Katharina Schratz","doi":"10.1007/s10208-023-09625-8","DOIUrl":"https://doi.org/10.1007/s10208-023-09625-8","url":null,"abstract":"<p>We introduce a new class of numerical schemes which allow for low-regularity approximations to the expectation <span>( mathbb {E}(|u_{k}(t, v^{eta })|^2))</span>, where <span>(u_k)</span> denotes the <i>k</i>-th Fourier coefficient of the solution <i>u</i> of the dispersive equation and <span>( v^{eta }(x) )</span> the associated random initial data. This quantity plays an important role in physics, in particular in the study of wave turbulence where one needs to adopt a statistical approach in order to obtain deep insight into the <i>generic</i> long-time behaviour of solutions to dispersive equations. Our new class of schemes is based on Wick’s theorem and Feynman diagrams together with a resonance-based discretisation (Bruned and Schratz in Forum Math Pi 10:E2, 2022) set in a more general context: we introduce a novel combinatorial structure called paired decorated forests which are two decorated trees whose decorations on the leaves come in pair. The character of the scheme draws its inspiration from the treatment of singular stochastic partial differential equations via regularity structures. In contrast to classical approaches, we do not discretise the PDE itself, but rather its expectation. This allows us to heavily exploit the optimal resonance structure and underlying gain in regularity on the finite dimensional (discrete) level.</p>","PeriodicalId":55151,"journal":{"name":"Foundations of Computational Mathematics","volume":"16 1","pages":""},"PeriodicalIF":3.0,"publicationDate":"2024-08-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141992016","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Global Convergence of Hessenberg Shifted QR I: Exact Arithmetic 海森堡偏移 QR 的全局收敛 I:精确算术
IF 3 1区 数学 Q2 COMPUTER SCIENCE, THEORY & METHODS Pub Date : 2024-08-13 DOI: 10.1007/s10208-024-09658-7
Jess Banks, Jorge Garza-Vargas, Nikhil Srivastava

Rapid convergence of the shifted QR algorithm on symmetric matrices was shown more than 50 years ago. Since then, despite significant interest and its practical relevance, an understanding of the dynamics and convergence properties of the shifted QR algorithm on nonsymmetric matrices has remained elusive. We introduce a new family of shifting strategies for the Hessenberg shifted QR algorithm. We prove that when the input is a diagonalizable Hessenberg matrix H of bounded eigenvector condition number (kappa _V(H))—defined as the minimum condition number of V over all diagonalizations (VDV^{-1}) of H—then the shifted QR algorithm with a certain strategy from our family is guaranteed to converge rapidly to a Hessenberg matrix with a zero subdiagonal entry, in exact arithmetic. Our convergence result is nonasymptotic, showing that the geometric mean of certain subdiagonal entries of H decays by a fixed constant in every QR iteration. The arithmetic cost of implementing each iteration of our strategy scales roughly logarithmically in the eigenvector condition number (kappa _V(H)), which is a measure of the nonnormality of H. The key ideas in the design and analysis of our strategy are: (1) we are able to precisely characterize when a certain shifting strategy based on Ritz values stagnates. We use this information to design certain “exceptional shifts” which are guaranteed to escape stagnation whenever it occurs. (2) We use higher degree shifts (of degree roughly (log kappa _V(H))) to dampen transient effects due to nonnormality, allowing us to treat nonnormal matrices in a manner similar to normal matrices.

移位 QR 算法在对称矩阵上的快速收敛性早在 50 多年前就已显示出来。从那时起,尽管人们对这一算法产生了浓厚的兴趣并认为它具有实际意义,但对非对称矩阵上的移位 QR 算法的动态和收敛特性的理解却始终难以捉摸。我们为海森堡移动 QR 算法引入了一个新的移动策略系列。我们证明,当输入是有界特征向量条件数 (kappa_V(H))的可对角化的海森堡矩阵 H 时--定义为 H 的所有对角化 (VDV^{-1})上 V 的最小条件数--那么使用我们族中的某种策略的移位 QR 算法就能保证以精确算术迅速收敛到子对角线项为零的海森堡矩阵。我们的收敛结果是非渐近的,表明 H 的某些对角线子项的几何平均数在每次 QR 迭代中都会以固定常数递减。实现我们策略的每次迭代的算术成本大致按特征向量条件数 (kappa _V(H)) 的对数缩放,这是 H 的非正态性的度量:(1)我们能够精确地描述某种基于里兹值的移动策略何时停滞。我们利用这一信息来设计某些 "特殊的移位",以保证在停滞发生时摆脱停滞。(2) 我们使用较高程度的移位(大致为 (log kappa _V(H)) 的程度)来抑制非正态性引起的瞬态效应,使我们能够以类似于正态矩阵的方式处理非正态性矩阵。
{"title":"Global Convergence of Hessenberg Shifted QR I: Exact Arithmetic","authors":"Jess Banks, Jorge Garza-Vargas, Nikhil Srivastava","doi":"10.1007/s10208-024-09658-7","DOIUrl":"https://doi.org/10.1007/s10208-024-09658-7","url":null,"abstract":"<p>Rapid convergence of the shifted QR algorithm on symmetric matrices was shown more than 50 years ago. Since then, despite significant interest and its practical relevance, an understanding of the dynamics and convergence properties of the shifted QR algorithm on nonsymmetric matrices has remained elusive. We introduce a new family of shifting strategies for the Hessenberg shifted QR algorithm. We prove that when the input is a diagonalizable Hessenberg matrix <i>H</i> of bounded <i>eigenvector condition number</i> <span>(kappa _V(H))</span>—defined as the minimum condition number of <i>V</i> over all diagonalizations <span>(VDV^{-1})</span> of <i>H</i>—then the shifted QR algorithm with a certain strategy from our family is guaranteed to converge rapidly to a Hessenberg matrix with a zero subdiagonal entry, in exact arithmetic. Our convergence result is nonasymptotic, showing that the geometric mean of certain subdiagonal entries of <i>H</i> decays by a fixed constant in every <i>QR</i> iteration. The arithmetic cost of implementing each iteration of our strategy scales roughly logarithmically in the eigenvector condition number <span>(kappa _V(H))</span>, which is a measure of the nonnormality of <i>H</i>. The key ideas in the design and analysis of our strategy are: (1) we are able to precisely characterize when a certain shifting strategy based on Ritz values stagnates. We use this information to design certain “exceptional shifts” which are guaranteed to escape stagnation whenever it occurs. (2) We use higher degree shifts (of degree roughly <span>(log kappa _V(H))</span>) to dampen transient effects due to nonnormality, allowing us to treat nonnormal matrices in a manner similar to normal matrices.</p>","PeriodicalId":55151,"journal":{"name":"Foundations of Computational Mathematics","volume":"47 1","pages":""},"PeriodicalIF":3.0,"publicationDate":"2024-08-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141974002","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Convergence of Numerical Methods for the Navier–Stokes–Fourier System Driven by Uncertain Initial/Boundary Data 不确定初始/边界数据驱动的纳维-斯托克斯-傅里叶系统数值方法的收敛性
IF 3 1区 数学 Q2 COMPUTER SCIENCE, THEORY & METHODS Pub Date : 2024-08-06 DOI: 10.1007/s10208-024-09666-7
Eduard Feireisl, Mária Lukáčová-Medvid’ová, Bangwei She, Yuhuan Yuan

We consider the Navier–Stokes–Fourier system governing the motion of a general compressible, heat conducting, Newtonian fluid driven by random initial/boundary data. Convergence of the stochastic collocation and Monte Carlo numerical methods is shown under the hypothesis that approximate solutions are bounded in probability. Abstract results are illustrated by numerical experiments for the Rayleigh–Bénard convection problem.

我们考虑了支配由随机初始/边界数据驱动的一般可压缩、导热、牛顿流体运动的纳维-斯托克斯-傅里叶系统。在近似解在概率上是有界的假设条件下,证明了随机配位和蒙特卡罗数值方法的收敛性。雷利-贝纳德对流问题的数值实验说明了抽象结果。
{"title":"Convergence of Numerical Methods for the Navier–Stokes–Fourier System Driven by Uncertain Initial/Boundary Data","authors":"Eduard Feireisl, Mária Lukáčová-Medvid’ová, Bangwei She, Yuhuan Yuan","doi":"10.1007/s10208-024-09666-7","DOIUrl":"https://doi.org/10.1007/s10208-024-09666-7","url":null,"abstract":"<p>We consider the Navier–Stokes–Fourier system governing the motion of a general compressible, heat conducting, Newtonian fluid driven by random initial/boundary data. Convergence of the stochastic collocation and Monte Carlo numerical methods is shown under the hypothesis that approximate solutions are bounded in probability. Abstract results are illustrated by numerical experiments for the Rayleigh–Bénard convection problem.\u0000</p>","PeriodicalId":55151,"journal":{"name":"Foundations of Computational Mathematics","volume":"156 1","pages":""},"PeriodicalIF":3.0,"publicationDate":"2024-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141899473","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Polynomial and Rational Measure Modifications of Orthogonal Polynomials via Infinite-Dimensional Banded Matrix Factorizations 通过无限维带状矩阵因式分解对正交多项式进行多项式和有理测度修正
IF 3 1区 数学 Q2 COMPUTER SCIENCE, THEORY & METHODS Pub Date : 2024-08-05 DOI: 10.1007/s10208-024-09671-w
Timon S. Gutleb, Sheehan Olver, Richard Mikaël Slevinsky

We describe fast algorithms for approximating the connection coefficients between a family of orthogonal polynomials and another family with a polynomially or rationally modified measure. The connection coefficients are computed via infinite-dimensional banded matrix factorizations and may be used to compute the modified Jacobi matrices all in linear complexity with respect to the truncation degree. A family of orthogonal polynomials with modified classical weights is constructed that support banded differentiation matrices, enabling sparse spectral methods with modified classical orthogonal polynomials. We present several applications and numerical experiments using an open source implementation which make direct use of these results.

我们描述了近似正交多项式族与另一个具有多项式或合理修正度量的族之间的连接系数的快速算法。连接系数通过无穷维带状矩阵因式分解计算,并可用于计算修正雅可比矩阵,其复杂度与截断度呈线性关系。我们构建了一个具有修正经典权重的正交多项式族,它支持带状微分矩阵,从而实现了使用修正经典正交多项式的稀疏谱方法。我们介绍了直接利用这些结果的几个应用和使用开源实现的数值实验。
{"title":"Polynomial and Rational Measure Modifications of Orthogonal Polynomials via Infinite-Dimensional Banded Matrix Factorizations","authors":"Timon S. Gutleb, Sheehan Olver, Richard Mikaël Slevinsky","doi":"10.1007/s10208-024-09671-w","DOIUrl":"https://doi.org/10.1007/s10208-024-09671-w","url":null,"abstract":"<p>We describe fast algorithms for approximating the connection coefficients between a family of orthogonal polynomials and another family with a polynomially or rationally modified measure. The connection coefficients are computed via infinite-dimensional banded matrix factorizations and may be used to compute the modified Jacobi matrices all in linear complexity with respect to the truncation degree. A family of orthogonal polynomials with modified classical weights is constructed that support banded differentiation matrices, enabling sparse spectral methods with modified classical orthogonal polynomials. We present several applications and numerical experiments using an open source implementation which make direct use of these results.</p>","PeriodicalId":55151,"journal":{"name":"Foundations of Computational Mathematics","volume":"23 1","pages":""},"PeriodicalIF":3.0,"publicationDate":"2024-08-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141895228","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Stable Liftings of Polynomial Traces on Tetrahedra 多项式轨迹在四面体上的稳定提升
IF 3 1区 数学 Q2 COMPUTER SCIENCE, THEORY & METHODS Pub Date : 2024-07-29 DOI: 10.1007/s10208-024-09670-x
Charles Parker, Endre Süli

On the reference tetrahedron (K), we construct, for each (k in {mathbb {N}}_0), a right inverse for the trace operator (u mapsto (u, partial _{textbf{n}} u, ldots , partial _{textbf{n}}^k u)|_{partial K}). The operator is stable as a mapping from the trace space of (W^{s, p}(K)) to (W^{s, p}(K)) for all (p in (1, infty )) and (s in (k+1/p, infty )). Moreover, if the data is the trace of a polynomial of degree (N in {mathbb {N}}_0), then the resulting lifting is a polynomial of degree N. One consequence of the analysis is a novel characterization for the range of the trace operator.

在参考四面体 (K)上,我们为每个 (k in {mathbb {N}}_0) 构造了迹算子 (u mapsto (u, partial _{textbf{n}} u, ldots , partial _{textbf{n}}^k u)|_{partial K}) 的右逆。对于所有的 (pin (1, infty )) 和 (sin (k+1/p, infty )) 来说,这个算子作为从 (W^{s, p}(K))的迹空间到 (W^{s, p}(K))的映射是稳定的。此外,如果数据是度数为 (N in {mathbb {N}}_0) 的多项式的迹,那么得到的提升就是度数为 N 的多项式。
{"title":"Stable Liftings of Polynomial Traces on Tetrahedra","authors":"Charles Parker, Endre Süli","doi":"10.1007/s10208-024-09670-x","DOIUrl":"https://doi.org/10.1007/s10208-024-09670-x","url":null,"abstract":"<p>On the reference tetrahedron <span>(K)</span>, we construct, for each <span>(k in {mathbb {N}}_0)</span>, a right inverse for the trace operator <span>(u mapsto (u, partial _{textbf{n}} u, ldots , partial _{textbf{n}}^k u)|_{partial K})</span>. The operator is stable as a mapping from the trace space of <span>(W^{s, p}(K))</span> to <span>(W^{s, p}(K))</span> for all <span>(p in (1, infty ))</span> and <span>(s in (k+1/p, infty ))</span>. Moreover, if the data is the trace of a polynomial of degree <span>(N in {mathbb {N}}_0)</span>, then the resulting lifting is a polynomial of degree <i>N</i>. One consequence of the analysis is a novel characterization for the range of the trace operator.\u0000</p>","PeriodicalId":55151,"journal":{"name":"Foundations of Computational Mathematics","volume":"19 1","pages":""},"PeriodicalIF":3.0,"publicationDate":"2024-07-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141836767","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Analysis of Langevin Monte Carlo from Poincaré to Log-Sobolev 从 Poincaré 到 Log-Sobolev 的 Langevin 蒙特卡洛分析
IF 3 1区 数学 Q2 COMPUTER SCIENCE, THEORY & METHODS Pub Date : 2024-07-26 DOI: 10.1007/s10208-024-09667-6
Sinho Chewi, Murat A. Erdogdu, Mufan Li, Ruoqi Shen, Matthew S. Zhang

Classically, the continuous-time Langevin diffusion converges exponentially fast to its stationary distribution (pi ) under the sole assumption that (pi ) satisfies a Poincaré inequality. Using this fact to provide guarantees for the discrete-time Langevin Monte Carlo (LMC) algorithm, however, is considerably more challenging due to the need for working with chi-squared or Rényi divergences, and prior works have largely focused on strongly log-concave targets. In this work, we provide the first convergence guarantees for LMC assuming that (pi ) satisfies either a Latała–Oleszkiewicz or modified log-Sobolev inequality, which interpolates between the Poincaré and log-Sobolev settings. Unlike prior works, our results allow for weak smoothness and do not require convexity or dissipativity conditions.

从经典上讲,在 (pi ) 满足Poincaré不等式的唯一假设下,连续时间朗之文扩散以指数级速度收敛到其静态分布 (pi )。然而,利用这一事实为离散时间朗之文蒙特卡洛(LMC)算法提供保证要困难得多,因为需要处理秩方或雷尼发散,而且之前的工作主要集中在强对数凹目标上。在这项工作中,我们首次为 LMC 提供了收敛性保证,假设 (pi ) 满足拉塔瓦-奥列兹凯维奇不等式或修正的 log-Sobolev 不等式,它们在 Poincaré 和 log-Sobolev 设置之间进行插值。与之前的研究不同,我们的结果允许弱平稳性,并且不需要凸性或消散性条件。
{"title":"Analysis of Langevin Monte Carlo from Poincaré to Log-Sobolev","authors":"Sinho Chewi, Murat A. Erdogdu, Mufan Li, Ruoqi Shen, Matthew S. Zhang","doi":"10.1007/s10208-024-09667-6","DOIUrl":"https://doi.org/10.1007/s10208-024-09667-6","url":null,"abstract":"<p>Classically, the continuous-time Langevin diffusion converges exponentially fast to its stationary distribution <span>(pi )</span> under the sole assumption that <span>(pi )</span> satisfies a Poincaré inequality. Using this fact to provide guarantees for the discrete-time Langevin Monte Carlo (LMC) algorithm, however, is considerably more challenging due to the need for working with chi-squared or Rényi divergences, and prior works have largely focused on strongly log-concave targets. In this work, we provide the first convergence guarantees for LMC assuming that <span>(pi )</span> satisfies either a Latała–Oleszkiewicz or modified log-Sobolev inequality, which interpolates between the Poincaré and log-Sobolev settings. Unlike prior works, our results allow for weak smoothness and do not require convexity or dissipativity conditions.\u0000</p>","PeriodicalId":55151,"journal":{"name":"Foundations of Computational Mathematics","volume":"57 1","pages":""},"PeriodicalIF":3.0,"publicationDate":"2024-07-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141768486","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Resonances as a Computational Tool 作为计算工具的共振
IF 3 1区 数学 Q2 COMPUTER SCIENCE, THEORY & METHODS Pub Date : 2024-07-26 DOI: 10.1007/s10208-024-09665-8
Frédéric Rousset, Katharina Schratz

A large toolbox of numerical schemes for dispersive equations has been established, based on different discretization techniques such as discretizing the variation-of-constants formula (e.g., exponential integrators) or splitting the full equation into a series of simpler subproblems (e.g., splitting methods). In many situations these classical schemes allow a precise and efficient approximation. This, however, drastically changes whenever non-smooth phenomena enter the scene such as for problems at low regularity and high oscillations. Classical schemes fail to capture the oscillatory nature of the solution, and this may lead to severe instabilities and loss of convergence. In this article we review a new class of resonance-based schemes. The key idea in the construction of the new schemes is to tackle and deeply embed the underlying nonlinear structure of resonances into the numerical discretization. As in the continuous case, these terms are central to structure preservation and offer the new schemes strong properties at low regularity.

基于不同的离散化技术,如将常数变异公式离散化(如指数积分器)或将完整方程拆分为一系列更简单的子问题(如拆分方法),已经建立了一个庞大的分散方程数值方案工具箱。在许多情况下,这些经典方案可以实现精确而高效的近似。然而,一旦出现非光滑现象,例如低规律性和高振荡问题,情况就会发生巨大变化。经典方案无法捕捉解的振荡性质,这可能会导致严重的不稳定性和收敛性损失。在本文中,我们回顾了一类基于共振的新方案。构建新方案的关键思路是解决共振的基本非线性结构,并将其深入嵌入数值离散化中。正如在连续情况下一样,这些项是结构保持的核心,并为新方案提供了低正则性的强大特性。
{"title":"Resonances as a Computational Tool","authors":"Frédéric Rousset, Katharina Schratz","doi":"10.1007/s10208-024-09665-8","DOIUrl":"https://doi.org/10.1007/s10208-024-09665-8","url":null,"abstract":"<p>A large toolbox of numerical schemes for dispersive equations has been established, based on different discretization techniques such as discretizing the variation-of-constants formula (e.g., exponential integrators) or splitting the full equation into a series of simpler subproblems (e.g., splitting methods). In many situations these classical schemes allow a precise and efficient approximation. This, however, drastically changes whenever non-smooth phenomena enter the scene such as for problems at low regularity and high oscillations. Classical schemes fail to capture the oscillatory nature of the solution, and this may lead to severe instabilities and loss of convergence. In this article we review a new class of resonance-based schemes. The key idea in the construction of the new schemes is to tackle and deeply embed the underlying nonlinear structure of resonances into the numerical discretization. As in the continuous case, these terms are central to structure preservation and offer the new schemes strong properties at low regularity.</p>","PeriodicalId":55151,"journal":{"name":"Foundations of Computational Mathematics","volume":"32 1","pages":""},"PeriodicalIF":3.0,"publicationDate":"2024-07-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141768455","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Polynomial Time Iterative Algorithm for Matching Gaussian Matrices with Non-vanishing Correlation 匹配非相关性高斯矩阵的多项式时间迭代算法
IF 3 1区 数学 Q2 COMPUTER SCIENCE, THEORY & METHODS Pub Date : 2024-07-22 DOI: 10.1007/s10208-024-09662-x
Jian Ding, Zhangsong Li

Motivated by the problem of matching vertices in two correlated Erdős-Rényi graphs, we study the problem of matching two correlated Gaussian Wigner matrices. We propose an iterative matching algorithm, which succeeds in polynomial time as long as the correlation between the two Gaussian matrices does not vanish. Our result is the first polynomial time algorithm that solves a graph matching type of problem when the correlation is an arbitrarily small constant.

受两个相关厄尔多斯-雷尼图中顶点匹配问题的启发,我们研究了两个相关高斯维格纳矩阵的匹配问题。我们提出了一种迭代匹配算法,只要两个高斯矩阵之间的相关性不消失,该算法就能在多项式时间内取得成功。我们的成果是第一个在相关性为任意小常数时解决图匹配类型问题的多项式时间算法。
{"title":"A Polynomial Time Iterative Algorithm for Matching Gaussian Matrices with Non-vanishing Correlation","authors":"Jian Ding, Zhangsong Li","doi":"10.1007/s10208-024-09662-x","DOIUrl":"https://doi.org/10.1007/s10208-024-09662-x","url":null,"abstract":"<p>Motivated by the problem of matching vertices in two correlated Erdős-Rényi graphs, we study the problem of matching two correlated Gaussian Wigner matrices. We propose an iterative matching algorithm, which succeeds in polynomial time as long as the correlation between the two Gaussian matrices does not vanish. Our result is the first polynomial time algorithm that solves a graph matching type of problem when the correlation is an arbitrarily small constant.\u0000</p>","PeriodicalId":55151,"journal":{"name":"Foundations of Computational Mathematics","volume":"60 1","pages":""},"PeriodicalIF":3.0,"publicationDate":"2024-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141737076","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Foundations of Computational Mathematics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1