Pub Date : 2025-04-24DOI: 10.1007/s10208-025-09710-0
Shaoshi Chen, Lixin Du, Hanqian Fang
Symbolic summation as an active research topic of symbolic computation provides efficient algorithmic tools for evaluating and simplifying different types of sums arising from mathematics, computer science, physics and other areas. Most of existing algorithms in symbolic summation are mainly applicable to the problem with univariate inputs. A long-term project in symbolic computation is to develop theories, algorithms and software for the symbolic summation of multivariate functions. This paper will give complete solutions to two challenging problems in symbolic summation of multivariate rational functions, namely the rational summability problem and the existence problem of telescopers for multivariate rational functions. Our approach is based on the structure of Sato’s isotropy groups of polynomials, which enables us to reduce the problems to testing the shift equivalence of polynomials. Our results provide a complete solution to the discrete analogue of Picard’s problem on differential forms and can be used to detect the applicability of the Wilf-Zeilberger method to multivariate rational functions.
{"title":"Symbolic Summation of Multivariate Rational Functions","authors":"Shaoshi Chen, Lixin Du, Hanqian Fang","doi":"10.1007/s10208-025-09710-0","DOIUrl":"https://doi.org/10.1007/s10208-025-09710-0","url":null,"abstract":"<p>Symbolic summation as an active research topic of symbolic computation provides efficient algorithmic tools for evaluating and simplifying different types of sums arising from mathematics, computer science, physics and other areas. Most of existing algorithms in symbolic summation are mainly applicable to the problem with univariate inputs. A long-term project in symbolic computation is to develop theories, algorithms and software for the symbolic summation of multivariate functions. This paper will give complete solutions to two challenging problems in symbolic summation of multivariate rational functions, namely the rational summability problem and the existence problem of telescopers for multivariate rational functions. Our approach is based on the structure of Sato’s isotropy groups of polynomials, which enables us to reduce the problems to testing the shift equivalence of polynomials. Our results provide a complete solution to the discrete analogue of Picard’s problem on differential forms and can be used to detect the applicability of the Wilf-Zeilberger method to multivariate rational functions.</p>","PeriodicalId":55151,"journal":{"name":"Foundations of Computational Mathematics","volume":"15 1","pages":""},"PeriodicalIF":3.0,"publicationDate":"2025-04-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143872731","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-04-24DOI: 10.1007/s10208-025-09707-9
Shuyang Ling
The orthogonal group synchronization problem, which focuses on recovering orthogonal group elements from their corrupted pairwise measurements, encompasses examples such as high-dimensional Kuramoto model on general signed networks, (mathbb {Z}_2)-synchronization, community detection under stochastic block models, and orthogonal Procrustes problem. The semidefinite relaxation (SDR) has proven its power in solving this problem; however, its expensive computational costs impede its widespread practical applications. We consider the Burer–Monteiro factorization approach to the orthogonal group synchronization, an effective and scalable low-rank factorization to solve large scale SDPs. Despite the significant empirical successes of this factorization approach, it is still a challenging task to understand when the nonconvex optimization landscape is benign, i.e., the optimization landscape possesses only one local minimizer, which is also global. In this work, we demonstrate that if the rank of the factorization exceeds twice the condition number of the “Laplacian" (certificate matrix) at the global minimizer, the optimization landscape is absent of spurious local minima. Our main theorem is purely algebraic and versatile, and it seamlessly applies to all the aforementioned examples: the nonconvex landscape remains benign under almost identical condition that enables the success of the SDR. Additionally, we illustrate that the Burer–Monteiro factorization is robust to “monotone adversaries", mirroring the resilience of the SDR. In other words, introducing “favorable" adversaries into the data will not result in the emergence of new spurious local minimizers.
{"title":"Local Geometry Determines Global Landscape in Low-Rank Factorization for Synchronization","authors":"Shuyang Ling","doi":"10.1007/s10208-025-09707-9","DOIUrl":"https://doi.org/10.1007/s10208-025-09707-9","url":null,"abstract":"<p>The orthogonal group synchronization problem, which focuses on recovering orthogonal group elements from their corrupted pairwise measurements, encompasses examples such as high-dimensional Kuramoto model on general signed networks, <span>(mathbb {Z}_2)</span>-synchronization, community detection under stochastic block models, and orthogonal Procrustes problem. The semidefinite relaxation (SDR) has proven its power in solving this problem; however, its expensive computational costs impede its widespread practical applications. We consider the Burer–Monteiro factorization approach to the orthogonal group synchronization, an effective and scalable low-rank factorization to solve large scale SDPs. Despite the significant empirical successes of this factorization approach, it is still a challenging task to understand when the nonconvex optimization landscape is benign, i.e., the optimization landscape possesses only one local minimizer, which is also global. In this work, we demonstrate that if the rank of the factorization exceeds twice the condition number of the “Laplacian\" (certificate matrix) at the global minimizer, the optimization landscape is absent of spurious local minima. Our main theorem is purely algebraic and versatile, and it seamlessly applies to all the aforementioned examples: the nonconvex landscape remains benign under almost identical condition that enables the success of the SDR. Additionally, we illustrate that the Burer–Monteiro factorization is robust to “monotone adversaries\", mirroring the resilience of the SDR. In other words, introducing “favorable\" adversaries into the data will not result in the emergence of new spurious local minimizers.</p>","PeriodicalId":55151,"journal":{"name":"Foundations of Computational Mathematics","volume":"2 1","pages":""},"PeriodicalIF":3.0,"publicationDate":"2025-04-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143872732","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-03-24DOI: 10.1007/s10208-025-09698-7
Lars Diening, Lukas Gehring, Johannes Storn
We introduce a simple initialization of the Maubach bisection routine for adaptive mesh refinement which applies to any conforming initial triangulation and terminates in linear time with respect to the number of initial vertices. We show that Maubach’s routine with this initialization always terminates and generates meshes that preserve shape regularity and satisfy the closure estimate needed for optimal convergence of adaptive schemes. Our ansatz allows for the intrinsic use of existing implementations.
{"title":"Adaptive Mesh Refinement for Arbitrary Initial Triangulations","authors":"Lars Diening, Lukas Gehring, Johannes Storn","doi":"10.1007/s10208-025-09698-7","DOIUrl":"https://doi.org/10.1007/s10208-025-09698-7","url":null,"abstract":"<p>We introduce a simple initialization of the Maubach bisection routine for adaptive mesh refinement which applies to any conforming initial triangulation and terminates in linear time with respect to the number of initial vertices. We show that Maubach’s routine with this initialization always terminates and generates meshes that preserve shape regularity and satisfy the closure estimate needed for optimal convergence of adaptive schemes. Our ansatz allows for the intrinsic use of existing implementations.</p>","PeriodicalId":55151,"journal":{"name":"Foundations of Computational Mathematics","volume":"21 1","pages":""},"PeriodicalIF":3.0,"publicationDate":"2025-03-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143695241","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-03-17DOI: 10.1007/s10208-025-09700-2
Richard Falk, Ragnar Winther
The bubble transform is a procedure to decompose differential forms, which are piecewise smooth with respect to a given triangulation of the domain, into a sum of local bubbles. In this paper, an improved version of a construction in the setting of the de Rham complex previously proposed by the authors is presented. The major improvement in the decomposition is that unlike the previous results, in which the individual bubbles were rational functions with the property that groups of local bubbles summed up to preserve piecewise smoothness, the new decomposition is strictly space-preserving in the sense that each local bubble preserves piecewise smoothness. An important property of the transform is that the construction only depends on the given triangulation of the domain and is independent of any finite element space. On the other hand, all the standard piecewise polynomial spaces are invariant under the transform. Other key properties of the transform are that it commutes with the exterior derivative, is bounded in (L^2), and satisfies the stable decomposition property.
{"title":"Local Space-Preserving Decompositions for the Bubble Transform","authors":"Richard Falk, Ragnar Winther","doi":"10.1007/s10208-025-09700-2","DOIUrl":"https://doi.org/10.1007/s10208-025-09700-2","url":null,"abstract":"<p>The bubble transform is a procedure to decompose differential forms, which are piecewise smooth with respect to a given triangulation of the domain, into a sum of local bubbles. In this paper, an improved version of a construction in the setting of the de Rham complex previously proposed by the authors is presented. The major improvement in the decomposition is that unlike the previous results, in which the individual bubbles were rational functions with the property that groups of local bubbles summed up to preserve piecewise smoothness, the new decomposition is strictly space-preserving in the sense that each local bubble preserves piecewise smoothness. An important property of the transform is that the construction only depends on the given triangulation of the domain and is independent of any finite element space. On the other hand, all the standard piecewise polynomial spaces are invariant under the transform. Other key properties of the transform are that it commutes with the exterior derivative, is bounded in <span>(L^2)</span>, and satisfies the <i>stable decomposition property</i>.</p>","PeriodicalId":55151,"journal":{"name":"Foundations of Computational Mathematics","volume":"61 1","pages":""},"PeriodicalIF":3.0,"publicationDate":"2025-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143641098","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-03-13DOI: 10.1007/s10208-025-09699-6
Robert Cardona, Eva Miranda, Daniel Peralta-Salas
In 1991, Moore (Nonlinearity 4:199–230, 1991) raised a question about whether hydrodynamics is capable of performing computations. Similarly, in 2016, Tao (J Am Math Soc 29(3):601–674, 2016) asked whether a mechanical system, including a fluid flow, can simulate a universal Turing machine. In this expository article, we review the construction in Cardona et al. (Proc Natl Acad Sci 118(19):e2026818118, 2021) of a “Fluid computer” in dimension 3 that combines techniques in symbolic dynamics with the connection between steady Euler flows and contact geometry unveiled by Etnyre and Ghrist. In addition, we argue that the metric that renders the vector field Beltrami cannot be critical in the Chern-Hamilton sense (Chern and Hamilton in On Riemannian metrics adapted to three-dimensional contact manifolds, Springer, Berlin, 1985). We also sketch the completely different construction for the Euclidean metric in (mathbb {R}^3) as given in Cardona et al. (J Math Pures Appl 169:50–81, 2023). These results reveal the existence of undecidable fluid particle paths. We conclude the article with a list of open problems.
1991年,Moore (Nonlinearity 4:199 - 230,1991)提出了一个关于流体力学是否能够进行计算的问题。同样,在2016年,Tao (J Am Math Soc 29(3): 601-674, 2016)提出了一个机械系统,包括流体流动,是否可以模拟通用图灵机。在这篇阐述性的文章中,我们回顾了Cardona等人(Proc Natl Acad Sci 118(19): e2026818118,2021)构建的三维“流体计算机”,该计算机将符号动力学技术与Etnyre和Ghrist揭示的稳定欧拉流和接触几何之间的联系结合起来。此外,我们认为呈现向量场Beltrami的度规在Chern-Hamilton意义上不可能是临界的(Chern和Hamilton在On riemanian metrics适应三维接触流形,施普林格,Berlin, 1985)。我们还概述了Cardona等人在(mathbb {R}^3)中给出的欧几里得度量的完全不同的结构(J Math Pures Appl 169:50 - 81,2023)。这些结果揭示了不确定流体粒子路径的存在。最后,我们列出了一些尚未解决的问题。
{"title":"Towards a Fluid Computer","authors":"Robert Cardona, Eva Miranda, Daniel Peralta-Salas","doi":"10.1007/s10208-025-09699-6","DOIUrl":"https://doi.org/10.1007/s10208-025-09699-6","url":null,"abstract":"<p>In 1991, Moore (Nonlinearity 4:199–230, 1991) raised a question about whether hydrodynamics is capable of performing computations. Similarly, in 2016, Tao (J Am Math Soc 29(3):601–674, 2016) asked whether a mechanical system, including a fluid flow, can simulate a universal Turing machine. In this expository article, we review the construction in Cardona et al. (Proc Natl Acad Sci 118(19):e2026818118, 2021) of a “Fluid computer” in dimension 3 that combines techniques in symbolic dynamics with the connection between steady Euler flows and contact geometry unveiled by Etnyre and Ghrist. In addition, we argue that the metric that renders the vector field Beltrami cannot be critical in the Chern-Hamilton sense (Chern and Hamilton in On Riemannian metrics adapted to three-dimensional contact manifolds, Springer, Berlin, 1985). We also sketch the completely different construction for the Euclidean metric in <span>(mathbb {R}^3)</span> as given in Cardona et al. (J Math Pures Appl 169:50–81, 2023). These results reveal the existence of undecidable fluid particle paths. We conclude the article with a list of open problems.\u0000</p>","PeriodicalId":55151,"journal":{"name":"Foundations of Computational Mathematics","volume":"16 1","pages":""},"PeriodicalIF":3.0,"publicationDate":"2025-03-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143618496","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-03-06DOI: 10.1007/s10208-025-09705-x
Riccardo Bonalli, Alessandro Rudi
We propose a novel non-parametric learning paradigm for the identification of drift and diffusion coefficients of multi-dimensional non-linear stochastic differential equations, which relies upon discrete-time observations of the state. The key idea essentially consists of fitting a RKHS-based approximation of the corresponding Fokker–Planck equation to such observations, yielding theoretical estimates of non-asymptotic learning rates which, unlike previous works, become increasingly tighter when the regularity of the unknown drift and diffusion coefficients becomes higher. Our method being kernel-based, offline pre-processing may be profitably leveraged to enable efficient numerical implementation, offering excellent balance between precision and computational complexity.
{"title":"Non-parametric Learning of Stochastic Differential Equations with Non-asymptotic Fast Rates of Convergence","authors":"Riccardo Bonalli, Alessandro Rudi","doi":"10.1007/s10208-025-09705-x","DOIUrl":"https://doi.org/10.1007/s10208-025-09705-x","url":null,"abstract":"<p>We propose a novel non-parametric learning paradigm for the identification of drift and diffusion coefficients of multi-dimensional non-linear stochastic differential equations, which relies upon discrete-time observations of the state. The key idea essentially consists of fitting a RKHS-based approximation of the corresponding Fokker–Planck equation to such observations, yielding theoretical estimates of non-asymptotic learning rates which, unlike previous works, become increasingly tighter when the regularity of the unknown drift and diffusion coefficients becomes higher. Our method being kernel-based, offline pre-processing may be profitably leveraged to enable efficient numerical implementation, offering excellent balance between precision and computational complexity.</p>","PeriodicalId":55151,"journal":{"name":"Foundations of Computational Mathematics","volume":"11 1","pages":""},"PeriodicalIF":3.0,"publicationDate":"2025-03-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143569769","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-03-04DOI: 10.1007/s10208-025-09703-z
Igor Klep, Victor Magron, Jurij Volčič
This paper introduces and develops the algebraic framework of moment polynomials, which are polynomial expressions in commuting variables and their formal mixed moments. Their positivity and optimization over probability measures supported on semialgebraic sets and subject to moment polynomial constraints is investigated. On the one hand, a positive solution to Hilbert’s 17th problem for pseudo-moments is given. On the other hand, moment polynomials positive on actual measures are shown to be sums of squares and formal moments of squares up to arbitrarily small perturbation of their coefficients. When only measures supported on a bounded semialgebraic set are considered, a stronger algebraic certificate for moment polynomial positivity is derived. This result gives rise to a converging hierarchy of semidefinite programs for moment polynomial optimization. Finally, as an application, two open nonlinear Bell inequalities from quantum physics are settled.
{"title":"Sums of Squares Certificates for Polynomial Moment Inequalities","authors":"Igor Klep, Victor Magron, Jurij Volčič","doi":"10.1007/s10208-025-09703-z","DOIUrl":"https://doi.org/10.1007/s10208-025-09703-z","url":null,"abstract":"<p>This paper introduces and develops the algebraic framework of moment polynomials, which are polynomial expressions in commuting variables and their formal mixed moments. Their positivity and optimization over probability measures supported on semialgebraic sets and subject to moment polynomial constraints is investigated. On the one hand, a positive solution to Hilbert’s 17th problem for pseudo-moments is given. On the other hand, moment polynomials positive on actual measures are shown to be sums of squares and formal moments of squares up to arbitrarily small perturbation of their coefficients. When only measures supported on a bounded semialgebraic set are considered, a stronger algebraic certificate for moment polynomial positivity is derived. This result gives rise to a converging hierarchy of semidefinite programs for moment polynomial optimization. Finally, as an application, two open nonlinear Bell inequalities from quantum physics are settled.</p>","PeriodicalId":55151,"journal":{"name":"Foundations of Computational Mathematics","volume":"35 1","pages":""},"PeriodicalIF":3.0,"publicationDate":"2025-03-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143546173","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-03-04DOI: 10.1007/s10208-025-09702-0
Buyang Li, Yifei Wu
This article is concerned with the construction and analysis of new time discretizations for the KdV equation on a torus for low-regularity solutions below (H^1). New harmonic analysis tools, including averaging approximations to the exponential phase functions and trilinear estimates of the KdV operator, are established for the construction and analysis of time discretizations with higher convergence orders under low-regularity conditions. In addition, new perturbation techniques are introduced to establish stability estimates of time discretizations under low-regularity conditions without using filters when the energy techniques fail. The proposed method is proved to be convergent with order (gamma ) (up to a logarithmic factor) in (L^2) under the regularity condition (uin C([0,T];H^gamma )) for (gamma in (0,1]).
本文关注环上 KdV 方程对于低于 (H^1)的低规则解的新时间离散的构造和分析。本文建立了新的谐波分析工具,包括指数相位函数的平均近似和 KdV 算子的三线性估计,用于构建和分析低规则性条件下具有更高收敛阶数的时间离散。此外,还引入了新的扰动技术,当能量技术失效时,无需使用滤波器即可建立低规则性条件下时间离散的稳定性估计。在 (gamma in C([0,T];H^gamma )) 为 (gamma in (0,1])的规则性条件下,所提出的方法被证明在 (L^2) 中以 (gamma )阶收敛(达到对数因子)。
{"title":"An Unfiltered Low-Regularity Integrator for the KdV Equation with Solutions Below $$mathbf{H^1}$$","authors":"Buyang Li, Yifei Wu","doi":"10.1007/s10208-025-09702-0","DOIUrl":"https://doi.org/10.1007/s10208-025-09702-0","url":null,"abstract":"<p>This article is concerned with the construction and analysis of new time discretizations for the KdV equation on a torus for low-regularity solutions below <span>(H^1)</span>. New harmonic analysis tools, including averaging approximations to the exponential phase functions and trilinear estimates of the KdV operator, are established for the construction and analysis of time discretizations with higher convergence orders under low-regularity conditions. In addition, new perturbation techniques are introduced to establish stability estimates of time discretizations under low-regularity conditions without using filters when the energy techniques fail. The proposed method is proved to be convergent with order <span>(gamma )</span> (up to a logarithmic factor) in <span>(L^2)</span> under the regularity condition <span>(uin C([0,T];H^gamma ))</span> for <span>(gamma in (0,1])</span>.</p>","PeriodicalId":55151,"journal":{"name":"Foundations of Computational Mathematics","volume":"59 1","pages":""},"PeriodicalIF":3.0,"publicationDate":"2025-03-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143546174","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-02-18DOI: 10.1007/s10208-024-09673-8
Ben Adcock, Matthew J. Colbrook, Maksym Neyra-Nesterenko
Sharpness is an almost generic assumption in continuous optimization that bounds the distance from minima by objective function suboptimality. It facilitates the acceleration of first-order methods through restarts. However, sharpness involves problem-specific constants that are typically unknown, and restart schemes typically reduce convergence rates. Moreover, these schemes are challenging to apply in the presence of noise or with approximate model classes (e.g., in compressive imaging or learning problems), and they generally assume that the first-order method used produces feasible iterates. We consider the assumption of approximate sharpness, a generalization of sharpness that incorporates an unknown constant perturbation to the objective function error. This constant offers greater robustness (e.g., with respect to noise or relaxation of model classes) for finding approximate minimizers. By employing a new type of search over the unknown constants, we design a restart scheme that applies to general first-order methods and does not require the first-order method to produce feasible iterates. Our scheme maintains the same convergence rate as when the constants are known. The convergence rates we achieve for various first-order methods match the optimal rates or improve on previously established rates for a wide range of problems. We showcase our restart scheme in several examples and highlight potential future applications and developments of our framework and theory.
{"title":"Restarts Subject to Approximate Sharpness: A Parameter-Free and Optimal Scheme For First-Order Methods","authors":"Ben Adcock, Matthew J. Colbrook, Maksym Neyra-Nesterenko","doi":"10.1007/s10208-024-09673-8","DOIUrl":"https://doi.org/10.1007/s10208-024-09673-8","url":null,"abstract":"<p>Sharpness is an almost generic assumption in continuous optimization that bounds the distance from minima by objective function suboptimality. It facilitates the acceleration of first-order methods through <i>restarts</i>. However, sharpness involves problem-specific constants that are typically unknown, and restart schemes typically reduce convergence rates. Moreover, these schemes are challenging to apply in the presence of noise or with approximate model classes (e.g., in compressive imaging or learning problems), and they generally assume that the first-order method used produces feasible iterates. We consider the assumption of <i>approximate sharpness</i>, a generalization of sharpness that incorporates an unknown constant perturbation to the objective function error. This constant offers greater robustness (e.g., with respect to noise or relaxation of model classes) for finding approximate minimizers. By employing a new type of search over the unknown constants, we design a restart scheme that applies to general first-order methods and does not require the first-order method to produce feasible iterates. Our scheme maintains the same convergence rate as when the constants are known. The convergence rates we achieve for various first-order methods match the optimal rates or improve on previously established rates for a wide range of problems. We showcase our restart scheme in several examples and highlight potential future applications and developments of our framework and theory.</p>","PeriodicalId":55151,"journal":{"name":"Foundations of Computational Mathematics","volume":"13 1","pages":""},"PeriodicalIF":3.0,"publicationDate":"2025-02-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143443283","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-02-14DOI: 10.1007/s10208-025-09692-z
Tommi Muller, Vidit Nanda, Anna Seigal
We count singular vector tuples of a system of tensors assigned to the edges of a directed hypergraph. To do so, we study the generalisation of quivers to directed hypergraphs. Assigning vector spaces to the nodes of a hypergraph and multilinear maps to its hyperedges gives a hyperquiver representation. Hyperquiver representations generalise quiver representations (where all hyperedges are edges) and tensors (where there is only one multilinear map). The singular vectors of a hyperquiver representation are a compatible assignment of vectors to the nodes. We compute the dimension and degree of the variety of singular vectors of a sufficiently generic hyperquiver representation. Our formula specialises to known results that count the singular vectors and eigenvectors of a generic tensor. Lastly, we study a hypergraph generalisation of the inverse tensor eigenvalue problem and solve it algorithmically.
{"title":"Multilinear Hyperquiver Representations","authors":"Tommi Muller, Vidit Nanda, Anna Seigal","doi":"10.1007/s10208-025-09692-z","DOIUrl":"https://doi.org/10.1007/s10208-025-09692-z","url":null,"abstract":"<p>We count singular vector tuples of a system of tensors assigned to the edges of a directed hypergraph. To do so, we study the generalisation of quivers to directed hypergraphs. Assigning vector spaces to the nodes of a hypergraph and multilinear maps to its hyperedges gives a hyperquiver representation. Hyperquiver representations generalise quiver representations (where all hyperedges are edges) and tensors (where there is only one multilinear map). The singular vectors of a hyperquiver representation are a compatible assignment of vectors to the nodes. We compute the dimension and degree of the variety of singular vectors of a sufficiently generic hyperquiver representation. Our formula specialises to known results that count the singular vectors and eigenvectors of a generic tensor. Lastly, we study a hypergraph generalisation of the inverse tensor eigenvalue problem and solve it algorithmically.</p>","PeriodicalId":55151,"journal":{"name":"Foundations of Computational Mathematics","volume":"9 1","pages":""},"PeriodicalIF":3.0,"publicationDate":"2025-02-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143417487","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}