Pub Date : 2025-08-12DOI: 10.1007/s10208-025-09727-5
Rémi Carles
We consider the nonlinear Schrödinger equation with a potential, also known as Gross-Pitaevskii equation. By introducing a suitable spectral localization, we prove low regularity error estimates for the time discretization corresponding to an adapted Lie-Trotter splitting scheme. The proof is based on tools from spectral theory and pseudodifferential calculus in order to obtain various estimates on the spectral localization, including discrete Strichartz estimates which support the nonlinear analysis.
{"title":"Time Splitting and Error Estimates for Nonlinear Schrödinger Equations with a Potential","authors":"Rémi Carles","doi":"10.1007/s10208-025-09727-5","DOIUrl":"https://doi.org/10.1007/s10208-025-09727-5","url":null,"abstract":"<p>We consider the nonlinear Schrödinger equation with a potential, also known as Gross-Pitaevskii equation. By introducing a suitable spectral localization, we prove low regularity error estimates for the time discretization corresponding to an adapted Lie-Trotter splitting scheme. The proof is based on tools from spectral theory and pseudodifferential calculus in order to obtain various estimates on the spectral localization, including discrete Strichartz estimates which support the nonlinear analysis.</p>","PeriodicalId":55151,"journal":{"name":"Foundations of Computational Mathematics","volume":"38 1","pages":""},"PeriodicalIF":3.0,"publicationDate":"2025-08-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144924522","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-08-11DOI: 10.1007/s10208-025-09729-3
Bekzhan Kerimkulov, James-Michael Leahy, David Siska, Lukasz Szpruch, Yufei Zhang
We study the global convergence of a Fisher–Rao policy gradient flow for infinite-horizon entropy-regularised Markov decision processes with Polish state and action spaces. The flow is a continuous-time analogue of a policy mirror descent method. We establish the global well-posedness of the gradient flow and demonstrate its exponential convergence to the optimal policy. Moreover, we prove the flow is stable with respect to gradient evaluation, offering insights into the performance of a natural policy gradient flow with log-linear policy parameterisation. To overcome challenges stemming from the lack of the convexity of the objective function and the discontinuity arising from the entropy regulariser, we leverage the performance difference lemma and the duality relationship between the gradient and mirror descent flows. Our analysis provides a theoretical foundation for developing various discrete policy gradient algorithms.
{"title":"A Fisher–Rao Gradient Flow for Entropy-Regularised Markov Decision Processes in Polish Spaces","authors":"Bekzhan Kerimkulov, James-Michael Leahy, David Siska, Lukasz Szpruch, Yufei Zhang","doi":"10.1007/s10208-025-09729-3","DOIUrl":"https://doi.org/10.1007/s10208-025-09729-3","url":null,"abstract":"<p>We study the global convergence of a Fisher–Rao policy gradient flow for infinite-horizon entropy-regularised Markov decision processes with Polish state and action spaces. The flow is a continuous-time analogue of a policy mirror descent method. We establish the global well-posedness of the gradient flow and demonstrate its exponential convergence to the optimal policy. Moreover, we prove the flow is stable with respect to gradient evaluation, offering insights into the performance of a natural policy gradient flow with log-linear policy parameterisation. To overcome challenges stemming from the lack of the convexity of the objective function and the discontinuity arising from the entropy regulariser, we leverage the performance difference lemma and the duality relationship between the gradient and mirror descent flows. Our analysis provides a theoretical foundation for developing various discrete policy gradient algorithms.</p>","PeriodicalId":55151,"journal":{"name":"Foundations of Computational Mathematics","volume":"56 1","pages":""},"PeriodicalIF":3.0,"publicationDate":"2025-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144924524","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-08-07DOI: 10.1007/s10208-025-09725-7
Niels Lubbes, Mehdi Makhul, Josef Schicho, Audie Warren
For a given graph whose edges are labeled with general real numbers, we consider the set of functions from the vertex set into the Euclidean plane such that the distance between the images of neighbouring vertices is equal to the corresponding edge label. This set of functions can be expressed as the zero set of quadratic polynomials and our main result characterizes the number of complex irreducible components of this zero set in terms of combinatorial properties of the graph. In case the complex components are three-dimensional, then the graph is minimally rigid and the component number is a well-known invariant from rigidity theory. If the components are four-dimensional, then they correspond to one-dimensional coupler curves of flexible planar mechanisms. As an application, we characterize the degree of irreducible components of such coupler curves combinatorially.
{"title":"Irreducible Components of Sets of Points in the Plane that Satisfy Distance Conditions","authors":"Niels Lubbes, Mehdi Makhul, Josef Schicho, Audie Warren","doi":"10.1007/s10208-025-09725-7","DOIUrl":"https://doi.org/10.1007/s10208-025-09725-7","url":null,"abstract":"<p>For a given graph whose edges are labeled with general real numbers, we consider the set of functions from the vertex set into the Euclidean plane such that the distance between the images of neighbouring vertices is equal to the corresponding edge label. This set of functions can be expressed as the zero set of quadratic polynomials and our main result characterizes the number of complex irreducible components of this zero set in terms of combinatorial properties of the graph. In case the complex components are three-dimensional, then the graph is minimally rigid and the component number is a well-known invariant from rigidity theory. If the components are four-dimensional, then they correspond to one-dimensional coupler curves of flexible planar mechanisms. As an application, we characterize the degree of irreducible components of such coupler curves combinatorially.</p>","PeriodicalId":55151,"journal":{"name":"Foundations of Computational Mathematics","volume":"27 1","pages":""},"PeriodicalIF":3.0,"publicationDate":"2025-08-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144924570","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-07-14DOI: 10.1007/s10208-025-09720-y
Ari Stern, Enrico Zampa
We consider the application of finite element exterior calculus (FEEC) methods to a class of canonical Hamiltonian PDE systems involving differential forms. Solutions to these systems satisfy a local multisymplectic conservation law, which generalizes the more familiar symplectic conservation law for Hamiltonian systems of ODEs, and which is connected with physically-important reciprocity phenomena, such as Lorentz reciprocity in electromagnetics. We characterize hybrid FEEC methods whose numerical traces satisfy a version of the multisymplectic conservation law, and we apply this characterization to several specific classes of FEEC methods, including conforming Arnold–Falk–Winther-type methods and various hybridizable discontinuous Galerkin (HDG) methods. Interestingly, the HDG-type and other nonconforming methods are shown, in general, to be multisymplectic in a stronger sense than the conforming FEEC methods. This substantially generalizes previous work of McLachlan and Stern [Found. Comput. Math., 20 (2020), pp. 35–69] on the more restricted class of canonical Hamiltonian PDEs in the de Donder–Weyl “grad-div” form.
{"title":"Multisymplecticity in Finite Element Exterior Calculus","authors":"Ari Stern, Enrico Zampa","doi":"10.1007/s10208-025-09720-y","DOIUrl":"https://doi.org/10.1007/s10208-025-09720-y","url":null,"abstract":"<p>We consider the application of finite element exterior calculus (FEEC) methods to a class of canonical Hamiltonian PDE systems involving differential forms. Solutions to these systems satisfy a local <i>multisymplectic conservation law</i>, which generalizes the more familiar symplectic conservation law for Hamiltonian systems of ODEs, and which is connected with physically-important reciprocity phenomena, such as Lorentz reciprocity in electromagnetics. We characterize hybrid FEEC methods whose numerical traces satisfy a version of the multisymplectic conservation law, and we apply this characterization to several specific classes of FEEC methods, including conforming Arnold–Falk–Winther-type methods and various hybridizable discontinuous Galerkin (HDG) methods. Interestingly, the HDG-type and other nonconforming methods are shown, in general, to be multisymplectic in a stronger sense than the conforming FEEC methods. This substantially generalizes previous work of McLachlan and Stern [Found. Comput. Math., 20 (2020), pp. 35–69] on the more restricted class of canonical Hamiltonian PDEs in the de Donder–Weyl “grad-div” form.</p>","PeriodicalId":55151,"journal":{"name":"Foundations of Computational Mathematics","volume":"1 1","pages":""},"PeriodicalIF":3.0,"publicationDate":"2025-07-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144924528","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-07-14DOI: 10.1007/s10208-025-09719-5
Jian-Feng Cai, Zhiqiang Xu, Zili Xu
<p>This paper delves into the spectral norm aspect of the Generalized Column and Row Subset Selection (GCRSS) problem. Given a target matrix <span><span>textbf{A}in mathbb {R}^{ntimes d}</span><script type="math/tex">textbf{A}in mathbb {R}^{ntimes d}</script></span>, the objective of GCRSS is to select a column submatrix <span><span>textbf{B}_{:,S}in mathbb {R}^{ntimes k}</span><script type="math/tex">textbf{B}_{:,S}in mathbb {R}^{ntimes k}</script></span> from the source matrix <span><span>textbf{B}in mathbb {R}^{ntimes d_B}</span><script type="math/tex">textbf{B}in mathbb {R}^{ntimes d_B}</script></span> and a row submatrix <span><span>textbf{C}_{R,:}in mathbb {R}^{rtimes d}</span><script type="math/tex">textbf{C}_{R,:}in mathbb {R}^{rtimes d}</script></span> from the source matrix <span><span>textbf{C}in mathbb {R}^{n_Ctimes d}</span><script type="math/tex">textbf{C}in mathbb {R}^{n_Ctimes d}</script></span>, such that the residual matrix <span><span>(textbf{I}_n-textbf{B}_{:,S}textbf{B}_{:,S}^{dagger })textbf{A}(textbf{I}_d-textbf{C}_{R,:}^{dagger } textbf{C}_{R,:})</span><script type="math/tex">(textbf{I}_n-textbf{B}_{:,S}textbf{B}_{:,S}^{dagger })textbf{A}(textbf{I}_d-textbf{C}_{R,:}^{dagger } textbf{C}_{R,:})</script></span> has a small spectral norm. By employing the method of interlacing polynomials, we show that the smallest possible spectral norm of a residual matrix can be bounded by the largest root of a related expected characteristic polynomial. A deterministic polynomial time algorithm is provided for the spectral norm case of the GCRSS problem. We next apply our results to two specific GCRSS scenarios, one where <span><span>r=0</span><script type="math/tex">r=0</script></span>, simplifying the problem to the Generalized Column Subset Selection (GCSS) problem, and the other where <span><span>textbf{B}=textbf{C}=textbf{I}_d</span><script type="math/tex">textbf{B}=textbf{C}=textbf{I}_d</script></span>, reducing the problem to the submatrix selection problem. In the GCSS scenario, we connect the expected characteristic polynomials to the convolution of multi-affine polynomials, leading to the derivation of the first provable reconstruction bound on the spectral norm of a residual matrix. In the submatrix selection scenario, we show that for any sufficiently small <span><span>varepsilon >0</span><script type="math/tex">varepsilon >0</script></span> and any square matrix <span><span>textbf{A}in mathbb {R}^{dtimes d}</span><script type="math/tex">textbf{A}in mathbb {R}^{dtimes d}</script></span>, there exist two subsets <span><span>Ssubset [d]</span><script type="math/tex">Ssubset [d]</script></span> and <span><span>Rsubset [d]</span><script type="math/tex">Rsubset [d]</script></span> of sizes <span><span>O(dcdot varepsilon ^2)</span><script type="math/tex">O(dcdot varepsilon ^2)</script></span> such that <span><span>Vert textbf{A}_{S,R}Vert _2le varepsilon cdot Vert textbf{A}Vert _2</span><script type="math/tex">Vert textbf{A}_{S,R}
{"title":"Interlacing Polynomial Method for Matrix Approximation via Generalized Column and Row Selection","authors":"Jian-Feng Cai, Zhiqiang Xu, Zili Xu","doi":"10.1007/s10208-025-09719-5","DOIUrl":"https://doi.org/10.1007/s10208-025-09719-5","url":null,"abstract":"<p>This paper delves into the spectral norm aspect of the Generalized Column and Row Subset Selection (GCRSS) problem. Given a target matrix <span><span>textbf{A}in mathbb {R}^{ntimes d}</span><script type=\"math/tex\">textbf{A}in mathbb {R}^{ntimes d}</script></span>, the objective of GCRSS is to select a column submatrix <span><span>textbf{B}_{:,S}in mathbb {R}^{ntimes k}</span><script type=\"math/tex\">textbf{B}_{:,S}in mathbb {R}^{ntimes k}</script></span> from the source matrix <span><span>textbf{B}in mathbb {R}^{ntimes d_B}</span><script type=\"math/tex\">textbf{B}in mathbb {R}^{ntimes d_B}</script></span> and a row submatrix <span><span>textbf{C}_{R,:}in mathbb {R}^{rtimes d}</span><script type=\"math/tex\">textbf{C}_{R,:}in mathbb {R}^{rtimes d}</script></span> from the source matrix <span><span>textbf{C}in mathbb {R}^{n_Ctimes d}</span><script type=\"math/tex\">textbf{C}in mathbb {R}^{n_Ctimes d}</script></span>, such that the residual matrix <span><span>(textbf{I}_n-textbf{B}_{:,S}textbf{B}_{:,S}^{dagger })textbf{A}(textbf{I}_d-textbf{C}_{R,:}^{dagger } textbf{C}_{R,:})</span><script type=\"math/tex\">(textbf{I}_n-textbf{B}_{:,S}textbf{B}_{:,S}^{dagger })textbf{A}(textbf{I}_d-textbf{C}_{R,:}^{dagger } textbf{C}_{R,:})</script></span> has a small spectral norm. By employing the method of interlacing polynomials, we show that the smallest possible spectral norm of a residual matrix can be bounded by the largest root of a related expected characteristic polynomial. A deterministic polynomial time algorithm is provided for the spectral norm case of the GCRSS problem. We next apply our results to two specific GCRSS scenarios, one where <span><span>r=0</span><script type=\"math/tex\">r=0</script></span>, simplifying the problem to the Generalized Column Subset Selection (GCSS) problem, and the other where <span><span>textbf{B}=textbf{C}=textbf{I}_d</span><script type=\"math/tex\">textbf{B}=textbf{C}=textbf{I}_d</script></span>, reducing the problem to the submatrix selection problem. In the GCSS scenario, we connect the expected characteristic polynomials to the convolution of multi-affine polynomials, leading to the derivation of the first provable reconstruction bound on the spectral norm of a residual matrix. In the submatrix selection scenario, we show that for any sufficiently small <span><span>varepsilon >0</span><script type=\"math/tex\">varepsilon >0</script></span> and any square matrix <span><span>textbf{A}in mathbb {R}^{dtimes d}</span><script type=\"math/tex\">textbf{A}in mathbb {R}^{dtimes d}</script></span>, there exist two subsets <span><span>Ssubset [d]</span><script type=\"math/tex\">Ssubset [d]</script></span> and <span><span>Rsubset [d]</span><script type=\"math/tex\">Rsubset [d]</script></span> of sizes <span><span>O(dcdot varepsilon ^2)</span><script type=\"math/tex\">O(dcdot varepsilon ^2)</script></span> such that <span><span>Vert textbf{A}_{S,R}Vert _2le varepsilon cdot Vert textbf{A}Vert _2</span><script type=\"math/tex\">Vert textbf{A}_{S,R}","PeriodicalId":55151,"journal":{"name":"Foundations of Computational Mathematics","volume":"18 1","pages":""},"PeriodicalIF":3.0,"publicationDate":"2025-07-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144924529","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-07-08DOI: 10.1007/s10208-025-09722-w
Zhengxin Zhang, Ziv Goldfeld, Kristjan Greenewald, Youssef Mroueh, Bharath K. Sriperumbudur
The Wasserstein space of probability measures is known for its intricate Riemannian structure, which underpins the Wasserstein geometry and enables gradient flow algorithms. However, the Wasserstein geometry may not be suitable for certain tasks or data modalities. Motivated by scenarios where the global structure of the data needs to be preserved, this work initiates the study of gradient flows and Riemannian structure in the Gromov-Wasserstein (GW) geometry, which is particularly suited for such purposes. We focus on the inner product GW (IGW) distance between distributions on mathbb {R}^d, which preserves the angles within the data and serves as a convenient initial setting due to its analytic tractability. Given a functional textsf{F}:mathcal {P}_2(mathbb {R}^d)rightarrow mathbb {R} to optimize and an initial distribution rho _0in mathcal {P}_2(mathbb {R}^d), we present an implicit IGW minimizing movement scheme that generates a sequence of distributions {rho _i}_{i=0}^n, which are close in IGW and aligned in the 2-Wasserstein sense. Taking the time step to zero, we prove that the (piecewise constant interpolation of the) discrete solution converges to an IGW generalized minimizing movement (GMM) (rho _t)_t that follows the continuity equation with a velocity field v_tin L^2(rho _t;mathbb {R}^d), specified by a global transformation of the Wasserstein gradient of textsf{F} (viz., the gradient of its first variation). The transformation is given by a mobility operator that modifies the Wasserstein gradient to encode not only local information, but also global structure, as expected for the IGW gradient flow. Our gradient flow analysis leads us to identify the Riemannian structure that gives rise to the intrinsic IGW geometry, using which we establish a Benamou-Brenier-like formula for IGW. We conclude with a formal derivation, akin to the Otto calculus, of the IGW gradient as the inverse mobility acting on the Wasserstein gradient. Numerical experiments demonstrating the global nature of IGW interpolations are provided to complement the theory.
{"title":"Gradient Flows and Riemannian Structure in the Gromov-Wasserstein Geometry","authors":"Zhengxin Zhang, Ziv Goldfeld, Kristjan Greenewald, Youssef Mroueh, Bharath K. Sriperumbudur","doi":"10.1007/s10208-025-09722-w","DOIUrl":"https://doi.org/10.1007/s10208-025-09722-w","url":null,"abstract":"<p>The Wasserstein space of probability measures is known for its intricate Riemannian structure, which underpins the Wasserstein geometry and enables gradient flow algorithms. However, the Wasserstein geometry may not be suitable for certain tasks or data modalities. Motivated by scenarios where the global structure of the data needs to be preserved, this work initiates the study of gradient flows and Riemannian structure in the Gromov-Wasserstein (GW) geometry, which is particularly suited for such purposes. We focus on the inner product GW (IGW) distance between distributions on <span><span>mathbb {R}^d</span><script type=\"math/tex\">mathbb {R}^d</script></span>, which preserves the angles within the data and serves as a convenient initial setting due to its analytic tractability. Given a functional <span><span>textsf{F}:mathcal {P}_2(mathbb {R}^d)rightarrow mathbb {R}</span><script type=\"math/tex\">textsf{F}:mathcal {P}_2(mathbb {R}^d)rightarrow mathbb {R}</script></span> to optimize and an initial distribution <span><span>rho _0in mathcal {P}_2(mathbb {R}^d)</span><script type=\"math/tex\">rho _0in mathcal {P}_2(mathbb {R}^d)</script></span>, we present an implicit IGW minimizing movement scheme that generates a sequence of distributions <span><span>{rho _i}_{i=0}^n</span><script type=\"math/tex\">{rho _i}_{i=0}^n</script></span>, which are close in IGW and aligned in the 2-Wasserstein sense. Taking the time step to zero, we prove that the (piecewise constant interpolation of the) discrete solution converges to an IGW generalized minimizing movement (GMM) <span><span>(rho _t)_t</span><script type=\"math/tex\">(rho _t)_t</script></span> that follows the continuity equation with a velocity field <span><span>v_tin L^2(rho _t;mathbb {R}^d)</span><script type=\"math/tex\">v_tin L^2(rho _t;mathbb {R}^d)</script></span>, specified by a global transformation of the Wasserstein gradient of <span><span>textsf{F}</span><script type=\"math/tex\">textsf{F}</script></span> (viz., the gradient of its first variation). The transformation is given by a mobility operator that modifies the Wasserstein gradient to encode not only local information, but also global structure, as expected for the IGW gradient flow. Our gradient flow analysis leads us to identify the Riemannian structure that gives rise to the intrinsic IGW geometry, using which we establish a Benamou-Brenier-like formula for IGW. We conclude with a formal derivation, akin to the Otto calculus, of the IGW gradient as the inverse mobility acting on the Wasserstein gradient. Numerical experiments demonstrating the global nature of IGW interpolations are provided to complement the theory.</p>","PeriodicalId":55151,"journal":{"name":"Foundations of Computational Mathematics","volume":"13 1","pages":""},"PeriodicalIF":3.0,"publicationDate":"2025-07-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144924530","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-06-25DOI: 10.1007/s10208-025-09712-y
Mihai I. Florea, Yurii E. Nesterov
First order methods endowed with global convergence guarantees operate using global lower bounds on the objective. The tightening of the bounds leads to an increase in theoretical guarantees and in observed practical performance. In this work, we define a global lower bound for smooth objectives that is optimal with respect to the collected oracle information. Our bound can be readily employed by the Gradient Method with Memory to improve its performance. Further using the machinery underlying the optimal bounds, we introduce a modified version of the estimate sequence that we use to construct an Optimized Gradient Method with Memory possessing the best known convergence guarantees for its class of algorithms up to the proportionality constant. We additionally equip the method with an adaptive convergence guarantee adjustment procedure that is an effective replacement for line-search. Simulation results on synthetic but otherwise difficult smooth problems validate the theoretical properties of the bound and of the proposed methods.
{"title":"An Optimal Lower Bound for Smooth Convex Functions","authors":"Mihai I. Florea, Yurii E. Nesterov","doi":"10.1007/s10208-025-09712-y","DOIUrl":"https://doi.org/10.1007/s10208-025-09712-y","url":null,"abstract":"<p>First order methods endowed with global convergence guarantees operate using global lower bounds on the objective. The tightening of the bounds leads to an increase in theoretical guarantees and in observed practical performance. In this work, we define a global lower bound for smooth objectives that is optimal with respect to the collected oracle information. Our bound can be readily employed by the Gradient Method with Memory to improve its performance. Further using the machinery underlying the optimal bounds, we introduce a modified version of the estimate sequence that we use to construct an Optimized Gradient Method with Memory possessing the best known convergence guarantees for its class of algorithms up to the proportionality constant. We additionally equip the method with an adaptive convergence guarantee adjustment procedure that is an effective replacement for line-search. Simulation results on synthetic but otherwise difficult smooth problems validate the theoretical properties of the bound and of the proposed methods.\u0000</p>","PeriodicalId":55151,"journal":{"name":"Foundations of Computational Mathematics","volume":"13 1","pages":""},"PeriodicalIF":3.0,"publicationDate":"2025-06-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144924555","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-06-02DOI: 10.1007/s10208-025-09704-y
Raymond van Bommel, Jordan Docking, Vladimir Dokchitser, Reynald Lercier, Elisa Lorenzo García
We give a conjectural characterisation of the stable reduction of plane quartics over local fields in terms of their Cayley octads. This results in p-adic criteria that efficiently give the stable reduction type amongst the 42 possible types, and whether the reduction is hyperelliptic or not. These criteria are in the vein of the machinery of “cluster pictures” for hyperelliptic curves. We also construct explicit families of quartic curves that realise all possible stable types, against which we test these criteria. We give numerical examples that illustrate how to use these criteria in practice.
{"title":"Reduction of Plane Quartics and Cayley Octads","authors":"Raymond van Bommel, Jordan Docking, Vladimir Dokchitser, Reynald Lercier, Elisa Lorenzo García","doi":"10.1007/s10208-025-09704-y","DOIUrl":"https://doi.org/10.1007/s10208-025-09704-y","url":null,"abstract":"<p>We give a conjectural characterisation of the stable reduction of plane quartics over local fields in terms of their Cayley octads. This results in <i>p</i>-adic criteria that efficiently give the stable reduction type amongst the 42 possible types, and whether the reduction is hyperelliptic or not. These criteria are in the vein of the machinery of “cluster pictures” for hyperelliptic curves. We also construct explicit families of quartic curves that realise all possible stable types, against which we test these criteria. We give numerical examples that illustrate how to use these criteria in practice.\u0000</p>","PeriodicalId":55151,"journal":{"name":"Foundations of Computational Mathematics","volume":"27 1","pages":""},"PeriodicalIF":3.0,"publicationDate":"2025-06-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144924560","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-04-29DOI: 10.1007/s10208-025-09711-z
Chupeng Ma
Multiscale partial differential equations (PDEs), featuring heterogeneous coefficients oscillating across possibly non-separated scales, pose computational challenges for standard numerical techniques. Over the past two decades, a range of specialized methods has emerged that enables the efficient solution of such problems. Two prominent approaches are numerical multiscale methods with problem-adapted coarse approximation spaces, and structured inverse methods that exploit a low-rank property of the associated Green’s functions to obtain approximate matrix factorizations. This work presents an abstract framework for the design, implementation, and analysis of the multiscale spectral generalized finite element method (MS-GFEM), a particular numerical multiscale method originally proposed in Babuska and Lipton (Multiscale Model Simul 9:373–406, 2011). MS-GFEM is a partition of unity method employing optimal local approximation spaces constructed from local spectral problems. We establish a general local approximation theory demonstrating exponential convergence with respect to the number of local degrees of freedom under certain assumptions, with explicit dependence on key problem parameters. Our framework applies to a broad class of multiscale PDEs with (L^{infty })-coefficients in both continuous and discrete, finite element settings, including highly indefinite problems and higher-order problems. Notably, we prove a local convergence rate of (O(e^{-cn^{1/d}})) for MS-GFEM for all these problems, improving upon the (O(e^{-cn^{1/(d+1)}})) rate shown by Babuska and Lipton. Moreover, based on the abstract local approximation theory for MS-GFEM, we establish a unified framework for showing low-rank approximations to multiscale PDEs. This framework applies to the aforementioned problems, proving that the associated Green’s functions admit an (O(|log epsilon |^{d}))-term separable approximation on well-separated domains with error (epsilon >0). Our analysis improves and generalizes the result in Bebendorf and Hackbusch (Numerische Mathematik 95:1–28, 2003) where an (O(|log epsilon |^{d+1}))-term separable approximation was proved for Poisson-type problems. It provides a rigorous theoretical foundation for diverse structured inverse methods, and also clarifies the intimate connection between approximation mechanisms in such methods and MS-GFEM.
多尺度偏微分方程(PDEs)具有非均匀系数在可能的非分离尺度上振荡的特点,对标准数值技术提出了计算挑战。在过去的二十年中,出现了一系列能够有效解决这类问题的专门方法。两种突出的方法是具有自适应问题的粗近似空间的数值多尺度方法,以及利用相关格林函数的低秩性质来获得近似矩阵分解的结构化逆方法。本文为多尺度谱广义有限元法(MS-GFEM)的设计、实现和分析提供了一个抽象框架,MS-GFEM是一种特殊的数值多尺度方法,最初由Babuska和Lipton提出(multiscale Model Simul 9:373-406, 2011)。MS-GFEM是一种利用局部谱问题构造的最优局部逼近空间的单位划分方法。我们建立了一个广义的局部逼近理论,证明了在一定的假设条件下,局部自由度的数目是指数收敛的,并且与关键问题参数有显式的依赖关系。我们的框架适用于连续和离散、有限元设置中具有(L^{infty }) -系数的广泛类别的多尺度偏微分方程,包括高度不确定问题和高阶问题。值得注意的是,我们证明了MS-GFEM对所有这些问题的局部收敛率为(O(e^{-cn^{1/d}})),比Babuska和Lipton给出的(O(e^{-cn^{1/(d+1)}}))收敛率有所提高。此外,基于MS-GFEM的抽象局部逼近理论,建立了多尺度偏微分方程低秩逼近的统一框架。该框架适用于上述问题,证明了相关的格林函数在分离良好的域上承认(O(|log epsilon |^{d}))项可分离近似,误差为(epsilon >0)。我们的分析改进并推广了Bebendorf和Hackbusch (Numerische Mathematik 95:1 - 28,2003)的结果,其中证明了泊松型问题的(O(|log epsilon |^{d+1}))项可分离近似。为各种结构化逆方法提供了严谨的理论基础,也阐明了这些方法中的逼近机制与MS-GFEM之间的密切联系。
{"title":"A Unified Framework for Multiscale Spectral Generalized FEMs and Low-Rank Approximations to Multiscale PDEs","authors":"Chupeng Ma","doi":"10.1007/s10208-025-09711-z","DOIUrl":"https://doi.org/10.1007/s10208-025-09711-z","url":null,"abstract":"<p>Multiscale partial differential equations (PDEs), featuring heterogeneous coefficients oscillating across possibly non-separated scales, pose computational challenges for standard numerical techniques. Over the past two decades, a range of specialized methods has emerged that enables the efficient solution of such problems. Two prominent approaches are numerical multiscale methods with problem-adapted coarse approximation spaces, and structured inverse methods that exploit a low-rank property of the associated Green’s functions to obtain approximate matrix factorizations. This work presents an abstract framework for the design, implementation, and analysis of the multiscale spectral generalized finite element method (MS-GFEM), a particular numerical multiscale method originally proposed in Babuska and Lipton (Multiscale Model Simul 9:373–406, 2011). MS-GFEM is a partition of unity method employing optimal local approximation spaces constructed from local spectral problems. We establish a general local approximation theory demonstrating exponential convergence with respect to the number of local degrees of freedom under certain assumptions, with explicit dependence on key problem parameters. Our framework applies to a broad class of multiscale PDEs with <span>(L^{infty })</span>-coefficients in both continuous and discrete, finite element settings, including highly indefinite problems and higher-order problems. Notably, we prove a local convergence rate of <span>(O(e^{-cn^{1/d}}))</span> for MS-GFEM for all these problems, improving upon the <span>(O(e^{-cn^{1/(d+1)}}))</span> rate shown by Babuska and Lipton. Moreover, based on the abstract local approximation theory for MS-GFEM, we establish a unified framework for showing low-rank approximations to multiscale PDEs. This framework applies to the aforementioned problems, proving that the associated Green’s functions admit an <span>(O(|log epsilon |^{d}))</span>-term separable approximation on well-separated domains with error <span>(epsilon >0)</span>. Our analysis improves and generalizes the result in Bebendorf and Hackbusch (Numerische Mathematik 95:1–28, 2003) where an <span>(O(|log epsilon |^{d+1}))</span>-term separable approximation was proved for Poisson-type problems. It provides a rigorous theoretical foundation for diverse structured inverse methods, and also clarifies the intimate connection between approximation mechanisms in such methods and MS-GFEM.</p>","PeriodicalId":55151,"journal":{"name":"Foundations of Computational Mathematics","volume":"45 1","pages":""},"PeriodicalIF":3.0,"publicationDate":"2025-04-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143889834","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}