Pub Date : 2024-06-13DOI: 10.1016/j.jco.2024.101883
A. Arman , A.E. Litvak
We improve some upper bounds for minimal dispersion on the cube and torus. Our new ingredient is an improvement of a probabilistic lemma used to obtain upper bounds for dispersion in several previous works. Our new lemma combines a random and non-random choice of points in the cube. This leads to better upper bounds for the minimal dispersion.
{"title":"Minimal dispersion on the cube and the torus","authors":"A. Arman , A.E. Litvak","doi":"10.1016/j.jco.2024.101883","DOIUrl":"10.1016/j.jco.2024.101883","url":null,"abstract":"<div><p>We improve some upper bounds for minimal dispersion on the cube and torus. Our new ingredient is an improvement of a probabilistic lemma used to obtain upper bounds for dispersion in several previous works. Our new lemma combines a random and non-random choice of points in the cube. This leads to better upper bounds for the minimal dispersion.</p></div>","PeriodicalId":50227,"journal":{"name":"Journal of Complexity","volume":"85 ","pages":"Article 101883"},"PeriodicalIF":1.7,"publicationDate":"2024-06-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0885064X24000608/pdfft?md5=8545345a37c7c8d8bd458b82060fc777&pid=1-s2.0-S0885064X24000608-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141408812","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-06-11DOI: 10.1016/j.jco.2024.101871
Xiangyong Tan , Ling Peng , Heng Lian , Xiaohui Liu
In this paper, we consider the adaptive Huber trace regression model with matrix covariates. A non-convex penalty function is employed to account for the low-rank structure of the unknown parameter. Under some mild conditions, we establish an upper bound for the statistical rate of convergence of the regularized matrix estimator. Theoretically, we can deal with heavy-tailed distributions with bounded -th moment for any . Furthermore, we derive the effect of the adaptive parameter on the final estimator. Some simulations, as well as a real data example, are designed to show the finite sample performance of the proposed method.
{"title":"Adaptive Huber trace regression with low-rank matrix parameter via nonconvex regularization","authors":"Xiangyong Tan , Ling Peng , Heng Lian , Xiaohui Liu","doi":"10.1016/j.jco.2024.101871","DOIUrl":"10.1016/j.jco.2024.101871","url":null,"abstract":"<div><p>In this paper, we consider the adaptive Huber trace regression model with matrix covariates. A non-convex penalty function is employed to account for the low-rank structure of the unknown parameter. Under some mild conditions, we establish an upper bound for the statistical rate of convergence of the regularized matrix estimator. Theoretically, we can deal with heavy-tailed distributions with bounded <span><math><mo>(</mo><mn>1</mn><mo>+</mo><mi>δ</mi><mo>)</mo></math></span>-th moment for any <span><math><mi>δ</mi><mo>></mo><mn>0</mn></math></span>. Furthermore, we derive the effect of the adaptive parameter on the final estimator. Some simulations, as well as a real data example, are designed to show the finite sample performance of the proposed method.</p></div>","PeriodicalId":50227,"journal":{"name":"Journal of Complexity","volume":"85 ","pages":"Article 101871"},"PeriodicalIF":1.8,"publicationDate":"2024-06-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141405892","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-06-11DOI: 10.1016/j.jco.2024.101873
Tim Johnston , Iosif Lytras , Sotirios Sabanis
In this article we consider sampling from log concave distributions in Hamiltonian setting, without assuming that the objective gradient is globally Lipschitz. We propose two algorithms based on monotone polygonal (tamed) Euler schemes, to sample from a target measure, and provide non-asymptotic 2-Wasserstein distance bounds between the law of the process of each algorithm and the target measure. Finally, we apply these results to bound the excess risk optimization error of the associated optimization problem.
{"title":"Kinetic Langevin MCMC sampling without gradient Lipschitz continuity - the strongly convex case","authors":"Tim Johnston , Iosif Lytras , Sotirios Sabanis","doi":"10.1016/j.jco.2024.101873","DOIUrl":"https://doi.org/10.1016/j.jco.2024.101873","url":null,"abstract":"<div><p>In this article we consider sampling from log concave distributions in Hamiltonian setting, without assuming that the objective gradient is globally Lipschitz. We propose two algorithms based on monotone polygonal (tamed) Euler schemes, to sample from a target measure, and provide non-asymptotic 2-Wasserstein distance bounds between the law of the process of each algorithm and the target measure. Finally, we apply these results to bound the excess risk optimization error of the associated optimization problem.</p></div>","PeriodicalId":50227,"journal":{"name":"Journal of Complexity","volume":"85 ","pages":"Article 101873"},"PeriodicalIF":1.7,"publicationDate":"2024-06-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0885064X24000505/pdfft?md5=a3d2ab8e2d24a32d60460bf5751fc280&pid=1-s2.0-S0885064X24000505-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141423327","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-06-11DOI: 10.1016/j.jco.2024.101872
Stefan Heinrich
Recently the adaption problem of Information-Based Complexity (IBC) for linear problems in the randomized setting was solved in Heinrich (2024) [8]. Several papers treating further aspects of this problem followed. However, all examples obtained so far were vector-valued. In this paper we settle the scalar-valued case. We study the complexity of mean computation in finite dimensional sequence spaces with mixed norms. We determine the n-th minimal errors in the randomized adaptive and non-adaptive settings. It turns out that among the problems considered there are examples where adaptive and non-adaptive n-th minimal errors deviate by a power of n. The gap can be (up to log factors) of the order . We also show how to turn such results into infinite dimensional examples with suitable deviation for all n simultaneously.
最近,海因里希(2024)[8] 解决了随机设置中线性问题的基于信息的复杂性(IBC)适应问题。随后,又有多篇论文对这一问题的其他方面进行了探讨。然而,迄今为止获得的所有示例都是矢量值。本文解决的是标量值问题。我们研究了具有混合 LpN 规范的有限维序列空间中均值计算的复杂性。我们确定了随机自适应和非自适应设置中的 n 次最小误差。结果发现,在所考虑的问题中,有自适应和非自适应 n 次最小误差偏差为 n 的幂的例子。这种差距可以是 n1/4 的数量级(最多对数因子)。我们还展示了如何同时将这些结果转化为对所有 n 都有适当偏差的无限维示例。
{"title":"Randomized complexity of mean computation and the adaption problem","authors":"Stefan Heinrich","doi":"10.1016/j.jco.2024.101872","DOIUrl":"https://doi.org/10.1016/j.jco.2024.101872","url":null,"abstract":"<div><p>Recently the adaption problem of Information-Based Complexity (IBC) for linear problems in the randomized setting was solved in Heinrich (2024) <span>[8]</span>. Several papers treating further aspects of this problem followed. However, all examples obtained so far were vector-valued. In this paper we settle the scalar-valued case. We study the complexity of mean computation in finite dimensional sequence spaces with mixed <span><math><msubsup><mrow><mi>L</mi></mrow><mrow><mi>p</mi></mrow><mrow><mi>N</mi></mrow></msubsup></math></span> norms. We determine the <em>n</em>-th minimal errors in the randomized adaptive and non-adaptive settings. It turns out that among the problems considered there are examples where adaptive and non-adaptive <em>n</em>-th minimal errors deviate by a power of <em>n</em>. The gap can be (up to log factors) of the order <span><math><msup><mrow><mi>n</mi></mrow><mrow><mn>1</mn><mo>/</mo><mn>4</mn></mrow></msup></math></span>. We also show how to turn such results into infinite dimensional examples with suitable deviation for all <em>n</em> simultaneously.</p></div>","PeriodicalId":50227,"journal":{"name":"Journal of Complexity","volume":"85 ","pages":"Article 101872"},"PeriodicalIF":1.7,"publicationDate":"2024-06-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141428808","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-06-03DOI: 10.1016/j.jco.2024.101870
Thomas Müller-Gronbach , Larisa Yaroslavtseva
We survey recent developments in the field of complexity of pathwise approximation in p-th mean of the solution of a stochastic differential equation at the final time based on finitely many evaluations of the driving Brownian motion. First, we briefly review the case of equations with globally Lipschitz continuous coefficients, for which an error rate of at least 1/2 in terms of the number of evaluations of the driving Brownian motion is always guaranteed by using the equidistant Euler-Maruyama scheme. Then we illustrate that giving up the global Lipschitz continuity of the coefficients may lead to a non-polynomial decay of the error for the Euler-Maruyama scheme or even to an arbitrary slow decay of the smallest possible error that can be achieved on the basis of finitely many evaluations of the driving Brownian motion. Finally, we turn to recent positive results for equations with a drift coefficient that is not globally Lipschitz continuous. Here we focus on scalar equations with a Lipschitz continuous diffusion coefficient and a drift coefficient that satisfies piecewise smoothness assumptions or has fractional Sobolev regularity and we present corresponding complexity results.
{"title":"On the complexity of strong approximation of stochastic differential equations with a non-Lipschitz drift coefficient","authors":"Thomas Müller-Gronbach , Larisa Yaroslavtseva","doi":"10.1016/j.jco.2024.101870","DOIUrl":"https://doi.org/10.1016/j.jco.2024.101870","url":null,"abstract":"<div><p>We survey recent developments in the field of complexity of pathwise approximation in <em>p</em>-th mean of the solution of a stochastic differential equation at the final time based on finitely many evaluations of the driving Brownian motion. First, we briefly review the case of equations with globally Lipschitz continuous coefficients, for which an error rate of at least 1/2 in terms of the number of evaluations of the driving Brownian motion is always guaranteed by using the equidistant Euler-Maruyama scheme. Then we illustrate that giving up the global Lipschitz continuity of the coefficients may lead to a non-polynomial decay of the error for the Euler-Maruyama scheme or even to an arbitrary slow decay of the smallest possible error that can be achieved on the basis of finitely many evaluations of the driving Brownian motion. Finally, we turn to recent positive results for equations with a drift coefficient that is not globally Lipschitz continuous. Here we focus on scalar equations with a Lipschitz continuous diffusion coefficient and a drift coefficient that satisfies piecewise smoothness assumptions or has fractional Sobolev regularity and we present corresponding complexity results.</p></div>","PeriodicalId":50227,"journal":{"name":"Journal of Complexity","volume":"85 ","pages":"Article 101870"},"PeriodicalIF":1.7,"publicationDate":"2024-06-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0885064X24000475/pdfft?md5=1abf95a86603ccdc1b342109b28265f5&pid=1-s2.0-S0885064X24000475-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141313488","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-05-31DOI: 10.1016/j.jco.2024.101868
Stefan Hain, Karsten Urban
We present a well-posed ultra-weak space-time variational formulation for the time-dependent version of the linear Schrödinger equation with an instationary Hamiltonian. We prove optimal inf-sup stability and introduce a space-time Petrov-Galerkin discretization with optimal discrete inf-sup stability.
We show norm-preservation of the ultra-weak formulation. The inf-sup optimal Petrov-Galerkin discretization is shown to be asymptotically norm-preserving, where the deviation is shown to be in the order of the discretization. In addition, we introduce a Galerkin discretization, which has suboptimal inf-sup stability but exact norm-preservation.
Numerical experiments underline the performance of the ultra-weak space-time variational formulation, especially for non-smooth initial data.
{"title":"An ultra-weak space-time variational formulation for the Schrödinger equation","authors":"Stefan Hain, Karsten Urban","doi":"10.1016/j.jco.2024.101868","DOIUrl":"https://doi.org/10.1016/j.jco.2024.101868","url":null,"abstract":"<div><p>We present a well-posed ultra-weak space-time variational formulation for the time-dependent version of the linear Schrödinger equation with an instationary Hamiltonian. We prove optimal inf-sup stability and introduce a space-time Petrov-Galerkin discretization with optimal discrete inf-sup stability.</p><p>We show norm-preservation of the ultra-weak formulation. The inf-sup optimal Petrov-Galerkin discretization is shown to be asymptotically norm-preserving, where the deviation is shown to be in the order of the discretization. In addition, we introduce a Galerkin discretization, which has suboptimal inf-sup stability but exact norm-preservation.</p><p>Numerical experiments underline the performance of the ultra-weak space-time variational formulation, especially for non-smooth initial data.</p></div>","PeriodicalId":50227,"journal":{"name":"Journal of Complexity","volume":"85 ","pages":"Article 101868"},"PeriodicalIF":1.7,"publicationDate":"2024-05-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0885064X24000451/pdfft?md5=1ddbc4d09e904e3ef47e872092642e90&pid=1-s2.0-S0885064X24000451-main.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141290844","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-05-31DOI: 10.1016/j.jco.2024.101869
Peter Kritzer
We give an overview of certain aspects of tractability analysis of multivariate problems. This paper is not intended to give a complete account of the subject, but provides an insight into how the theory works for particular types of problems. We mainly focus on linear problems on Hilbert spaces, and mostly allow arbitrary linear information. In such cases, tractability analysis is closely linked to an analysis of the singular values of the operator under consideration. We also highlight the more recent developments regarding exponential and generalized tractability. The theoretical results are illustrated by several examples throughout the article.
{"title":"Selected aspects of tractability analysis","authors":"Peter Kritzer","doi":"10.1016/j.jco.2024.101869","DOIUrl":"https://doi.org/10.1016/j.jco.2024.101869","url":null,"abstract":"<div><p>We give an overview of certain aspects of tractability analysis of multivariate problems. This paper is not intended to give a complete account of the subject, but provides an insight into how the theory works for particular types of problems. We mainly focus on linear problems on Hilbert spaces, and mostly allow arbitrary linear information. In such cases, tractability analysis is closely linked to an analysis of the singular values of the operator under consideration. We also highlight the more recent developments regarding exponential and generalized tractability. The theoretical results are illustrated by several examples throughout the article.</p></div>","PeriodicalId":50227,"journal":{"name":"Journal of Complexity","volume":"84 ","pages":"Article 101869"},"PeriodicalIF":1.7,"publicationDate":"2024-05-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141250103","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-05-13DOI: 10.1016/j.jco.2024.101867
Peter Mathé , Bernd Hofmann
We introduce a notion of tractability for ill-posed operator equations in Hilbert space. For such operator equations the asymptotics of the best possible rate of reconstruction in terms of the underlying noise level is known in many cases. However, the relevant question is, which level of discretization, again driven by the noise level, is required in order to achieve this best possible accuracy. The proposed concept adapts the one from Information-based Complexity. Several examples indicate the relevance of this concept in the light of the curse of dimensionality.
{"title":"Tractability of linear ill-posed problems in Hilbert space","authors":"Peter Mathé , Bernd Hofmann","doi":"10.1016/j.jco.2024.101867","DOIUrl":"10.1016/j.jco.2024.101867","url":null,"abstract":"<div><p>We introduce a notion of tractability for ill-posed operator equations in Hilbert space. For such operator equations the asymptotics of the best possible rate of reconstruction in terms of the underlying noise level is known in many cases. However, the relevant question is, which level of discretization, again driven by the noise level, is required in order to achieve this best possible accuracy. The proposed concept adapts the one from Information-based Complexity. Several examples indicate the relevance of this concept in the light of the curse of dimensionality.</p></div>","PeriodicalId":50227,"journal":{"name":"Journal of Complexity","volume":"84 ","pages":"Article 101867"},"PeriodicalIF":1.7,"publicationDate":"2024-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141061797","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-05-06DOI: 10.1016/j.jco.2024.101866
Artūras Dubickas
<div><p>In this paper, we consider the problem of finding how close two sums of <em>m</em>th roots can be to each other. For integers <span><math><mi>m</mi><mo>≥</mo><mn>2</mn></math></span>, <span><math><mi>k</mi><mo>≥</mo><mn>1</mn></math></span> and <span><math><mn>0</mn><mo>≤</mo><mi>s</mi><mo>≤</mo><mi>k</mi></math></span>, let <span><math><msub><mrow><mi>e</mi></mrow><mrow><mi>m</mi></mrow></msub><mo>(</mo><mi>s</mi><mo>,</mo><mi>k</mi><mo>)</mo><mo>></mo><mn>0</mn></math></span> and <span><math><msub><mrow><mi>E</mi></mrow><mrow><mi>m</mi></mrow></msub><mo>(</mo><mi>s</mi><mo>,</mo><mi>k</mi><mo>)</mo><mo>></mo><mn>0</mn></math></span> be the largest exponents such that for infinitely many integers <em>N</em> there exist <em>k</em> positive integers <span><math><msub><mrow><mi>a</mi></mrow><mrow><mn>1</mn></mrow></msub><mo>,</mo><mo>…</mo><mo>,</mo><msub><mrow><mi>a</mi></mrow><mrow><mi>k</mi></mrow></msub><mo>≤</mo><mi>N</mi></math></span> for which two sums of their <em>m</em>th roots <span><math><msubsup><mrow><mo>∑</mo></mrow><mrow><mi>j</mi><mo>=</mo><mn>1</mn></mrow><mrow><mi>s</mi></mrow></msubsup><mroot><mrow><msub><mrow><mi>a</mi></mrow><mrow><mi>j</mi></mrow></msub></mrow><mrow><mi>m</mi></mrow></mroot></math></span> and <span><math><msubsup><mrow><mo>∑</mo></mrow><mrow><mi>j</mi><mo>=</mo><mi>s</mi><mo>+</mo><mn>1</mn></mrow><mrow><mi>k</mi></mrow></msubsup><mroot><mrow><msub><mrow><mi>a</mi></mrow><mrow><mi>j</mi></mrow></msub></mrow><mrow><mi>m</mi></mrow></mroot></math></span> are distinct but not further than <span><math><msup><mrow><mi>N</mi></mrow><mrow><mo>−</mo><msub><mrow><mi>e</mi></mrow><mrow><mi>m</mi></mrow></msub><mo>(</mo><mi>s</mi><mo>,</mo><mi>k</mi><mo>)</mo></mrow></msup></math></span> from each other, or they are distinct modulo 1 but not further than <span><math><msup><mrow><mi>N</mi></mrow><mrow><mo>−</mo><msub><mrow><mi>E</mi></mrow><mrow><mi>m</mi></mrow></msub><mo>(</mo><mi>s</mi><mo>,</mo><mi>k</mi><mo>)</mo></mrow></msup></math></span> from each other modulo 1. Some upper bounds on <span><math><msub><mrow><mi>e</mi></mrow><mrow><mi>m</mi></mrow></msub><mo>(</mo><mi>s</mi><mo>,</mo><mi>k</mi><mo>)</mo></math></span> and <span><math><msub><mrow><mi>E</mi></mrow><mrow><mi>m</mi></mrow></msub><mo>(</mo><mi>s</mi><mo>,</mo><mi>k</mi><mo>)</mo></math></span> can be derived by a Liouville-type argument, while lower bounds are usually difficult to obtain. We prove that <span><math><msub><mrow><mi>e</mi></mrow><mrow><mi>m</mi></mrow></msub><mo>(</mo><mi>s</mi><mo>,</mo><mi>k</mi><mo>)</mo><mo>≥</mo><mi>min</mi><mo></mo><mo>(</mo><mn>2</mn><mi>s</mi><mo>,</mo><mi>k</mi><mo>−</mo><mn>1</mn><mo>,</mo><mn>2</mn><mi>k</mi><mo>−</mo><mn>2</mn><mi>s</mi><mo>)</mo><mo>−</mo><mn>1</mn><mo>/</mo><mi>m</mi></math></span> for <span><math><mn>1</mn><mo>≤</mo><mi>s</mi><mo><</mo><mi>k</mi></math></span> and that <span><math><msub><mrow><mi>E</mi></mrow><mrow><mi>m</mi></mrow></msub><mo>(</mo><mi>s</mi><mo>,</mo><mi
{"title":"Approximate equality for two sums of roots","authors":"Artūras Dubickas","doi":"10.1016/j.jco.2024.101866","DOIUrl":"https://doi.org/10.1016/j.jco.2024.101866","url":null,"abstract":"<div><p>In this paper, we consider the problem of finding how close two sums of <em>m</em>th roots can be to each other. For integers <span><math><mi>m</mi><mo>≥</mo><mn>2</mn></math></span>, <span><math><mi>k</mi><mo>≥</mo><mn>1</mn></math></span> and <span><math><mn>0</mn><mo>≤</mo><mi>s</mi><mo>≤</mo><mi>k</mi></math></span>, let <span><math><msub><mrow><mi>e</mi></mrow><mrow><mi>m</mi></mrow></msub><mo>(</mo><mi>s</mi><mo>,</mo><mi>k</mi><mo>)</mo><mo>></mo><mn>0</mn></math></span> and <span><math><msub><mrow><mi>E</mi></mrow><mrow><mi>m</mi></mrow></msub><mo>(</mo><mi>s</mi><mo>,</mo><mi>k</mi><mo>)</mo><mo>></mo><mn>0</mn></math></span> be the largest exponents such that for infinitely many integers <em>N</em> there exist <em>k</em> positive integers <span><math><msub><mrow><mi>a</mi></mrow><mrow><mn>1</mn></mrow></msub><mo>,</mo><mo>…</mo><mo>,</mo><msub><mrow><mi>a</mi></mrow><mrow><mi>k</mi></mrow></msub><mo>≤</mo><mi>N</mi></math></span> for which two sums of their <em>m</em>th roots <span><math><msubsup><mrow><mo>∑</mo></mrow><mrow><mi>j</mi><mo>=</mo><mn>1</mn></mrow><mrow><mi>s</mi></mrow></msubsup><mroot><mrow><msub><mrow><mi>a</mi></mrow><mrow><mi>j</mi></mrow></msub></mrow><mrow><mi>m</mi></mrow></mroot></math></span> and <span><math><msubsup><mrow><mo>∑</mo></mrow><mrow><mi>j</mi><mo>=</mo><mi>s</mi><mo>+</mo><mn>1</mn></mrow><mrow><mi>k</mi></mrow></msubsup><mroot><mrow><msub><mrow><mi>a</mi></mrow><mrow><mi>j</mi></mrow></msub></mrow><mrow><mi>m</mi></mrow></mroot></math></span> are distinct but not further than <span><math><msup><mrow><mi>N</mi></mrow><mrow><mo>−</mo><msub><mrow><mi>e</mi></mrow><mrow><mi>m</mi></mrow></msub><mo>(</mo><mi>s</mi><mo>,</mo><mi>k</mi><mo>)</mo></mrow></msup></math></span> from each other, or they are distinct modulo 1 but not further than <span><math><msup><mrow><mi>N</mi></mrow><mrow><mo>−</mo><msub><mrow><mi>E</mi></mrow><mrow><mi>m</mi></mrow></msub><mo>(</mo><mi>s</mi><mo>,</mo><mi>k</mi><mo>)</mo></mrow></msup></math></span> from each other modulo 1. Some upper bounds on <span><math><msub><mrow><mi>e</mi></mrow><mrow><mi>m</mi></mrow></msub><mo>(</mo><mi>s</mi><mo>,</mo><mi>k</mi><mo>)</mo></math></span> and <span><math><msub><mrow><mi>E</mi></mrow><mrow><mi>m</mi></mrow></msub><mo>(</mo><mi>s</mi><mo>,</mo><mi>k</mi><mo>)</mo></math></span> can be derived by a Liouville-type argument, while lower bounds are usually difficult to obtain. We prove that <span><math><msub><mrow><mi>e</mi></mrow><mrow><mi>m</mi></mrow></msub><mo>(</mo><mi>s</mi><mo>,</mo><mi>k</mi><mo>)</mo><mo>≥</mo><mi>min</mi><mo></mo><mo>(</mo><mn>2</mn><mi>s</mi><mo>,</mo><mi>k</mi><mo>−</mo><mn>1</mn><mo>,</mo><mn>2</mn><mi>k</mi><mo>−</mo><mn>2</mn><mi>s</mi><mo>)</mo><mo>−</mo><mn>1</mn><mo>/</mo><mi>m</mi></math></span> for <span><math><mn>1</mn><mo>≤</mo><mi>s</mi><mo><</mo><mi>k</mi></math></span> and that <span><math><msub><mrow><mi>E</mi></mrow><mrow><mi>m</mi></mrow></msub><mo>(</mo><mi>s</mi><mo>,</mo><mi","PeriodicalId":50227,"journal":{"name":"Journal of Complexity","volume":"84 ","pages":"Article 101866"},"PeriodicalIF":1.7,"publicationDate":"2024-05-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140905544","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-04-30DOI: 10.1016/j.jco.2024.101858
Cheng Wang , Jun Fan
Functional data analysis offers a set of statistical methods concerned with extracting insights from intrinsically infinite-dimensional data and has attracted considerable amount of attentions in the past few decades. In this paper, we study robust functional linear regression model with a scalar response and a functional predictor in the framework of reproducing kernel Hilbert spaces. A gradient descent algorithm with early stopping is introduced to solve the corresponding empirical risk minimization problem associated with robust loss functions. By appropriately selecting the early stopping rule and the scaling parameter of the robust losses, the convergence of the proposed algorithm is established when the response variable is bounded or satisfies a moment condition. Explicit learning rates with respect to both estimation and prediction error are provided in terms of regularity of the regression function and eigenvalue decay rate of the integral operator induced by the reproducing kernel and covariance function.
{"title":"On the convergence of gradient descent for robust functional linear regression","authors":"Cheng Wang , Jun Fan","doi":"10.1016/j.jco.2024.101858","DOIUrl":"https://doi.org/10.1016/j.jco.2024.101858","url":null,"abstract":"<div><p>Functional data analysis offers a set of statistical methods concerned with extracting insights from intrinsically infinite-dimensional data and has attracted considerable amount of attentions in the past few decades. In this paper, we study robust functional linear regression model with a scalar response and a functional predictor in the framework of reproducing kernel Hilbert spaces. A gradient descent algorithm with early stopping is introduced to solve the corresponding empirical risk minimization problem associated with robust loss functions. By appropriately selecting the early stopping rule and the scaling parameter of the robust losses, the convergence of the proposed algorithm is established when the response variable is bounded or satisfies a moment condition. Explicit learning rates with respect to both estimation and prediction error are provided in terms of regularity of the regression function and eigenvalue decay rate of the integral operator induced by the reproducing kernel and covariance function.</p></div>","PeriodicalId":50227,"journal":{"name":"Journal of Complexity","volume":"84 ","pages":"Article 101858"},"PeriodicalIF":1.7,"publicationDate":"2024-04-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140823062","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}