Convergence guarantees for forward gradient descent in the linear regression model

IF 0.8 4区数学 Q3 STATISTICS & PROBABILITY Journal of Statistical Planning and Inference Pub Date : 2024-04-06 DOI:10.1016/j.jspi.2024.106174

Thijs Bos , Johannes Schmidt-Hieber

{"title":"Convergence guarantees for forward gradient descent in the linear regression model","authors":"Thijs Bos , Johannes Schmidt-Hieber","doi":"10.1016/j.jspi.2024.106174","DOIUrl":null,"url":null,"abstract":"<div>Renewed interest in the relationship between artificial and biological neural networks motivates the study of gradient-free methods. Considering the linear regression model with random design, we theoretically analyze in this work the biologically motivated (weight-perturbed) forward gradient scheme that is based on random linear combination of the gradient. If <math><mi>d</mi></math> denotes the number of parameters and <math><mi>k</mi></math> the number of samples, we prove that the mean squared error of this method converges for <math><mrow><mi>k</mi><mo>≳</mo><msup><mrow><mi>d</mi></mrow><mrow><mn>2</mn></mrow></msup><mo>log</mo><mrow><mo>(</mo><mi>d</mi><mo>)</mo></mrow></mrow></math> with rate <math><mrow><msup><mrow><mi>d</mi></mrow><mrow><mn>2</mn></mrow></msup><mo>log</mo><mrow><mo>(</mo><mi>d</mi><mo>)</mo></mrow><mo>/</mo><mi>k</mi></mrow></math>. Compared to the dimension dependence <math><mi>d</mi></math> for stochastic gradient descent, an additional factor <math><mrow><mi>d</mi><mo>log</mo><mrow><mo>(</mo><mi>d</mi><mo>)</mo></mrow></mrow></math> occurs.</div>","PeriodicalId":50039,"journal":{"name":"Journal of Statistical Planning and Inference","volume":"233 ","pages":"Article 106174"},"PeriodicalIF":0.8000,"publicationDate":"2024-04-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S0378375824000314/pdfft?md5=fc5918288c472da3301b467d899078ad&pid=1-s2.0-S0378375824000314-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Statistical Planning and Inference","FirstCategoryId":"100","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0378375824000314","RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"STATISTICS & PROBABILITY","Score":null,"Total":0}

引用次数: 0

Abstract

Renewed interest in the relationship between artificial and biological neural networks motivates the study of gradient-free methods. Considering the linear regression model with random design, we theoretically analyze in this work the biologically motivated (weight-perturbed) forward gradient scheme that is based on random linear combination of the gradient. If $d$ denotes the number of parameters and $k$ the number of samples, we prove that the mean squared error of this method converges for $k ≳ d^{2} log (d)$ with rate $d^{2} log (d) / k$ . Compared to the dimension dependence $d$ for stochastic gradient descent, an additional factor $d log (d)$ occurs.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

线性回归模型中前向梯度下降的收敛保证

人们对人工神经网络和生物神经网络之间关系的兴趣再次激发了对无梯度方法的研究。考虑到随机设计的线性回归模型，我们在本研究中从理论上分析了基于梯度随机线性组合的生物（权重扰动）前向梯度方案。如果 d 表示参数个数，k 表示样本个数，我们证明这种方法的均方误差在 k≳d2log(d) 条件下以 d2log(d)/k 的速率收敛。与随机梯度下降法的维度依赖性 d 相比，多了一个系数 dlog(d)。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Journal of Statistical Planning and Inference 数学-统计学与概率论

CiteScore

2.10

自引率

11.10%

发文量

审稿时长

3-6 weeks

期刊介绍： The Journal of Statistical Planning and Inference offers itself as a multifaceted and all-inclusive bridge between classical aspects of statistics and probability, and the emerging interdisciplinary aspects that have a potential of revolutionizing the subject. While we maintain our traditional strength in statistical inference, design, classical probability, and large sample methods, we also have a far more inclusive and broadened scope to keep up with the new problems that confront us as statisticians, mathematicians, and scientists. We publish high quality articles in all branches of statistics, probability, discrete mathematics, machine learning, and bioinformatics. We also especially welcome well written and up to date review articles on fundamental themes of statistics, probability, machine learning, and general biostatistics. Thoughtful letters to the editors, interesting problems in need of a solution, and short notes carrying an element of elegance or beauty are equally welcome.

期刊最新文献

The two-sample location shift model under log-concavity On cross-validated estimation of skew normal model Editorial Board Model averaging prediction for survival data with time-dependent effects Marginally constrained nonparametric Bayesian inference through Gaussian processes