分散联合学习的网络梯度下降算法

IF 2.9 2区数学 Q1 ECONOMICS Journal of Business & Economic Statistics Pub Date : 2022-05-06 DOI:10.1080/07350015.2022.2074426

Shuyuan Wu, Danyang Huang, Hansheng Wang

{"title":"分散联合学习的网络梯度下降算法","authors":"Shuyuan Wu, Danyang Huang, Hansheng Wang","doi":"10.1080/07350015.2022.2074426","DOIUrl":null,"url":null,"abstract":"Abstract We study a fully decentralized federated learning algorithm, which is a novel gradient descent algorithm executed on a communication-based network. For convenience, we refer to it as a network gradient descent (NGD) method. In the NGD method, only statistics (e.g., parameter estimates) need to be communicated, minimizing the risk of privacy. Meanwhile, different clients communicate with each other directly according to a carefully designed network structure without a central master. This greatly enhances the reliability of the entire algorithm. Those nice properties inspire us to carefully study the NGD method both theoretically and numerically. Theoretically, we start with a classical linear regression model. We find that both the learning rate and the network structure play significant roles in determining the NGD estimator’s statistical efficiency. The resulting NGD estimator can be statistically as efficient as the global estimator, if the learning rate is sufficiently small and the network structure is weakly balanced, even if the data are distributed heterogeneously. Those interesting findings are then extended to general models and loss functions. Extensive numerical studies are presented to corroborate our theoretical findings. Classical deep learning models are also presented for illustration purpose.","PeriodicalId":50247,"journal":{"name":"Journal of Business & Economic Statistics","volume":"41 1","pages":"806 - 818"},"PeriodicalIF":2.9000,"publicationDate":"2022-05-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Network Gradient Descent Algorithm for Decentralized Federated Learning\",\"authors\":\"Shuyuan Wu, Danyang Huang, Hansheng Wang\",\"doi\":\"10.1080/07350015.2022.2074426\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Abstract We study a fully decentralized federated learning algorithm, which is a novel gradient descent algorithm executed on a communication-based network. For convenience, we refer to it as a network gradient descent (NGD) method. In the NGD method, only statistics (e.g., parameter estimates) need to be communicated, minimizing the risk of privacy. Meanwhile, different clients communicate with each other directly according to a carefully designed network structure without a central master. This greatly enhances the reliability of the entire algorithm. Those nice properties inspire us to carefully study the NGD method both theoretically and numerically. Theoretically, we start with a classical linear regression model. We find that both the learning rate and the network structure play significant roles in determining the NGD estimator’s statistical efficiency. The resulting NGD estimator can be statistically as efficient as the global estimator, if the learning rate is sufficiently small and the network structure is weakly balanced, even if the data are distributed heterogeneously. Those interesting findings are then extended to general models and loss functions. Extensive numerical studies are presented to corroborate our theoretical findings. Classical deep learning models are also presented for illustration purpose.\",\"PeriodicalId\":50247,\"journal\":{\"name\":\"Journal of Business & Economic Statistics\",\"volume\":\"41 1\",\"pages\":\"806 - 818\"},\"PeriodicalIF\":2.9000,\"publicationDate\":\"2022-05-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Business & Economic Statistics\",\"FirstCategoryId\":\"100\",\"ListUrlMain\":\"https://doi.org/10.1080/07350015.2022.2074426\",\"RegionNum\":2,\"RegionCategory\":\"数学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ECONOMICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Business & Economic Statistics","FirstCategoryId":"100","ListUrlMain":"https://doi.org/10.1080/07350015.2022.2074426","RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ECONOMICS","Score":null,"Total":0}

引用次数: 3

摘要

摘要我们研究了一种完全分散的联邦学习算法，这是一种在基于通信的网络上执行的新型梯度下降算法。为了方便起见，我们将其称为网络梯度下降（NGD）方法。在NGD方法中，只需要传达统计数据（例如参数估计），从而将隐私风险降至最低。同时，不同的客户端根据精心设计的网络结构直接相互通信，而无需中央主机。这大大提高了整个算法的可靠性。这些良好的性质激励我们从理论和数值上仔细研究NGD方法。从理论上讲，我们从一个经典的线性回归模型开始。我们发现，学习率和网络结构在决定NGD估计器的统计效率方面都起着重要作用。如果学习率足够小并且网络结构弱平衡，即使数据分布不均匀，所得到的NGD估计器在统计上也可以与全局估计器一样有效。然后将这些有趣的发现推广到一般模型和损失函数中。大量的数值研究证实了我们的理论发现。为了便于说明，还提出了经典的深度学习模型。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Network Gradient Descent Algorithm for Decentralized Federated Learning

Abstract We study a fully decentralized federated learning algorithm, which is a novel gradient descent algorithm executed on a communication-based network. For convenience, we refer to it as a network gradient descent (NGD) method. In the NGD method, only statistics (e.g., parameter estimates) need to be communicated, minimizing the risk of privacy. Meanwhile, different clients communicate with each other directly according to a carefully designed network structure without a central master. This greatly enhances the reliability of the entire algorithm. Those nice properties inspire us to carefully study the NGD method both theoretically and numerically. Theoretically, we start with a classical linear regression model. We find that both the learning rate and the network structure play significant roles in determining the NGD estimator’s statistical efficiency. The resulting NGD estimator can be statistically as efficient as the global estimator, if the learning rate is sufficiently small and the network structure is weakly balanced, even if the data are distributed heterogeneously. Those interesting findings are then extended to general models and loss functions. Extensive numerical studies are presented to corroborate our theoretical findings. Classical deep learning models are also presented for illustration purpose.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Journal of Business & Economic Statistics 数学-统计学与概率论

CiteScore

5.00

自引率

6.70%

发文量

审稿时长

>12 weeks

期刊介绍： The Journal of Business and Economic Statistics (JBES) publishes a range of articles, primarily applied statistical analyses of microeconomic, macroeconomic, forecasting, business, and finance related topics. More general papers in statistics, econometrics, computation, simulation, or graphics are also appropriate if they are immediately applicable to the journal''s general topics of interest. Articles published in JBES contain significant results, high-quality methodological content, excellent exposition, and usually include a substantive empirical application.

期刊最新文献

A Ridge-Regularized Jackknifed Anderson-Rubin Test. Efficient and Robust Estimation of the Generalized LATE Model Modeling and Forecasting Macroeconomic Downside Risk* Causal inference under outcome-based sampling with monotonicity assumptions Functional-Coefficient Quantile Regression for Panel Data with Latent Group Structure