在异构客户端异步联合学习中实现线性加速

IF 7.7 2区计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS IEEE Transactions on Mobile Computing Pub Date : 2024-09-17 DOI:10.1109/TMC.2024.3461852

Xiaolu Wang;Zijian Li;Shi Jin;Jun Zhang

{"title":"在异构客户端异步联合学习中实现线性加速","authors":"Xiaolu Wang;Zijian Li;Shi Jin;Jun Zhang","doi":"10.1109/TMC.2024.3461852","DOIUrl":null,"url":null,"abstract":"Federated learning (FL) is an emerging distributed training paradigm that aims to learn a common global model without exchanging or transferring the data that are stored locally at different clients. The Federated Averaging (FedAvg)-based algorithms have gained substantial popularity in FL to reduce the communication overhead, where each client conducts multiple localized iterations before communicating with a central server. In this paper, we focus on FL where the clients have diverse computation and/or communication capabilities. Under this circumstance, FedAvg can be less efficient since it requires all clients that participate in the global aggregation in a round to initiate iterations from the \n<italic>latest\n global model, and thus the synchronization among fast clients and \n<italic>straggler clients\n can severely slow down the overall training process. To address this issue, we propose an efficient asynchronous federated learning (AFL) framework called \n<italic>Delayed Federated Averaging (DeFedAvg)\n. In DeFedAvg, the clients are allowed to perform local training with different stale global models at their own paces. Theoretical analyses demonstrate that DeFedAvg achieves asymptotic convergence rates that are on par with the results of FedAvg for solving nonconvex problems. More importantly, DeFedAvg is the first AFL algorithm that provably achieves the desirable \n<italic>linear speedup\n property, which indicates its high scalability. Additionally, we carry out extensive numerical experiments using real datasets to validate the efficiency and scalability of our approach when training deep neural networks.","PeriodicalId":50389,"journal":{"name":"IEEE Transactions on Mobile Computing","volume":"24 1","pages":"435-448"},"PeriodicalIF":7.7000,"publicationDate":"2024-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Achieving Linear Speedup in Asynchronous Federated Learning With Heterogeneous Clients\",\"authors\":\"Xiaolu Wang;Zijian Li;Shi Jin;Jun Zhang\",\"doi\":\"10.1109/TMC.2024.3461852\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Federated learning (FL) is an emerging distributed training paradigm that aims to learn a common global model without exchanging or transferring the data that are stored locally at different clients. The Federated Averaging (FedAvg)-based algorithms have gained substantial popularity in FL to reduce the communication overhead, where each client conducts multiple localized iterations before communicating with a central server. In this paper, we focus on FL where the clients have diverse computation and/or communication capabilities. Under this circumstance, FedAvg can be less efficient since it requires all clients that participate in the global aggregation in a round to initiate iterations from the \\n<italic>latest\\n global model, and thus the synchronization among fast clients and \\n<italic>straggler clients\\n can severely slow down the overall training process. To address this issue, we propose an efficient asynchronous federated learning (AFL) framework called \\n<italic>Delayed Federated Averaging (DeFedAvg)\\n. In DeFedAvg, the clients are allowed to perform local training with different stale global models at their own paces. Theoretical analyses demonstrate that DeFedAvg achieves asymptotic convergence rates that are on par with the results of FedAvg for solving nonconvex problems. More importantly, DeFedAvg is the first AFL algorithm that provably achieves the desirable \\n<italic>linear speedup\\n property, which indicates its high scalability. Additionally, we carry out extensive numerical experiments using real datasets to validate the efficiency and scalability of our approach when training deep neural networks.\",\"PeriodicalId\":50389,\"journal\":{\"name\":\"IEEE Transactions on Mobile Computing\",\"volume\":\"24 1\",\"pages\":\"435-448\"},\"PeriodicalIF\":7.7000,\"publicationDate\":\"2024-09-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Mobile Computing\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10681663/\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Mobile Computing","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10681663/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}

引用次数: 0

摘要

联邦学习（FL）是一种新兴的分布式训练范例，旨在学习通用的全局模型，而无需交换或传输本地存储在不同客户机上的数据。基于联邦平均（fedag）的算法在FL中非常流行，以减少通信开销，其中每个客户端在与中央服务器通信之前进行多次本地化迭代。在本文中，我们关注的是客户端具有不同计算和/或通信能力的FL。在这种情况下，fedag的效率可能会降低，因为它需要一轮中参与全局聚合的所有客户端从最新的全局模型开始迭代，因此快速客户端和落后客户端之间的同步可能会严重减慢整个训练过程。为了解决这个问题，我们提出了一个高效的异步联邦学习（AFL）框架，称为延迟联邦平均（DeFedAvg）。在defdavg中，客户可以按照自己的节奏使用不同陈旧的全局模型进行本地训练。理论分析表明，在求解非凸问题时，FedAvg算法的渐近收敛速度与FedAvg算法的结果相当。更重要的是，DeFedAvg是第一个可以证明达到理想线性加速特性的AFL算法，这表明它具有很高的可扩展性。此外，我们使用真实数据集进行了大量的数值实验，以验证我们的方法在训练深度神经网络时的效率和可扩展性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Achieving Linear Speedup in Asynchronous Federated Learning With Heterogeneous Clients

Federated learning (FL) is an emerging distributed training paradigm that aims to learn a common global model without exchanging or transferring the data that are stored locally at different clients. The Federated Averaging (FedAvg)-based algorithms have gained substantial popularity in FL to reduce the communication overhead, where each client conducts multiple localized iterations before communicating with a central server. In this paper, we focus on FL where the clients have diverse computation and/or communication capabilities. Under this circumstance, FedAvg can be less efficient since it requires all clients that participate in the global aggregation in a round to initiate iterations from the latest global model, and thus the synchronization among fast clients and straggler clients can severely slow down the overall training process. To address this issue, we propose an efficient asynchronous federated learning (AFL) framework called Delayed Federated Averaging (DeFedAvg) . In DeFedAvg, the clients are allowed to perform local training with different stale global models at their own paces. Theoretical analyses demonstrate that DeFedAvg achieves asymptotic convergence rates that are on par with the results of FedAvg for solving nonconvex problems. More importantly, DeFedAvg is the first AFL algorithm that provably achieves the desirable linear speedup property, which indicates its high scalability. Additionally, we carry out extensive numerical experiments using real datasets to validate the efficiency and scalability of our approach when training deep neural networks.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

IEEE Transactions on Mobile Computing 工程技术-电信学

CiteScore

12.90

自引率

2.50%

发文量

403

审稿时长

6.6 months

期刊介绍： IEEE Transactions on Mobile Computing addresses key technical issues related to various aspects of mobile computing. This includes (a) architectures, (b) support services, (c) algorithm/protocol design and analysis, (d) mobile environments, (e) mobile communication systems, (f) applications, and (g) emerging technologies. Topics of interest span a wide range, covering aspects like mobile networks and hosts, mobility management, multimedia, operating system support, power management, online and mobile environments, security, scalability, reliability, and emerging technologies such as wearable computers, body area networks, and wireless sensor networks. The journal serves as a comprehensive platform for advancements in mobile computing research.

期刊最新文献

Harmonizing Global and Local Class Imbalance for Federated Learning O-RAN-Enabled Intelligent Network Slicing to Meet Service-Level Agreement (SLA) CV-Cast: Computer Vision–Oriented Linear Coding and Transmission AdaWiFi, Collaborative WiFi Sensing for Cross-Environment Adaptation BIT-FL: Blockchain-Enabled Incentivized and Secure Federated Learning Framework