QSFL: Two-Level Communication-Efficient Federated Learning on Mobile Edge Devices

IF 5.8 2区计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS IEEE Transactions on Services Computing Pub Date : 2024-09-05 DOI:10.1109/TSC.2024.3455098

Liping Yi;Gang Wang;Xiaofei Wang;Xiaoguang Liu

{"title":"QSFL: Two-Level Communication-Efficient Federated Learning on Mobile Edge Devices","authors":"Liping Yi;Gang Wang;Xiaofei Wang;Xiaoguang Liu","doi":"10.1109/TSC.2024.3455098","DOIUrl":null,"url":null,"abstract":"In cross-device horizontal federated learning (FL), the communication cost of transmitting complete models between edge devices and a central server is a significant bottleneck, due to expensive, unreliable, and low-bandwidth wireless connections. As a solution, we propose a novel \n<underline>FL\n framework named \n<italic>QSFL\n, towards \n<italic>optimizing FL uplink (client-to-server) communication at both client and model levels\n. At the client level, we design a \n<italic>Qualification Judgment (QJ)\n algorithm to sample high-qualification clients to upload models. At the model level, we design a \n<italic>Sparse Cyclic Sliding Segmentation (SCSS)\n algorithm to further compress the local model transmitted from the client to the server in the uplink communication. We prove that QSFL can converge over wall-to-wall time, and develop an optimal hyperparameter searching algorithm based on theoretical analysis to enable QSFL to make the best trade-off between model accuracy and communication cost. Experimental results show that QSFL achieves state-of-the-art compression ratios with marginal model accuracy degradation. Since mobile edge devices as FL clients often have heterogeneous system resources, such as communication bandwidth, we propose two novel \n<italic>dynamic segmentation strategies with varied counts or sizes\n based on QSFL to enhance the robustness of QSFL to FL system heterogeneity. For some mobile edge devices joining as FL clients with both limited uplink and downlink communication bandwidths, they can not pull up the global model from the server. To tackle it, we propose a novel \n<italic>symmetric downlink compression\n scheme on top of QSFL to further reduce the downlink (server-to-client) communication costs, hence enabling a bidirectional communication-efficient FL. Theory analysis and experiments demonstrate that QSFL with dynamic segmentation or symmetric downlink compression still keeps convergence and takes a better trade-off between model accuracy and communication efficiency than without them.","PeriodicalId":13255,"journal":{"name":"IEEE Transactions on Services Computing","volume":"17 6","pages":"4166-4182"},"PeriodicalIF":5.8000,"publicationDate":"2024-09-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Services Computing","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10666282/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

In cross-device horizontal federated learning (FL), the communication cost of transmitting complete models between edge devices and a central server is a significant bottleneck, due to expensive, unreliable, and low-bandwidth wireless connections. As a solution, we propose a novel FL framework named QSFL , towards optimizing FL uplink (client-to-server) communication at both client and model levels . At the client level, we design a Qualification Judgment (QJ) algorithm to sample high-qualification clients to upload models. At the model level, we design a Sparse Cyclic Sliding Segmentation (SCSS) algorithm to further compress the local model transmitted from the client to the server in the uplink communication. We prove that QSFL can converge over wall-to-wall time, and develop an optimal hyperparameter searching algorithm based on theoretical analysis to enable QSFL to make the best trade-off between model accuracy and communication cost. Experimental results show that QSFL achieves state-of-the-art compression ratios with marginal model accuracy degradation. Since mobile edge devices as FL clients often have heterogeneous system resources, such as communication bandwidth, we propose two novel dynamic segmentation strategies with varied counts or sizes based on QSFL to enhance the robustness of QSFL to FL system heterogeneity. For some mobile edge devices joining as FL clients with both limited uplink and downlink communication bandwidths, they can not pull up the global model from the server. To tackle it, we propose a novel symmetric downlink compression scheme on top of QSFL to further reduce the downlink (server-to-client) communication costs, hence enabling a bidirectional communication-efficient FL. Theory analysis and experiments demonstrate that QSFL with dynamic segmentation or symmetric downlink compression still keeps convergence and takes a better trade-off between model accuracy and communication efficiency than without them.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

QSFL：移动边缘设备上的两级通信效率联盟学习

在跨设备水平联邦学习（FL）中，由于昂贵、不可靠和低带宽的无线连接，在边缘设备和中心服务器之间传输完整模型的通信成本是一个重要的瓶颈。作为解决方案，我们提出了一个名为QSFL的新型FL框架，以优化客户端和模型级别的FL上行链路（客户端到服务器）通信。在客户端层面，我们设计了一个资格判断（QJ）算法，对高资格的客户端进行抽样来上传模型。在模型层面，我们设计了一种稀疏循环滑动分割（SCSS）算法，在上行通信中进一步压缩从客户端传输到服务器端的本地模型。我们证明了QSFL可以在墙到墙时间内收敛，并在理论分析的基础上开发了一种最优超参数搜索算法，使QSFL能够在模型精度和通信成本之间取得最佳平衡。实验结果表明，在边际模型精度下降的情况下，QSFL获得了最先进的压缩比。由于作为FL客户端的移动边缘设备通常具有异构系统资源（如通信带宽），我们提出了两种基于QSFL的不同计数或大小的动态分割策略，以增强QSFL对FL系统异构的鲁棒性。对于一些作为FL客户端加入的移动边缘设备，由于上行和下行通信带宽都有限，它们无法从服务器拉出全局模型。为了解决这个问题，我们在QSFL的基础上提出了一种新的对称下行链路压缩方案，以进一步降低下行链路（服务器到客户端）的通信成本，从而实现双向通信高效的QSFL。理论分析和实验表明，采用动态分割或对称下行链路压缩的QSFL仍然保持收敛性，并且在模型精度和通信效率之间取得了更好的平衡。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

IEEE Transactions on Services Computing COMPUTER SCIENCE, INFORMATION SYSTEMS-COMPUTER SCIENCE, SOFTWARE ENGINEERING

CiteScore

11.50

自引率

6.20%

发文量

278

审稿时长

>12 weeks

期刊介绍： IEEE Transactions on Services Computing encompasses the computing and software aspects of the science and technology of services innovation research and development. It places emphasis on algorithmic, mathematical, statistical, and computational methods central to services computing. Topics covered include Service Oriented Architecture, Web Services, Business Process Integration, Solution Performance Management, and Services Operations and Management. The transactions address mathematical foundations, security, privacy, agreement, contract, discovery, negotiation, collaboration, and quality of service for web services. It also covers areas like composite web service creation, business and scientific applications, standards, utility models, business process modeling, integration, collaboration, and more in the realm of Services Computing.