{"title":"QSFL: Two-Level Communication-Efficient Federated Learning on Mobile Edge Devices","authors":"Liping Yi;Gang Wang;Xiaofei Wang;Xiaoguang Liu","doi":"10.1109/TSC.2024.3455098","DOIUrl":null,"url":null,"abstract":"In cross-device horizontal federated learning (FL), the communication cost of transmitting complete models between edge devices and a central server is a significant bottleneck, due to expensive, unreliable, and low-bandwidth wireless connections. As a solution, we propose a novel \n<underline>FL</u>\n framework named \n<italic>QSFL</i>\n, towards \n<italic>optimizing FL uplink (client-to-server) communication at both client and model levels</i>\n. At the client level, we design a \n<italic>Qualification Judgment (<u>Q</u>J)</i>\n algorithm to sample high-qualification clients to upload models. At the model level, we design a \n<italic>Sparse Cyclic Sliding Segmentation (<u>S</u>CSS)</i>\n algorithm to further compress the local model transmitted from the client to the server in the uplink communication. We prove that QSFL can converge over wall-to-wall time, and develop an optimal hyperparameter searching algorithm based on theoretical analysis to enable QSFL to make the best trade-off between model accuracy and communication cost. Experimental results show that QSFL achieves state-of-the-art compression ratios with marginal model accuracy degradation. Since mobile edge devices as FL clients often have heterogeneous system resources, such as communication bandwidth, we propose two novel \n<italic>dynamic segmentation strategies with varied counts or sizes</i>\n based on QSFL to enhance the robustness of QSFL to FL system heterogeneity. For some mobile edge devices joining as FL clients with both limited uplink and downlink communication bandwidths, they can not pull up the global model from the server. To tackle it, we propose a novel \n<italic>symmetric downlink compression</i>\n scheme on top of QSFL to further reduce the downlink (server-to-client) communication costs, hence enabling a bidirectional communication-efficient FL. Theory analysis and experiments demonstrate that QSFL with dynamic segmentation or symmetric downlink compression still keeps convergence and takes a better trade-off between model accuracy and communication efficiency than without them.","PeriodicalId":13255,"journal":{"name":"IEEE Transactions on Services Computing","volume":"17 6","pages":"4166-4182"},"PeriodicalIF":5.8000,"publicationDate":"2024-09-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Services Computing","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10666282/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
In cross-device horizontal federated learning (FL), the communication cost of transmitting complete models between edge devices and a central server is a significant bottleneck, due to expensive, unreliable, and low-bandwidth wireless connections. As a solution, we propose a novel
FL
framework named
QSFL
, towards
optimizing FL uplink (client-to-server) communication at both client and model levels
. At the client level, we design a
Qualification Judgment (QJ)
algorithm to sample high-qualification clients to upload models. At the model level, we design a
Sparse Cyclic Sliding Segmentation (SCSS)
algorithm to further compress the local model transmitted from the client to the server in the uplink communication. We prove that QSFL can converge over wall-to-wall time, and develop an optimal hyperparameter searching algorithm based on theoretical analysis to enable QSFL to make the best trade-off between model accuracy and communication cost. Experimental results show that QSFL achieves state-of-the-art compression ratios with marginal model accuracy degradation. Since mobile edge devices as FL clients often have heterogeneous system resources, such as communication bandwidth, we propose two novel
dynamic segmentation strategies with varied counts or sizes
based on QSFL to enhance the robustness of QSFL to FL system heterogeneity. For some mobile edge devices joining as FL clients with both limited uplink and downlink communication bandwidths, they can not pull up the global model from the server. To tackle it, we propose a novel
symmetric downlink compression
scheme on top of QSFL to further reduce the downlink (server-to-client) communication costs, hence enabling a bidirectional communication-efficient FL. Theory analysis and experiments demonstrate that QSFL with dynamic segmentation or symmetric downlink compression still keeps convergence and takes a better trade-off between model accuracy and communication efficiency than without them.
期刊介绍:
IEEE Transactions on Services Computing encompasses the computing and software aspects of the science and technology of services innovation research and development. It places emphasis on algorithmic, mathematical, statistical, and computational methods central to services computing. Topics covered include Service Oriented Architecture, Web Services, Business Process Integration, Solution Performance Management, and Services Operations and Management. The transactions address mathematical foundations, security, privacy, agreement, contract, discovery, negotiation, collaboration, and quality of service for web services. It also covers areas like composite web service creation, business and scientific applications, standards, utility models, business process modeling, integration, collaboration, and more in the realm of Services Computing.