{"title":"Efficient and Intelligent Multijob Federated Learning in Wireless Networks","authors":"Jiajin Wang;Ne Wang;Ruiting Zhou;Bo Li","doi":"10.1109/JIOT.2024.3502403","DOIUrl":null,"url":null,"abstract":"Federated learning (FL) has emerged as an innovative paradigm designed to protect privacy by enabling collaborative machine learning (ML) model training across multiple data owners (also known as clients) without the need to access clients’ raw data. The majority of existing FL research concentrates on scenarios where a single job necessitates training. In practical applications, multiple FL jobs can simultaneously undergo training using a common pool of clients, a scenario known as multijob FL. However, the problem of FL training with multiple jobs remains open and presents significant challenges of the escalated heterogeneity of jobs and clients, complex tradeoffs between training latency and energy consumption, uncertainty of client quality, and potential linear switching cost associated with client selection. This work aims to jointly optimize training efficiency in terms of latency, energy consumption, and switching cost for multiple jobs in stochastic and dynamic environments. Specifically, we propose a novel multijob FL framework, named <monospace>EffI-FL</monospace>, incorporating three innovative designs: 1) to reduce switching cost, we extend the client selection interval from every round to multiple rounds, called a block, within which client subset switching is prohibited; 2) we employ multiarmed bandit (MAB) methods to measure clients’ latency and energy cost under uncertainty. Additionally, we utilize the virtual queue technique to trace clients’ battery usage patterns. By integrating the above client-side knowledge, we propose an adaptive client selection policy aimed at balancing latency, energy consumption, and battery condition; and 3) given that multiple jobs may compete for the same client, we devise a greedy algorithm to assign each client to a single job. We rigorously prove that the regret of our client selection policy and the cost of our block-wise client subset switching algorithm are both sublinear. Finally, we implement <monospace>EffI-FL</monospace> using PyTorch and conduct experiments demonstrating that <monospace>EffI-FL</monospace> reduces the weighted sum of latency, energy consumption, and switching cost by up to 52.3% compared to four state-of-the-art FL frameworks.","PeriodicalId":54347,"journal":{"name":"IEEE Internet of Things Journal","volume":"12 7","pages":"8685-8698"},"PeriodicalIF":8.9000,"publicationDate":"2024-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Internet of Things Journal","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10757319/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
Federated learning (FL) has emerged as an innovative paradigm designed to protect privacy by enabling collaborative machine learning (ML) model training across multiple data owners (also known as clients) without the need to access clients’ raw data. The majority of existing FL research concentrates on scenarios where a single job necessitates training. In practical applications, multiple FL jobs can simultaneously undergo training using a common pool of clients, a scenario known as multijob FL. However, the problem of FL training with multiple jobs remains open and presents significant challenges of the escalated heterogeneity of jobs and clients, complex tradeoffs between training latency and energy consumption, uncertainty of client quality, and potential linear switching cost associated with client selection. This work aims to jointly optimize training efficiency in terms of latency, energy consumption, and switching cost for multiple jobs in stochastic and dynamic environments. Specifically, we propose a novel multijob FL framework, named EffI-FL, incorporating three innovative designs: 1) to reduce switching cost, we extend the client selection interval from every round to multiple rounds, called a block, within which client subset switching is prohibited; 2) we employ multiarmed bandit (MAB) methods to measure clients’ latency and energy cost under uncertainty. Additionally, we utilize the virtual queue technique to trace clients’ battery usage patterns. By integrating the above client-side knowledge, we propose an adaptive client selection policy aimed at balancing latency, energy consumption, and battery condition; and 3) given that multiple jobs may compete for the same client, we devise a greedy algorithm to assign each client to a single job. We rigorously prove that the regret of our client selection policy and the cost of our block-wise client subset switching algorithm are both sublinear. Finally, we implement EffI-FL using PyTorch and conduct experiments demonstrating that EffI-FL reduces the weighted sum of latency, energy consumption, and switching cost by up to 52.3% compared to four state-of-the-art FL frameworks.
期刊介绍:
The EEE Internet of Things (IoT) Journal publishes articles and review articles covering various aspects of IoT, including IoT system architecture, IoT enabling technologies, IoT communication and networking protocols such as network coding, and IoT services and applications. Topics encompass IoT's impacts on sensor technologies, big data management, and future internet design for applications like smart cities and smart homes. Fields of interest include IoT architecture such as things-centric, data-centric, service-oriented IoT architecture; IoT enabling technologies and systematic integration such as sensor technologies, big sensor data management, and future Internet design for IoT; IoT services, applications, and test-beds such as IoT service middleware, IoT application programming interface (API), IoT application design, and IoT trials/experiments; IoT standardization activities and technology development in different standard development organizations (SDO) such as IEEE, IETF, ITU, 3GPP, ETSI, etc.