{"title":"支持swift的物联网网络中无线联合学习的深度强化学习","authors":"Xinran Zhang, Hui Tian, Wanli Ni, Mengying Sun","doi":"10.1109/VTC2022-Fall57202.2022.10012702","DOIUrl":null,"url":null,"abstract":"As a distributed machine learning paradigm, federated learning (FL) has been regarded as a promising candidate to preserve user privacy in Internet of Things (IoT) networks. Leveraging the waveform superposition property of wireless channels, over-the-air FL (AirFL) achieves fast model aggregation by integrating communication and computation via concurrent analog transmissions. To support sustainable AirFL among energy-constrained IoT devices, we consider that the base station (BS) adopts simultaneous wireless information and power transfer (SWIPT) to distribute global model and charge local devices in each communication round. To maximize the long-term energy efficiency (EE) of AirFL, we investigate a resource allocation problem by jointly optimizing the time division, transceiver beamforming, and power splitting in SWIPT-enabled IoT networks. Considering such multiple closely-coupled continuous valuables, we propose a deep reinforcement learning (DRL) algorithm based on twin delayed deep deterministic (TD3) policy to smartly make downlink and uplink communication strategies with the coordination between the BS and devices. Simulation results show that the proposed TD3 algorithm obtains about 41% EE improvement compared to traditional optimization method and other DRL algorithms.","PeriodicalId":326047,"journal":{"name":"2022 IEEE 96th Vehicular Technology Conference (VTC2022-Fall)","volume":"26 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Deep Reinforcement Learning for Over-the-Air Federated Learning in SWIPT-Enabled IoT Networks\",\"authors\":\"Xinran Zhang, Hui Tian, Wanli Ni, Mengying Sun\",\"doi\":\"10.1109/VTC2022-Fall57202.2022.10012702\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"As a distributed machine learning paradigm, federated learning (FL) has been regarded as a promising candidate to preserve user privacy in Internet of Things (IoT) networks. Leveraging the waveform superposition property of wireless channels, over-the-air FL (AirFL) achieves fast model aggregation by integrating communication and computation via concurrent analog transmissions. To support sustainable AirFL among energy-constrained IoT devices, we consider that the base station (BS) adopts simultaneous wireless information and power transfer (SWIPT) to distribute global model and charge local devices in each communication round. To maximize the long-term energy efficiency (EE) of AirFL, we investigate a resource allocation problem by jointly optimizing the time division, transceiver beamforming, and power splitting in SWIPT-enabled IoT networks. Considering such multiple closely-coupled continuous valuables, we propose a deep reinforcement learning (DRL) algorithm based on twin delayed deep deterministic (TD3) policy to smartly make downlink and uplink communication strategies with the coordination between the BS and devices. Simulation results show that the proposed TD3 algorithm obtains about 41% EE improvement compared to traditional optimization method and other DRL algorithms.\",\"PeriodicalId\":326047,\"journal\":{\"name\":\"2022 IEEE 96th Vehicular Technology Conference (VTC2022-Fall)\",\"volume\":\"26 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE 96th Vehicular Technology Conference (VTC2022-Fall)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/VTC2022-Fall57202.2022.10012702\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE 96th Vehicular Technology Conference (VTC2022-Fall)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/VTC2022-Fall57202.2022.10012702","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Deep Reinforcement Learning for Over-the-Air Federated Learning in SWIPT-Enabled IoT Networks
As a distributed machine learning paradigm, federated learning (FL) has been regarded as a promising candidate to preserve user privacy in Internet of Things (IoT) networks. Leveraging the waveform superposition property of wireless channels, over-the-air FL (AirFL) achieves fast model aggregation by integrating communication and computation via concurrent analog transmissions. To support sustainable AirFL among energy-constrained IoT devices, we consider that the base station (BS) adopts simultaneous wireless information and power transfer (SWIPT) to distribute global model and charge local devices in each communication round. To maximize the long-term energy efficiency (EE) of AirFL, we investigate a resource allocation problem by jointly optimizing the time division, transceiver beamforming, and power splitting in SWIPT-enabled IoT networks. Considering such multiple closely-coupled continuous valuables, we propose a deep reinforcement learning (DRL) algorithm based on twin delayed deep deterministic (TD3) policy to smartly make downlink and uplink communication strategies with the coordination between the BS and devices. Simulation results show that the proposed TD3 algorithm obtains about 41% EE improvement compared to traditional optimization method and other DRL algorithms.