{"title":"Proximal Policy Optimization With Adaptive Weight Adjustment for Airborne Integrated Sensing and Communication","authors":"Shihang Zhang;Chao Sun;Shuyan Hu;Wei Ni;Xin Wang","doi":"10.1109/LWC.2025.3553316","DOIUrl":null,"url":null,"abstract":"Integrated sensing and communication (ISAC) technology has emerged as a potential enabler for efficient data transmission and sensing in autonomous aerial vehicle (UAV) networks. In this letter, we investigate a UAV-ISAC system, where a UAV furnishes downlink communication and simultaneous sensing for mobile ground nodes (GNs). To address this non-convex optimization problem and prevent retraining required due to dynamic environments, we design a new proximal policy optimization (PPO)-based approach that jointly optimizes the UAV trajectory and resource assignment while guaranteeing the required throughput, sensing fairness (SF) and age of information (AoI) among GNs. We integrate a new adaptive reward weight adjustment (ARWA) mechanism into the PPO framework to guide the UAV operations dynamically. Simulations corroborate that the proposed ARWA-PPO scheme can ameliorate the system throughput by 12% and increase the minimum GN throughput by 19%, compared to the existing baselines, including PPO.","PeriodicalId":13343,"journal":{"name":"IEEE Wireless Communications Letters","volume":"14 6","pages":"1703-1707"},"PeriodicalIF":5.5000,"publicationDate":"2025-03-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Wireless Communications Letters","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10935637/","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
Integrated sensing and communication (ISAC) technology has emerged as a potential enabler for efficient data transmission and sensing in autonomous aerial vehicle (UAV) networks. In this letter, we investigate a UAV-ISAC system, where a UAV furnishes downlink communication and simultaneous sensing for mobile ground nodes (GNs). To address this non-convex optimization problem and prevent retraining required due to dynamic environments, we design a new proximal policy optimization (PPO)-based approach that jointly optimizes the UAV trajectory and resource assignment while guaranteeing the required throughput, sensing fairness (SF) and age of information (AoI) among GNs. We integrate a new adaptive reward weight adjustment (ARWA) mechanism into the PPO framework to guide the UAV operations dynamically. Simulations corroborate that the proposed ARWA-PPO scheme can ameliorate the system throughput by 12% and increase the minimum GN throughput by 19%, compared to the existing baselines, including PPO.
期刊介绍:
IEEE Wireless Communications Letters publishes short papers in a rapid publication cycle on advances in the state-of-the-art of wireless communications. Both theoretical contributions (including new techniques, concepts, and analyses) and practical contributions (including system experiments and prototypes, and new applications) are encouraged. This journal focuses on the physical layer and the link layer of wireless communication systems.