Joint UAV Trajectory and RadCom Task Schedule for IVNs: A Game-Embedding Multi-Agent Deep Reinforcement Learning Approach

IF 10.7 1区 计算机科学 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC IEEE Transactions on Wireless Communications Pub Date : 2024-11-08 DOI:10.1109/TWC.2024.3489624
Sike Cheng;Xiangbo Lin;Xuanheng Li;Jingjing Wang
{"title":"Joint UAV Trajectory and RadCom Task Schedule for IVNs: A Game-Embedding Multi-Agent Deep Reinforcement Learning Approach","authors":"Sike Cheng;Xiangbo Lin;Xuanheng Li;Jingjing Wang","doi":"10.1109/TWC.2024.3489624","DOIUrl":null,"url":null,"abstract":"Integrated sensing and communication (ISAC) technology has been envisioned to revolutionize the future intelligent vehicle networks (IVNs). Recently, due to the mobility and flexible deployment, unmanned aerial vehicle (UAV) has been regarded as a promising aerial ISAC platform in future IVNs. In this paper, comprehensively considering all the performance on the throughput, sensing accuracy, and sensing rate, we propose a Multi-Agent joint Trajectory control and RadCom task schedule (MA-TRC) scheme for the ISAC-UAV assisted IVN. To make UAVs achieve the optimal decisions autonomously and adaptively, we propose a Game-Embedding Multi-Agent Deep Reinforcement Learning (GE-MADRL) approach. Specifically, considering the complex action space with both discrete and continuous decision variables of UAVs, we develop a multi-agent Parametrized deep Q-network (MAPDQN) based solution, which can help UAVs learn the dynamic and uncertain environment to adaptively obtain the MA-TRC scheme. Furthermore, since UAVs work in a distributed decision making manner, the potential conflicting decisions will impact the network performance. To avoid the decision conflicts among UAVs during the network parameter training, a distributed two-stage Game method is designed as an action adjuster embedded in MAPDQN, by which the learning convergence performance will be further improved and the strategy conflicts can be avoided.","PeriodicalId":13431,"journal":{"name":"IEEE Transactions on Wireless Communications","volume":"24 1","pages":"181-196"},"PeriodicalIF":10.7000,"publicationDate":"2024-11-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Wireless Communications","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10747827/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0

Abstract

Integrated sensing and communication (ISAC) technology has been envisioned to revolutionize the future intelligent vehicle networks (IVNs). Recently, due to the mobility and flexible deployment, unmanned aerial vehicle (UAV) has been regarded as a promising aerial ISAC platform in future IVNs. In this paper, comprehensively considering all the performance on the throughput, sensing accuracy, and sensing rate, we propose a Multi-Agent joint Trajectory control and RadCom task schedule (MA-TRC) scheme for the ISAC-UAV assisted IVN. To make UAVs achieve the optimal decisions autonomously and adaptively, we propose a Game-Embedding Multi-Agent Deep Reinforcement Learning (GE-MADRL) approach. Specifically, considering the complex action space with both discrete and continuous decision variables of UAVs, we develop a multi-agent Parametrized deep Q-network (MAPDQN) based solution, which can help UAVs learn the dynamic and uncertain environment to adaptively obtain the MA-TRC scheme. Furthermore, since UAVs work in a distributed decision making manner, the potential conflicting decisions will impact the network performance. To avoid the decision conflicts among UAVs during the network parameter training, a distributed two-stage Game method is designed as an action adjuster embedded in MAPDQN, by which the learning convergence performance will be further improved and the strategy conflicts can be avoided.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
IVN 的联合无人机轨迹和 RadCom 任务时间表:游戏嵌入式多代理深度强化学习方法
集成传感和通信(ISAC)技术已被设想为未来智能汽车网络(ivn)的革命。近年来,无人机(UAV)由于其机动性和部署灵活性,被认为是未来ivn中很有前途的空中ISAC平台。在综合考虑吞吐量、感知精度和感知速率等性能的基础上,提出了一种ISAC-UAV辅助IVN的多agent联合轨迹控制和RadCom任务调度(MA-TRC)方案。为了使无人机能够自主自适应地实现最优决策,提出了一种嵌入游戏的多智能体深度强化学习(GE-MADRL)方法。具体而言,针对无人机具有离散和连续决策变量的复杂动作空间,提出了一种基于多智能体参数化深度q网络(MAPDQN)的解决方案,该方案可以帮助无人机学习动态和不确定环境,自适应获得MA-TRC方案。此外,由于无人机以分布式决策方式工作,潜在的冲突决策将影响网络性能。为了避免无人机在网络参数训练过程中的决策冲突,在MAPDQN中嵌入了一种分布式两阶段博弈方法作为动作调节器,进一步提高了学习收敛性能,避免了策略冲突。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
18.60
自引率
10.60%
发文量
708
审稿时长
5.6 months
期刊介绍: The IEEE Transactions on Wireless Communications is a prestigious publication that showcases cutting-edge advancements in wireless communications. It welcomes both theoretical and practical contributions in various areas. The scope of the Transactions encompasses a wide range of topics, including modulation and coding, detection and estimation, propagation and channel characterization, and diversity techniques. The journal also emphasizes the physical and link layer communication aspects of network architectures and protocols. The journal is open to papers on specific topics or non-traditional topics related to specific application areas. This includes simulation tools and methodologies, orthogonal frequency division multiplexing, MIMO systems, and wireless over optical technologies. Overall, the IEEE Transactions on Wireless Communications serves as a platform for high-quality manuscripts that push the boundaries of wireless communications and contribute to advancements in the field.
期刊最新文献
Pinching-Antenna Systems with In-Waveguide Attenuation: Performance Analysis and Algorithm Design UAV-Enabled Multi-Source Data Fusion in Vehicular Networks: A Joint Optimization Approach for Reliability and Latency Line-of-Sight Probability in Macrocells: Framework, Statistical Models, and Parametrization From Massive Real-World Datasets in the USA Passive UAV Detection Based on Channel Estimation and Temporal Variation Network Energy-Efficient Resource Orchestration for URLLC in Cell-Free RANs via GNNs With Reliability Enforcement
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1