通过深度强化学习规避通信干扰的认知多载波雷达

IF 7.4 1区 计算机科学 Q1 TELECOMMUNICATIONS IEEE Transactions on Cognitive Communications and Networking Pub Date : 2023-08-21 DOI:10.1109/TCCN.2023.3306854
Zhao Shan;Pengfei Liu;Lei Wang;Yimin Liu
{"title":"通过深度强化学习规避通信干扰的认知多载波雷达","authors":"Zhao Shan;Pengfei Liu;Lei Wang;Yimin Liu","doi":"10.1109/TCCN.2023.3306854","DOIUrl":null,"url":null,"abstract":"Spectrum sharing between the radar and communication systems has become increasingly prevalent in recent years, therefore reducing the communication interference is a critical issue for radar. Deep reinforcement learning (DRL) based frequency allocation is a popular approach to solving the problem, especially in the highly dynamic spectrum. However, most DRL based methods suffer from low training efficiency due to the limited channel state information (CSI). To address the challenge, we propose a cognitive multi-carrier radar (CMCR), which acquires more CSI in one transmission and thus can learn the spectrum evolution faster. The frequency allocation problem for the CMCR is formulated as a partially observable Markov decision process which is hard to solve due to the combinatorial action space. To this end, we use the Iteratively Selecting approach along with the Proximal Policy Optimization (ISPPO) to solve it. To further enhance the performance of the CMCR in a short-term task, we pre-train the policy with model agnostic meta learning (MAML). Simulation results show that the CMCR learns fast and achieves an excellent detection ability in a congested spectrum on the basis of the ISPPO method. Besides, we also illustrate the efficiency of the MAML pre-training.","PeriodicalId":13069,"journal":{"name":"IEEE Transactions on Cognitive Communications and Networking","volume":"9 6","pages":"1561-1578"},"PeriodicalIF":7.4000,"publicationDate":"2023-08-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Cognitive Multi-Carrier Radar for Communication Interference Avoidance via Deep Reinforcement Learning\",\"authors\":\"Zhao Shan;Pengfei Liu;Lei Wang;Yimin Liu\",\"doi\":\"10.1109/TCCN.2023.3306854\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Spectrum sharing between the radar and communication systems has become increasingly prevalent in recent years, therefore reducing the communication interference is a critical issue for radar. Deep reinforcement learning (DRL) based frequency allocation is a popular approach to solving the problem, especially in the highly dynamic spectrum. However, most DRL based methods suffer from low training efficiency due to the limited channel state information (CSI). To address the challenge, we propose a cognitive multi-carrier radar (CMCR), which acquires more CSI in one transmission and thus can learn the spectrum evolution faster. The frequency allocation problem for the CMCR is formulated as a partially observable Markov decision process which is hard to solve due to the combinatorial action space. To this end, we use the Iteratively Selecting approach along with the Proximal Policy Optimization (ISPPO) to solve it. To further enhance the performance of the CMCR in a short-term task, we pre-train the policy with model agnostic meta learning (MAML). Simulation results show that the CMCR learns fast and achieves an excellent detection ability in a congested spectrum on the basis of the ISPPO method. Besides, we also illustrate the efficiency of the MAML pre-training.\",\"PeriodicalId\":13069,\"journal\":{\"name\":\"IEEE Transactions on Cognitive Communications and Networking\",\"volume\":\"9 6\",\"pages\":\"1561-1578\"},\"PeriodicalIF\":7.4000,\"publicationDate\":\"2023-08-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Cognitive Communications and Networking\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10225362/\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"TELECOMMUNICATIONS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Cognitive Communications and Networking","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10225362/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"TELECOMMUNICATIONS","Score":null,"Total":0}
引用次数: 0

摘要

近年来,雷达与通信系统之间的频谱共享越来越普遍,因此减少通信干扰是雷达面临的一个关键问题。基于深度强化学习(DRL)的频率分配是解决这一问题的常用方法,尤其是在高动态频谱中。然而,由于信道状态信息(CSI)有限,大多数基于 DRL 的方法都存在训练效率低的问题。为了应对这一挑战,我们提出了认知多载波雷达(CMCR),它能在一次传输中获取更多的 CSI,从而更快地学习频谱演化。CMCR 的频率分配问题被表述为一个部分可观测的马尔可夫决策过程,由于存在组合行动空间,这个问题很难解决。为此,我们采用迭代选择法和近端策略优化法(ISPPO)来解决这一问题。为了进一步提高 CMCR 在短期任务中的性能,我们使用模型无关元学习(MAML)对策略进行了预训练。仿真结果表明,在 ISPPO 方法的基础上,CMCR 能够快速学习并在拥塞频谱中实现出色的检测能力。此外,我们还说明了 MAML 预训练的效率。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
A Cognitive Multi-Carrier Radar for Communication Interference Avoidance via Deep Reinforcement Learning
Spectrum sharing between the radar and communication systems has become increasingly prevalent in recent years, therefore reducing the communication interference is a critical issue for radar. Deep reinforcement learning (DRL) based frequency allocation is a popular approach to solving the problem, especially in the highly dynamic spectrum. However, most DRL based methods suffer from low training efficiency due to the limited channel state information (CSI). To address the challenge, we propose a cognitive multi-carrier radar (CMCR), which acquires more CSI in one transmission and thus can learn the spectrum evolution faster. The frequency allocation problem for the CMCR is formulated as a partially observable Markov decision process which is hard to solve due to the combinatorial action space. To this end, we use the Iteratively Selecting approach along with the Proximal Policy Optimization (ISPPO) to solve it. To further enhance the performance of the CMCR in a short-term task, we pre-train the policy with model agnostic meta learning (MAML). Simulation results show that the CMCR learns fast and achieves an excellent detection ability in a congested spectrum on the basis of the ISPPO method. Besides, we also illustrate the efficiency of the MAML pre-training.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
IEEE Transactions on Cognitive Communications and Networking
IEEE Transactions on Cognitive Communications and Networking Computer Science-Artificial Intelligence
CiteScore
15.50
自引率
7.00%
发文量
108
期刊介绍: The IEEE Transactions on Cognitive Communications and Networking (TCCN) aims to publish high-quality manuscripts that push the boundaries of cognitive communications and networking research. Cognitive, in this context, refers to the application of perception, learning, reasoning, memory, and adaptive approaches in communication system design. The transactions welcome submissions that explore various aspects of cognitive communications and networks, focusing on innovative and holistic approaches to complex system design. Key topics covered include architecture, protocols, cross-layer design, and cognition cycle design for cognitive networks. Additionally, research on machine learning, artificial intelligence, end-to-end and distributed intelligence, software-defined networking, cognitive radios, spectrum sharing, and security and privacy issues in cognitive networks are of interest. The publication also encourages papers addressing novel services and applications enabled by these cognitive concepts.
期刊最新文献
Joint Optimization for Mobile Crowdsensing Systems With Reliability Consideration Application-Aware Real-time Network Resource Allocation for Industry 5.0 Delay-Guaranteed Path Selection and Scheduling in IAB Networks RadioDiff: An Effective Generative Diffusion Model for Sampling-Free Dynamic Radio Map Construction An Optimized Move List for Dynamic Protection Area in CBRS
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1