Youlin Fan;Bo Jiu;Wenqiang Pu;Ziniu Li;Kang Li;Hongwei Liu
{"title":"Sensing Jamming Strategy From Limited Observations: An Imitation Learning Perspective","authors":"Youlin Fan;Bo Jiu;Wenqiang Pu;Ziniu Li;Kang Li;Hongwei Liu","doi":"10.1109/TSP.2024.3443121","DOIUrl":null,"url":null,"abstract":"This paper studies the problem of sensing mainlobe jamming strategy through interaction samples between a frequency agile radar and a transmit/receive time-sharing jammer. We model this interaction as an episodic Markov decision process, where the jammer's strategy is treated as the state transition probability that needs to be learned. To effectively learn the strategy, we employ two sensing criteria from the imitation learning perspective: Behavioral Cloning (BC) and Generative Adversarial Imitation Learning (GAIL). These criteria enable us to imitate the jammer's strategy based on collected interaction samples. Our theoretical analysis indicates that GAIL provides more accurate strategy sensing performance, while BC offers faster learning. Experimental results corroborate these findings. Additionally, empirical evidence shows that our trained anti-jamming strategies, informed by either BC or GAIL, significantly outperform existing intelligent anti-jamming strategy learning methods in terms of sample efficiency.","PeriodicalId":13330,"journal":{"name":"IEEE Transactions on Signal Processing","volume":"72 ","pages":"4098-4114"},"PeriodicalIF":4.6000,"publicationDate":"2024-08-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Signal Processing","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10634527/","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0
Abstract
This paper studies the problem of sensing mainlobe jamming strategy through interaction samples between a frequency agile radar and a transmit/receive time-sharing jammer. We model this interaction as an episodic Markov decision process, where the jammer's strategy is treated as the state transition probability that needs to be learned. To effectively learn the strategy, we employ two sensing criteria from the imitation learning perspective: Behavioral Cloning (BC) and Generative Adversarial Imitation Learning (GAIL). These criteria enable us to imitate the jammer's strategy based on collected interaction samples. Our theoretical analysis indicates that GAIL provides more accurate strategy sensing performance, while BC offers faster learning. Experimental results corroborate these findings. Additionally, empirical evidence shows that our trained anti-jamming strategies, informed by either BC or GAIL, significantly outperform existing intelligent anti-jamming strategy learning methods in terms of sample efficiency.
本文研究了通过频率敏捷雷达与发射/接收分时干扰器之间的交互样本来感知主波干扰策略的问题。我们将这种交互建模为一个偶发马尔可夫决策过程,其中干扰者的策略被视为需要学习的状态转换概率。为了有效地学习策略,我们从模仿学习的角度出发,采用了两种感知标准:行为克隆(BC)和生成对抗模仿学习(GAIL)。这些标准使我们能够根据收集到的交互样本模仿干扰者的策略。我们的理论分析表明,GAIL 能提供更准确的策略感知性能,而 BC 则能提供更快的学习速度。实验结果证实了这些结论。此外,经验证据表明,在 BC 或 GAIL 的指导下,我们训练的反干扰策略在样本效率方面明显优于现有的智能反干扰策略学习方法。
期刊介绍:
The IEEE Transactions on Signal Processing covers novel theory, algorithms, performance analyses and applications of techniques for the processing, understanding, learning, retrieval, mining, and extraction of information from signals. The term “signal” includes, among others, audio, video, speech, image, communication, geophysical, sonar, radar, medical and musical signals. Examples of topics of interest include, but are not limited to, information processing and the theory and application of filtering, coding, transmitting, estimating, detecting, analyzing, recognizing, synthesizing, recording, and reproducing signals.