{"title":"针对 DoS 攻击下无线电资源分配的深度学习","authors":"Ke Wang;Wanchun Liu;Teng Joon Lim","doi":"10.1109/TMLCN.2024.3403513","DOIUrl":null,"url":null,"abstract":"In this paper, we focus on the problem of remote state estimation in wireless networked cyber-physical systems (CPS). Information from multiple sensors is transmitted to a central gateway over a wireless network with fewer channels than sensors. Channel and power allocation are performed jointly, in the presence of a denial of service (DoS) attack where one or more channels are jammed by an attacker transmitting spurious signals. The attack policy is unknown and the central gateway has the objective of minimizing state estimation error with maximum energy efficiency. The problem involves a combination of discrete and continuous action spaces. In addition, the state and action spaces have high dimensionality, and the channel states are not fully known to the defender. We propose an innovative model-free deep reinforcement learning (DRL) algorithm to address the problem. In addition, we develop a deep learning-based method with a novel deep neural network (DNN) structure for detecting changes in the attack policy post-training. The proposed online policy change detector accelerates the adaptation of the defender to a new attack policy and also saves computational resources compared to continuous training. In short, a complete system featuring a DRL-based defender that is trained initially and adapts continually to changes in attack policy has been developed. Our numerical results show that the proposed intelligent system can significantly enhance the resilience of the system to DoS attacks.","PeriodicalId":100641,"journal":{"name":"IEEE Transactions on Machine Learning in Communications and Networking","volume":"2 ","pages":"703-716"},"PeriodicalIF":0.0000,"publicationDate":"2024-03-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10535299","citationCount":"0","resultStr":"{\"title\":\"Deep Learning for Radio Resource Allocation Under DoS Attack\",\"authors\":\"Ke Wang;Wanchun Liu;Teng Joon Lim\",\"doi\":\"10.1109/TMLCN.2024.3403513\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, we focus on the problem of remote state estimation in wireless networked cyber-physical systems (CPS). Information from multiple sensors is transmitted to a central gateway over a wireless network with fewer channels than sensors. Channel and power allocation are performed jointly, in the presence of a denial of service (DoS) attack where one or more channels are jammed by an attacker transmitting spurious signals. The attack policy is unknown and the central gateway has the objective of minimizing state estimation error with maximum energy efficiency. The problem involves a combination of discrete and continuous action spaces. In addition, the state and action spaces have high dimensionality, and the channel states are not fully known to the defender. We propose an innovative model-free deep reinforcement learning (DRL) algorithm to address the problem. In addition, we develop a deep learning-based method with a novel deep neural network (DNN) structure for detecting changes in the attack policy post-training. The proposed online policy change detector accelerates the adaptation of the defender to a new attack policy and also saves computational resources compared to continuous training. In short, a complete system featuring a DRL-based defender that is trained initially and adapts continually to changes in attack policy has been developed. Our numerical results show that the proposed intelligent system can significantly enhance the resilience of the system to DoS attacks.\",\"PeriodicalId\":100641,\"journal\":{\"name\":\"IEEE Transactions on Machine Learning in Communications and Networking\",\"volume\":\"2 \",\"pages\":\"703-716\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-03-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10535299\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Machine Learning in Communications and Networking\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10535299/\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Machine Learning in Communications and Networking","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10535299/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
本文重点讨论无线网络网络物理系统(CPS)中的远程状态估计问题。来自多个传感器的信息通过无线网络传输到中央网关,其信道数量少于传感器数量。在受到拒绝服务(DoS)攻击时,一个或多个信道会受到攻击者发射的虚假信号的干扰,信道和功率分配将联合执行。攻击策略是未知的,中央网关的目标是以最大的能效最小化状态估计误差。该问题涉及离散和连续行动空间的组合。此外,状态和行动空间的维度都很高,而且防御者并不完全了解信道状态。我们提出了一种创新的无模型深度强化学习(DRL)算法来解决这个问题。此外,我们还开发了一种基于深度学习的方法,该方法采用新型深度神经网络(DNN)结构,用于检测训练后攻击策略的变化。所提出的在线策略变化检测器加快了防御者对新攻击策略的适应,与持续训练相比还节省了计算资源。总之,我们开发出了一个完整的系统,它具有基于 DRL 的防御器,该防御器经过初始训练,并能不断适应攻击策略的变化。我们的数值结果表明,所提出的智能系统能显著增强系统对 DoS 攻击的抵御能力。
Deep Learning for Radio Resource Allocation Under DoS Attack
In this paper, we focus on the problem of remote state estimation in wireless networked cyber-physical systems (CPS). Information from multiple sensors is transmitted to a central gateway over a wireless network with fewer channels than sensors. Channel and power allocation are performed jointly, in the presence of a denial of service (DoS) attack where one or more channels are jammed by an attacker transmitting spurious signals. The attack policy is unknown and the central gateway has the objective of minimizing state estimation error with maximum energy efficiency. The problem involves a combination of discrete and continuous action spaces. In addition, the state and action spaces have high dimensionality, and the channel states are not fully known to the defender. We propose an innovative model-free deep reinforcement learning (DRL) algorithm to address the problem. In addition, we develop a deep learning-based method with a novel deep neural network (DNN) structure for detecting changes in the attack policy post-training. The proposed online policy change detector accelerates the adaptation of the defender to a new attack policy and also saves computational resources compared to continuous training. In short, a complete system featuring a DRL-based defender that is trained initially and adapts continually to changes in attack policy has been developed. Our numerical results show that the proposed intelligent system can significantly enhance the resilience of the system to DoS attacks.