Optimal Transmit Antenna Selection Strategy for MIMO Wiretap Channel Based on Deep Reinforcement Learning

Youbing Hu, Lixin Li, Jiaying Yin, Huisheng Zhang, Wei Liang, Ang Gao, Zhu Han
{"title":"Optimal Transmit Antenna Selection Strategy for MIMO Wiretap Channel Based on Deep Reinforcement Learning","authors":"Youbing Hu, Lixin Li, Jiaying Yin, Huisheng Zhang, Wei Liang, Ang Gao, Zhu Han","doi":"10.1109/ICCCHINA.2018.8641085","DOIUrl":null,"url":null,"abstract":"Antenna selection is often used for physical layer security to implement secure communications. However, due to the rapid changes of the main channel and the feedback delay of the channel state information (CSI), the transmitter obtains outdated CSI, and the outdated CSI leads to the outdated optimal transmit antenna. In order to improve the security of the system based on outdated CSI, in this paper, we propose a deep reinforcement learning framework of Deep Q Network (DQN) to predict the optimal transmit antenna in the multiple input multiple output (MIMO) wiretap channel. The legitimate receiver receives the pilot signals from each transmitting antenna, and the signal-to-noise ratio (SNR) of the pilot signals transmitted by each transmitting antenna can be obtained through maximal ratio combining. And then the legitimate receiver uses the DQN to predict the transmitting antenna at the next moment according to these SNRs. The simulation results show that DQN algorithm can effectively predict the optimal antenna at the next moment, and reduce the secrecy outage probability of MIMO wiretap system, compared with the traditional algorithm.","PeriodicalId":170216,"journal":{"name":"2018 IEEE/CIC International Conference on Communications in China (ICCC)","volume":"181 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE/CIC International Conference on Communications in China (ICCC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCCHINA.2018.8641085","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 7

Abstract

Antenna selection is often used for physical layer security to implement secure communications. However, due to the rapid changes of the main channel and the feedback delay of the channel state information (CSI), the transmitter obtains outdated CSI, and the outdated CSI leads to the outdated optimal transmit antenna. In order to improve the security of the system based on outdated CSI, in this paper, we propose a deep reinforcement learning framework of Deep Q Network (DQN) to predict the optimal transmit antenna in the multiple input multiple output (MIMO) wiretap channel. The legitimate receiver receives the pilot signals from each transmitting antenna, and the signal-to-noise ratio (SNR) of the pilot signals transmitted by each transmitting antenna can be obtained through maximal ratio combining. And then the legitimate receiver uses the DQN to predict the transmitting antenna at the next moment according to these SNRs. The simulation results show that DQN algorithm can effectively predict the optimal antenna at the next moment, and reduce the secrecy outage probability of MIMO wiretap system, compared with the traditional algorithm.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于深度强化学习的MIMO窃听信道发射天线优化选择策略
天线选择通常用于物理层安全,以实现安全通信。然而,由于主信道的快速变化和信道状态信息(CSI)的反馈延迟,发射机得到了过时的CSI,而过时的CSI导致了过时的最优发射天线。为了提高基于过时CSI的系统安全性,本文提出了一种深度Q网络(deep Q Network, DQN)的深度强化学习框架,用于预测多输入多输出(MIMO)窃听信道中的最佳发射天线。合法接收机接收各发射天线发射的导频信号,通过最大比值组合得到各发射天线发射的导频信号的信噪比。然后合法接收机利用DQN根据这些信噪比预测下一时刻的发射天线。仿真结果表明,与传统算法相比,DQN算法可以有效地预测下一时刻的最优天线,降低MIMO窃听系统的保密中断概率。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Adaptive Power Allocation for D2D Assisted Cooperative Relaying System with NOMA Hybrid Transmission Time Intervals for TCP Slow Start in Mobile Edge Computing System UE Computation Offloading Based on Task and Channel Prediction of Single User A Modified Unquantized Fano Sequential Decoding Algorithm for Rateless Spinal Codes Cooperative Slotted Aloha with Reservation for Multi-Receiver Satellite IoT Networks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1