{"title":"A MAB-Based Discrete Power Control Approach in Anti-jamming Relay Communication via Three-layer Stackelberg Game","authors":"Zhibin Feng, Yijie Luo, Xueqiang Chen, Wen Li","doi":"10.1109/ICCC51575.2020.9344934","DOIUrl":null,"url":null,"abstract":"In this paper, we investigate the discrete power control problem in anti-jamming relay communication networks. Based on the hierarchical competitive relationships between transmitters (user and relay) and jammer, a three-layer Stackelberg game is formulated, in which user acts as leader, relay acts as vice-leader and jammer acts as follower. From the perspective of hierarchical-game theoretic, we formulate the power optimization problem as a multi-armed bandit (MAB) problem, where user, relay and jammer act as players and each optional power strategy is considered as an arm to select. Based on MAB theory, we give the regret function to express the loss of payoff of the whole communication process. To minimize the regrets of user and relay, we propose a UCB1-based discrete power control online learning algorithm. Simulation results give the power selection rate and logarithmic incremental regrets in the proposed anti-jamming scenario. The user's and relay's utilities are also compared under different algorithms.","PeriodicalId":386048,"journal":{"name":"2020 IEEE 6th International Conference on Computer and Communications (ICCC)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-12-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE 6th International Conference on Computer and Communications (ICCC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCC51575.2020.9344934","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
In this paper, we investigate the discrete power control problem in anti-jamming relay communication networks. Based on the hierarchical competitive relationships between transmitters (user and relay) and jammer, a three-layer Stackelberg game is formulated, in which user acts as leader, relay acts as vice-leader and jammer acts as follower. From the perspective of hierarchical-game theoretic, we formulate the power optimization problem as a multi-armed bandit (MAB) problem, where user, relay and jammer act as players and each optional power strategy is considered as an arm to select. Based on MAB theory, we give the regret function to express the loss of payoff of the whole communication process. To minimize the regrets of user and relay, we propose a UCB1-based discrete power control online learning algorithm. Simulation results give the power selection rate and logarithmic incremental regrets in the proposed anti-jamming scenario. The user's and relay's utilities are also compared under different algorithms.