{"title":"Power Control for D2D Communication Using Multi-Agent Reinforcement Learning","authors":"Min Zhao, Yifei Wei, Mei Song, Da Guo","doi":"10.1109/ICCCHINA.2018.8641165","DOIUrl":null,"url":null,"abstract":"Device-to-device (D2D) communication is a promising and rapidly evolving technology and it plays a significant role in reducing pressure on the base station. In this paper, we focus on how to achieve the goal of maximizing system throughput by adjusting the transmitted power of each D2D user. Due to the uncertainty of channel states and state transition probabilities, this problem can be modeled as the reinforcement learning (RL) algorithm. We first use the fuzzy clustering algorithm to group D2D users of which the attribute values are in a large dissimilarity so as to reduce interference, then each group is treated as an agent in the RL algorithm. Therefore, a multi-agent RL based on fuzzy clustering algorithm is established. Finally, we verified the superiority of the proposed algorithm through simulations.","PeriodicalId":170216,"journal":{"name":"2018 IEEE/CIC International Conference on Communications in China (ICCC)","volume":"80 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE/CIC International Conference on Communications in China (ICCC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCCHINA.2018.8641165","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5
Abstract
Device-to-device (D2D) communication is a promising and rapidly evolving technology and it plays a significant role in reducing pressure on the base station. In this paper, we focus on how to achieve the goal of maximizing system throughput by adjusting the transmitted power of each D2D user. Due to the uncertainty of channel states and state transition probabilities, this problem can be modeled as the reinforcement learning (RL) algorithm. We first use the fuzzy clustering algorithm to group D2D users of which the attribute values are in a large dissimilarity so as to reduce interference, then each group is treated as an agent in the RL algorithm. Therefore, a multi-agent RL based on fuzzy clustering algorithm is established. Finally, we verified the superiority of the proposed algorithm through simulations.