{"title":"基于多智能体强化学习的D2D通信功率控制","authors":"Min Zhao, Yifei Wei, Mei Song, Da Guo","doi":"10.1109/ICCCHINA.2018.8641165","DOIUrl":null,"url":null,"abstract":"Device-to-device (D2D) communication is a promising and rapidly evolving technology and it plays a significant role in reducing pressure on the base station. In this paper, we focus on how to achieve the goal of maximizing system throughput by adjusting the transmitted power of each D2D user. Due to the uncertainty of channel states and state transition probabilities, this problem can be modeled as the reinforcement learning (RL) algorithm. We first use the fuzzy clustering algorithm to group D2D users of which the attribute values are in a large dissimilarity so as to reduce interference, then each group is treated as an agent in the RL algorithm. Therefore, a multi-agent RL based on fuzzy clustering algorithm is established. Finally, we verified the superiority of the proposed algorithm through simulations.","PeriodicalId":170216,"journal":{"name":"2018 IEEE/CIC International Conference on Communications in China (ICCC)","volume":"80 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Power Control for D2D Communication Using Multi-Agent Reinforcement Learning\",\"authors\":\"Min Zhao, Yifei Wei, Mei Song, Da Guo\",\"doi\":\"10.1109/ICCCHINA.2018.8641165\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Device-to-device (D2D) communication is a promising and rapidly evolving technology and it plays a significant role in reducing pressure on the base station. In this paper, we focus on how to achieve the goal of maximizing system throughput by adjusting the transmitted power of each D2D user. Due to the uncertainty of channel states and state transition probabilities, this problem can be modeled as the reinforcement learning (RL) algorithm. We first use the fuzzy clustering algorithm to group D2D users of which the attribute values are in a large dissimilarity so as to reduce interference, then each group is treated as an agent in the RL algorithm. Therefore, a multi-agent RL based on fuzzy clustering algorithm is established. Finally, we verified the superiority of the proposed algorithm through simulations.\",\"PeriodicalId\":170216,\"journal\":{\"name\":\"2018 IEEE/CIC International Conference on Communications in China (ICCC)\",\"volume\":\"80 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 IEEE/CIC International Conference on Communications in China (ICCC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCCHINA.2018.8641165\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE/CIC International Conference on Communications in China (ICCC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCCHINA.2018.8641165","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Power Control for D2D Communication Using Multi-Agent Reinforcement Learning
Device-to-device (D2D) communication is a promising and rapidly evolving technology and it plays a significant role in reducing pressure on the base station. In this paper, we focus on how to achieve the goal of maximizing system throughput by adjusting the transmitted power of each D2D user. Due to the uncertainty of channel states and state transition probabilities, this problem can be modeled as the reinforcement learning (RL) algorithm. We first use the fuzzy clustering algorithm to group D2D users of which the attribute values are in a large dissimilarity so as to reduce interference, then each group is treated as an agent in the RL algorithm. Therefore, a multi-agent RL based on fuzzy clustering algorithm is established. Finally, we verified the superiority of the proposed algorithm through simulations.