{"title":"基于D2D通信的NOMA系统联合模式选择与功率分配","authors":"Rui Tang, Ruizhi Zhang, Yuanman Xia, Yihong Zhao, Jinpu He, Yu Long","doi":"10.1109/iccc52777.2021.9580380","DOIUrl":null,"url":null,"abstract":"In this paper, we study the integration of device-to-device communication into a non-orthogonal multiple access system. To deal with the complex co-channel interference resulting from the dense spectral reuse, we aim to maximize the sum proportional bit rate by jointly optimizing mode selection (MS) and power allocation (PA). Considering the high complexity of the original problem and the dynamics of the wireless environment, we propose an online mechanism with a double-layer structure by efficiently combining machine learning with optimization theory. In particular, when the MS scheme is given, the remaining nonconvex PA problem can be equivalently transformed into a convex one under certain manipulations. Based on the above optimum, a deep reinforcement learning-based online mechanism is designed and it constantly refines the output MS scheme generated from a deep neural network by utilizing the recent historical experiences via reinforcement learning. Finally, simulations are conducted to validate the superiority of the proposed mechanism in balancing the fundamental tradeoff between performance and online computational time.","PeriodicalId":425118,"journal":{"name":"2021 IEEE/CIC International Conference on Communications in China (ICCC)","volume":"111 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-07-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Joint Mode Selection And Power Allocation for NOMA Systems With D2D Communication\",\"authors\":\"Rui Tang, Ruizhi Zhang, Yuanman Xia, Yihong Zhao, Jinpu He, Yu Long\",\"doi\":\"10.1109/iccc52777.2021.9580380\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, we study the integration of device-to-device communication into a non-orthogonal multiple access system. To deal with the complex co-channel interference resulting from the dense spectral reuse, we aim to maximize the sum proportional bit rate by jointly optimizing mode selection (MS) and power allocation (PA). Considering the high complexity of the original problem and the dynamics of the wireless environment, we propose an online mechanism with a double-layer structure by efficiently combining machine learning with optimization theory. In particular, when the MS scheme is given, the remaining nonconvex PA problem can be equivalently transformed into a convex one under certain manipulations. Based on the above optimum, a deep reinforcement learning-based online mechanism is designed and it constantly refines the output MS scheme generated from a deep neural network by utilizing the recent historical experiences via reinforcement learning. Finally, simulations are conducted to validate the superiority of the proposed mechanism in balancing the fundamental tradeoff between performance and online computational time.\",\"PeriodicalId\":425118,\"journal\":{\"name\":\"2021 IEEE/CIC International Conference on Communications in China (ICCC)\",\"volume\":\"111 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-07-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 IEEE/CIC International Conference on Communications in China (ICCC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/iccc52777.2021.9580380\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE/CIC International Conference on Communications in China (ICCC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/iccc52777.2021.9580380","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Joint Mode Selection And Power Allocation for NOMA Systems With D2D Communication
In this paper, we study the integration of device-to-device communication into a non-orthogonal multiple access system. To deal with the complex co-channel interference resulting from the dense spectral reuse, we aim to maximize the sum proportional bit rate by jointly optimizing mode selection (MS) and power allocation (PA). Considering the high complexity of the original problem and the dynamics of the wireless environment, we propose an online mechanism with a double-layer structure by efficiently combining machine learning with optimization theory. In particular, when the MS scheme is given, the remaining nonconvex PA problem can be equivalently transformed into a convex one under certain manipulations. Based on the above optimum, a deep reinforcement learning-based online mechanism is designed and it constantly refines the output MS scheme generated from a deep neural network by utilizing the recent historical experiences via reinforcement learning. Finally, simulations are conducted to validate the superiority of the proposed mechanism in balancing the fundamental tradeoff between performance and online computational time.