Juan Fang, Zhenzhen Liu, Shuopeng Li, Siqi Chen, Huijing Yang
{"title":"基于NOMA的MEC通信资源分配优化算法","authors":"Juan Fang, Zhenzhen Liu, Shuopeng Li, Siqi Chen, Huijing Yang","doi":"10.1109/CCPQT56151.2022.00046","DOIUrl":null,"url":null,"abstract":"To solve the problem of communication delay and resource shortage when multiple users offload tasks at the same time in mobile edge computing (MEC), the deep reinforcement learning algorithm based on non-orthogonal multiple access (NOMA) technology was proposed to optimize users' communication resource allocation. Firstly, the taboo tag deep Q-network algorithm was used to train the relationship between users and subchannels at the users grouping stage, then the deep deterministic policy gradient algorithm was used to allocate users transmission power who sharing subchannel. The simulation results display that the proposed algorithm perform more stable than other reinforcement learning and traditional algorithm, moreover, the system sum rate have been significantly improved when multiple edge users offload tasks.","PeriodicalId":235893,"journal":{"name":"2022 International Conference on Computing, Communication, Perception and Quantum Technology (CCPQT)","volume":"13 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"MEC Communication Resource Allocation Optimization Algorithm Based on NOMA\",\"authors\":\"Juan Fang, Zhenzhen Liu, Shuopeng Li, Siqi Chen, Huijing Yang\",\"doi\":\"10.1109/CCPQT56151.2022.00046\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"To solve the problem of communication delay and resource shortage when multiple users offload tasks at the same time in mobile edge computing (MEC), the deep reinforcement learning algorithm based on non-orthogonal multiple access (NOMA) technology was proposed to optimize users' communication resource allocation. Firstly, the taboo tag deep Q-network algorithm was used to train the relationship between users and subchannels at the users grouping stage, then the deep deterministic policy gradient algorithm was used to allocate users transmission power who sharing subchannel. The simulation results display that the proposed algorithm perform more stable than other reinforcement learning and traditional algorithm, moreover, the system sum rate have been significantly improved when multiple edge users offload tasks.\",\"PeriodicalId\":235893,\"journal\":{\"name\":\"2022 International Conference on Computing, Communication, Perception and Quantum Technology (CCPQT)\",\"volume\":\"13 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 International Conference on Computing, Communication, Perception and Quantum Technology (CCPQT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CCPQT56151.2022.00046\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference on Computing, Communication, Perception and Quantum Technology (CCPQT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CCPQT56151.2022.00046","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
MEC Communication Resource Allocation Optimization Algorithm Based on NOMA
To solve the problem of communication delay and resource shortage when multiple users offload tasks at the same time in mobile edge computing (MEC), the deep reinforcement learning algorithm based on non-orthogonal multiple access (NOMA) technology was proposed to optimize users' communication resource allocation. Firstly, the taboo tag deep Q-network algorithm was used to train the relationship between users and subchannels at the users grouping stage, then the deep deterministic policy gradient algorithm was used to allocate users transmission power who sharing subchannel. The simulation results display that the proposed algorithm perform more stable than other reinforcement learning and traditional algorithm, moreover, the system sum rate have been significantly improved when multiple edge users offload tasks.