{"title":"基于改进madpg算法的多智能体竞争频谱切换","authors":"Li Shufeng, S. Wei, Pen Yunfei, Zhou Min","doi":"10.1109/icicn52636.2021.9673986","DOIUrl":null,"url":null,"abstract":"Spectrum handoff plays an important role in realizing dynamic spectrum management. When there are multiple competing secondary users (SUs) in a cognitive radio network, the dynamics of the environment will increase dramatically, and efficient spectrum handoffwill become difficult. In this paper, the competition of multiple SUs in spectrum handoff is transformed into competition between SU teams, and the Multi-agent deep deterministic policy gradient (MADDPG) method with auxiliary network is proposed to deal with the dynamics of the environment. By predicting the opponent’s actions, the SU can improve the accuracy of the handoff. In simulation experiments, the proposed method is proved to bring SU with higher Quality of Experience (QoE).","PeriodicalId":231379,"journal":{"name":"2021 IEEE 9th International Conference on Information, Communication and Networks (ICICN)","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2021-11-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Multi-agent Competitive Spectrum Handoff Based on Improved MADDPG Algorithm\",\"authors\":\"Li Shufeng, S. Wei, Pen Yunfei, Zhou Min\",\"doi\":\"10.1109/icicn52636.2021.9673986\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Spectrum handoff plays an important role in realizing dynamic spectrum management. When there are multiple competing secondary users (SUs) in a cognitive radio network, the dynamics of the environment will increase dramatically, and efficient spectrum handoffwill become difficult. In this paper, the competition of multiple SUs in spectrum handoff is transformed into competition between SU teams, and the Multi-agent deep deterministic policy gradient (MADDPG) method with auxiliary network is proposed to deal with the dynamics of the environment. By predicting the opponent’s actions, the SU can improve the accuracy of the handoff. In simulation experiments, the proposed method is proved to bring SU with higher Quality of Experience (QoE).\",\"PeriodicalId\":231379,\"journal\":{\"name\":\"2021 IEEE 9th International Conference on Information, Communication and Networks (ICICN)\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-11-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 IEEE 9th International Conference on Information, Communication and Networks (ICICN)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/icicn52636.2021.9673986\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE 9th International Conference on Information, Communication and Networks (ICICN)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/icicn52636.2021.9673986","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Multi-agent Competitive Spectrum Handoff Based on Improved MADDPG Algorithm
Spectrum handoff plays an important role in realizing dynamic spectrum management. When there are multiple competing secondary users (SUs) in a cognitive radio network, the dynamics of the environment will increase dramatically, and efficient spectrum handoffwill become difficult. In this paper, the competition of multiple SUs in spectrum handoff is transformed into competition between SU teams, and the Multi-agent deep deterministic policy gradient (MADDPG) method with auxiliary network is proposed to deal with the dynamics of the environment. By predicting the opponent’s actions, the SU can improve the accuracy of the handoff. In simulation experiments, the proposed method is proved to bring SU with higher Quality of Experience (QoE).