{"title":"动态频谱接入的自适应信道推荐","authors":"Xu Chen, Jianwei Huang, Husheng Li","doi":"10.1109/DYSPAN.2011.5936198","DOIUrl":null,"url":null,"abstract":"We propose a dynamic spectrum access scheme where secondary users recommend “good” channels to each other and access accordingly. We formulate the problem as an average reward based Markov decision process. Since the action space of the Markov decision process is continuous (i.e., transmission probabilities), it is difficult to find the optimal policy by simply discretizing the action space and use the policy iteration, or value iteration. Instead, we propose a new algorithm based on the Model Reference Adaptive Search method, and prove its convergence to the optimal policy. Numerical results show that the proposed algorithm achieves up to 18% performance improvement than the static channel recommendation scheme and up to 63% performance improvement than the random access scheme, and is robust to channel dynamics.","PeriodicalId":119856,"journal":{"name":"2011 IEEE International Symposium on Dynamic Spectrum Access Networks (DySPAN)","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2011-02-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"19","resultStr":"{\"title\":\"Adaptive channel recommendation for dynamic spectrum access\",\"authors\":\"Xu Chen, Jianwei Huang, Husheng Li\",\"doi\":\"10.1109/DYSPAN.2011.5936198\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We propose a dynamic spectrum access scheme where secondary users recommend “good” channels to each other and access accordingly. We formulate the problem as an average reward based Markov decision process. Since the action space of the Markov decision process is continuous (i.e., transmission probabilities), it is difficult to find the optimal policy by simply discretizing the action space and use the policy iteration, or value iteration. Instead, we propose a new algorithm based on the Model Reference Adaptive Search method, and prove its convergence to the optimal policy. Numerical results show that the proposed algorithm achieves up to 18% performance improvement than the static channel recommendation scheme and up to 63% performance improvement than the random access scheme, and is robust to channel dynamics.\",\"PeriodicalId\":119856,\"journal\":{\"name\":\"2011 IEEE International Symposium on Dynamic Spectrum Access Networks (DySPAN)\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2011-02-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"19\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2011 IEEE International Symposium on Dynamic Spectrum Access Networks (DySPAN)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/DYSPAN.2011.5936198\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2011 IEEE International Symposium on Dynamic Spectrum Access Networks (DySPAN)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DYSPAN.2011.5936198","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Adaptive channel recommendation for dynamic spectrum access
We propose a dynamic spectrum access scheme where secondary users recommend “good” channels to each other and access accordingly. We formulate the problem as an average reward based Markov decision process. Since the action space of the Markov decision process is continuous (i.e., transmission probabilities), it is difficult to find the optimal policy by simply discretizing the action space and use the policy iteration, or value iteration. Instead, we propose a new algorithm based on the Model Reference Adaptive Search method, and prove its convergence to the optimal policy. Numerical results show that the proposed algorithm achieves up to 18% performance improvement than the static channel recommendation scheme and up to 63% performance improvement than the random access scheme, and is robust to channel dynamics.