Zhao Hai-tao, Du Ai-Qian, Zhu Hong-bo, Li Dapeng, LI Nan-jie
{"title":"Research on Q-Learning Based Channel Access Control Algorithm for Internet of Vehicles","authors":"Zhao Hai-tao, Du Ai-Qian, Zhu Hong-bo, Li Dapeng, LI Nan-jie","doi":"10.1109/ICS.2016.0104","DOIUrl":null,"url":null,"abstract":"A Q-Learning based back-off algorithm was proposed in this paper because the traditional DCF approach used for IEEE 802.11p MAC protocol to access the channel has some problems of the low packet delivery rate, high delay and the poor scalability in VANETs. The proposed algorithm which is quite different from the traditional BEB algorithm was adopted by the nodes(agents) to interact with surroundings continuously and learn from each other. The vehicle nodes adjust the size of CW(Contention Window) dynamically according to the results learned from the surroundings so that the nodes can access the channel with the optimal CW eventually minimizing the packet collisions and end-to-end delay. The simulation results show that the communication nodes using the proposed algorithm can adapt to the unknown vehicular environment rapidly, and simultaneously the high packet delivery ratio, low end-to-end delay and high fairness can be achieved for vehicular network with various load.","PeriodicalId":281088,"journal":{"name":"2016 International Computer Symposium (ICS)","volume":"12 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 International Computer Symposium (ICS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICS.2016.0104","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6
Abstract
A Q-Learning based back-off algorithm was proposed in this paper because the traditional DCF approach used for IEEE 802.11p MAC protocol to access the channel has some problems of the low packet delivery rate, high delay and the poor scalability in VANETs. The proposed algorithm which is quite different from the traditional BEB algorithm was adopted by the nodes(agents) to interact with surroundings continuously and learn from each other. The vehicle nodes adjust the size of CW(Contention Window) dynamically according to the results learned from the surroundings so that the nodes can access the channel with the optimal CW eventually minimizing the packet collisions and end-to-end delay. The simulation results show that the communication nodes using the proposed algorithm can adapt to the unknown vehicular environment rapidly, and simultaneously the high packet delivery ratio, low end-to-end delay and high fairness can be achieved for vehicular network with various load.