{"title":"基于q学习的多路口交通信号控制模型","authors":"Jiong Song, Zhao Jin","doi":"10.1109/ICSSEM.2011.6081298","DOIUrl":null,"url":null,"abstract":"In multi-intersection urban traffic environment, conventional fixed-time traffic signal control methods expose low performance when face with complex and stochastic traffic conditions which caused by the interaction among multiple intersections. A Q-learning based traffic signal control model is proposed to deal with time-varying and stochastic traffic flow problem, which takes advantage of the specialty of autonomous learning inherent in Q-learning. The capacity of discovering autonomously optimal control policy corresponding to varying traffic conditions and no fixed mathematic control model is needed are the major advantages of this method. The experiment results in simulation environment also demonstrate this method is applicable and effective.","PeriodicalId":406311,"journal":{"name":"2011 International Conference on System science, Engineering design and Manufacturing informatization","volume":"2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Q-learning based multi-intersection traffic signal control model\",\"authors\":\"Jiong Song, Zhao Jin\",\"doi\":\"10.1109/ICSSEM.2011.6081298\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In multi-intersection urban traffic environment, conventional fixed-time traffic signal control methods expose low performance when face with complex and stochastic traffic conditions which caused by the interaction among multiple intersections. A Q-learning based traffic signal control model is proposed to deal with time-varying and stochastic traffic flow problem, which takes advantage of the specialty of autonomous learning inherent in Q-learning. The capacity of discovering autonomously optimal control policy corresponding to varying traffic conditions and no fixed mathematic control model is needed are the major advantages of this method. The experiment results in simulation environment also demonstrate this method is applicable and effective.\",\"PeriodicalId\":406311,\"journal\":{\"name\":\"2011 International Conference on System science, Engineering design and Manufacturing informatization\",\"volume\":\"2 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2011-11-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2011 International Conference on System science, Engineering design and Manufacturing informatization\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICSSEM.2011.6081298\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2011 International Conference on System science, Engineering design and Manufacturing informatization","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSSEM.2011.6081298","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Q-learning based multi-intersection traffic signal control model
In multi-intersection urban traffic environment, conventional fixed-time traffic signal control methods expose low performance when face with complex and stochastic traffic conditions which caused by the interaction among multiple intersections. A Q-learning based traffic signal control model is proposed to deal with time-varying and stochastic traffic flow problem, which takes advantage of the specialty of autonomous learning inherent in Q-learning. The capacity of discovering autonomously optimal control policy corresponding to varying traffic conditions and no fixed mathematic control model is needed are the major advantages of this method. The experiment results in simulation environment also demonstrate this method is applicable and effective.