{"title":"SoftLight: A Maximum Entropy Deep Reinforcement Learning Approach for Intelligent Traffic Signal Control","authors":"Pengyong Wang, Feng Mao, Zhiheng Li","doi":"10.1109/icaci55529.2022.9837664","DOIUrl":null,"url":null,"abstract":"Intelligent traffic signal control plays a crucial role in alleviating traffic congestion. With increasingly available traffic data, there is a trend to use deep reinforcement learning (DRL) techniques for intelligent traffic signal control. However, a majority of existing DRL methods are based on Q-learning, where the optimal solution is always a deterministic policy, so they may fail to adapt to heterogeneous traffic flow and different environment settings. In this paper, we propose a method called SoftLight based on maximum entropy DRL. Through the regularization of maximum entropy, our method learns a stochastic policy that significantly reduces the queue length at the intersection. At the same time, our method keeps the policy as random as possible, which achieves better adaptability to heterogeneous traffic flow. By conducting comprehensive experiments, we demonstrate that our method outperforms existing DRL methods in both phase selection and phase shift settings. We also compare our method with the prevalent maximum entropy DRL method, soft actor-critic (SAC). The results show that our method can find better solutions than SAC under different model designs and hyper-parameters.","PeriodicalId":412347,"journal":{"name":"2022 14th International Conference on Advanced Computational Intelligence (ICACI)","volume":"37 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-07-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 14th International Conference on Advanced Computational Intelligence (ICACI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/icaci55529.2022.9837664","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Intelligent traffic signal control plays a crucial role in alleviating traffic congestion. With increasingly available traffic data, there is a trend to use deep reinforcement learning (DRL) techniques for intelligent traffic signal control. However, a majority of existing DRL methods are based on Q-learning, where the optimal solution is always a deterministic policy, so they may fail to adapt to heterogeneous traffic flow and different environment settings. In this paper, we propose a method called SoftLight based on maximum entropy DRL. Through the regularization of maximum entropy, our method learns a stochastic policy that significantly reduces the queue length at the intersection. At the same time, our method keeps the policy as random as possible, which achieves better adaptability to heterogeneous traffic flow. By conducting comprehensive experiments, we demonstrate that our method outperforms existing DRL methods in both phase selection and phase shift settings. We also compare our method with the prevalent maximum entropy DRL method, soft actor-critic (SAC). The results show that our method can find better solutions than SAC under different model designs and hyper-parameters.