Intelligent Traffic Control using Double Deep Q Networks for time-varying Traffic Flows

Priyadharshini Shanmugasundaram, Aakash Sinha
{"title":"Intelligent Traffic Control using Double Deep Q Networks for time-varying Traffic Flows","authors":"Priyadharshini Shanmugasundaram, Aakash Sinha","doi":"10.1109/SPIN52536.2021.9565961","DOIUrl":null,"url":null,"abstract":"Reinforcement learning, a sub-field of Machine Learning has been garnering lot of research attention lately. It helps create intelligent agents that can incrementally learn optimal strategies for challenging environments by interacting with it. Such agents are best suited for solving problems like traffic congestion, which demand solutions that eater to dynamic changes in the traffic throughput. Intelligent transportation systems which use deep reinforcement learning can adapt to varying traffic demands and learn to maintain reduced congestion. In this paper, we propose a solution approach to use Double Deep Q Networks for traffic signal control of varied traffic flows in an isolated intersection. To improve the stability of our proposed method we have used target networks, delayed updates and experience replay mechanisms. We evaluate the performance of our method on different time-varying traffic flows and find that our method learns a robust and optimal strategy which reduces vehicle waiting time and queue length significantly. Our method achieved superior performance compared to traditional traffic signal control strategies. The method has been trained and evaluated through simulations of road networks created on Simulation of Urban Mobility (SUMO).","PeriodicalId":343177,"journal":{"name":"2021 8th International Conference on Signal Processing and Integrated Networks (SPIN)","volume":"60 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 8th International Conference on Signal Processing and Integrated Networks (SPIN)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SPIN52536.2021.9565961","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Reinforcement learning, a sub-field of Machine Learning has been garnering lot of research attention lately. It helps create intelligent agents that can incrementally learn optimal strategies for challenging environments by interacting with it. Such agents are best suited for solving problems like traffic congestion, which demand solutions that eater to dynamic changes in the traffic throughput. Intelligent transportation systems which use deep reinforcement learning can adapt to varying traffic demands and learn to maintain reduced congestion. In this paper, we propose a solution approach to use Double Deep Q Networks for traffic signal control of varied traffic flows in an isolated intersection. To improve the stability of our proposed method we have used target networks, delayed updates and experience replay mechanisms. We evaluate the performance of our method on different time-varying traffic flows and find that our method learns a robust and optimal strategy which reduces vehicle waiting time and queue length significantly. Our method achieved superior performance compared to traditional traffic signal control strategies. The method has been trained and evaluated through simulations of road networks created on Simulation of Urban Mobility (SUMO).
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于双深Q网络的时变交通流智能交通控制
强化学习是机器学习的一个子领域,最近受到了很多研究的关注。它有助于创建智能代理,这些代理可以通过与之交互,逐步学习应对挑战环境的最佳策略。此类代理最适合解决交通拥堵等问题,这些问题需要能够适应交通吞吐量动态变化的解决方案。使用深度强化学习的智能交通系统可以适应不同的交通需求,并学习保持减少拥堵。在本文中,我们提出了一种使用双深Q网络来控制孤立交叉口中不同交通流量的交通信号的解决方法。为了提高我们提出的方法的稳定性,我们使用了目标网络、延迟更新和经验重放机制。我们对不同时变交通流的性能进行了评估,发现我们的方法学习了一个鲁棒的最优策略,显著减少了车辆等待时间和队列长度。与传统的交通信号控制策略相比,该方法具有更好的性能。通过模拟城市交通(SUMO)上创建的道路网络,对该方法进行了训练和评估。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Temperature Compensation Circuit for ISFET based pH Sensor Knowledge Adaptation for Cross-Domain Opinion Mining Voltage Profile Enhancement of a 33 Bus System Integrated with Renewable Energy Sources and Electric Vehicle Power Quality Enhancement of Cascaded H Bridge 5 Level and 7 Level Inverters PIC simulation study of Beam Tunnel for W- Band high power Gyrotron
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1