利用强化学习,基于探测器的数据提高 SUMO 中路线生成的质量

Ivan A. Salenek, Yaroslav A. Seliverstov, Svyatoslav A. Seliverstov, Elena A. Sofronova
{"title":"利用强化学习,基于探测器的数据提高 SUMO 中路线生成的质量","authors":"Ivan A. Salenek, Yaroslav A. Seliverstov, Svyatoslav A. Seliverstov, Elena A. Sofronova","doi":"10.20537/2076-7633-2024-16-1-137-146","DOIUrl":null,"url":null,"abstract":"This work provides a new approach for constructing high-precision routes based on data from transport detectors inside the SUMO traffic modeling package. Existing tools such as flowrouter and routeSampler have a number of disadvantages, such as the lack of interaction with the network in the process of building routes. Our rlRouter uses multi-agent reinforcement learning (MARL), where the agents are incoming lanes and the environment is the road network. By performing actions to launch vehicles, agents receive a reward for matching data from transport detectors. Parameter Sharing DQN with the LSTM backbone of the Q-function was used as an algorithm for multi-agent reinforcement learning. Since the rlRouter is trained inside the SUMO simulation, it can restore routes better by taking into account the interaction of vehicles within the network with each other and with the network infrastructure. We have modeled diverse traffic situations on three different junctions in order to compare the performance of SUMO’s routers with the rlRouter . We used Mean Absoluter Error (MAE) as the measure of the deviation from both cumulative detectors and routes data. The rlRouter achieved the highest compliance with the data from the detectors. We also found that by maximizing the reward for matching detectors, the resulting routes also get closer to the real ones. Despite the fact that the routes recovered using rlRouter are superior to the routes obtained using SUMO tools, they do not fully correspond to the real ones, due to the natural limitations of induction-loop detectors. To achieve more plausible routes, it is necessary to equip junctions with other types of transport counters, for example, camera detectors.","PeriodicalId":37429,"journal":{"name":"Computer Research and Modeling","volume":"27 11","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Improving the quality of route generation in SUMO based on data from detectors using reinforcement learning\",\"authors\":\"Ivan A. Salenek, Yaroslav A. Seliverstov, Svyatoslav A. Seliverstov, Elena A. Sofronova\",\"doi\":\"10.20537/2076-7633-2024-16-1-137-146\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This work provides a new approach for constructing high-precision routes based on data from transport detectors inside the SUMO traffic modeling package. Existing tools such as flowrouter and routeSampler have a number of disadvantages, such as the lack of interaction with the network in the process of building routes. Our rlRouter uses multi-agent reinforcement learning (MARL), where the agents are incoming lanes and the environment is the road network. By performing actions to launch vehicles, agents receive a reward for matching data from transport detectors. Parameter Sharing DQN with the LSTM backbone of the Q-function was used as an algorithm for multi-agent reinforcement learning. Since the rlRouter is trained inside the SUMO simulation, it can restore routes better by taking into account the interaction of vehicles within the network with each other and with the network infrastructure. We have modeled diverse traffic situations on three different junctions in order to compare the performance of SUMO’s routers with the rlRouter . We used Mean Absoluter Error (MAE) as the measure of the deviation from both cumulative detectors and routes data. The rlRouter achieved the highest compliance with the data from the detectors. We also found that by maximizing the reward for matching detectors, the resulting routes also get closer to the real ones. Despite the fact that the routes recovered using rlRouter are superior to the routes obtained using SUMO tools, they do not fully correspond to the real ones, due to the natural limitations of induction-loop detectors. To achieve more plausible routes, it is necessary to equip junctions with other types of transport counters, for example, camera detectors.\",\"PeriodicalId\":37429,\"journal\":{\"name\":\"Computer Research and Modeling\",\"volume\":\"27 11\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-02-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computer Research and Modeling\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.20537/2076-7633-2024-16-1-137-146\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"Computer Science\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer Research and Modeling","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.20537/2076-7633-2024-16-1-137-146","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"Computer Science","Score":null,"Total":0}
引用次数: 0

摘要

这项工作提供了一种基于 SUMO 交通建模软件包内交通探测器数据构建高精度路线的新方法。现有的工具(如 flowrouter 和 routeSampler)有许多缺点,如在构建路线的过程中缺乏与网络的交互。我们的 rlRouter 采用了多代理强化学习(MARL)技术,其中代理是来车道,环境是道路网络。通过执行发射车辆的操作,代理可获得与运输检测器数据相匹配的奖励。参数共享 DQN 与 Q 函数的 LSTM 骨干被用作多代理强化学习的算法。由于 rlRouter 是在 SUMO 仿真中进行训练的,因此它可以通过考虑网络内车辆之间以及车辆与网络基础设施之间的相互作用,更好地恢复路线。为了比较 SUMO 路由器和 rlRouter 的性能,我们在三个不同的路口模拟了不同的交通状况。我们使用平均绝对误差(MAE)来衡量累积检测器和路由数据的偏差。rlRouter 与检测器数据的一致性最高。我们还发现,通过最大限度地提高匹配检测器的奖励,得到的路线也更接近真实路线。尽管使用 rlRouter 恢复的路线优于使用 SUMO 工具获得的路线,但由于感应回路检测器的自然限制,这些路线并不完全符合真实路线。为了获得更可信的路线,有必要在路口配备其他类型的交通计数器,例如摄像头检测器。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Improving the quality of route generation in SUMO based on data from detectors using reinforcement learning
This work provides a new approach for constructing high-precision routes based on data from transport detectors inside the SUMO traffic modeling package. Existing tools such as flowrouter and routeSampler have a number of disadvantages, such as the lack of interaction with the network in the process of building routes. Our rlRouter uses multi-agent reinforcement learning (MARL), where the agents are incoming lanes and the environment is the road network. By performing actions to launch vehicles, agents receive a reward for matching data from transport detectors. Parameter Sharing DQN with the LSTM backbone of the Q-function was used as an algorithm for multi-agent reinforcement learning. Since the rlRouter is trained inside the SUMO simulation, it can restore routes better by taking into account the interaction of vehicles within the network with each other and with the network infrastructure. We have modeled diverse traffic situations on three different junctions in order to compare the performance of SUMO’s routers with the rlRouter . We used Mean Absoluter Error (MAE) as the measure of the deviation from both cumulative detectors and routes data. The rlRouter achieved the highest compliance with the data from the detectors. We also found that by maximizing the reward for matching detectors, the resulting routes also get closer to the real ones. Despite the fact that the routes recovered using rlRouter are superior to the routes obtained using SUMO tools, they do not fully correspond to the real ones, due to the natural limitations of induction-loop detectors. To achieve more plausible routes, it is necessary to equip junctions with other types of transport counters, for example, camera detectors.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Computer Research and Modeling
Computer Research and Modeling Computer Science-Computational Theory and Mathematics
CiteScore
0.80
自引率
0.00%
发文量
82
审稿时长
15 weeks
期刊介绍: The journal publishes original research papers and review articles in the field of computer research and mathematical modeling in physics, engineering, biology, ecology, economics, psychology etc. The journal covers research on computer methods and simulation of systems of various nature in the leading scientific schools of Russia and other countries. Of particular interest are papers devoted to simulation in thriving fields of science such as nanotechnology, bioinformatics, and econophysics. The main goal of the journal is to cover the development of computer and mathematical methods for the study of processes in complex structured and developing systems. The primary criterion for publication of papers in the journal is their scientific level. The journal does not charge a publication fee. The decision made on publication is based on the results of an independent review. The journal is oriented towards a wide readership – specialists in mathematical modeling in various areas of science and engineering. The scope of the journal includes: — mathematical modeling and numerical simulation; — numerical methods and the basics of their application; — models in physics and technology; — analysis and modeling of complex living systems; — models of economic and social systems. New sections and headings may be included in the next volumes.
期刊最新文献
On periodic modes of body motion along a horizontal rough plane, performed by moving two internal masses In memory of A. P.Buslaev - friend, scientist and founder of the scientific school of mathematical modeling of traffic flows Cloud interpretation of the entropy model for calculating the trip matrix Double-circuit system with clusters of different lengths and unequal arrangement of two nodes on the circuits Improving the quality of route generation in SUMO based on data from detectors using reinforcement learning
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1