利用强化学习，基于探测器的数据提高 SUMO 中路线生成的质量

Q4 Computer Science Computer Research and Modeling Pub Date : 2024-02-01 DOI:10.20537/2076-7633-2024-16-1-137-146

Ivan A. Salenek, Yaroslav A. Seliverstov, Svyatoslav A. Seliverstov, Elena A. Sofronova

{"title":"利用强化学习，基于探测器的数据提高 SUMO 中路线生成的质量","authors":"Ivan A. Salenek, Yaroslav A. Seliverstov, Svyatoslav A. Seliverstov, Elena A. Sofronova","doi":"10.20537/2076-7633-2024-16-1-137-146","DOIUrl":null,"url":null,"abstract":"This work provides a new approach for constructing high-precision routes based on data from transport detectors inside the SUMO traffic modeling package. Existing tools such as flowrouter and routeSampler have a number of disadvantages, such as the lack of interaction with the network in the process of building routes. Our rlRouter uses multi-agent reinforcement learning (MARL), where the agents are incoming lanes and the environment is the road network. By performing actions to launch vehicles, agents receive a reward for matching data from transport detectors. Parameter Sharing DQN with the LSTM backbone of the Q-function was used as an algorithm for multi-agent reinforcement learning. Since the rlRouter is trained inside the SUMO simulation, it can restore routes better by taking into account the interaction of vehicles within the network with each other and with the network infrastructure. We have modeled diverse traffic situations on three different junctions in order to compare the performance of SUMO’s routers with the rlRouter . We used Mean Absoluter Error (MAE) as the measure of the deviation from both cumulative detectors and routes data. The rlRouter achieved the highest compliance with the data from the detectors. We also found that by maximizing the reward for matching detectors, the resulting routes also get closer to the real ones. Despite the fact that the routes recovered using rlRouter are superior to the routes obtained using SUMO tools, they do not fully correspond to the real ones, due to the natural limitations of induction-loop detectors. To achieve more plausible routes, it is necessary to equip junctions with other types of transport counters, for example, camera detectors.","PeriodicalId":37429,"journal":{"name":"Computer Research and Modeling","volume":"27 11","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Improving the quality of route generation in SUMO based on data from detectors using reinforcement learning\",\"authors\":\"Ivan A. Salenek, Yaroslav A. Seliverstov, Svyatoslav A. Seliverstov, Elena A. Sofronova\",\"doi\":\"10.20537/2076-7633-2024-16-1-137-146\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This work provides a new approach for constructing high-precision routes based on data from transport detectors inside the SUMO traffic modeling package. Existing tools such as flowrouter and routeSampler have a number of disadvantages, such as the lack of interaction with the network in the process of building routes. Our rlRouter uses multi-agent reinforcement learning (MARL), where the agents are incoming lanes and the environment is the road network. By performing actions to launch vehicles, agents receive a reward for matching data from transport detectors. Parameter Sharing DQN with the LSTM backbone of the Q-function was used as an algorithm for multi-agent reinforcement learning. Since the rlRouter is trained inside the SUMO simulation, it can restore routes better by taking into account the interaction of vehicles within the network with each other and with the network infrastructure. We have modeled diverse traffic situations on three different junctions in order to compare the performance of SUMO’s routers with the rlRouter . We used Mean Absoluter Error (MAE) as the measure of the deviation from both cumulative detectors and routes data. The rlRouter achieved the highest compliance with the data from the detectors. We also found that by maximizing the reward for matching detectors, the resulting routes also get closer to the real ones. Despite the fact that the routes recovered using rlRouter are superior to the routes obtained using SUMO tools, they do not fully correspond to the real ones, due to the natural limitations of induction-loop detectors. To achieve more plausible routes, it is necessary to equip junctions with other types of transport counters, for example, camera detectors.\",\"PeriodicalId\":37429,\"journal\":{\"name\":\"Computer Research and Modeling\",\"volume\":\"27 11\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-02-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computer Research and Modeling\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.20537/2076-7633-2024-16-1-137-146\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"Computer Science\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer Research and Modeling","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.20537/2076-7633-2024-16-1-137-146","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"Computer Science","Score":null,"Total":0}

引用次数: 0

摘要

这项工作提供了一种基于 SUMO 交通建模软件包内交通探测器数据构建高精度路线的新方法。现有的工具（如 flowrouter 和 routeSampler）有许多缺点，如在构建路线的过程中缺乏与网络的交互。我们的 rlRouter 采用了多代理强化学习（MARL）技术，其中代理是来车道，环境是道路网络。通过执行发射车辆的操作，代理可获得与运输检测器数据相匹配的奖励。参数共享 DQN 与 Q 函数的 LSTM 骨干被用作多代理强化学习的算法。由于 rlRouter 是在 SUMO 仿真中进行训练的，因此它可以通过考虑网络内车辆之间以及车辆与网络基础设施之间的相互作用，更好地恢复路线。为了比较 SUMO 路由器和 rlRouter 的性能，我们在三个不同的路口模拟了不同的交通状况。我们使用平均绝对误差（MAE）来衡量累积检测器和路由数据的偏差。rlRouter 与检测器数据的一致性最高。我们还发现，通过最大限度地提高匹配检测器的奖励，得到的路线也更接近真实路线。尽管使用 rlRouter 恢复的路线优于使用 SUMO 工具获得的路线，但由于感应回路检测器的自然限制，这些路线并不完全符合真实路线。为了获得更可信的路线，有必要在路口配备其他类型的交通计数器，例如摄像头检测器。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Improving the quality of route generation in SUMO based on data from detectors using reinforcement learning

This work provides a new approach for constructing high-precision routes based on data from transport detectors inside the SUMO traffic modeling package. Existing tools such as flowrouter and routeSampler have a number of disadvantages, such as the lack of interaction with the network in the process of building routes. Our rlRouter uses multi-agent reinforcement learning (MARL), where the agents are incoming lanes and the environment is the road network. By performing actions to launch vehicles, agents receive a reward for matching data from transport detectors. Parameter Sharing DQN with the LSTM backbone of the Q-function was used as an algorithm for multi-agent reinforcement learning. Since the rlRouter is trained inside the SUMO simulation, it can restore routes better by taking into account the interaction of vehicles within the network with each other and with the network infrastructure. We have modeled diverse traffic situations on three different junctions in order to compare the performance of SUMO’s routers with the rlRouter . We used Mean Absoluter Error (MAE) as the measure of the deviation from both cumulative detectors and routes data. The rlRouter achieved the highest compliance with the data from the detectors. We also found that by maximizing the reward for matching detectors, the resulting routes also get closer to the real ones. Despite the fact that the routes recovered using rlRouter are superior to the routes obtained using SUMO tools, they do not fully correspond to the real ones, due to the natural limitations of induction-loop detectors. To achieve more plausible routes, it is necessary to equip junctions with other types of transport counters, for example, camera detectors.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Computer Research and Modeling Computer Science-Computational Theory and Mathematics

CiteScore

0.80

自引率

0.00%

发文量

审稿时长

15 weeks

期刊介绍： The journal publishes original research papers and review articles in the field of computer research and mathematical modeling in physics, engineering, biology, ecology, economics, psychology etc. The journal covers research on computer methods and simulation of systems of various nature in the leading scientific schools of Russia and other countries. Of particular interest are papers devoted to simulation in thriving fields of science such as nanotechnology, bioinformatics, and econophysics. The main goal of the journal is to cover the development of computer and mathematical methods for the study of processes in complex structured and developing systems. The primary criterion for publication of papers in the journal is their scientific level. The journal does not charge a publication fee. The decision made on publication is based on the results of an independent review. The journal is oriented towards a wide readership – specialists in mathematical modeling in various areas of science and engineering. The scope of the journal includes: — mathematical modeling and numerical simulation; — numerical methods and the basics of their application; — models in physics and technology; — analysis and modeling of complex living systems; — models of economic and social systems. New sections and headings may be included in the next volumes.