Solving the train dispatching problem via deep reinforcement learning

Valerio Agasucci , Giorgio Grani , Leonardo Lamorgese
{"title":"Solving the train dispatching problem via deep reinforcement learning","authors":"Valerio Agasucci ,&nbsp;Giorgio Grani ,&nbsp;Leonardo Lamorgese","doi":"10.1016/j.jrtpm.2023.100394","DOIUrl":null,"url":null,"abstract":"<div><p>Every day, railways experience disturbances and disruptions, both on the network and the fleet side, that affect the stability of rail traffic. Induced delays propagate through the network, which leads to a mismatch in demand and offer for goods and passengers, and, in turn, to a loss in service quality. In these cases, it is the duty of human traffic controllers, the so-called dispatchers, to do their best to minimize the impact on traffic. However, dispatchers inevitably have a limited depth of perception of the knock-on effect of their decisions, particularly how they affect areas of the network that are outside their direct control. In recent years, much work in Decision Science has been devoted to developing methods to solve the problem automatically and support the dispatchers in this challenging task. This paper investigates Machine Learning-based methods for tackling this problem, proposing two different Deep Q-Learning methods(Decentralized and Centralized). Numerical results show the superiority of these techniques respect to the classical linear Q-Learning based on matrices. Moreover the Centralized approach is compared with a MILP formulation showing interesting results. The experiments are inspired on data provided by a U.S. class 1 railroad.</p></div>","PeriodicalId":51821,"journal":{"name":"Journal of Rail Transport Planning & Management","volume":"26 ","pages":"Article 100394"},"PeriodicalIF":2.6000,"publicationDate":"2023-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Rail Transport Planning & Management","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2210970623000264","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"TRANSPORTATION","Score":null,"Total":0}
引用次数: 1

Abstract

Every day, railways experience disturbances and disruptions, both on the network and the fleet side, that affect the stability of rail traffic. Induced delays propagate through the network, which leads to a mismatch in demand and offer for goods and passengers, and, in turn, to a loss in service quality. In these cases, it is the duty of human traffic controllers, the so-called dispatchers, to do their best to minimize the impact on traffic. However, dispatchers inevitably have a limited depth of perception of the knock-on effect of their decisions, particularly how they affect areas of the network that are outside their direct control. In recent years, much work in Decision Science has been devoted to developing methods to solve the problem automatically and support the dispatchers in this challenging task. This paper investigates Machine Learning-based methods for tackling this problem, proposing two different Deep Q-Learning methods(Decentralized and Centralized). Numerical results show the superiority of these techniques respect to the classical linear Q-Learning based on matrices. Moreover the Centralized approach is compared with a MILP formulation showing interesting results. The experiments are inspired on data provided by a U.S. class 1 railroad.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
利用深度强化学习解决列车调度问题
每天,铁路都会经历网络和车队方面的干扰和中断,影响铁路交通的稳定性。诱导的延误通过网络传播,导致货物和乘客的需求和报价不匹配,进而导致服务质量下降。在这种情况下,人类交通管制员,即所谓的调度员,有责任尽最大努力将对交通的影响降至最低。然而,调度员不可避免地对其决策的连锁反应有着有限的感知深度,特别是他们如何影响他们直接控制之外的网络区域。近年来,决策科学领域的许多工作都致力于开发自动解决问题的方法,并支持调度员完成这项具有挑战性的任务。本文研究了基于机器学习的方法来解决这个问题,提出了两种不同的深度Q学习方法(分散和集中)。数值结果表明,与传统的基于矩阵的线性Q学习相比,这些技术具有优越性。此外,将集中式方法与MILP公式进行了比较,结果令人感兴趣。这些实验的灵感来源于美国一级铁路提供的数据。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
7.10
自引率
8.10%
发文量
41
期刊最新文献
A MILP model to improve the robustness of a railway timetable by retiming and rerouting in a complex bottleneck area A decomposition approach to solve the individual railway crew Re-planning problem A Bi-objective model and a branch-and-price-and-cut solution method for the railroad blocking problem in hazardous material transportation Relationships between service quality and customer satisfaction in rail freight transportation: A structural equation modeling approach The evaluation of competition effect on rail fares using the difference-in-difference method through symmetric and lagged spans
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1