城际高速铁路列车调度的强化学习方法

2020 IEEE 5th International Conference on Intelligent Transportation Engineering (ICITE) Pub Date : 2020-09-01 DOI:10.1109/ICITE50838.2020.9231418

Yiwei Guo

{"title":"城际高速铁路列车调度的强化学习方法","authors":"Yiwei Guo","doi":"10.1109/ICITE50838.2020.9231418","DOIUrl":null,"url":null,"abstract":"This paper describes a reinforcement learning (RL) approach to train timetabling, which takes into account the characteristics of inter-city high speed railway lines in China. A potential advantage of the proposed approach over well-established mathematical programming approaches lies in that it does not rely heavily on domain expertise to define the various timetabling rules and strategies. Specifically, a discrete time Markov Decision Process (MDP) is established to model the studied problem, and a well-designed RL method is proposed to solve the problem, assuming that the fundamental information about the studied lines (minimum running times, headways, stopping patterns, etc.) is known. Four inter-city high speed railway lines that operate on the Beijing-Tianjin corridor are employed as a case study to test the performance of the proposed approach. The obtained results preliminarily demonstrate the effectiveness and applicability of the proposed approach.","PeriodicalId":112371,"journal":{"name":"2020 IEEE 5th International Conference on Intelligent Transportation Engineering (ICITE)","volume":"13 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"A Reinforcement Learning Approach to Train Timetabling for Inter-City High Speed Railway Lines\",\"authors\":\"Yiwei Guo\",\"doi\":\"10.1109/ICITE50838.2020.9231418\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper describes a reinforcement learning (RL) approach to train timetabling, which takes into account the characteristics of inter-city high speed railway lines in China. A potential advantage of the proposed approach over well-established mathematical programming approaches lies in that it does not rely heavily on domain expertise to define the various timetabling rules and strategies. Specifically, a discrete time Markov Decision Process (MDP) is established to model the studied problem, and a well-designed RL method is proposed to solve the problem, assuming that the fundamental information about the studied lines (minimum running times, headways, stopping patterns, etc.) is known. Four inter-city high speed railway lines that operate on the Beijing-Tianjin corridor are employed as a case study to test the performance of the proposed approach. The obtained results preliminarily demonstrate the effectiveness and applicability of the proposed approach.\",\"PeriodicalId\":112371,\"journal\":{\"name\":\"2020 IEEE 5th International Conference on Intelligent Transportation Engineering (ICITE)\",\"volume\":\"13 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 IEEE 5th International Conference on Intelligent Transportation Engineering (ICITE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICITE50838.2020.9231418\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE 5th International Conference on Intelligent Transportation Engineering (ICITE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICITE50838.2020.9231418","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 4

摘要

本文介绍了一种考虑中国城际高速铁路特点的列车调度强化学习方法。与已建立的数学规划方法相比，所提出的方法的一个潜在优势在于，它不严重依赖于领域专业知识来定义各种时间表规则和策略。具体来说，建立了离散时间马尔可夫决策过程(MDP)对所研究的问题进行建模，并在已知所研究线路的基本信息(最小运行时间、车头、停车模式等)的情况下，提出了一种设计良好的强化学习方法来解决问题。以京津走廊上运行的四条城际高速铁路线为例，测试了所提出方法的性能。所得结果初步证明了该方法的有效性和适用性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

A Reinforcement Learning Approach to Train Timetabling for Inter-City High Speed Railway Lines

This paper describes a reinforcement learning (RL) approach to train timetabling, which takes into account the characteristics of inter-city high speed railway lines in China. A potential advantage of the proposed approach over well-established mathematical programming approaches lies in that it does not rely heavily on domain expertise to define the various timetabling rules and strategies. Specifically, a discrete time Markov Decision Process (MDP) is established to model the studied problem, and a well-designed RL method is proposed to solve the problem, assuming that the fundamental information about the studied lines (minimum running times, headways, stopping patterns, etc.) is known. Four inter-city high speed railway lines that operate on the Beijing-Tianjin corridor are employed as a case study to test the performance of the proposed approach. The obtained results preliminarily demonstrate the effectiveness and applicability of the proposed approach.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2020 IEEE 5th International Conference on Intelligent Transportation Engineering (ICITE)

自引率

0.00%

发文量