A Novel Approach for Train Tracking in Virtual Coupling Based on Soft Actor-Critic

IF 2.3 3区工程技术 Q2 ENGINEERING, MECHANICAL Actuators Pub Date : 2023-12-01 DOI:10.3390/act12120447

Bin Chen, Lei Zhang, Gaoyun Cheng, Yiqing Liu, Junjie Chen

{"title":"A Novel Approach for Train Tracking in Virtual Coupling Based on Soft Actor-Critic","authors":"Bin Chen, Lei Zhang, Gaoyun Cheng, Yiqing Liu, Junjie Chen","doi":"10.3390/act12120447","DOIUrl":null,"url":null,"abstract":"The development of virtual coupling technology provides solutions to the challenges faced by urban rail transit systems. Train tracking control is a crucial component in the operation of virtual coupling, which plays a pivotal role in ensuring the safe and efficient movement of trains within the train and along the rail network. In order to ensure the high efficiency and safety of train tracking control in virtual coupling, this paper proposes an optimization algorithm based on Soft Actor-Critic for train tracking control in virtual coupling. Firstly, we construct the train tracking model under the reinforcement learning architecture using the operation states of the train, Proportional Integral Derivative (PID) controller output, and train tracking spacing and speed difference as elements of reinforcement learning. The train tracking control reward function is designed. Then, the Soft Actor-Critic (SAC) algorithm is used to train the virtual coupling train tracking reinforcement learning model. Finally, we took the Deep Deterministic Policy Gradient as the comparison algorithm to verify the superiority of the algorithm proposed in this paper.","PeriodicalId":48584,"journal":{"name":"Actuators","volume":"80 4","pages":""},"PeriodicalIF":2.3000,"publicationDate":"2023-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Actuators","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.3390/act12120447","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, MECHANICAL","Score":null,"Total":0}

引用次数: 0

Abstract

The development of virtual coupling technology provides solutions to the challenges faced by urban rail transit systems. Train tracking control is a crucial component in the operation of virtual coupling, which plays a pivotal role in ensuring the safe and efficient movement of trains within the train and along the rail network. In order to ensure the high efficiency and safety of train tracking control in virtual coupling, this paper proposes an optimization algorithm based on Soft Actor-Critic for train tracking control in virtual coupling. Firstly, we construct the train tracking model under the reinforcement learning architecture using the operation states of the train, Proportional Integral Derivative (PID) controller output, and train tracking spacing and speed difference as elements of reinforcement learning. The train tracking control reward function is designed. Then, the Soft Actor-Critic (SAC) algorithm is used to train the virtual coupling train tracking reinforcement learning model. Finally, we took the Deep Deterministic Policy Gradient as the comparison algorithm to verify the superiority of the algorithm proposed in this paper.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于软演员批判的虚拟耦合中列车追踪新方法

虚拟耦合技术的发展为城市轨道交通系统面临的挑战提供了解决方案。列车跟踪控制是虚拟耦合运行的重要组成部分，对保证列车在列车内和在轨道网络上的安全高效运行起着至关重要的作用。为了保证虚拟耦合下列车跟踪控制的高效性和安全性，本文提出了一种基于软行为者评价的虚拟耦合下列车跟踪控制优化算法。首先，利用列车运行状态、PID控制器输出、列车跟踪间距和速度差作为强化学习的要素，构建强化学习架构下的列车跟踪模型;设计了列车跟踪控制奖励函数。然后，利用软行为者-批评家(SAC)算法对虚拟耦合列车跟踪强化学习模型进行训练。最后，我们以深度确定性策略梯度作为比较算法来验证本文算法的优越性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Actuators Mathematics-Control and Optimization

CiteScore

3.90

自引率

15.40%

发文量

315

审稿时长

11 weeks

期刊介绍： Actuators (ISSN 2076-0825; CODEN: ACTUC3) is an international open access journal on the science and technology of actuators and control systems published quarterly online by MDPI.