A Novel Approach for Train Tracking in Virtual Coupling Based on Soft Actor-Critic

IF 2.2 3区 工程技术 Q2 ENGINEERING, MECHANICAL Actuators Pub Date : 2023-12-01 DOI:10.3390/act12120447
Bin Chen, Lei Zhang, Gaoyun Cheng, Yiqing Liu, Junjie Chen
{"title":"A Novel Approach for Train Tracking in Virtual Coupling Based on Soft Actor-Critic","authors":"Bin Chen, Lei Zhang, Gaoyun Cheng, Yiqing Liu, Junjie Chen","doi":"10.3390/act12120447","DOIUrl":null,"url":null,"abstract":"The development of virtual coupling technology provides solutions to the challenges faced by urban rail transit systems. Train tracking control is a crucial component in the operation of virtual coupling, which plays a pivotal role in ensuring the safe and efficient movement of trains within the train and along the rail network. In order to ensure the high efficiency and safety of train tracking control in virtual coupling, this paper proposes an optimization algorithm based on Soft Actor-Critic for train tracking control in virtual coupling. Firstly, we construct the train tracking model under the reinforcement learning architecture using the operation states of the train, Proportional Integral Derivative (PID) controller output, and train tracking spacing and speed difference as elements of reinforcement learning. The train tracking control reward function is designed. Then, the Soft Actor-Critic (SAC) algorithm is used to train the virtual coupling train tracking reinforcement learning model. Finally, we took the Deep Deterministic Policy Gradient as the comparison algorithm to verify the superiority of the algorithm proposed in this paper.","PeriodicalId":48584,"journal":{"name":"Actuators","volume":"80 4","pages":""},"PeriodicalIF":2.2000,"publicationDate":"2023-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Actuators","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.3390/act12120447","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, MECHANICAL","Score":null,"Total":0}
引用次数: 0

Abstract

The development of virtual coupling technology provides solutions to the challenges faced by urban rail transit systems. Train tracking control is a crucial component in the operation of virtual coupling, which plays a pivotal role in ensuring the safe and efficient movement of trains within the train and along the rail network. In order to ensure the high efficiency and safety of train tracking control in virtual coupling, this paper proposes an optimization algorithm based on Soft Actor-Critic for train tracking control in virtual coupling. Firstly, we construct the train tracking model under the reinforcement learning architecture using the operation states of the train, Proportional Integral Derivative (PID) controller output, and train tracking spacing and speed difference as elements of reinforcement learning. The train tracking control reward function is designed. Then, the Soft Actor-Critic (SAC) algorithm is used to train the virtual coupling train tracking reinforcement learning model. Finally, we took the Deep Deterministic Policy Gradient as the comparison algorithm to verify the superiority of the algorithm proposed in this paper.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于软演员批判的虚拟耦合中列车追踪新方法
虚拟耦合技术的发展为城市轨道交通系统面临的挑战提供了解决方案。列车跟踪控制是虚拟耦合运行的重要组成部分,对保证列车在列车内和在轨道网络上的安全高效运行起着至关重要的作用。为了保证虚拟耦合下列车跟踪控制的高效性和安全性,本文提出了一种基于软行为者评价的虚拟耦合下列车跟踪控制优化算法。首先,利用列车运行状态、PID控制器输出、列车跟踪间距和速度差作为强化学习的要素,构建强化学习架构下的列车跟踪模型;设计了列车跟踪控制奖励函数。然后,利用软行为者-批评家(SAC)算法对虚拟耦合列车跟踪强化学习模型进行训练。最后,我们以深度确定性策略梯度作为比较算法来验证本文算法的优越性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Actuators
Actuators Mathematics-Control and Optimization
CiteScore
3.90
自引率
15.40%
发文量
315
审稿时长
11 weeks
期刊介绍: Actuators (ISSN 2076-0825; CODEN: ACTUC3) is an international open access journal on the science and technology of actuators and control systems published quarterly online by MDPI.
期刊最新文献
Current State, Needs, and Opportunities for Wearable Robots in Military Medical Rehabilitation and Force Protection. Numerical Investigation on the Evolution Process of Different Vortex Structures and Distributed Blowing Control for Dynamic Stall Suppression of Rotor Airfoils Experimental Research on Avoidance Obstacle Control for Mobile Robots Using Q-Learning (QL) and Deep Q-Learning (DQL) Algorithms in Dynamic Environments Design and Control of a Reconfigurable Robot with Rolling and Flying Locomotion Dynamic Path Planning for Mobile Robots by Integrating Improved Sparrow Search Algorithm and Dynamic Window Approach
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1