基于深度确定性策略梯度的高速列车虚拟耦合控制

Giacomo Basile, Dario Giuseppe Lui, A. Petrillo, S. Santini
{"title":"基于深度确定性策略梯度的高速列车虚拟耦合控制","authors":"Giacomo Basile, Dario Giuseppe Lui, A. Petrillo, S. Santini","doi":"10.1109/ICNSC55942.2022.10004067","DOIUrl":null,"url":null,"abstract":"This work addresses the problem of Virtual Coupling (VC) control for uncertain heterogeneous nonlinear autonomous trains convoys sharing information among each other with Radio Block Center (RBC) and via Train-2-Train (T2T) communication network. To solve the problem we propose a novel no-supervised actor-critic Deep Deterministic Policy Gradient-based (DDPG) controller which drives each train within the convoy to track the reference behaviour, as imposed by the RBC, while maintaining a desired inter-train distance w.r.t. the preceding train. The effectiveness of the proposed approach is evaluated via a numerical analysis which is carried out in Python environment. The first step of validation involves the efficiency of the training process and discloses how the agent has learned the correct behaviour to track the train ahead. Then, we numerically prove how the overall closed-loop trains convoy under the action of the DDPG controller reaches the VC formation despite the presence of external disturbances acting on the train dynamics.","PeriodicalId":230499,"journal":{"name":"2022 IEEE International Conference on Networking, Sensing and Control (ICNSC)","volume":"91 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Deep Deterministic Policy Gradient-based Virtual Coupling Control For High-Speed Train Convoys\",\"authors\":\"Giacomo Basile, Dario Giuseppe Lui, A. Petrillo, S. Santini\",\"doi\":\"10.1109/ICNSC55942.2022.10004067\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This work addresses the problem of Virtual Coupling (VC) control for uncertain heterogeneous nonlinear autonomous trains convoys sharing information among each other with Radio Block Center (RBC) and via Train-2-Train (T2T) communication network. To solve the problem we propose a novel no-supervised actor-critic Deep Deterministic Policy Gradient-based (DDPG) controller which drives each train within the convoy to track the reference behaviour, as imposed by the RBC, while maintaining a desired inter-train distance w.r.t. the preceding train. The effectiveness of the proposed approach is evaluated via a numerical analysis which is carried out in Python environment. The first step of validation involves the efficiency of the training process and discloses how the agent has learned the correct behaviour to track the train ahead. Then, we numerically prove how the overall closed-loop trains convoy under the action of the DDPG controller reaches the VC formation despite the presence of external disturbances acting on the train dynamics.\",\"PeriodicalId\":230499,\"journal\":{\"name\":\"2022 IEEE International Conference on Networking, Sensing and Control (ICNSC)\",\"volume\":\"91 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-12-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE International Conference on Networking, Sensing and Control (ICNSC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICNSC55942.2022.10004067\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE International Conference on Networking, Sensing and Control (ICNSC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICNSC55942.2022.10004067","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

摘要

本研究解决了不确定异构非线性自主列车车队的虚拟耦合(VC)控制问题,这些车队通过无线电块中心(RBC)和列车-列车(T2T)通信网络相互共享信息。为了解决这个问题,我们提出了一种新的无监督的基于深度确定性策略梯度(Deep Deterministic Policy gradient, DDPG)控制器,它驱动车队内的每列火车跟踪参考行为,就像RBC施加的那样,同时保持与前一列火车相比所需的列车间距离。通过在Python环境中进行的数值分析来评估所提出方法的有效性。验证的第一步涉及训练过程的效率,并揭示智能体如何学习正确的行为来跟踪前方的训练。然后,数值证明了在DDPG控制器的作用下,在存在外部扰动的情况下,整个闭环列车车队是如何到达VC队形的。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Deep Deterministic Policy Gradient-based Virtual Coupling Control For High-Speed Train Convoys
This work addresses the problem of Virtual Coupling (VC) control for uncertain heterogeneous nonlinear autonomous trains convoys sharing information among each other with Radio Block Center (RBC) and via Train-2-Train (T2T) communication network. To solve the problem we propose a novel no-supervised actor-critic Deep Deterministic Policy Gradient-based (DDPG) controller which drives each train within the convoy to track the reference behaviour, as imposed by the RBC, while maintaining a desired inter-train distance w.r.t. the preceding train. The effectiveness of the proposed approach is evaluated via a numerical analysis which is carried out in Python environment. The first step of validation involves the efficiency of the training process and discloses how the agent has learned the correct behaviour to track the train ahead. Then, we numerically prove how the overall closed-loop trains convoy under the action of the DDPG controller reaches the VC formation despite the presence of external disturbances acting on the train dynamics.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
High-dimensional Feature Selection in Classification: A Length-Adaptive Evolutionary Approach SFME: Score Fusion from Multiple Experts for Long-tailed Recognition Attention and Cost-Sensitive Graph Neural Network for Imbalanced Node Classification Conditioning Customers' Product Reviews for Accurate Classification Performance Constraint Programming for Modeling and Solving a Hybrid Flow Shop Scheduling Problem
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1