A hybrid ensemble deep reinforcement learning model for locomotive axle temperature using the deterministic and probabilistic strategy

IF 3 4区工程技术 Q2 TRANSPORTATION SCIENCE & TECHNOLOGY Transportation Safety and Environment Pub Date : 2022-12-21 DOI:10.1093/tse/tdac055

Guangxi Yan, Hui Liu, Chengqing Yu, Chengming Yu, Ye Li, Zhu Duan

引用次数: 0

Abstract

This paper proposes a hybrid deep reinforcement learning framework for locomotive axle temperature by combining the wavelet packet decomposition (WPD), long short-term memory (LSTM), the gated recurrent unit (GRU) reinforcement learning, and generalized autoregressive conditional heteroskedasticity (GARCH) algorithms. The WPD is utilized to decompose the raw nonlinear series into subseries. Then the deep learning predictors LSTM and GRU are established to predict the future axle temperatures in each subseries. The Q-learning could generate optimal ensemble weights to integrate the predictors to finish the deterministic forecasting and GARCH is used to conduct the deterministic forecasting based on the deterministic forecasting residual. These parts of the hybrid ensemble structure contributed to optimal modeling accuracy and provided effective support in the real-time monitoring and fault diagnosis of transportation.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于确定性和概率策略的机车轴温混合集成深度强化学习模型

结合小波包分解(WPD)、长短期记忆(LSTM)、门控循环单元(GRU)强化学习和广义自回归条件异方差(GARCH)算法，提出了机车轴温混合深度强化学习框架。利用WPD将原始非线性序列分解为子序列。然后建立深度学习预测器LSTM和GRU来预测每个子系列的未来轴温。Q-learning可以生成最优的集合权值来整合预测因子完成确定性预测，并利用GARCH基于确定性预测残差进行确定性预测。这些部分的混合集成结构有助于优化建模精度，为交通运输实时监测和故障诊断提供有效支持。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊