Using the gradient boosting decision tree (GBDT) algorithm for a train delay prediction model considering the delay propagation feature

IF 2.8 3区 工程技术 Q2 ENGINEERING, MANUFACTURING Advances in Production Engineering & Management Pub Date : 2021-09-30 DOI:10.14743/apem2021.3.400
Y.D. Zhang, L. Liao, Q. Yu, W. Ma, K. Li
{"title":"Using the gradient boosting decision tree (GBDT) algorithm for a train delay prediction model considering the delay propagation feature","authors":"Y.D. Zhang, L. Liao, Q. Yu, W. Ma, K. Li","doi":"10.14743/apem2021.3.400","DOIUrl":null,"url":null,"abstract":"Accurate prediction of train delay is an important basis for the intelligent adjustment of train operation plans. This paper proposes a train delay prediction model that considers the delay propagation feature. The model consists of two parts. The first part is the extraction of delay propagation feature. The best delay classification scheme is determined through the clustering method of delay types for historical data based on the density-based spatial clustering of applications with noise algorithm (DBSCAN), and combining the best delay classification scheme and the k-nearest neighbor (KNN) algorithm to design the classification method of delay type for online data. The delay propagation factor is used to quantify the delay propagation relationship, and on this basis, the horizontal and vertical delay propagation feature are constructed. The second part is the delay prediction, which takes the train operation status feature and delay propagation feature as input feature, and use the gradient boosting decision tree (GBDT) algorithm to complete the prediction. The model was tested and simulated using the actual train operation data, and compared with random forest (RF), support vector regression (SVR) and multilayer perceptron (MLP). The results show that considering the delay propagation feature in the train delay prediction model can further improve the accuracy of train delay prediction. The delay prediction model proposed in this paper can provide a theoretical basis for the intelligentization of railway dispatching, enabling dispatchers to control delays more reasonably, and improve the quality of railway transportation services.","PeriodicalId":48763,"journal":{"name":"Advances in Production Engineering & Management","volume":null,"pages":null},"PeriodicalIF":2.8000,"publicationDate":"2021-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Advances in Production Engineering & Management","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.14743/apem2021.3.400","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, MANUFACTURING","Score":null,"Total":0}
引用次数: 3

Abstract

Accurate prediction of train delay is an important basis for the intelligent adjustment of train operation plans. This paper proposes a train delay prediction model that considers the delay propagation feature. The model consists of two parts. The first part is the extraction of delay propagation feature. The best delay classification scheme is determined through the clustering method of delay types for historical data based on the density-based spatial clustering of applications with noise algorithm (DBSCAN), and combining the best delay classification scheme and the k-nearest neighbor (KNN) algorithm to design the classification method of delay type for online data. The delay propagation factor is used to quantify the delay propagation relationship, and on this basis, the horizontal and vertical delay propagation feature are constructed. The second part is the delay prediction, which takes the train operation status feature and delay propagation feature as input feature, and use the gradient boosting decision tree (GBDT) algorithm to complete the prediction. The model was tested and simulated using the actual train operation data, and compared with random forest (RF), support vector regression (SVR) and multilayer perceptron (MLP). The results show that considering the delay propagation feature in the train delay prediction model can further improve the accuracy of train delay prediction. The delay prediction model proposed in this paper can provide a theoretical basis for the intelligentization of railway dispatching, enabling dispatchers to control delays more reasonably, and improve the quality of railway transportation services.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
采用梯度增强决策树(GBDT)算法建立了考虑延迟传播特征的列车延误预测模型
列车延误的准确预测是列车运行计划智能调整的重要依据。提出了一种考虑延迟传播特征的列车延误预测模型。该模型由两部分组成。第一部分是延迟传播特征的提取。基于基于密度的空间聚类应用噪声算法(DBSCAN),通过历史数据延迟类型聚类方法确定最佳延迟分类方案,并将最佳延迟分类方案与k近邻(KNN)算法相结合,设计在线数据延迟类型分类方法。用延迟传播因子来量化延迟传播关系,在此基础上构造水平和垂直延迟传播特征。第二部分是延迟预测,以列车运行状态特征和延迟传播特征作为输入特征,采用梯度增强决策树(GBDT)算法完成预测。利用实际列车运行数据对模型进行了测试和仿真,并与随机森林(RF)、支持向量回归(SVR)和多层感知器(MLP)进行了比较。结果表明,在列车延误预测模型中考虑延误传播特性可以进一步提高列车延误预测的精度。本文提出的延误预测模型可为铁路调度智能化提供理论依据,使调度员更合理地控制延误,提高铁路运输服务质量。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Advances in Production Engineering & Management
Advances in Production Engineering & Management ENGINEERING, MANUFACTURINGMATERIALS SCIENC-MATERIALS SCIENCE, MULTIDISCIPLINARY
CiteScore
5.90
自引率
22.20%
发文量
19
期刊介绍: Advances in Production Engineering & Management (APEM journal) is an interdisciplinary international academic journal published quarterly. The main goal of the APEM journal is to present original, high quality, theoretical and application-oriented research developments in all areas of production engineering and production management to a broad audience of academics and practitioners. In order to bridge the gap between theory and practice, applications based on advanced theory and case studies are particularly welcome. For theoretical papers, their originality and research contributions are the main factors in the evaluation process. General approaches, formalisms, algorithms or techniques should be illustrated with significant applications that demonstrate their applicability to real-world problems. Please note the APEM journal is not intended especially for studying problems in the finance, economics, business, and bank sectors even though the methodology in the paper is quality/project management oriented. Therefore, the papers should include a substantial level of engineering issues in the field of manufacturing engineering.
期刊最新文献
Optimal path planning of a disinfection mobile robot against COVID-19 in a ROS-based research platform A comparative study of different pull control strategies in multi-product manufacturing systems using discrete event simulation The impact of the collaborative workplace on the production system capacity: Simulation modelling vs. real-world application approach Molecular-dynamics study of multi-pulsed ultrafast laser interaction with copper A deep learning-based worker assistance system for error prevention: Case study in a real-world manual assembly
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1