一种用于列车改期策略模式发现的混合数据挖掘框架

IF 2.7 4区 工程技术 Q2 TRANSPORTATION SCIENCE & TECHNOLOGY Transportation Safety and Environment Pub Date : 2023-02-16 DOI:10.1093/tse/tdad007
Rui Chen, Xu Ge, Ping Huang, Chao Wen
{"title":"一种用于列车改期策略模式发现的混合数据挖掘框架","authors":"Rui Chen, Xu Ge, Ping Huang, Chao Wen","doi":"10.1093/tse/tdad007","DOIUrl":null,"url":null,"abstract":"\n This study presents a hybrid data-mining framework based on feature selection algorithms and clustering methods to perform the pattern discovery of high-speed railway train rescheduling strategies (RS). The proposed model is composed of two states. In the first state, decision tree, random forest, Gradient Boosting Decision Tree (GBDT), and eXtreme Gradient Boosting (XGBoost) models are used to investigate the importance of features. The features that have a high influence on RS are first selected. In the second state, a K-means clustering method is used to uncover the interdependences between RS and the influencing features, based on the results in the first state. The proposed method can determine the quantitative relationships between RS and influencing factors. The results clearly show the influences of the factors on RS, the possibilities of different train operation RS under different situations, as well as some key time periods and key trains that the controllers should pay more attention to. The research in this paper can help train traffic controllers better understand the train operation patterns and provides direction for optimizing rail traffic RS.","PeriodicalId":52804,"journal":{"name":"Transportation Safety and Environment","volume":" ","pages":""},"PeriodicalIF":2.7000,"publicationDate":"2023-02-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A hybrid data-mining framework for train rescheduling strategy pattern discovery\",\"authors\":\"Rui Chen, Xu Ge, Ping Huang, Chao Wen\",\"doi\":\"10.1093/tse/tdad007\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"\\n This study presents a hybrid data-mining framework based on feature selection algorithms and clustering methods to perform the pattern discovery of high-speed railway train rescheduling strategies (RS). The proposed model is composed of two states. In the first state, decision tree, random forest, Gradient Boosting Decision Tree (GBDT), and eXtreme Gradient Boosting (XGBoost) models are used to investigate the importance of features. The features that have a high influence on RS are first selected. In the second state, a K-means clustering method is used to uncover the interdependences between RS and the influencing features, based on the results in the first state. The proposed method can determine the quantitative relationships between RS and influencing factors. The results clearly show the influences of the factors on RS, the possibilities of different train operation RS under different situations, as well as some key time periods and key trains that the controllers should pay more attention to. The research in this paper can help train traffic controllers better understand the train operation patterns and provides direction for optimizing rail traffic RS.\",\"PeriodicalId\":52804,\"journal\":{\"name\":\"Transportation Safety and Environment\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":2.7000,\"publicationDate\":\"2023-02-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Transportation Safety and Environment\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://doi.org/10.1093/tse/tdad007\",\"RegionNum\":4,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"TRANSPORTATION SCIENCE & TECHNOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Transportation Safety and Environment","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1093/tse/tdad007","RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"TRANSPORTATION SCIENCE & TECHNOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

提出了一种基于特征选择算法和聚类方法的混合数据挖掘框架,用于高速铁路列车重调度策略的模式发现。该模型由两种状态组成。在第一种状态下,使用决策树、随机森林、梯度提升决策树(GBDT)和极限梯度提升(XGBoost)模型来研究特征的重要性。首先选择对RS影响较大的特征。在第二种状态下,基于第一种状态的结果,使用K-means聚类方法来揭示RS与影响特征之间的相互依赖性。该方法可以确定RS与影响因素之间的定量关系。结果清楚地显示了各因素对RS的影响,不同情况下不同列车运行RS的可能性,以及控制人员应注意的一些关键时间段和关键列车。本文的研究可以帮助列车交通管制员更好地了解列车运行模式,为优化轨道交通RS提供指导。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
A hybrid data-mining framework for train rescheduling strategy pattern discovery
This study presents a hybrid data-mining framework based on feature selection algorithms and clustering methods to perform the pattern discovery of high-speed railway train rescheduling strategies (RS). The proposed model is composed of two states. In the first state, decision tree, random forest, Gradient Boosting Decision Tree (GBDT), and eXtreme Gradient Boosting (XGBoost) models are used to investigate the importance of features. The features that have a high influence on RS are first selected. In the second state, a K-means clustering method is used to uncover the interdependences between RS and the influencing features, based on the results in the first state. The proposed method can determine the quantitative relationships between RS and influencing factors. The results clearly show the influences of the factors on RS, the possibilities of different train operation RS under different situations, as well as some key time periods and key trains that the controllers should pay more attention to. The research in this paper can help train traffic controllers better understand the train operation patterns and provides direction for optimizing rail traffic RS.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Transportation Safety and Environment
Transportation Safety and Environment TRANSPORTATION SCIENCE & TECHNOLOGY-
CiteScore
3.90
自引率
13.60%
发文量
32
审稿时长
10 weeks
期刊最新文献
Parking choice behavior analysis of rural residents based on latent variable random forest model Risk Mapping of Wildlife-Vehicle Collisions across the State of Montana, U.S.A.: A Machine Learning Approach for Imbalanced Data along Rural Roads Evolutionary game analysis of the shared parking market promotion under government management The Characteristics of Driver Lane-Changing Behavior in Congested Road Environments Effect of helmet wearing regulation on electric bike riders: a case study of two cities in China
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1