Railroad accident analysis by machine learning and natural language processing

Raj Bridgelall , Denver D. Tolliver
{"title":"Railroad accident analysis by machine learning and natural language processing","authors":"Raj Bridgelall ,&nbsp;Denver D. Tolliver","doi":"10.1016/j.jrtpm.2023.100429","DOIUrl":null,"url":null,"abstract":"<div><p>The evolving complexities of railroad systems also increase their vulnerability to failure from human error. This study compared the outcomes of two workflows that incorporated 11 different machine learning techniques to identify characteristics of railroad operations that are generally associated with human-caused accidents. The first workflow engineered features from the fixed attribute fields of a large railroad accident database and the second applied natural language processing to extract features from the unstructured accident narratives. Both workflows applied a Shapely game-theoretic model to rank the importance of features based on their marginal contribution towards predicting accident cause. Among several interesting findings, some of the most unexpected were that human-caused accidents are generally not associated with high train speeds nor derailment type accidents, and that shoving cars is riskier than pulling cars. Those, and other findings, from this study can inform management decisions, planning, and policies to minimize the risk of human-caused accidents.</p></div>","PeriodicalId":51821,"journal":{"name":"Journal of Rail Transport Planning & Management","volume":"29 ","pages":"Article 100429"},"PeriodicalIF":2.6000,"publicationDate":"2023-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2210970623000616/pdfft?md5=7beb76b2bfeb64b23efbb7c9927107db&pid=1-s2.0-S2210970623000616-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Rail Transport Planning & Management","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2210970623000616","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"TRANSPORTATION","Score":null,"Total":0}
引用次数: 0

Abstract

The evolving complexities of railroad systems also increase their vulnerability to failure from human error. This study compared the outcomes of two workflows that incorporated 11 different machine learning techniques to identify characteristics of railroad operations that are generally associated with human-caused accidents. The first workflow engineered features from the fixed attribute fields of a large railroad accident database and the second applied natural language processing to extract features from the unstructured accident narratives. Both workflows applied a Shapely game-theoretic model to rank the importance of features based on their marginal contribution towards predicting accident cause. Among several interesting findings, some of the most unexpected were that human-caused accidents are generally not associated with high train speeds nor derailment type accidents, and that shoving cars is riskier than pulling cars. Those, and other findings, from this study can inform management decisions, planning, and policies to minimize the risk of human-caused accidents.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
通过机器学习和自然语言处理分析铁路事故
铁路系统不断发展的复杂性也增加了人为失误导致故障的可能性。本研究比较了两个工作流程的结果,这两个流程采用了 11 种不同的机器学习技术,以识别通常与人为事故相关的铁路运营特征。第一个工作流程从大型铁路事故数据库的固定属性字段中提取特征,第二个工作流程应用自然语言处理技术从非结构化事故叙述中提取特征。两个工作流程都应用了 Shapely 博弈论模型,根据特征对预测事故原因的边际贡献来排列特征的重要性。在几个有趣的发现中,最出人意料的是人为事故通常与列车高速行驶或脱轨类型的事故无关,而且推车比拉车更危险。这项研究的这些发现和其他发现可以为管理决策、规划和政策提供参考,从而最大限度地降低人为事故的风险。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
7.10
自引率
8.10%
发文量
41
期刊最新文献
A MILP model to improve the robustness of a railway timetable by retiming and rerouting in a complex bottleneck area A decomposition approach to solve the individual railway crew Re-planning problem A Bi-objective model and a branch-and-price-and-cut solution method for the railroad blocking problem in hazardous material transportation Relationships between service quality and customer satisfaction in rail freight transportation: A structural equation modeling approach The evaluation of competition effect on rail fares using the difference-in-difference method through symmetric and lagged spans
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1