A semantic embedding methodology for motor vehicle crash records: A case study of traffic safety in Manhattan Borough of New York City

IF 2.4 3区 工程技术 Q3 TRANSPORTATION Journal of Transportation Safety & Security Pub Date : 2021-10-27 DOI:10.1080/19439962.2021.1994681
Yuxuan Wang, Ruoxin Xiong, Hao Yu, Jie Bao, Zhao Yang
{"title":"A semantic embedding methodology for motor vehicle crash records: A case study of traffic safety in Manhattan Borough of New York City","authors":"Yuxuan Wang, Ruoxin Xiong, Hao Yu, Jie Bao, Zhao Yang","doi":"10.1080/19439962.2021.1994681","DOIUrl":null,"url":null,"abstract":"Abstract This study introduces a hybrid Latent Dirichlet Allocation (LDA) model to excavate hidden crash patterns from the large-scale crash dataset. External semantic descriptions have been attached to raw GPS coordinates of crash events. The K-means clustering algorithm is first applied to determine land use characteristics of crash points by grouping surrounding Points of Interests (POIs). Then, each crash record is transformed into a formalized label consisting of land use, Annual Average Daily Traffic (AADT), and time stamps, allowing the analysis of massive traffic crash data as document corpora. Finally, a data-driven modeling approach based on the LDA is conducted to discover hidden crash patterns from traffic crash records combining the external semantic information. The approach is verified using motor vehicle crash data in Manhattan County of New York City. The novel semantic analysis of crash records provides an effective method to investigate the hidden information in traffic crashes. Identifying spatial-temporal patterns on motor vehicle crashes would provide insights into underlying traffic behaviors for intelligent policy-making and resource allocation.","PeriodicalId":46672,"journal":{"name":"Journal of Transportation Safety & Security","volume":null,"pages":null},"PeriodicalIF":2.4000,"publicationDate":"2021-10-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Transportation Safety & Security","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1080/19439962.2021.1994681","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"TRANSPORTATION","Score":null,"Total":0}
引用次数: 3

Abstract

Abstract This study introduces a hybrid Latent Dirichlet Allocation (LDA) model to excavate hidden crash patterns from the large-scale crash dataset. External semantic descriptions have been attached to raw GPS coordinates of crash events. The K-means clustering algorithm is first applied to determine land use characteristics of crash points by grouping surrounding Points of Interests (POIs). Then, each crash record is transformed into a formalized label consisting of land use, Annual Average Daily Traffic (AADT), and time stamps, allowing the analysis of massive traffic crash data as document corpora. Finally, a data-driven modeling approach based on the LDA is conducted to discover hidden crash patterns from traffic crash records combining the external semantic information. The approach is verified using motor vehicle crash data in Manhattan County of New York City. The novel semantic analysis of crash records provides an effective method to investigate the hidden information in traffic crashes. Identifying spatial-temporal patterns on motor vehicle crashes would provide insights into underlying traffic behaviors for intelligent policy-making and resource allocation.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
机动车碰撞记录的语义嵌入方法:以纽约市曼哈顿区交通安全为例
摘要本文引入一种混合潜狄利克雷分配(Latent Dirichlet Allocation, LDA)模型,从大规模碰撞数据集中挖掘隐藏的碰撞模式。外部语义描述已附加到碰撞事件的原始GPS坐标。首先应用k均值聚类算法,通过对周边兴趣点(poi)进行分组,确定碰撞点的土地利用特征。然后,将每个碰撞记录转换为由土地使用、年平均每日交通量(AADT)和时间戳组成的正式标签,从而允许将大量交通碰撞数据作为文档语料库进行分析。最后,提出了一种基于LDA的数据驱动建模方法,结合外部语义信息从交通碰撞记录中发现隐藏的碰撞模式。该方法使用纽约市曼哈顿县的机动车碰撞数据进行了验证。新的碰撞记录语义分析方法为研究交通碰撞中隐藏的信息提供了一种有效的方法。识别机动车碰撞的时空模式将为智能决策和资源配置提供对潜在交通行为的洞察。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
6.00
自引率
15.40%
发文量
38
期刊最新文献
Examining the crash risk factors associated with cycling by considering spatial and temporal disaggregation of exposure: Findings from four Dutch cities Traffic safety performance evaluation in a connected vehicle environment with queue warning and speed harmonization applications Enhancing bicyclist survival time in fatal crashes: Investigating the impact of faster crash notification time through explainable machine learning Factors affecting pedestrian injury severity in pedestrian-vehicle crashes: Insights from a data mining and mixed logit model approach Prediction of high-risk bus drivers characterized by aggressive driving behavior
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1