Exploratory Analysis of Automated Vehicle Crashes Using an NLP Pipeline

Proceedings of the Human Factors and Ergonomics Society ... Annual Meeting. Human Factors and Ergonomics Society. Annual meeting Pub Date : 2023-10-25 DOI:10.1177/21695067231194987

Anjnesh Sharma, Na Du

{"title":"Exploratory Analysis of Automated Vehicle Crashes Using an NLP Pipeline","authors":"Anjnesh Sharma, Na Du","doi":"10.1177/21695067231194987","DOIUrl":null,"url":null,"abstract":"This study utilized a recently released crash dataset of Level 3 automated vehicles (AVs) made publicly available by the National Highway Traffic Safety Administration (NHTSA). The primary objective was to investigate various crash types and identify factors that influence crash severity. To achieve this, we employed a lightweight Natural Language Processing (NLP) pipeline to automatically extract relevant information from crash narratives and categorized the crashes into 15 distinct types. By analyzing the dependency triples derived from the crash narrative using the Stanford CoreNLP library, we determined the similarity between each narrative and the predefined categories. Our findings highlight safety-critical crash scenarios based on real-world data encompassing diverse operational design domains (ODDs), revealing a statistically significant impact of lighting conditions on crash severity. These results contribute to a better understanding of AV crashes and provide valuable insights to enhance the safe testing, integration, and development of AVs in real-world environments.","PeriodicalId":74544,"journal":{"name":"Proceedings of the Human Factors and Ergonomics Society ... Annual Meeting. Human Factors and Ergonomics Society. Annual meeting","volume":"19 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-10-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the Human Factors and Ergonomics Society ... Annual Meeting. Human Factors and Ergonomics Society. Annual meeting","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1177/21695067231194987","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

This study utilized a recently released crash dataset of Level 3 automated vehicles (AVs) made publicly available by the National Highway Traffic Safety Administration (NHTSA). The primary objective was to investigate various crash types and identify factors that influence crash severity. To achieve this, we employed a lightweight Natural Language Processing (NLP) pipeline to automatically extract relevant information from crash narratives and categorized the crashes into 15 distinct types. By analyzing the dependency triples derived from the crash narrative using the Stanford CoreNLP library, we determined the similarity between each narrative and the predefined categories. Our findings highlight safety-critical crash scenarios based on real-world data encompassing diverse operational design domains (ODDs), revealing a statistically significant impact of lighting conditions on crash severity. These results contribute to a better understanding of AV crashes and provide valuable insights to enhance the safe testing, integration, and development of AVs in real-world environments.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于NLP管道的自动车辆碰撞探索性分析

这项研究利用了美国国家公路交通安全管理局(NHTSA)最近公布的3级自动驾驶汽车(AVs)的碰撞数据集。主要目的是调查各种碰撞类型，并确定影响碰撞严重程度的因素。为了实现这一点，我们使用了一个轻量级的自然语言处理(NLP)管道来自动从崩溃叙述中提取相关信息，并将崩溃分为15种不同的类型。通过使用斯坦福CoreNLP库分析从崩溃叙述中派生的依赖三元组，我们确定了每个叙述与预定义类别之间的相似性。我们的研究结果强调了基于现实世界数据的安全关键型碰撞场景，包括不同的操作设计域(ODDs)，揭示了照明条件对碰撞严重程度的统计显著影响。这些结果有助于更好地理解自动驾驶汽车碰撞，并为增强自动驾驶汽车在现实环境中的安全测试、集成和开发提供有价值的见解。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Proceedings of the Human Factors and Ergonomics Society ... Annual Meeting. Human Factors and Ergonomics Society. Annual meeting

自引率

0.00%

发文量