{"title":"Exploratory Analysis of Automated Vehicle Crashes Using an NLP Pipeline","authors":"Anjnesh Sharma, Na Du","doi":"10.1177/21695067231194987","DOIUrl":null,"url":null,"abstract":"This study utilized a recently released crash dataset of Level 3 automated vehicles (AVs) made publicly available by the National Highway Traffic Safety Administration (NHTSA). The primary objective was to investigate various crash types and identify factors that influence crash severity. To achieve this, we employed a lightweight Natural Language Processing (NLP) pipeline to automatically extract relevant information from crash narratives and categorized the crashes into 15 distinct types. By analyzing the dependency triples derived from the crash narrative using the Stanford CoreNLP library, we determined the similarity between each narrative and the predefined categories. Our findings highlight safety-critical crash scenarios based on real-world data encompassing diverse operational design domains (ODDs), revealing a statistically significant impact of lighting conditions on crash severity. These results contribute to a better understanding of AV crashes and provide valuable insights to enhance the safe testing, integration, and development of AVs in real-world environments.","PeriodicalId":74544,"journal":{"name":"Proceedings of the Human Factors and Ergonomics Society ... Annual Meeting. Human Factors and Ergonomics Society. Annual meeting","volume":"19 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-10-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the Human Factors and Ergonomics Society ... Annual Meeting. Human Factors and Ergonomics Society. Annual meeting","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1177/21695067231194987","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
This study utilized a recently released crash dataset of Level 3 automated vehicles (AVs) made publicly available by the National Highway Traffic Safety Administration (NHTSA). The primary objective was to investigate various crash types and identify factors that influence crash severity. To achieve this, we employed a lightweight Natural Language Processing (NLP) pipeline to automatically extract relevant information from crash narratives and categorized the crashes into 15 distinct types. By analyzing the dependency triples derived from the crash narrative using the Stanford CoreNLP library, we determined the similarity between each narrative and the predefined categories. Our findings highlight safety-critical crash scenarios based on real-world data encompassing diverse operational design domains (ODDs), revealing a statistically significant impact of lighting conditions on crash severity. These results contribute to a better understanding of AV crashes and provide valuable insights to enhance the safe testing, integration, and development of AVs in real-world environments.