{"title":"整合机器学习和极值理论,通过基于人工智能的视频分析,按严重程度估算碰撞频率","authors":"Fizza Hussain , Yuefeng Li , Md Mazharul Haque","doi":"10.1016/j.commtr.2024.100147","DOIUrl":null,"url":null,"abstract":"<div><div>Traffic conflict techniques rely heavily on the proper identification of conflict extremes, which directly affects the prediction performance of extreme value models. Two sampling techniques, namely, block maxima and peak over threshold, form the core of these models. Several studies have demonstrated the inefficacy of extreme value models based on these sampling approaches, as their crash estimates are too imprecise, hindering their widespread practical use. Recently, anomaly detection techniques for sampling conflict extremes have been used, but their application has been limited to estimating crash frequency without considering the crash severity aspect. To address this research gap, this study proposes a hybrid model of machine learning and extreme value theory within a bivariate framework of traffic conflict measures to estimate crash frequency by severity level. In particular, modified time-to-collision (MTTC) and expected post-collision change in velocity (Delta-<em>V</em> or Δ<em>V</em>) have been proposed in the hybrid modeling framework to estimate rear-end crash frequency by severity level. Rear-end conflicts were identified through artificial intelligence-based video analytics for three four-legged signalized intersections in Brisbane, Australia, using four days of data. Non-stationary bivariate hybrid generalized extreme value models with different anomaly detection/sampling techniques (isolation forest and minimum covariance determinant) were developed. The non-stationarity of traffic conflict extremes was handled by parameterizing model parameters, including location, scale, and both location and scale parameters simultaneously. The results indicate that the bivariate hybrid models can estimate severe and non-severe crashes when compared with historical crash records, thereby demonstrating the viability of the proposed approach. A comparative analysis of two anomaly techniques reveals that the isolation forest model marginally outperforms the minimum covariance determinant model. Overall, the modeling framework presented in this study advances conflict-based safety assessment, where the severity dimension can be captured via bivariate hybrid models.</div></div>","PeriodicalId":100292,"journal":{"name":"Communications in Transportation Research","volume":"4 ","pages":"Article 100147"},"PeriodicalIF":12.5000,"publicationDate":"2024-11-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Integrating machine learning and extreme value theory for estimating crash frequency-by-severity via AI-based video analytics\",\"authors\":\"Fizza Hussain , Yuefeng Li , Md Mazharul Haque\",\"doi\":\"10.1016/j.commtr.2024.100147\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Traffic conflict techniques rely heavily on the proper identification of conflict extremes, which directly affects the prediction performance of extreme value models. Two sampling techniques, namely, block maxima and peak over threshold, form the core of these models. Several studies have demonstrated the inefficacy of extreme value models based on these sampling approaches, as their crash estimates are too imprecise, hindering their widespread practical use. Recently, anomaly detection techniques for sampling conflict extremes have been used, but their application has been limited to estimating crash frequency without considering the crash severity aspect. To address this research gap, this study proposes a hybrid model of machine learning and extreme value theory within a bivariate framework of traffic conflict measures to estimate crash frequency by severity level. In particular, modified time-to-collision (MTTC) and expected post-collision change in velocity (Delta-<em>V</em> or Δ<em>V</em>) have been proposed in the hybrid modeling framework to estimate rear-end crash frequency by severity level. Rear-end conflicts were identified through artificial intelligence-based video analytics for three four-legged signalized intersections in Brisbane, Australia, using four days of data. Non-stationary bivariate hybrid generalized extreme value models with different anomaly detection/sampling techniques (isolation forest and minimum covariance determinant) were developed. The non-stationarity of traffic conflict extremes was handled by parameterizing model parameters, including location, scale, and both location and scale parameters simultaneously. The results indicate that the bivariate hybrid models can estimate severe and non-severe crashes when compared with historical crash records, thereby demonstrating the viability of the proposed approach. A comparative analysis of two anomaly techniques reveals that the isolation forest model marginally outperforms the minimum covariance determinant model. Overall, the modeling framework presented in this study advances conflict-based safety assessment, where the severity dimension can be captured via bivariate hybrid models.</div></div>\",\"PeriodicalId\":100292,\"journal\":{\"name\":\"Communications in Transportation Research\",\"volume\":\"4 \",\"pages\":\"Article 100147\"},\"PeriodicalIF\":12.5000,\"publicationDate\":\"2024-11-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Communications in Transportation Research\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2772424724000301\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"TRANSPORTATION\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Communications in Transportation Research","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2772424724000301","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"TRANSPORTATION","Score":null,"Total":0}
Integrating machine learning and extreme value theory for estimating crash frequency-by-severity via AI-based video analytics
Traffic conflict techniques rely heavily on the proper identification of conflict extremes, which directly affects the prediction performance of extreme value models. Two sampling techniques, namely, block maxima and peak over threshold, form the core of these models. Several studies have demonstrated the inefficacy of extreme value models based on these sampling approaches, as their crash estimates are too imprecise, hindering their widespread practical use. Recently, anomaly detection techniques for sampling conflict extremes have been used, but their application has been limited to estimating crash frequency without considering the crash severity aspect. To address this research gap, this study proposes a hybrid model of machine learning and extreme value theory within a bivariate framework of traffic conflict measures to estimate crash frequency by severity level. In particular, modified time-to-collision (MTTC) and expected post-collision change in velocity (Delta-V or ΔV) have been proposed in the hybrid modeling framework to estimate rear-end crash frequency by severity level. Rear-end conflicts were identified through artificial intelligence-based video analytics for three four-legged signalized intersections in Brisbane, Australia, using four days of data. Non-stationary bivariate hybrid generalized extreme value models with different anomaly detection/sampling techniques (isolation forest and minimum covariance determinant) were developed. The non-stationarity of traffic conflict extremes was handled by parameterizing model parameters, including location, scale, and both location and scale parameters simultaneously. The results indicate that the bivariate hybrid models can estimate severe and non-severe crashes when compared with historical crash records, thereby demonstrating the viability of the proposed approach. A comparative analysis of two anomaly techniques reveals that the isolation forest model marginally outperforms the minimum covariance determinant model. Overall, the modeling framework presented in this study advances conflict-based safety assessment, where the severity dimension can be captured via bivariate hybrid models.