{"title":"A genetic programming approach for real-time crash prediction to solve trade-off between interpretability and accuracy","authors":"Xiaochi Ma, Jian Lu, Xian Liu, Weibin Qu","doi":"10.1080/19439962.2022.2076756","DOIUrl":null,"url":null,"abstract":"Abstract Real-time crash risk prediction is a hot topic of emerging technology. Due to the lack of basic risk formation theory, previous studies focussed on the application of complex models to improve the accuracy of prediction, ignoring the interpretation of variables, while the traditional statistical analysis method can interpret variables, but the prediction accuracy is poor, which falls into a dilemma of trade-off. In this study, based on the traffic flow information of elevated expressway, an improved genetic programming (GP) approach with elite gene bank is applied to obtain an explicit traffic flow crash risk function to solve the above trade-off problem. Logistic regression and backward-propagation neural network combined with partial dependency plot were used as baseline methods to examine the interpretability and accuracy of GP. It is found that GP prediction model has been proved to be able to select important variables and solve the trade-off dilemma, which has good interpretability and accuracy. The results show that crash risk in the traffic flow mainly comes from the traffic volume, speed of the upstream section, and the speed of the current section. Furthermore, the error of GP comes from the unobserved heterogeneity and crash mechanism theory is proposed.","PeriodicalId":46672,"journal":{"name":"Journal of Transportation Safety & Security","volume":null,"pages":null},"PeriodicalIF":2.4000,"publicationDate":"2022-05-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Transportation Safety & Security","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1080/19439962.2022.2076756","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"TRANSPORTATION","Score":null,"Total":0}
引用次数: 5
Abstract
Abstract Real-time crash risk prediction is a hot topic of emerging technology. Due to the lack of basic risk formation theory, previous studies focussed on the application of complex models to improve the accuracy of prediction, ignoring the interpretation of variables, while the traditional statistical analysis method can interpret variables, but the prediction accuracy is poor, which falls into a dilemma of trade-off. In this study, based on the traffic flow information of elevated expressway, an improved genetic programming (GP) approach with elite gene bank is applied to obtain an explicit traffic flow crash risk function to solve the above trade-off problem. Logistic regression and backward-propagation neural network combined with partial dependency plot were used as baseline methods to examine the interpretability and accuracy of GP. It is found that GP prediction model has been proved to be able to select important variables and solve the trade-off dilemma, which has good interpretability and accuracy. The results show that crash risk in the traffic flow mainly comes from the traffic volume, speed of the upstream section, and the speed of the current section. Furthermore, the error of GP comes from the unobserved heterogeneity and crash mechanism theory is proposed.