Xiaochi Ma , Zongxin Huo , Jian Lu , Yiik Diew Wong
{"title":"Deep Forest with SHapley additive explanations on detailed risky driving behavior data for freeway crash risk prediction","authors":"Xiaochi Ma , Zongxin Huo , Jian Lu , Yiik Diew Wong","doi":"10.1016/j.engappai.2024.109787","DOIUrl":null,"url":null,"abstract":"<div><div>Freeway crash risk prediction is a critical component of traffic safety management, yet existing crash risk models often fail to capture complex driving behaviors and lack interpretability. This study introduces a novel freeway crash risk prediction framework based on the Deep Forest (DF) algorithm, considering the detailed risky driving behavior data. The DF model integrates multi-grained scanning and cascade forest layers, enabling it to capture the complex relationship between risky driving behavior features. SHapley Additive Explanations (SHAP) are applied to interpret the model's predictions, including both SHAP summary and interaction results. Additionally, ablation studies are conducted to evaluate the contributions of key components like multi-grained scanning and cascade structures to the model's performance. The experimental results demonstrate that the DF model outperforms traditional machine learning models. The DF model achieves an area under the receiver operating characteristic curve of 0.825, with a balanced Sensitivity of 0.75 and Specificity of 0.816, surpassing other models. The ablation studies show that removing multi-grained scanning, cascade layers, or completely random tree forest leads to performance declines, confirming the importance of each component. The SHAP analysis highlights that sharp acceleration and braking behaviors have the most significant impact on crash risk, offering clear, interpretable insights into how driving behaviors contribute to risk. Overall, the DF model's superior performance and SHAP-based interpretability provide a powerful tool for traffic safety management. These findings emphasize the value of incorporating both driving behavior intensity and model interpretability into crash risk prediction, offering practical applications for reducing crash rates.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"141 ","pages":"Article 109787"},"PeriodicalIF":7.5000,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering Applications of Artificial Intelligence","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0952197624019468","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
Freeway crash risk prediction is a critical component of traffic safety management, yet existing crash risk models often fail to capture complex driving behaviors and lack interpretability. This study introduces a novel freeway crash risk prediction framework based on the Deep Forest (DF) algorithm, considering the detailed risky driving behavior data. The DF model integrates multi-grained scanning and cascade forest layers, enabling it to capture the complex relationship between risky driving behavior features. SHapley Additive Explanations (SHAP) are applied to interpret the model's predictions, including both SHAP summary and interaction results. Additionally, ablation studies are conducted to evaluate the contributions of key components like multi-grained scanning and cascade structures to the model's performance. The experimental results demonstrate that the DF model outperforms traditional machine learning models. The DF model achieves an area under the receiver operating characteristic curve of 0.825, with a balanced Sensitivity of 0.75 and Specificity of 0.816, surpassing other models. The ablation studies show that removing multi-grained scanning, cascade layers, or completely random tree forest leads to performance declines, confirming the importance of each component. The SHAP analysis highlights that sharp acceleration and braking behaviors have the most significant impact on crash risk, offering clear, interpretable insights into how driving behaviors contribute to risk. Overall, the DF model's superior performance and SHAP-based interpretability provide a powerful tool for traffic safety management. These findings emphasize the value of incorporating both driving behavior intensity and model interpretability into crash risk prediction, offering practical applications for reducing crash rates.
期刊介绍:
Artificial Intelligence (AI) is pivotal in driving the fourth industrial revolution, witnessing remarkable advancements across various machine learning methodologies. AI techniques have become indispensable tools for practicing engineers, enabling them to tackle previously insurmountable challenges. Engineering Applications of Artificial Intelligence serves as a global platform for the swift dissemination of research elucidating the practical application of AI methods across all engineering disciplines. Submitted papers are expected to present novel aspects of AI utilized in real-world engineering applications, validated using publicly available datasets to ensure the replicability of research outcomes. Join us in exploring the transformative potential of AI in engineering.