A machine learning approach to understanding the road and traffic environments of crashes involving driver distraction and inattention (DDI) on rural multilane highways
Chenxuan Yang , Jun Liu , Zihe Zhang , Emmanuel Kofi Adanu , Praveena Penmetsa , Steven Jones
{"title":"A machine learning approach to understanding the road and traffic environments of crashes involving driver distraction and inattention (DDI) on rural multilane highways","authors":"Chenxuan Yang , Jun Liu , Zihe Zhang , Emmanuel Kofi Adanu , Praveena Penmetsa , Steven Jones","doi":"10.1016/j.jsr.2024.11.011","DOIUrl":null,"url":null,"abstract":"<div><div><em>Introduction</em>: Driver distraction and inattention (DDI) are major causes of road crashes, especially on rural highways. However, not all instances of distracted or inattentive driving lead to crashes. Previous studies indicate that DDI-related driving behavior is closely associated with low-traffic and less complex driving environments. Nevertheless, it is unclear if these traffic or road environments also increase the likelihood of crashes involving DDI. <em>Method</em>: This study employed machine learning algorithms to identify the factors contributing to DDI-involved crashes on rural highways. This study applied multiple machine learning models including the Light Gradient Boosting Model (LGBM), Random Forest (RF), and Neural Network (NN) to quantify the correlations of DDI-involved crashes related to road and traffic environments. The study leveraged a statewide crash database with unique roadway data that contains variables for median type (e.g., 4-ft flush medians) and roadside access point density. To deal with the extreme imbalance of data, two sampling methods (over and under-sampling) were used to balance the data for machine learning<em>. Results</em>: Modeling results indicated that the road and traffic environments that are strongly linked to DDI-involved crashes in general overlap with the environments that lead to DDI-related driving behavior, except for the truck volumes in traffic. Crashes that involved DDI were more likely to occur in environments with non-traversable medians (compared to 4-ft flush medians), lower-volume traffic, and greater access spacing on roadsides. With regard to truck volumes, a non-linear relationship with the occurrence of DDI-involved crashes was uncovered. Traffic with about 8 to 10% of trucks is associated with the highest likelihood of DDI-involved crashes. <em>Practical Applications:</em> This study provides valuable information for drivers who need to be careful while driving in certain environments with a risk of DDI-involved crashes and for agencies who need to take actions to address the issue of DDI under such environments.</div></div>","PeriodicalId":48224,"journal":{"name":"Journal of Safety Research","volume":"92 ","pages":"Pages 14-26"},"PeriodicalIF":3.9000,"publicationDate":"2024-11-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Safety Research","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0022437524001609","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ERGONOMICS","Score":null,"Total":0}
引用次数: 0
Abstract
Introduction: Driver distraction and inattention (DDI) are major causes of road crashes, especially on rural highways. However, not all instances of distracted or inattentive driving lead to crashes. Previous studies indicate that DDI-related driving behavior is closely associated with low-traffic and less complex driving environments. Nevertheless, it is unclear if these traffic or road environments also increase the likelihood of crashes involving DDI. Method: This study employed machine learning algorithms to identify the factors contributing to DDI-involved crashes on rural highways. This study applied multiple machine learning models including the Light Gradient Boosting Model (LGBM), Random Forest (RF), and Neural Network (NN) to quantify the correlations of DDI-involved crashes related to road and traffic environments. The study leveraged a statewide crash database with unique roadway data that contains variables for median type (e.g., 4-ft flush medians) and roadside access point density. To deal with the extreme imbalance of data, two sampling methods (over and under-sampling) were used to balance the data for machine learning. Results: Modeling results indicated that the road and traffic environments that are strongly linked to DDI-involved crashes in general overlap with the environments that lead to DDI-related driving behavior, except for the truck volumes in traffic. Crashes that involved DDI were more likely to occur in environments with non-traversable medians (compared to 4-ft flush medians), lower-volume traffic, and greater access spacing on roadsides. With regard to truck volumes, a non-linear relationship with the occurrence of DDI-involved crashes was uncovered. Traffic with about 8 to 10% of trucks is associated with the highest likelihood of DDI-involved crashes. Practical Applications: This study provides valuable information for drivers who need to be careful while driving in certain environments with a risk of DDI-involved crashes and for agencies who need to take actions to address the issue of DDI under such environments.
期刊介绍:
Journal of Safety Research is an interdisciplinary publication that provides for the exchange of ideas and scientific evidence capturing studies through research in all areas of safety and health, including traffic, workplace, home, and community. This forum invites research using rigorous methodologies, encourages translational research, and engages the global scientific community through various partnerships (e.g., this outreach includes highlighting some of the latest findings from the U.S. Centers for Disease Control and Prevention).