Ziqian Zhang, Haojie Li, Tiantian Chen, N N Sze, Wenzhang Yang, Yihao Zhang, Gang Ren
{"title":"自动驾驶汽车在与乱穿马路者互动时的决策制定:风险感知深度强化学习方法。","authors":"Ziqian Zhang, Haojie Li, Tiantian Chen, N N Sze, Wenzhang Yang, Yihao Zhang, Gang Ren","doi":"10.1016/j.aap.2024.107843","DOIUrl":null,"url":null,"abstract":"<p><p>Jaywalking, as a hazardous crossing behavior, leaves little time for drivers to anticipate and respond promptly, resulting in high crossing risks. The prevalence of Autonomous Vehicle (AV) technologies has offered new solutions for mitigating jaywalking risks. In this study, we propose a risk-aware deep reinforcement learning (DRL) approach for AVs to make decisions safely and efficiently in jaywalker-vehicle interactions. Notably, a risk prediction module is incorporated into the traditional DRL framework, making the AV agent risk-aware. Considering the complexity of jaywalker-vehicle conflicts, an encoder-decoder model is adopted as the risk prediction module, which comprehensively integrates multi-source data and predicts probabilities of the final conflict severity levels. The risk-aware DRL approach is applied in a simulated environment established in Anylogic, where the motion features of jaywalkers and vehicles are calibrated using real-world survey data. The trained driving policies are evaluated from perspectives of safety and efficiency across three scenarios with escalading levels of jaywalker volume. Regarding safety performance, the Baseline policy performs the worst in \"medium jaywalker volume\" scenario and \"high jaywalker volume\" scenario, while our Proposed risk-aware method outperforms the other methods, with the \"low TTC ratio\" metric stabilizing near 0.08. Moreover, as the scenario gets more complex, the superiority of our Proposed risk-aware policy gets more evident. In terms of efficiency performance, our Proposed risk-aware policy ranks the second best, achieving an \"AV delay\" metric around 8.1 s in the \"medium jaywalker volume\" scenario and 8.5 s in the \"high jaywalker volume\" scenario. In practice, the proposed risk-aware DRL approach can help AV agents perceive potential risks in advance and navigate through potential jaywalking areas safely and efficiently, further enhancing pedestrian safety.</p>","PeriodicalId":6926,"journal":{"name":"Accident; analysis and prevention","volume":"210 ","pages":"107843"},"PeriodicalIF":5.7000,"publicationDate":"2024-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Decision-making of autonomous vehicles in interactions with jaywalkers: A risk-aware deep reinforcement learning approach.\",\"authors\":\"Ziqian Zhang, Haojie Li, Tiantian Chen, N N Sze, Wenzhang Yang, Yihao Zhang, Gang Ren\",\"doi\":\"10.1016/j.aap.2024.107843\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Jaywalking, as a hazardous crossing behavior, leaves little time for drivers to anticipate and respond promptly, resulting in high crossing risks. The prevalence of Autonomous Vehicle (AV) technologies has offered new solutions for mitigating jaywalking risks. In this study, we propose a risk-aware deep reinforcement learning (DRL) approach for AVs to make decisions safely and efficiently in jaywalker-vehicle interactions. Notably, a risk prediction module is incorporated into the traditional DRL framework, making the AV agent risk-aware. Considering the complexity of jaywalker-vehicle conflicts, an encoder-decoder model is adopted as the risk prediction module, which comprehensively integrates multi-source data and predicts probabilities of the final conflict severity levels. The risk-aware DRL approach is applied in a simulated environment established in Anylogic, where the motion features of jaywalkers and vehicles are calibrated using real-world survey data. The trained driving policies are evaluated from perspectives of safety and efficiency across three scenarios with escalading levels of jaywalker volume. Regarding safety performance, the Baseline policy performs the worst in \\\"medium jaywalker volume\\\" scenario and \\\"high jaywalker volume\\\" scenario, while our Proposed risk-aware method outperforms the other methods, with the \\\"low TTC ratio\\\" metric stabilizing near 0.08. Moreover, as the scenario gets more complex, the superiority of our Proposed risk-aware policy gets more evident. In terms of efficiency performance, our Proposed risk-aware policy ranks the second best, achieving an \\\"AV delay\\\" metric around 8.1 s in the \\\"medium jaywalker volume\\\" scenario and 8.5 s in the \\\"high jaywalker volume\\\" scenario. In practice, the proposed risk-aware DRL approach can help AV agents perceive potential risks in advance and navigate through potential jaywalking areas safely and efficiently, further enhancing pedestrian safety.</p>\",\"PeriodicalId\":6926,\"journal\":{\"name\":\"Accident; analysis and prevention\",\"volume\":\"210 \",\"pages\":\"107843\"},\"PeriodicalIF\":5.7000,\"publicationDate\":\"2024-11-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Accident; analysis and prevention\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://doi.org/10.1016/j.aap.2024.107843\",\"RegionNum\":1,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ERGONOMICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Accident; analysis and prevention","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1016/j.aap.2024.107843","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ERGONOMICS","Score":null,"Total":0}
Decision-making of autonomous vehicles in interactions with jaywalkers: A risk-aware deep reinforcement learning approach.
Jaywalking, as a hazardous crossing behavior, leaves little time for drivers to anticipate and respond promptly, resulting in high crossing risks. The prevalence of Autonomous Vehicle (AV) technologies has offered new solutions for mitigating jaywalking risks. In this study, we propose a risk-aware deep reinforcement learning (DRL) approach for AVs to make decisions safely and efficiently in jaywalker-vehicle interactions. Notably, a risk prediction module is incorporated into the traditional DRL framework, making the AV agent risk-aware. Considering the complexity of jaywalker-vehicle conflicts, an encoder-decoder model is adopted as the risk prediction module, which comprehensively integrates multi-source data and predicts probabilities of the final conflict severity levels. The risk-aware DRL approach is applied in a simulated environment established in Anylogic, where the motion features of jaywalkers and vehicles are calibrated using real-world survey data. The trained driving policies are evaluated from perspectives of safety and efficiency across three scenarios with escalading levels of jaywalker volume. Regarding safety performance, the Baseline policy performs the worst in "medium jaywalker volume" scenario and "high jaywalker volume" scenario, while our Proposed risk-aware method outperforms the other methods, with the "low TTC ratio" metric stabilizing near 0.08. Moreover, as the scenario gets more complex, the superiority of our Proposed risk-aware policy gets more evident. In terms of efficiency performance, our Proposed risk-aware policy ranks the second best, achieving an "AV delay" metric around 8.1 s in the "medium jaywalker volume" scenario and 8.5 s in the "high jaywalker volume" scenario. In practice, the proposed risk-aware DRL approach can help AV agents perceive potential risks in advance and navigate through potential jaywalking areas safely and efficiently, further enhancing pedestrian safety.
期刊介绍:
Accident Analysis & Prevention provides wide coverage of the general areas relating to accidental injury and damage, including the pre-injury and immediate post-injury phases. Published papers deal with medical, legal, economic, educational, behavioral, theoretical or empirical aspects of transportation accidents, as well as with accidents at other sites. Selected topics within the scope of the Journal may include: studies of human, environmental and vehicular factors influencing the occurrence, type and severity of accidents and injury; the design, implementation and evaluation of countermeasures; biomechanics of impact and human tolerance limits to injury; modelling and statistical analysis of accident data; policy, planning and decision-making in safety.