Journal of Intelligent and Connected Vehicles最新文献_第2页

Automatic Labeling and Qualification of Functional Scenarios on the Basis of Sparse Field Observations 基于稀疏场观测的功能场景自动标注与定性

IF 7.8

Journal of Intelligent and Connected Vehicles

Pub Date : 2025-09-01 DOI: 10.26599/JICV.2025.9210058

Hugues Blache;Pierre-Antoine Laharotte;Nour-Eddin El Faouzi

At the dawn of the deployment of connected and automated vehicles (CAVs) on our roads, assessing the safety of new systems is crucial. Given the overwhelming number of situations to test, focusing efforts on the most relevant ones for the system is essential. Qualifying scenarios with respect to their relevance is a challenging task. The scope of relevancy must be defined, and a labeling process applicable to any scenario must be developed. However, gathering information on various scenarios to label them poses a challenge because the flagrant lacks field data. In this study, we assume that relevancy is depicted by a safety criticality level on the basis of time-to-collision. We develop a labeling process for scenarios. It learns latent connections between the words generating scenarios and takes advantage of the latent structure to associate criticality levels with any scenario. Such a prediction model enables one to cope with the lack of data by ensuring the prior qualification of any scenario regardless of the quantity of field observations. This process is applied to scenarios described at a high level of abstraction, called functional scenarios. Criticality levels might be used to guide the application of the sampling strategy to select the scenarios under consideration when testing CAVs. Compared with field observations, the results of our automated process are highly correlated, with $R^2$ values of up to 0.835 on average.

在道路上部署联网和自动驾驶汽车（cav）之初，评估新系统的安全性至关重要。考虑到要测试的情况数量庞大，将精力集中在与系统最相关的情况上是至关重要的。根据它们的相关性来确定场景是一项具有挑战性的任务。必须定义相关的范围，并且必须开发适用于任何场景的标签流程。然而，由于缺乏现场数据，收集各种情景的信息并对其进行标记是一项挑战。在本研究中，我们假设相关性是基于碰撞时间的安全临界水平来描述的。我们为场景开发了一个标签流程。它学习生成场景的单词之间的潜在联系，并利用潜在结构将临界水平与任何场景关联起来。这种预测模式使人们能够处理缺乏数据的问题，因为它确保无论实地观测的数量多少，都能事先确定任何情景。此过程应用于在高层次抽象上描述的场景，称为功能场景。在测试cav时，临界水平可用于指导抽样策略的应用，以选择所考虑的场景。与野外观测结果相比，自动化过程的结果具有较高的相关性，平均R^2$值可达0.835。

{"title":"Automatic Labeling and Qualification of Functional Scenarios on the Basis of Sparse Field Observations","authors":"Hugues Blache;Pierre-Antoine Laharotte;Nour-Eddin El Faouzi","doi":"10.26599/JICV.2025.9210058","DOIUrl":"https://doi.org/10.26599/JICV.2025.9210058","url":null,"abstract":"At the dawn of the deployment of connected and automated vehicles (CAVs) on our roads, assessing the safety of new systems is crucial. Given the overwhelming number of situations to test, focusing efforts on the most relevant ones for the system is essential. Qualifying scenarios with respect to their relevance is a challenging task. The scope of relevancy must be defined, and a labeling process applicable to any scenario must be developed. However, gathering information on various scenarios to label them poses a challenge because the flagrant lacks field data. In this study, we assume that relevancy is depicted by a safety criticality level on the basis of time-to-collision. We develop a labeling process for scenarios. It learns latent connections between the words generating scenarios and takes advantage of the latent structure to associate criticality levels with any scenario. Such a prediction model enables one to cope with the lack of data by ensuring the prior qualification of any scenario regardless of the quantity of field observations. This process is applied to scenarios described at a high level of abstraction, called functional scenarios. Criticality levels might be used to guide the application of the sampling strategy to select the scenarios under consideration when testing CAVs. Compared with field observations, the results of our automated process are highly correlated, with <tex>$R^2$</tex> values of up to 0.835 on average.","PeriodicalId":100793,"journal":{"name":"Journal of Intelligent and Connected Vehicles","volume":"8 3","pages":"9210058-1-9210058-16"},"PeriodicalIF":7.8,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11215930","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145339713","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Young Urban Dwellers' Acceptance of Autonomous Vehicle-Induced Landscape Changes: An Eye-Tracking Study 年轻城市居民对自动驾驶汽车引发的景观变化的接受程度：一项眼动追踪研究

IF 7.8

Journal of Intelligent and Connected Vehicles

Pub Date : 2025-09-01 DOI: 10.26599/JICV.2025.9210062

Miklós Lukovics;Barbara Nagy

Using an autonomous vehicle fleet instead of owning a car might significantly reduce the number of vehicles in cities, which may have important consequences related to land use and the urban landscape. More information is available about these possibilities; at the same time, much less is known about whether urban residents would accept them. Moreover, the majority of research addressing the preferences of urban residents presents findings on the entire population rather than on its specific sections; thus, scarce information is available about the urban landscape preferences of young people, who are highly exposed to autonomous vehicle-driven future mobility. This study aims to determine how much young city dwellers accept potential specific urban landscape changes triggered by autonomous vehicles. The research applied real-time eye-tracking tests and supplementary questionnaires to a sample of 102 participants. The tests were carried out under laboratory conditions, during which the subjects looked at before/after urban landscape pairs of images that depicted the potential urban landscape and land use changes triggered by the mass adoption of autonomous vehicles. The examination of the total fixation duration, the average fixation duration and the average number of fixations indicates that the “after” images were collectively more appealing to the subjects. An analysis of the reasons behind the eye-tracking results revealed that safety and human-centered design were identified as the most significant factors across various image pairs.

使用自动驾驶车队而不是拥有汽车可能会大大减少城市中的车辆数量，这可能会对土地使用和城市景观产生重要影响。关于这些可能性可以获得更多信息；与此同时，城市居民是否会接受它们却知之甚少。此外，大多数关于城市居民偏好的研究都是关于整个人口而不是具体部分的调查结果；因此，关于年轻人的城市景观偏好的信息是稀缺的，他们高度暴露于自动驾驶汽车驱动的未来交通。这项研究旨在确定有多少年轻的城市居民接受由自动驾驶汽车引发的潜在的特定城市景观变化。该研究对102名参与者进行了实时眼球追踪测试和补充问卷调查。测试是在实验室条件下进行的，在此期间，受试者观察了城市景观之前/之后的成对图像，这些图像描绘了大规模采用自动驾驶汽车引发的潜在城市景观和土地利用变化。对总注视时间、平均注视时间和平均注视次数的检测表明，“后”图像总体上对被试更有吸引力。对眼球追踪结果背后原因的分析显示，在各种图像对中，安全性和以人为本的设计被认为是最重要的因素。

{"title":"Young Urban Dwellers' Acceptance of Autonomous Vehicle-Induced Landscape Changes: An Eye-Tracking Study","authors":"Miklós Lukovics;Barbara Nagy","doi":"10.26599/JICV.2025.9210062","DOIUrl":"https://doi.org/10.26599/JICV.2025.9210062","url":null,"abstract":"Using an autonomous vehicle fleet instead of owning a car might significantly reduce the number of vehicles in cities, which may have important consequences related to land use and the urban landscape. More information is available about these possibilities; at the same time, much less is known about whether urban residents would accept them. Moreover, the majority of research addressing the preferences of urban residents presents findings on the entire population rather than on its specific sections; thus, scarce information is available about the urban landscape preferences of young people, who are highly exposed to autonomous vehicle-driven future mobility. This study aims to determine how much young city dwellers accept potential specific urban landscape changes triggered by autonomous vehicles. The research applied real-time eye-tracking tests and supplementary questionnaires to a sample of 102 participants. The tests were carried out under laboratory conditions, during which the subjects looked at before/after urban landscape pairs of images that depicted the potential urban landscape and land use changes triggered by the mass adoption of autonomous vehicles. The examination of the total fixation duration, the average fixation duration and the average number of fixations indicates that the “after” images were collectively more appealing to the subjects. An analysis of the reasons behind the eye-tracking results revealed that safety and human-centered design were identified as the most significant factors across various image pairs.","PeriodicalId":100793,"journal":{"name":"Journal of Intelligent and Connected Vehicles","volume":"8 3","pages":"9210062-1-9210062-13"},"PeriodicalIF":7.8,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11215928","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145339731","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Enhancing Driver Emotion Recognition Through Deep Ensemble Classification 通过深度集成分类增强驾驶员情绪识别

Journal of Intelligent and Connected Vehicles

Pub Date : 2025-06-01 DOI: 10.26599/JICV.2025.9210055

Faizan Zaman;Zhigang Xu;Adil Hussain;Anees Ullah;Khalid Zaman

This research addresses the challenging task of classifying drivers' emotions to increase their awareness of their driving behaviors. It recognizes the common issue of driver emotions, which often leads to the neglect of poor driving practices. By automatically detecting and identifying these behaviors, drivers can proactively obtain valuable insights to reduce potential accidents. This study proposes a comprehensive facial recognition model for drivers that uses a unified architecture comprising a convolutional neural network (CNN), a recurrent neural network (RNN), and a multilayer perceptron (MLP) classification model. Initially, a faster region-based convolutional neural network (R-CNN) was employed for accurate and efficient facial detection of drivers in live and recorded videos. Features are extracted from three CNN models and merged via advanced techniques to create an ensemble classification model. Moreover, the improved Faster R-CNN feature learning module is replaced with a new convolutional neural network module, VGG16, which maximizes the precision and effectiveness of facial detection in our system. Significant accuracy results of 89.2%, 97.20%, 99.01%, 93.65%, and 98.61% are shown in evaluations of our suggested facial detection and facial expression recognition (DFER) datasets, including the EMOTIC, CK+, FERPLUS, AffectNet, and custom datasets. These datasets were meticulously acquired in a simulated environment, necessitating the creation of several custom datasets. This research highlights the potential of deep ensemble classification in improving driver emotion recognition, thereby contributing to enhanced road safety.

本研究解决了驾驶员情绪分类的挑战性任务，以提高他们对驾驶行为的认识。它认识到司机情绪的普遍问题，这往往导致忽视不良驾驶习惯。通过自动检测和识别这些行为，驾驶员可以主动获得有价值的见解，以减少潜在的事故。本研究提出了一种综合的驾驶员面部识别模型，该模型使用由卷积神经网络（CNN）、递归神经网络（RNN）和多层感知器（MLP）分类模型组成的统一架构。最初，采用更快的基于区域的卷积神经网络（R-CNN）对直播和录制视频中的驾驶员进行准确高效的面部检测。从三个CNN模型中提取特征，并通过先进的技术进行合并，创建一个集成分类模型。此外，改进的Faster R-CNN特征学习模块被新的卷积神经网络模块VGG16所取代，使我们的系统中人脸检测的精度和有效性最大化。在我们建议的面部检测和面部表情识别（DFER）数据集（包括EMOTIC， CK+， FERPLUS， AffectNet和自定义数据集）的评估中，准确率分别为89.2%,97.20%,99.01%，93.65%和98.61%。这些数据集是在模拟环境中精心获取的，因此需要创建几个自定义数据集。本研究强调了深度集成分类在提高驾驶员情绪识别方面的潜力，从而有助于提高道路安全。

{"title":"Enhancing Driver Emotion Recognition Through Deep Ensemble Classification","authors":"Faizan Zaman;Zhigang Xu;Adil Hussain;Anees Ullah;Khalid Zaman","doi":"10.26599/JICV.2025.9210055","DOIUrl":"https://doi.org/10.26599/JICV.2025.9210055","url":null,"abstract":"This research addresses the challenging task of classifying drivers' emotions to increase their awareness of their driving behaviors. It recognizes the common issue of driver emotions, which often leads to the neglect of poor driving practices. By automatically detecting and identifying these behaviors, drivers can proactively obtain valuable insights to reduce potential accidents. This study proposes a comprehensive facial recognition model for drivers that uses a unified architecture comprising a convolutional neural network (CNN), a recurrent neural network (RNN), and a multilayer perceptron (MLP) classification model. Initially, a faster region-based convolutional neural network (R-CNN) was employed for accurate and efficient facial detection of drivers in live and recorded videos. Features are extracted from three CNN models and merged via advanced techniques to create an ensemble classification model. Moreover, the improved Faster R-CNN feature learning module is replaced with a new convolutional neural network module, VGG16, which maximizes the precision and effectiveness of facial detection in our system. Significant accuracy results of 89.2%, 97.20%, 99.01%, 93.65%, and 98.61% are shown in evaluations of our suggested facial detection and facial expression recognition (DFER) datasets, including the EMOTIC, CK+, FERPLUS, AffectNet, and custom datasets. These datasets were meticulously acquired in a simulated environment, necessitating the creation of several custom datasets. This research highlights the potential of deep ensemble classification in improving driver emotion recognition, thereby contributing to enhanced road safety.","PeriodicalId":100793,"journal":{"name":"Journal of Intelligent and Connected Vehicles","volume":"8 2","pages":"9210055-1-9210055-16"},"PeriodicalIF":0.0,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144646570","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Decision-Making of Drivers Following Autonomous Vehicles: Developing a Bayesian Network on the Basis of Field Tests and Questionnaire Data 驾驶人跟随自动驾驶汽车的决策：基于现场测试和问卷调查数据的贝叶斯网络

Journal of Intelligent and Connected Vehicles

Pub Date : 2025-06-01 DOI: 10.26599/JICV.2025.9210057

Fang Zong;Huan Wu;Meng Zeng;Won Kim;Qiaowen Bai;Yafeng Gong;Ruifeng Duan;Ying Guo

With the development of autonomous driving technology, traffic mixed with human-driven vehicles (HDVs) and autonomous vehicles (AVs) has dominated transportation systems for a long period of time. Drivers' car-following decision-making in mixed traffic needs to be considered for traffic simulation and management policy formulation. This study aims to explore the differences in drivers' decision-making mechanisms when following AVs and HDVs. Data from a questionnaire survey and a field test are collected and employed to establish a Bayesian network for car-following decision-making process analysis and inference. The influences of driving habits and recognition of AVs on car-following decisions and the correlations among the four decision variables are analyzed. The four decision variables consist of the vehicle gap and acceleration in both the acceleration and deceleration phases. The results show that there are direct correlations among the four internal decision variables. Among the external variables, overspeeding and honking have distinct impacts on decisions made while following an AV. Moreover, regardless of whether they are in an acceleration or deceleration phase, most drivers tend to make gentler decisions when following AVs than when following HDVs. On the basis of the results, we propose some strategies for the traffic management of mixed traffic that are beneficial to traffic efficiency: (1) Improving drivers' recognition of AVs; (2) embedding the external sensing devices of AVs internally to make them visually similar to HDVs; and (3) establishing dedicated lanes for AVs. The research results have important reference significance for simulating car-following behavior, designing traffic control facilities and formulating policies under mixed traffic scenarios.

随着自动驾驶技术的发展，人类驾驶车辆（HDVs）和自动驾驶车辆（AVs）混合交通在很长一段时间内主导着交通系统。混合交通中驾驶员的跟车决策是交通仿真和管理政策制定需要考虑的问题。本研究旨在探讨驾驶员跟随自动驾驶汽车和自动驾驶汽车时决策机制的差异。收集问卷调查和现场试验数据，建立贝叶斯网络，进行跟车决策过程分析和推理。分析了驾驶习惯和自动驾驶汽车识别对汽车跟随决策的影响以及四个决策变量之间的相关性。这四个决策变量包括加速和减速阶段的车辆间隙和加速度。结果表明，四个内部决策变量之间存在直接相关关系。在外部变量中，超速和鸣笛对跟随自动驾驶时的决策有明显的影响。而且，无论处于加速还是减速阶段，大多数驾驶员在跟随自动驾驶时的决策倾向于比跟随hdv时更温和。在此基础上，提出了有利于提高交通效率的混合交通交通管理策略：(1)提高驾驶员对自动驾驶汽车的识别；(2)在自动驾驶汽车内部嵌入外部传感装置，使其在视觉上与hdv相似；(3)建立自动驾驶汽车专用车道。研究结果对混合交通场景下车辆跟随行为模拟、交通控制设施设计和政策制定具有重要的参考意义。

{"title":"Decision-Making of Drivers Following Autonomous Vehicles: Developing a Bayesian Network on the Basis of Field Tests and Questionnaire Data","authors":"Fang Zong;Huan Wu;Meng Zeng;Won Kim;Qiaowen Bai;Yafeng Gong;Ruifeng Duan;Ying Guo","doi":"10.26599/JICV.2025.9210057","DOIUrl":"https://doi.org/10.26599/JICV.2025.9210057","url":null,"abstract":"With the development of autonomous driving technology, traffic mixed with human-driven vehicles (HDVs) and autonomous vehicles (AVs) has dominated transportation systems for a long period of time. Drivers' car-following decision-making in mixed traffic needs to be considered for traffic simulation and management policy formulation. This study aims to explore the differences in drivers' decision-making mechanisms when following AVs and HDVs. Data from a questionnaire survey and a field test are collected and employed to establish a Bayesian network for car-following decision-making process analysis and inference. The influences of driving habits and recognition of AVs on car-following decisions and the correlations among the four decision variables are analyzed. The four decision variables consist of the vehicle gap and acceleration in both the acceleration and deceleration phases. The results show that there are direct correlations among the four internal decision variables. Among the external variables, overspeeding and honking have distinct impacts on decisions made while following an AV. Moreover, regardless of whether they are in an acceleration or deceleration phase, most drivers tend to make gentler decisions when following AVs than when following HDVs. On the basis of the results, we propose some strategies for the traffic management of mixed traffic that are beneficial to traffic efficiency: (1) Improving drivers' recognition of AVs; (2) embedding the external sensing devices of AVs internally to make them visually similar to HDVs; and (3) establishing dedicated lanes for AVs. The research results have important reference significance for simulating car-following behavior, designing traffic control facilities and formulating policies under mixed traffic scenarios.","PeriodicalId":100793,"journal":{"name":"Journal of Intelligent and Connected Vehicles","volume":"8 2","pages":"9210057-1-9210057-18"},"PeriodicalIF":0.0,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144646574","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

APM-SLAM: Visual Localization for Fixed Routes with Tightly Coupled a Priori Map APM-SLAM：基于紧耦合先验地图的固定路线视觉定位

Journal of Intelligent and Connected Vehicles

Pub Date : 2025-06-01 DOI: 10.26599/JICV.2025.9210063

Linsong Xue;Qi Luo;Kai Zhang

Localization along fixed routes is the fundamental function of transportation applications, including patrol vehicles, shuttles, buses, and even passenger vehicles. To achieve accurate and reliable localization, we propose a tightly coupled A Priori Map Simultaneous Localization and Mapping (APM-SLAM) system. APM-SLAM provides a comprehensive and heterogeneous framework, encompassing both mapping and localization processes. The mapping stage leverages Global Navigation Satellite System (GNSS)-aided Structure from Motion (SfM) to establish reliable a priori maps with coarse-and fine-level components. The localization process integrates coarse-to-fine matching with Maximum A Posteriori (MAP) Probability estimation to refine pose accuracy. By incorporating deep learning-based features and point descriptors, our system maintains robustness even in scenarios with significant visual variation. Unlike traditional map-based approaches, APM-SLAM models the a priori map's point structures as probabilistic distributions and incorporates them into the optimization process. Extensive experiments on public datasets demonstrate the superiority of our method in both mapping precision and localization accuracy, achieving decimeter-level translation precision. Ablation studies further validate the effectiveness of each component within our system. This work contributes to establishing maps and utilizing a priori information for localization simultaneously.

沿着固定路线定位是交通应用的基本功能，包括巡逻车、班车、公共汽车，甚至客运车辆。为了实现准确可靠的定位，我们提出了一种紧密耦合的先验地图同步定位与制图（APM-SLAM）系统。APM-SLAM提供了一个全面的异构框架，包括映射和定位过程。制图阶段利用全球导航卫星系统（GNSS）辅助运动结构（SfM）建立可靠的先验地图，包括粗级和精细级组件。定位过程结合了粗精匹配和最大后验概率（MAP）估计来提高姿态精度。通过结合基于深度学习的特征和点描述符，我们的系统即使在具有显著视觉变化的场景中也能保持鲁棒性。与传统的基于地图的方法不同，APM-SLAM将先验地图的点结构建模为概率分布，并将其纳入优化过程。在公共数据集上的大量实验证明了我们的方法在制图精度和定位精度上的优势，达到了分米级的翻译精度。消融研究进一步验证了我们系统中每个组件的有效性。这项工作有助于同时建立地图和利用先验信息进行定位。

{"title":"APM-SLAM: Visual Localization for Fixed Routes with Tightly Coupled a Priori Map","authors":"Linsong Xue;Qi Luo;Kai Zhang","doi":"10.26599/JICV.2025.9210063","DOIUrl":"https://doi.org/10.26599/JICV.2025.9210063","url":null,"abstract":"Localization along fixed routes is the fundamental function of transportation applications, including patrol vehicles, shuttles, buses, and even passenger vehicles. To achieve accurate and reliable localization, we propose a tightly coupled A Priori Map Simultaneous Localization and Mapping (APM-SLAM) system. APM-SLAM provides a comprehensive and heterogeneous framework, encompassing both mapping and localization processes. The mapping stage leverages Global Navigation Satellite System (GNSS)-aided Structure from Motion (SfM) to establish reliable a priori maps with coarse-and fine-level components. The localization process integrates coarse-to-fine matching with Maximum A Posteriori (MAP) Probability estimation to refine pose accuracy. By incorporating deep learning-based features and point descriptors, our system maintains robustness even in scenarios with significant visual variation. Unlike traditional map-based approaches, APM-SLAM models the a priori map's point structures as probabilistic distributions and incorporates them into the optimization process. Extensive experiments on public datasets demonstrate the superiority of our method in both mapping precision and localization accuracy, achieving decimeter-level translation precision. Ablation studies further validate the effectiveness of each component within our system. This work contributes to establishing maps and utilizing a priori information for localization simultaneously.","PeriodicalId":100793,"journal":{"name":"Journal of Intelligent and Connected Vehicles","volume":"8 2","pages":"9210063-1-9210063-15"},"PeriodicalIF":0.0,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144646571","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Multimodal Large-Language Model Empowering Next-Generation Autonomous Driving Systems 支持下一代自动驾驶系统的多模态大语言模型

Journal of Intelligent and Connected Vehicles

Pub Date : 2025-06-01 DOI: 10.26599/JICV.2025.9210059

Zhiqiang Hu;Mingxing Xu;Qixiu Cheng

Autonomous driving technology has made significant advancements in recent years. The evolution of autonomous driving systems from traditional modular designs to end-to-end learning paradigms has led to comprehensive improvements in driving capabilities. In modular designs, driving tasks are segmented into independent modules, such as perception, decision-making, planning, and control. This modular structure offers high explainability and safety in simple scenarios but is hindered by limited generalizability in complex traffic environments, and the sequential connection of multiple modules often leads to error accumulation. In contrast, end-to-end methods process perception data directly to produce control outputs, thereby mitigating information loss and sequential error accumulation, ultimately improving scene generalization in diverse environments. However, this approach is limited by strong data dependency, low interpretability, and inadequate handling of long-tail scenarios (Zhao et al., 2024).

近年来，自动驾驶技术取得了重大进展。自动驾驶系统从传统的模块化设计向端到端学习范式的演变，导致了驾驶能力的全面提高。在模块化设计中，驾驶任务被分割成独立的模块，如感知、决策、规划和控制。这种模块化结构在简单场景下具有较高的可解释性和安全性，但在复杂交通环境下泛化能力有限，且多个模块的顺序连接往往导致错误积累。相比之下，端到端方法直接处理感知数据以产生控制输出，从而减少信息丢失和顺序误差积累，最终提高不同环境下的场景泛化。然而，这种方法受到数据依赖性强、可解释性低以及对长尾场景处理不足的限制（Zhao et al., 2024）。

引用次数: 0

InVDriver: Intra-Instance Aware Vectorized Query-Based Autonomous Driving Transformer InVDriver：基于实例感知的矢量化查询的自动驾驶变压器

Journal of Intelligent and Connected Vehicles

Pub Date : 2025-06-01 DOI: 10.26599/JICV.2025.9210060

Bo Zhang;Heye Huang;Chunyang Liu;Yaqin Zhang;Zhenhua Xu

End-to-end autonomous driving, with its holistic optimization capabilities, has gained increasing traction in academia and industry. Vectorized representations, which preserve instance-level topological information while reducing computational overhead, have emerged as promising paradigms. However, existing vectorized query-based frameworks often overlook the inherent spatial correlations among intra-instance points, resulting in geometrically inconsistent outputs (e.g., fragmented HD map elements or oscillatory trajectories). To address these limitations, we propose intra-instance vectorized driving transformer (InVDriver), a novel vectorized query-based system that systematically models intra-instance spatial dependencies through masked self-attention layers, thereby enhancing planning accuracy and trajectory smoothness. Across all core modules, i.e., perception, prediction, and planning, InVDriver incorporates masked self-attention mechanisms that restrict attention to intra-instance point interactions, enabling coordinated refinement of structural elements while suppressing irrelevant inter-instance noise. The experimental results on the nuScenes benchmark demonstrate that InVDriver achieves state-of-the-art performance, surpassing prior methods in both accuracy and safety, while maintaining high computational efficiency.

端到端自动驾驶以其整体优化能力在学术界和产业界获得了越来越多的关注。向量化表示在减少计算开销的同时保留实例级拓扑信息，已成为一种很有前途的范式。然而，现有的基于矢量化查询的框架往往忽略了实例内点之间固有的空间相关性，导致几何上不一致的输出（例如，碎片化的高清地图元素或振荡轨迹）。为了解决这些限制，我们提出了实例内矢量化驱动变压器（InVDriver），这是一种新颖的基于矢量化查询的系统，通过屏蔽自关注层系统地建模实例内空间依赖关系，从而提高规划精度和轨迹平滑度。在所有核心模块（即感知、预测和规划）中，InVDriver结合了隐藏的自关注机制，将注意力限制在实例内点交互上，从而在抑制不相关的实例间噪声的同时，实现对结构元素的协调细化。在nuScenes基准测试上的实验结果表明，InVDriver达到了最先进的性能，在精度和安全性方面超过了先前的方法，同时保持了较高的计算效率。

{"title":"InVDriver: Intra-Instance Aware Vectorized Query-Based Autonomous Driving Transformer","authors":"Bo Zhang;Heye Huang;Chunyang Liu;Yaqin Zhang;Zhenhua Xu","doi":"10.26599/JICV.2025.9210060","DOIUrl":"https://doi.org/10.26599/JICV.2025.9210060","url":null,"abstract":"End-to-end autonomous driving, with its holistic optimization capabilities, has gained increasing traction in academia and industry. Vectorized representations, which preserve instance-level topological information while reducing computational overhead, have emerged as promising paradigms. However, existing vectorized query-based frameworks often overlook the inherent spatial correlations among intra-instance points, resulting in geometrically inconsistent outputs (e.g., fragmented HD map elements or oscillatory trajectories). To address these limitations, we propose intra-instance vectorized driving transformer (InVDriver), a novel vectorized query-based system that systematically models intra-instance spatial dependencies through masked self-attention layers, thereby enhancing planning accuracy and trajectory smoothness. Across all core modules, i.e., perception, prediction, and planning, InVDriver incorporates masked self-attention mechanisms that restrict attention to intra-instance point interactions, enabling coordinated refinement of structural elements while suppressing irrelevant inter-instance noise. The experimental results on the nuScenes benchmark demonstrate that InVDriver achieves state-of-the-art performance, surpassing prior methods in both accuracy and safety, while maintaining high computational efficiency.","PeriodicalId":100793,"journal":{"name":"Journal of Intelligent and Connected Vehicles","volume":"8 2","pages":"9210060-1-9210060-8"},"PeriodicalIF":0.0,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144646572","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A Trajectory Planning and Tracking Method Based on Deep Hierarchical Reinforcement Learning 一种基于深度层次强化学习的轨迹规划与跟踪方法

Journal of Intelligent and Connected Vehicles

Pub Date : 2025-06-01 DOI: 10.26599/JICV.2025.9210056

Jiajie Zhang;Bao-Lin Ye;Xin Wang;Lingxi Li;Bo Song

To improve the driving efficiency of unmanned vehicles in a complex urban traffic flow environment and the safety and passenger comfort of vehicles when changing lanes, we propose a hierarchical reinforcement learning (HRL)-based vehicle trajectory planning and tracking method. First, we present a hierarchical control framework for vehicle trajectory tracking that is based on deep reinforcement learning (DRL) and model predictive control (MPC). We design an upper-level decision model based on the trust region policy optimization algorithm integrated with long short-term memory to obtain more accurate strategies. Second, to improve stability and passenger comfort, we constructed a lower controller that combines the Bezier curve fitting method and an MPC controller. Finally, the proposed method was simulated via the car learning to act (CARLA) simulator, which is based on an unreal engine. Random urban traffic-flow test scenarios were used to simulate a real urban road-traffic environment. The simulation results illustrate that the proposed method can complete the vehicle trajectory planning and tracking task well. Compared with the existing RL methods, our proposed method has the lowest collision rate of 1.5% and achieves an average speed improvement of 7.04%. Moreover, our proposed method has better comfort performance and lower fuel consumption during the driving process.

为了提高无人驾驶汽车在复杂城市交通流环境下的行驶效率以及车辆变道时的安全性和乘客舒适性，提出了一种基于分层强化学习（HRL）的车辆轨迹规划与跟踪方法。首先，我们提出了一种基于深度强化学习（DRL）和模型预测控制（MPC）的车辆轨迹跟踪分层控制框架。为了获得更精确的策略，我们设计了一个基于信任域策略优化算法并结合长短期记忆的上层决策模型。其次，为了提高稳定性和乘客舒适度，我们构建了一个结合Bezier曲线拟合方法和MPC控制器的下控制器。最后，通过基于虚幻引擎的汽车行为学习（CARLA）模拟器对所提方法进行了仿真。采用随机城市交通流测试场景模拟真实城市道路交通环境。仿真结果表明，该方法能较好地完成车辆轨迹规划和跟踪任务。与现有的RL方法相比，本文方法的碰撞率最低，为1.5%，平均速度提高7.04%。此外，该方法在驾驶过程中具有更好的舒适性和更低的油耗。

{"title":"A Trajectory Planning and Tracking Method Based on Deep Hierarchical Reinforcement Learning","authors":"Jiajie Zhang;Bao-Lin Ye;Xin Wang;Lingxi Li;Bo Song","doi":"10.26599/JICV.2025.9210056","DOIUrl":"https://doi.org/10.26599/JICV.2025.9210056","url":null,"abstract":"To improve the driving efficiency of unmanned vehicles in a complex urban traffic flow environment and the safety and passenger comfort of vehicles when changing lanes, we propose a hierarchical reinforcement learning (HRL)-based vehicle trajectory planning and tracking method. First, we present a hierarchical control framework for vehicle trajectory tracking that is based on deep reinforcement learning (DRL) and model predictive control (MPC). We design an upper-level decision model based on the trust region policy optimization algorithm integrated with long short-term memory to obtain more accurate strategies. Second, to improve stability and passenger comfort, we constructed a lower controller that combines the Bezier curve fitting method and an MPC controller. Finally, the proposed method was simulated via the car learning to act (CARLA) simulator, which is based on an unreal engine. Random urban traffic-flow test scenarios were used to simulate a real urban road-traffic environment. The simulation results illustrate that the proposed method can complete the vehicle trajectory planning and tracking task well. Compared with the existing RL methods, our proposed method has the lowest collision rate of 1.5% and achieves an average speed improvement of 7.04%. Moreover, our proposed method has better comfort performance and lower fuel consumption during the driving process.","PeriodicalId":100793,"journal":{"name":"Journal of Intelligent and Connected Vehicles","volume":"8 2","pages":"9210056-1-9210056-9"},"PeriodicalIF":0.0,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144646569","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Smart Prediction-Planning Algorithm for Connected and Autonomous Vehicle Based on Social Value Orientation 基于社会价值取向的车联网自动驾驶智能预测规划算法

Journal of Intelligent and Connected Vehicles

Pub Date : 2025-03-01 DOI: 10.26599/JICV.2024.9210053

Donglei Rong;Yuefeng Wu;Wenjun Du;Chengcheng Yang;Sheng Jin;Min Xu;Fujian Wang

To improve the adaptability of Connected and Automated Vehicles (CAVs) in mixed traffic, this study proposes a prediction model training indicator that comprehensively considers drivers' Social Value Orientation (SVO) and planning goals. Active Influence Factor (AIF) is used as the goal to predict the future safety loss and consistency loss of CAVs. Second, an objective function based on SVO is constructed to understand the driver's characteristics to evaluate the safety, comfort, efficiency, and consistency of candidate trajectories. The results showed that integrating SVO and consistency functions can help ensure that CAVs drive under a more stable risk potential energy field. The prediction planning model that considers SVO can improve the reliability of the CAV output trajectory to a certain extent. The prediction planning under the AIF has better accuracy and stability of the output trajectory; however, it still has strong adaptability and superiority under different sensitivity parameters. The minimum and maximum standard deviations of our model are 0.78 and 0.78 m, respectively, whereas the minimum and maximum standard deviations of the comparative model reach 2.07 and 4.56 m, respectively. The minimum standard deviation of the other comparative model reaches 1.35 m, and the maximum standard deviation reaches 4.45 m.

为了提高网联自动驾驶汽车在混合交通中的适应性，本研究提出了综合考虑驾驶员社会价值取向（SVO）和规划目标的预测模型训练指标。以主动影响因子（Active Influence Factor， AIF）为目标，预测cav未来的安全性损失和一致性损失。其次，构建基于SVO的目标函数，了解驾驶员特征，评价候选轨迹的安全性、舒适性、效率性和一致性；结果表明，将SVO与一致性函数相结合，可以保证自动驾驶汽车在更稳定的风险势能场下行驶。考虑SVO的预测规划模型可以在一定程度上提高CAV输出轨迹的可靠性。AIF下的预测规划具有更好的输出轨迹精度和稳定性；但在不同的灵敏度参数下仍具有较强的适应性和优越性。我们模型的最小和最大标准差分别为0.78和0.78 m，而比较模型的最小和最大标准差分别为2.07和4.56 m。另一个比较模型的最小标准差为1.35 m，最大标准差为4.45 m。

{"title":"Smart Prediction-Planning Algorithm for Connected and Autonomous Vehicle Based on Social Value Orientation","authors":"Donglei Rong;Yuefeng Wu;Wenjun Du;Chengcheng Yang;Sheng Jin;Min Xu;Fujian Wang","doi":"10.26599/JICV.2024.9210053","DOIUrl":"https://doi.org/10.26599/JICV.2024.9210053","url":null,"abstract":"To improve the adaptability of Connected and Automated Vehicles (CAVs) in mixed traffic, this study proposes a prediction model training indicator that comprehensively considers drivers' Social Value Orientation (SVO) and planning goals. Active Influence Factor (AIF) is used as the goal to predict the future safety loss and consistency loss of CAVs. Second, an objective function based on SVO is constructed to understand the driver's characteristics to evaluate the safety, comfort, efficiency, and consistency of candidate trajectories. The results showed that integrating SVO and consistency functions can help ensure that CAVs drive under a more stable risk potential energy field. The prediction planning model that considers SVO can improve the reliability of the CAV output trajectory to a certain extent. The prediction planning under the AIF has better accuracy and stability of the output trajectory; however, it still has strong adaptability and superiority under different sensitivity parameters. The minimum and maximum standard deviations of our model are 0.78 and 0.78 m, respectively, whereas the minimum and maximum standard deviations of the comparative model reach 2.07 and 4.56 m, respectively. The minimum standard deviation of the other comparative model reaches 1.35 m, and the maximum standard deviation reaches 4.45 m.","PeriodicalId":100793,"journal":{"name":"Journal of Intelligent and Connected Vehicles","volume":"8 1","pages":"1-17"},"PeriodicalIF":0.0,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10960599","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143808988","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Safe and Efficient DRL Driving Policies Using Fuzzy Logic for Urban Lane Changing Scenarios 城市变道场景下基于模糊逻辑的安全高效DRL驾驶策略研究

Journal of Intelligent and Connected Vehicles

Pub Date : 2025-03-01 DOI: 10.26599/JICV.2024.9210054

Ling Han;Xiangyu Ma;Yiren Wang;Lei He;Yipeng Li;Lele Zhang;Qiang Yi

Lane changing is common in driving. Thus, the possibility of traffic accidents occurring during lane changes is high given the complexity of this process. One of the primary objectives of intelligent driving is to increase a vehicle's behavior, making it more similar to that of a real driver. This study proposes a decision-making framework based on deep reinforcement learning (DRL) in a lane-changing scenario, which seeks to find a driving strategy that simultaneously considers the expected lane-changing risks and gains. First, a fuzzy logic lane-changing controller is designed. It outputs the corresponding safety and lane-change gain weights by inputting relevant driving parameters. Second, the obtained weights are brought into the constructed reward function of DRL. The model parameters are designed and trained on the basis of lane-changing behavior. Finally, we conducted experiments in a simulator to evaluate the performance of our developed algorithm in urban scenarios. To visualize and validate the estimated driving intentions, lane-changing strategies were tested under four scenarios. The results show that the average improvement in travel efficiency in the four scenarios is 19%. In addition, the average accident rate in the four scenarios increased by only 4%. We combine fuzzy logic and DRL reward functions to personify the lane-changing behavior of intelligent driving. Compared with conservative strategies that prioritize only safety, this method can considerably improve the number of lane changes and travel efficiency for autonomous vehicles (AVs) on the premise of ensuring safety. The approach provides an effective and explainable method designed for facilitating intelligent driving lane-changing behavior.

换道在驾驶中很常见。因此，考虑到变道过程的复杂性，变道过程中发生交通事故的可能性很高。智能驾驶的主要目标之一是提高车辆的行为，使其更像真正的司机。本研究提出了一种基于深度强化学习（DRL）的变道场景决策框架，寻求同时考虑预期变道风险和收益的驾驶策略。首先，设计了模糊逻辑变道控制器。通过输入相关驾驶参数，输出相应的安全增益权和变道增益权。其次，将得到的权值带入构建的DRL奖励函数中。基于变道行为对模型参数进行设计和训练。最后，我们在模拟器中进行了实验，以评估我们开发的算法在城市场景中的性能。为了可视化和验证估计的驾驶意图，在四种场景下测试了变道策略。结果表明，四种场景下出行效率的平均提升幅度为19%。此外，四种情况下的平均事故率仅增加了4%。将模糊逻辑和DRL奖励函数相结合，拟人化智能驾驶变道行为。与只考虑安全的保守策略相比，该方法在保证安全的前提下，可以显著提高自动驾驶汽车的变道次数和行驶效率。该方法为智能驾驶变道行为的实现提供了一种有效的、可解释的方法。

{"title":"Safe and Efficient DRL Driving Policies Using Fuzzy Logic for Urban Lane Changing Scenarios","authors":"Ling Han;Xiangyu Ma;Yiren Wang;Lei He;Yipeng Li;Lele Zhang;Qiang Yi","doi":"10.26599/JICV.2024.9210054","DOIUrl":"https://doi.org/10.26599/JICV.2024.9210054","url":null,"abstract":"Lane changing is common in driving. Thus, the possibility of traffic accidents occurring during lane changes is high given the complexity of this process. One of the primary objectives of intelligent driving is to increase a vehicle's behavior, making it more similar to that of a real driver. This study proposes a decision-making framework based on deep reinforcement learning (DRL) in a lane-changing scenario, which seeks to find a driving strategy that simultaneously considers the expected lane-changing risks and gains. First, a fuzzy logic lane-changing controller is designed. It outputs the corresponding safety and lane-change gain weights by inputting relevant driving parameters. Second, the obtained weights are brought into the constructed reward function of DRL. The model parameters are designed and trained on the basis of lane-changing behavior. Finally, we conducted experiments in a simulator to evaluate the performance of our developed algorithm in urban scenarios. To visualize and validate the estimated driving intentions, lane-changing strategies were tested under four scenarios. The results show that the average improvement in travel efficiency in the four scenarios is 19%. In addition, the average accident rate in the four scenarios increased by only 4%. We combine fuzzy logic and DRL reward functions to personify the lane-changing behavior of intelligent driving. Compared with conservative strategies that prioritize only safety, this method can considerably improve the number of lane changes and travel efficiency for autonomous vehicles (AVs) on the premise of ensuring safety. The approach provides an effective and explainable method designed for facilitating intelligent driving lane-changing behavior.","PeriodicalId":100793,"journal":{"name":"Journal of Intelligent and Connected Vehicles","volume":"8 1","pages":"1-13"},"PeriodicalIF":0.0,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10960600","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143808987","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0