首页 > 最新文献

IEEE Transactions on Intelligent Vehicles最新文献

英文 中文
Run-Time Introspection of 2D Object Detection in Automated Driving Systems Using Learning Representations 使用学习表征对自动驾驶系统中的二维物体检测进行运行时自省
IF 14 1区 工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-04-08 DOI: 10.1109/TIV.2024.3385531
Hakan Yekta Yatbaz;Mehrdad Dianati;Konstantinos Koufos;Roger Woodman
Reliable detection of various objects and road users in the surrounding environment is crucial for the safe operation of automated driving systems (ADS). Despite recent progresses in developing highly accurate object detectors based on Deep Neural Networks (DNNs), they still remain prone to detection errors, which can lead to fatal consequences in safety-critical applications such as ADS. An effective remedy to this problem is to equip the system with run-time monitoring, named as introspection in the context of autonomous systems. Motivated by this, we introduce a novel introspection solution, which operates at the frame level for DNN-based 2D object detection and leverages neural network activation patterns. The proposed approach pre-processes the neural activation patterns of the object detector's backbone using several different modes. To provide extensive comparative analysis and fair comparison, we also adapt and implement several state-of-the-art (SOTA) introspection mechanisms for error detection in 2D object detection, using one-stage and two-stage object detectors evaluated on KITTI and BDD datasets. We compare the performance of the proposed solution in terms of error detection, adaptability to dataset shift, and, computational and memory resource requirements. Our performance evaluation shows that the proposed introspection solution outperforms SOTA methods, achieving an absolute reduction in the missed error ratio of 9% to 17% in the BDD dataset.
可靠地检测周围环境中的各种物体和道路使用者对于自动驾驶系统(ADS)的安全运行至关重要。尽管最近在开发基于深度神经网络(DNN)的高精度物体检测器方面取得了进展,但它们仍然容易出现检测错误,这可能会在自动驾驶系统等对安全至关重要的应用中导致致命后果。解决这一问题的有效方法是为系统配备运行时监控功能,在自主系统中称为自省。受此启发,我们引入了一种新颖的自省解决方案,该方案在帧级进行基于 DNN 的 2D 物体检测,并利用神经网络激活模式。所提出的方法使用几种不同的模式对物体检测器主干的神经激活模式进行预处理。为了提供广泛的对比分析和公平的比较,我们还调整并实施了几种最先进的(SOTA)自省机制,用于二维物体检测中的错误检测,并在 KITTI 和 BDD 数据集上使用单级和双级物体检测器进行了评估。我们从错误检测、对数据集转移的适应性、计算和内存资源需求等方面比较了所提解决方案的性能。我们的性能评估结果表明,所提出的自省解决方案优于 SOTA 方法,在 BDD 数据集中,漏检错误率绝对值降低了 9% 到 17%。
{"title":"Run-Time Introspection of 2D Object Detection in Automated Driving Systems Using Learning Representations","authors":"Hakan Yekta Yatbaz;Mehrdad Dianati;Konstantinos Koufos;Roger Woodman","doi":"10.1109/TIV.2024.3385531","DOIUrl":"https://doi.org/10.1109/TIV.2024.3385531","url":null,"abstract":"Reliable detection of various objects and road users in the surrounding environment is crucial for the safe operation of automated driving systems (ADS). Despite recent progresses in developing highly accurate object detectors based on Deep Neural Networks (DNNs), they still remain prone to detection errors, which can lead to fatal consequences in safety-critical applications such as ADS. An effective remedy to this problem is to equip the system with run-time monitoring, named as \u0000<italic>introspection</i>\u0000 in the context of autonomous systems. Motivated by this, we introduce a novel introspection solution, which operates at the frame level for DNN-based 2D object detection and leverages neural network activation patterns. The proposed approach pre-processes the neural activation patterns of the object detector's backbone using several different modes. To provide extensive comparative analysis and fair comparison, we also adapt and implement several state-of-the-art (SOTA) introspection mechanisms for error detection in 2D object detection, using one-stage and two-stage object detectors evaluated on KITTI and BDD datasets. We compare the performance of the proposed solution in terms of error detection, adaptability to dataset shift, and, computational and memory resource requirements. Our performance evaluation shows that the proposed introspection solution outperforms SOTA methods, achieving an absolute reduction in the missed error ratio of 9% to 17% in the BDD dataset.","PeriodicalId":36532,"journal":{"name":"IEEE Transactions on Intelligent Vehicles","volume":"9 6","pages":"5033-5046"},"PeriodicalIF":14.0,"publicationDate":"2024-04-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141965420","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Share Your Preprint Research with the World! 与世界分享您的预印本研究成果
IF 8.2 1区 工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-04-01 DOI: 10.1109/TIV.2024.3413588
{"title":"Share Your Preprint Research with the World!","authors":"","doi":"10.1109/TIV.2024.3413588","DOIUrl":"https://doi.org/10.1109/TIV.2024.3413588","url":null,"abstract":"","PeriodicalId":36532,"journal":{"name":"IEEE Transactions on Intelligent Vehicles","volume":"9 4","pages":"4820-4820"},"PeriodicalIF":8.2,"publicationDate":"2024-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10555435","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141315166","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Sora for Hierarchical Parallel Motion Planner: A Safe End-to-End Method Against OOD Events 分层平行运动规划器 Sora:针对 OOD 事件的安全端到端方法
IF 8.2 1区 工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-04-01 DOI: 10.1109/TIV.2024.3392647
Siyu Teng;Ran Yan;Xiaotong Zhang;Yuchen Li;Xingxia Wang;Yutong Wang;Yonglin Tian;Hui Yu;Lingxi Li;Long Chen;Fei-Yue Wang
End-to-end motion planners have shown great potential for enabling fully autonomous driving. However, when facing out-of-distribution (OOD) events, these planners might not guarantee the optimal prediction of control commands. To better enhance safety, an end-to-end method that benefits robust and general policy learning from potential OOD events is urgently desirable. In this perspective, Sore4PMP, a hierarchical parallel motion planner, is presented as a suitable solution. Based on raw perception data and descriptive prompts, Sore4PMP can first leverage the advanced generative capabilities of Sora to generate virtual OOD events, and then integrate these events into the decision-making process, thereby enhancing the robustness and generalization of autonomous vehicles (AVs) in emergency scenarios. With a comprehensive outlook, this perspective aims to provide a potential direction for the development of foundation models coupled with autonomous driving and finally promote the safety, efficiency, reliability, and sustainability of AVs.
端到端运动规划器在实现完全自主驾驶方面展现出巨大潜力。然而,当面临失控(OOD)事件时,这些规划器可能无法保证控制指令的最优预测。为了更好地提高安全性,迫切需要一种能从潜在的 OOD 事件中获得稳健而通用的策略学习的端到端方法。从这个角度出发,分层并行运动规划器 Sore4PMP 就是一个合适的解决方案。基于原始感知数据和描述性提示,Sore4PMP 可首先利用 Sora 的高级生成功能生成虚拟 OOD 事件,然后将这些事件整合到决策过程中,从而增强自动驾驶汽车(AV)在紧急情况下的鲁棒性和通用性。本视角以全面的视角,旨在为与自动驾驶相结合的基础模型的发展提供一个潜在的方向,并最终促进自动驾驶汽车的安全性、效率、可靠性和可持续性。
{"title":"Sora for Hierarchical Parallel Motion Planner: A Safe End-to-End Method Against OOD Events","authors":"Siyu Teng;Ran Yan;Xiaotong Zhang;Yuchen Li;Xingxia Wang;Yutong Wang;Yonglin Tian;Hui Yu;Lingxi Li;Long Chen;Fei-Yue Wang","doi":"10.1109/TIV.2024.3392647","DOIUrl":"https://doi.org/10.1109/TIV.2024.3392647","url":null,"abstract":"End-to-end motion planners have shown great potential for enabling fully autonomous driving. However, when facing out-of-distribution (OOD) events, these planners might not guarantee the optimal prediction of control commands. To better enhance safety, an end-to-end method that benefits robust and general policy learning from potential OOD events is urgently desirable. In this perspective, Sore4PMP, a hierarchical parallel motion planner, is presented as a suitable solution. Based on raw perception data and descriptive prompts, Sore4PMP can first leverage the advanced generative capabilities of Sora to generate virtual OOD events, and then integrate these events into the decision-making process, thereby enhancing the robustness and generalization of autonomous vehicles (AVs) in emergency scenarios. With a comprehensive outlook, this perspective aims to provide a potential direction for the development of foundation models coupled with autonomous driving and finally promote the safety, efficiency, reliability, and sustainability of AVs.","PeriodicalId":36532,"journal":{"name":"IEEE Transactions on Intelligent Vehicles","volume":"9 4","pages":"4573-4576"},"PeriodicalIF":8.2,"publicationDate":"2024-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141315146","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The Transactions on Intelligent Vehicles Information 智能车辆信息论文集
IF 8.2 1区 工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-04-01 DOI: 10.1109/TIV.2024.3400796
{"title":"The Transactions on Intelligent Vehicles Information","authors":"","doi":"10.1109/TIV.2024.3400796","DOIUrl":"https://doi.org/10.1109/TIV.2024.3400796","url":null,"abstract":"","PeriodicalId":36532,"journal":{"name":"IEEE Transactions on Intelligent Vehicles","volume":"9 4","pages":"C3-C3"},"PeriodicalIF":8.2,"publicationDate":"2024-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10555389","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141315161","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
IEEE Transactions on Intelligent Vehicles Publication Information 电气和电子工程师学会智能车辆论文集》出版信息
IF 8.2 1区 工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-04-01 DOI: 10.1109/TIV.2024.3400798
{"title":"IEEE Transactions on Intelligent Vehicles Publication Information","authors":"","doi":"10.1109/TIV.2024.3400798","DOIUrl":"https://doi.org/10.1109/TIV.2024.3400798","url":null,"abstract":"","PeriodicalId":36532,"journal":{"name":"IEEE Transactions on Intelligent Vehicles","volume":"9 4","pages":"C2-C2"},"PeriodicalIF":8.2,"publicationDate":"2024-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10555433","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141315145","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
LLM-Based Operating Systems for Automated Vehicles: A New Perspective 基于 LLM 的自动驾驶汽车操作系统:新视角
IF 8.2 1区 工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-04-01 DOI: 10.1109/TIV.2024.3399813
Jingwei Ge;Cheng Chang;Jiawei Zhang;Lingxi Li;Xiaoxiang Na;Yilun Lin;Li Li;Fei-Yue Wang
The deployment of large language models (LLMs) brings challenges to intelligent systems because its capability of integrating large-scale training data facilitates contextual reasoning. This paper envisions a revolution of the LLM based (Artificial) Intelligent Operating Systems (IOS, or AIOS) to support the core of automated vehicles. We explain the structure of this LLM-OS and discuss the resulting benefits and implementation difficulties.
大型语言模型(LLMs)的部署给智能系统带来了挑战,因为其整合大规模训练数据的能力有助于上下文推理。本文设想了一场基于 LLM 的(人工)智能操作系统(IOS,或 AIOS)革命,以支持自动驾驶汽车的核心。我们解释了这种 LLM-OS 的结构,并讨论了由此带来的好处和实施上的困难。
{"title":"LLM-Based Operating Systems for Automated Vehicles: A New Perspective","authors":"Jingwei Ge;Cheng Chang;Jiawei Zhang;Lingxi Li;Xiaoxiang Na;Yilun Lin;Li Li;Fei-Yue Wang","doi":"10.1109/TIV.2024.3399813","DOIUrl":"https://doi.org/10.1109/TIV.2024.3399813","url":null,"abstract":"The deployment of large language models (LLMs) brings challenges to intelligent systems because its capability of integrating large-scale training data facilitates contextual reasoning. This paper envisions a revolution of the LLM based (Artificial) Intelligent Operating Systems (IOS, or AIOS) to support the core of automated vehicles. We explain the structure of this LLM-OS and discuss the resulting benefits and implementation difficulties.","PeriodicalId":36532,"journal":{"name":"IEEE Transactions on Intelligent Vehicles","volume":"9 4","pages":"4563-4567"},"PeriodicalIF":8.2,"publicationDate":"2024-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141315157","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Probabilistic Graph-Based Real-Time Ground Segmentation for Urban Robotics 基于概率图的城市机器人实时地面分割技术
IF 14 1区 工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-04-01 DOI: 10.1109/TIV.2024.3383599
Iván del Pino;Angel Santamaria-Navarro;Anaís Garrell Zulueta;Fernando Torres;Juan Andrade-Cetto
Terrain analysis is of paramount importance for the safe navigation of autonomous robots. In this study, we introduce GATA, a probabilistic real-time graph-based method for segmentation and traversability analysis of point clouds. In the method, we iteratively refine the parameters of a ground plane model and identify regions imaged by a LiDAR as traversable and non-traversable. The method excels in delivering rapid, high-precision obstacle detection, surpassing existing state-of-the-art methods. Furthermore, our method addresses the need to distinguish between surfaces with varying traversability, such as vegetation or unpaved roads, depending on the specific application. To achieve this, we integrate a shallow neural network, which operates on features extracted from the ground model. This enhancement not only boosts performance but also maintains real-time efficiency, without the need for GPUs. The method is rigorously evaluated using the SemanticKitti dataset and its practicality is showcased through real-world experiments with an urban last-mile delivery autonomous robot.
地形分析对于自主机器人的安全导航至关重要。在本研究中,我们介绍了一种基于概率的实时图方法 GATA,用于对点云进行分割和可穿越性分析。在该方法中,我们迭代完善地平面模型的参数,并将激光雷达成像的区域识别为可穿越和不可穿越区域。该方法在提供快速、高精度障碍物检测方面表现出色,超越了现有的先进方法。此外,我们的方法还能根据具体应用,区分不同可穿越性的表面,如植被或未铺设路面的道路。为此,我们整合了一个浅层神经网络,该网络根据从地面模型中提取的特征运行。这一改进不仅提高了性能,而且保持了实时效率,无需 GPU。我们使用 SemanticKitti 数据集对该方法进行了严格评估,并通过城市最后一英里配送自主机器人的实际实验展示了该方法的实用性。
{"title":"Probabilistic Graph-Based Real-Time Ground Segmentation for Urban Robotics","authors":"Iván del Pino;Angel Santamaria-Navarro;Anaís Garrell Zulueta;Fernando Torres;Juan Andrade-Cetto","doi":"10.1109/TIV.2024.3383599","DOIUrl":"https://doi.org/10.1109/TIV.2024.3383599","url":null,"abstract":"Terrain analysis is of paramount importance for the safe navigation of autonomous robots. In this study, we introduce GATA, a probabilistic real-time graph-based method for segmentation and traversability analysis of point clouds. In the method, we iteratively refine the parameters of a ground plane model and identify regions imaged by a LiDAR as traversable and non-traversable. The method excels in delivering rapid, high-precision obstacle detection, surpassing existing state-of-the-art methods. Furthermore, our method addresses the need to distinguish between surfaces with varying traversability, such as vegetation or unpaved roads, depending on the specific application. To achieve this, we integrate a shallow neural network, which operates on features extracted from the ground model. This enhancement not only boosts performance but also maintains real-time efficiency, without the need for GPUs. The method is rigorously evaluated using the SemanticKitti dataset and its practicality is showcased through real-world experiments with an urban last-mile delivery autonomous robot.","PeriodicalId":36532,"journal":{"name":"IEEE Transactions on Intelligent Vehicles","volume":"9 5","pages":"4989-5002"},"PeriodicalIF":14.0,"publicationDate":"2024-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10487036","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141964761","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
VistaRAG: Toward Safe and Trustworthy Autonomous Driving Through Retrieval-Augmented Generation VistaRAG:通过检索增强生成实现安全可信的自动驾驶
IF 8.2 1区 工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-04-01 DOI: 10.1109/TIV.2024.3396450
Xingyuan Dai;Chao Guo;Yun Tang;Haichuan Li;Yutong Wang;Jun Huang;Yonglin Tian;Xin Xia;Yisheng Lv;Fei-Yue Wang
Autonomous driving based on foundation models has recently garnered widespread attention. However, the risk of hallucinations inherent in foundation models could compromise the safety and reliability of autonomous driving systems. This letter, as part of a series of reports from the Distributed/Decentralized Hybrid Workshop on Foundation/Infrastructure Intelligence (DHW-FII), aims to tackle these issues. We introduce VistaRAG, which integrates retrieval-augmented generation (RAG) technologies into autonomous driving systems based on foundation models, to address the inherent reliability challenges in decision-making. VistaRAG employs a dynamic retrieval mechanism to access highly relevant driving experience, real-time road network status, and other contextual information from external databases. This aids foundation models in informed reasoning and decision-making, thereby enhancing the safety and trustworthiness of foundation-model-based autonomous driving systems under complex traffic scenarios.
基于基础模型的自动驾驶最近受到广泛关注。然而,基础模型固有的幻觉风险可能会损害自动驾驶系统的安全性和可靠性。作为基础/基础设施智能分布式/分散式混合研讨会(DHW-FII)系列报告的一部分,这封信旨在解决这些问题。我们介绍了 VistaRAG,它将检索增强生成(RAG)技术集成到基于地基模型的自动驾驶系统中,以解决决策中固有的可靠性挑战。VistaRAG 采用动态检索机制,从外部数据库获取高度相关的驾驶经验、实时路网状态和其他上下文信息。这有助于基础模型进行知情推理和决策,从而提高基于基础模型的自动驾驶系统在复杂交通场景下的安全性和可信度。
{"title":"VistaRAG: Toward Safe and Trustworthy Autonomous Driving Through Retrieval-Augmented Generation","authors":"Xingyuan Dai;Chao Guo;Yun Tang;Haichuan Li;Yutong Wang;Jun Huang;Yonglin Tian;Xin Xia;Yisheng Lv;Fei-Yue Wang","doi":"10.1109/TIV.2024.3396450","DOIUrl":"https://doi.org/10.1109/TIV.2024.3396450","url":null,"abstract":"Autonomous driving based on foundation models has recently garnered widespread attention. However, the risk of hallucinations inherent in foundation models could compromise the safety and reliability of autonomous driving systems. This letter, as part of a series of reports from the Distributed/Decentralized Hybrid Workshop on Foundation/Infrastructure Intelligence (DHW-FII), aims to tackle these issues. We introduce VistaRAG, which integrates retrieval-augmented generation (RAG) technologies into autonomous driving systems based on foundation models, to address the inherent reliability challenges in decision-making. VistaRAG employs a dynamic retrieval mechanism to access highly relevant driving experience, real-time road network status, and other contextual information from external databases. This aids foundation models in informed reasoning and decision-making, thereby enhancing the safety and trustworthiness of foundation-model-based autonomous driving systems under complex traffic scenarios.","PeriodicalId":36532,"journal":{"name":"IEEE Transactions on Intelligent Vehicles","volume":"9 4","pages":"4579-4582"},"PeriodicalIF":8.2,"publicationDate":"2024-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141315147","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Scenario-Function System for Automotive Intelligent Cockpits: Framework, Research Progress and Perspectives 汽车智能驾驶舱的情景功能系统:框架、研究进展和前景
IF 14 1区 工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-03-29 DOI: 10.1109/TIV.2024.3382995
Hongchang Chen;Ruiyang Gao;Lili Fan;Erxuan Liu;Wenbo Li;Ruichen Tan;Ying Li;Lei He;Dongpu Cao
The innovative development of intelligent cockpit scenarios and functions brings increasingly enhanced user experiences to drivers and passengers in intelligent vehicles. However, existing research lacks a precise definition of intelligent cockpit scenarios and functions, let alone an understanding of their relationship. In this article, we first define concepts related to scenario and function. Then, we construct the scenario-function system framework. Specifically, the scenarios are divided based on the spatial-temporal dimension, and both scenarios and functions are stratified by their attributes. Finally, the progress and perspectives on scenario understanding are discussed in relation to existing research, especially for emotion and motion sickness recognition.
智能驾驶舱场景和功能的创新发展为智能汽车中的驾驶员和乘客带来了日益增强的用户体验。然而,现有研究缺乏对智能驾驶舱场景和功能的准确定义,更不用说对它们之间关系的理解了。在本文中,我们首先定义了场景和功能的相关概念。然后,我们构建了场景-功能系统框架。具体而言,根据时空维度对场景进行划分,根据属性对场景和功能进行分层。最后,结合现有研究,特别是情绪和晕车识别方面的研究,讨论了场景理解的进展和前景。
{"title":"Scenario-Function System for Automotive Intelligent Cockpits: Framework, Research Progress and Perspectives","authors":"Hongchang Chen;Ruiyang Gao;Lili Fan;Erxuan Liu;Wenbo Li;Ruichen Tan;Ying Li;Lei He;Dongpu Cao","doi":"10.1109/TIV.2024.3382995","DOIUrl":"https://doi.org/10.1109/TIV.2024.3382995","url":null,"abstract":"The innovative development of intelligent cockpit scenarios and functions brings increasingly enhanced user experiences to drivers and passengers in intelligent vehicles. However, existing research lacks a precise definition of intelligent cockpit scenarios and functions, let alone an understanding of their relationship. In this article, we first define concepts related to scenario and function. Then, we construct the scenario-function system framework. Specifically, the scenarios are divided based on the spatial-temporal dimension, and both scenarios and functions are stratified by their attributes. Finally, the progress and perspectives on scenario understanding are discussed in relation to existing research, especially for emotion and motion sickness recognition.","PeriodicalId":36532,"journal":{"name":"IEEE Transactions on Intelligent Vehicles","volume":"9 5","pages":"4890-4904"},"PeriodicalIF":14.0,"publicationDate":"2024-03-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141964756","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Progressive Growth for Point Cloud Completion by Surface-Projection Optimization 通过曲面投影优化逐步提高点云完成度
IF 14 1区 工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Pub Date : 2024-03-29 DOI: 10.1109/TIV.2024.3383108
Ben Fei;Rui Zhang;Weidong Yang;Zhijun Li;Wen-Ming Chen
Point cloud completion concentrates on completing geometric and topological shapes from incomplete 3D shapes. Nevertheless, the unordered nature of point clouds will hamper the generation of high-quality point clouds without predicting structured and topological information of the complete shapes and introducing noisy points. To effectively address the challenges posed by missing topology and noisy points, we introduce SPOFormer, a novel topology-aware model that utilizes surface-projection optimization in a progressive growth manner. SPOFormer consists of three distinct steps for completing the missing topology: (1) Missing Keypoints Prediction. A topology-aware transformer auto-encoder is integrated for missing keypoint prediction. (2) Skeleton Generation. The skeleton generation module produces a new type of representation named skeletons with the aid of keypoints predicted by topology-aware transformer auto-encoder and the partial input. (3) Progressively Growth. We design a progressive growth module to predict final output under Multi-scale Supervision and Surface-projection Optimization. Surface-projection Optimization is firstly devised for point cloud completion, aiming to enforce the generated points to align with the underlying object surface. Experimentally, SPOFormer model achieves an impressive Chamfer Distance-$ell _{1}$ (CD) score of 8.11 on PCN dataset. Furthermore, it attains average CD-$ell _{2}$ scores of 1.13, 1.14, and 1.70 on ShapeNet-55, ShapeNet-34, and ShapeNet-Unseen21 datasets, respectively. Additionally, the model achieves a Maximum Mean Discrepancy (MMD) of 0.523 on the real-world KITTI dataset. These outstanding qualitative and quantitative performances surpass previous approaches by a significant margin, firmly establishing new state-of-the-art performance across various benchmark datasets.
点云补全主要是根据不完整的三维形状补全几何和拓扑形状。然而,如果不预测完整形状的结构和拓扑信息并引入噪声点,点云的无序性将阻碍高质量点云的生成。为了有效应对拓扑缺失和噪声点带来的挑战,我们引入了 SPOFormer,这是一种新型拓扑感知模型,以渐进增长的方式利用曲面投影优化。SPOFormer 包括三个不同的步骤来完成缺失拓扑:(1)缺失关键点预测。为缺失关键点预测集成了拓扑感知变换器自动编码器。(2) 骨架生成。骨架生成模块借助拓扑感知变换器自动编码器预测的关键点和部分输入,生成一种名为骨架的新型表示法。(3) 逐步增长。我们设计了一个渐进增长模块,用于预测多尺度监督和曲面投影优化下的最终输出。表面投影优化首先用于完成点云,目的是强制生成的点与底层对象表面对齐。实验结果表明,SPOFormer 模型在 PCN 数据集上的倒角距离-$ell _{1}$(CD)得分高达 8.11,令人印象深刻。此外,该模型在 ShapeNet-55、ShapeNet-34 和 ShapeNet-Unseen21 数据集上的平均 CD-$ell _{2}$ 分数分别为 1.13、1.14 和 1.70。此外,该模型在真实世界的 KITTI 数据集上实现了 0.523 的最大平均差异 (MMD)。这些出色的定性和定量性能大大超越了以前的方法,在各种基准数据集上牢固地确立了新的一流性能。
{"title":"Progressive Growth for Point Cloud Completion by Surface-Projection Optimization","authors":"Ben Fei;Rui Zhang;Weidong Yang;Zhijun Li;Wen-Ming Chen","doi":"10.1109/TIV.2024.3383108","DOIUrl":"https://doi.org/10.1109/TIV.2024.3383108","url":null,"abstract":"Point cloud completion concentrates on completing geometric and topological shapes from incomplete 3D shapes. Nevertheless, the unordered nature of point clouds will hamper the generation of high-quality point clouds without predicting structured and topological information of the complete shapes and introducing noisy points. To effectively address the challenges posed by missing topology and noisy points, we introduce SPOFormer, a novel topology-aware model that utilizes surface-projection optimization in a progressive growth manner. SPOFormer consists of three distinct steps for completing the missing topology: (1) \u0000<bold>Missing Keypoints Prediction.</b>\u0000 A topology-aware transformer auto-encoder is integrated for missing keypoint prediction. (2) \u0000<bold>Skeleton Generation.</b>\u0000 The skeleton generation module produces a new type of representation named skeletons with the aid of keypoints predicted by topology-aware transformer auto-encoder and the partial input. (3) \u0000<bold>Progressively Growth.</b>\u0000 We design a progressive growth module to predict final output under \u0000<bold>Multi-scale Supervision</b>\u0000 and \u0000<bold>Surface-projection Optimization</b>\u0000. Surface-projection Optimization is firstly devised for point cloud completion, aiming to enforce the generated points to align with the underlying object surface. Experimentally, SPOFormer model achieves an impressive Chamfer Distance-\u0000<inline-formula><tex-math>$ell _{1}$</tex-math></inline-formula>\u0000 (CD) score of 8.11 on PCN dataset. Furthermore, it attains average CD-\u0000<inline-formula><tex-math>$ell _{2}$</tex-math></inline-formula>\u0000 scores of 1.13, 1.14, and 1.70 on ShapeNet-55, ShapeNet-34, and ShapeNet-Unseen21 datasets, respectively. Additionally, the model achieves a Maximum Mean Discrepancy (MMD) of 0.523 on the real-world KITTI dataset. These outstanding qualitative and quantitative performances surpass previous approaches by a significant margin, firmly establishing new state-of-the-art performance across various benchmark datasets.","PeriodicalId":36532,"journal":{"name":"IEEE Transactions on Intelligent Vehicles","volume":"9 5","pages":"4931-4945"},"PeriodicalIF":14.0,"publicationDate":"2024-03-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141964750","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
IEEE Transactions on Intelligent Vehicles
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1