首页 > 最新文献

Autonomous Vehicles and Machines最新文献

英文 中文
The design and validation of a rain model for a simulated automotive environment 模拟汽车环境下降雨模型的设计与验证
Pub Date : 2023-01-16 DOI: 10.2352/ei.2023.35.16.avm-116
T. Brophy, B. Deegan, J. Salado, A. Tena, Patrick Denny, M. Glavin, Enda Ward, J. Horgan, E. Jones
{"title":"The design and validation of a rain model for a simulated automotive environment","authors":"T. Brophy, B. Deegan, J. Salado, A. Tena, Patrick Denny, M. Glavin, Enda Ward, J. Horgan, E. Jones","doi":"10.2352/ei.2023.35.16.avm-116","DOIUrl":"https://doi.org/10.2352/ei.2023.35.16.avm-116","url":null,"abstract":"","PeriodicalId":177462,"journal":{"name":"Autonomous Vehicles and Machines","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-01-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128839484","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
How much depth information can radar contribute to a depth estimation model? 雷达能为深度估计模型提供多少深度信息?
Pub Date : 2022-02-26 DOI: 10.2352/EI.2023.35.16.AVM-122
Chen-Chou Lo, P. Vandewalle
Recently, several works have proposed fusing radar data as an additional perceptual signal into monocular depth estimation models because radar data is robust against varying light and weather conditions. Although improved performances were reported in prior works, it is still hard to tell how much depth information radar can contribute to a depth estimation model. In this paper, we propose radar inference and supervision experiments to investigate the intrinsic depth potential of radar data using state-of-the-art depth estimation models on the nuScenes dataset. In the inference experiment, the model predicts depth by taking only radar as input to demonstrate the inference capability using radar data. In the supervision experiment, a monocular depth estimation model is trained under radar supervision to show the intrinsic depth information that radar can contribute. Our experiments demonstrate that the model using only sparse radar as input can detect the shape of surroundings to a certain extent in the predicted depth. Furthermore, the monocular depth estimation model supervised by preprocessed radar achieves a good performance compared to the baseline model trained with sparse lidar supervision.
最近,一些研究提出将雷达数据作为额外的感知信号融合到单目深度估计模型中,因为雷达数据对不同的光照和天气条件具有鲁棒性。尽管在之前的工作中已经报道了改进的性能,但仍然很难判断深度信息雷达对深度估计模型的贡献有多大。在本文中,我们提出了雷达推理和监督实验,利用最先进的深度估计模型在nuScenes数据集上研究雷达数据的内在深度潜力。在推理实验中,该模型仅以雷达作为输入预测深度,以验证利用雷达数据进行推理的能力。在监督实验中,在雷达监督下训练单目深度估计模型,以显示雷达可以提供的固有深度信息。我们的实验表明,仅使用稀疏雷达作为输入的模型在预测深度内可以在一定程度上检测到周围环境的形状。此外,与稀疏激光雷达监督训练的基线模型相比,预处理雷达监督的单目深度估计模型取得了良好的性能。
{"title":"How much depth information can radar contribute to a depth estimation model?","authors":"Chen-Chou Lo, P. Vandewalle","doi":"10.2352/EI.2023.35.16.AVM-122","DOIUrl":"https://doi.org/10.2352/EI.2023.35.16.AVM-122","url":null,"abstract":"Recently, several works have proposed fusing radar data as an additional perceptual signal into monocular depth estimation models because radar data is robust against varying light and weather conditions. Although improved performances were reported in prior works, it is still hard to tell how much depth information radar can contribute to a depth estimation model. In this paper, we propose radar inference and supervision experiments to investigate the intrinsic depth potential of radar data using state-of-the-art depth estimation models on the nuScenes dataset. In the inference experiment, the model predicts depth by taking only radar as input to demonstrate the inference capability using radar data. In the supervision experiment, a monocular depth estimation model is trained under radar supervision to show the intrinsic depth information that radar can contribute. Our experiments demonstrate that the model using only sparse radar as input can detect the shape of surroundings to a certain extent in the predicted depth. Furthermore, the monocular depth estimation model supervised by preprocessed radar achieves a good performance compared to the baseline model trained with sparse lidar supervision.","PeriodicalId":177462,"journal":{"name":"Autonomous Vehicles and Machines","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-02-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121519799","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A review of IEEE P2020 noise metrics IEEE P2020噪声指标综述
Pub Date : 2022-01-16 DOI: 10.2352/ei.2022.34.16.avm-109
O. Skorka, Paul Romanczyk
The IEEE P2020 Noise standard is built upon methodology that is discussed by other photography and camera standards. It includes extensions and adjustments to support operating modes and conditions that are relevant to automotive cameras. This work presents methods and procedures that are covered by the IEEE P2020 Noise standard to derive sensor-level and camera-level noise image quality factors from dark statistics, photon-transfer and signal-to-noise ratio curves, and signal falloff. Example implementations and experimental results are shown from work that was done with automotive cameras which were activated and tested under conditions that are relevant to automotive applications.
IEEE P2020噪声标准是建立在其他摄影和相机标准所讨论的方法之上的。它包括扩展和调整,以支持与汽车摄像头相关的操作模式和条件。本工作介绍了IEEE P2020噪声标准涵盖的方法和程序,从暗统计、光子传输和信噪比曲线以及信号衰减中导出传感器级和相机级噪声图像质量因子。通过在与汽车应用相关的条件下激活和测试汽车摄像头,给出了实例实现和实验结果。
{"title":"A review of IEEE P2020 noise metrics","authors":"O. Skorka, Paul Romanczyk","doi":"10.2352/ei.2022.34.16.avm-109","DOIUrl":"https://doi.org/10.2352/ei.2022.34.16.avm-109","url":null,"abstract":"The IEEE P2020 Noise standard is built upon methodology that is discussed by other photography and camera standards. It includes extensions and adjustments to support operating modes and conditions that are relevant to automotive cameras. This work presents methods and procedures that are covered by the IEEE P2020 Noise standard to derive sensor-level and camera-level noise image quality factors from dark statistics, photon-transfer and signal-to-noise ratio curves, and signal falloff. Example implementations and experimental results are shown from work that was done with automotive cameras which were activated and tested under conditions that are relevant to automotive applications.","PeriodicalId":177462,"journal":{"name":"Autonomous Vehicles and Machines","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-01-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125987041","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Efficient in-cabin monitoring solution using TI TDA2PxSOCs 采用TI tda2pxsoc的高效舱内监控解决方案
Pub Date : 2022-01-16 DOI: 10.2352/ei.2022.34.16.avm-116
Mangla Mayank, Mihir Mody, Chitnis Kedar, Goswami Piyali, Pande Tarkesh, D. Shashank, Jagannathan Shyam, Haas Stefan, Hua Gang, G. Hrushikesh, Kumar Desappan, Shankar Prithvi, N. Niraj
{"title":"Efficient in-cabin monitoring solution using TI TDA2PxSOCs","authors":"Mangla Mayank, Mihir Mody, Chitnis Kedar, Goswami Piyali, Pande Tarkesh, D. Shashank, Jagannathan Shyam, Haas Stefan, Hua Gang, G. Hrushikesh, Kumar Desappan, Shankar Prithvi, N. Niraj","doi":"10.2352/ei.2022.34.16.avm-116","DOIUrl":"https://doi.org/10.2352/ei.2022.34.16.avm-116","url":null,"abstract":"","PeriodicalId":177462,"journal":{"name":"Autonomous Vehicles and Machines","volume":"31 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-01-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133939528","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Spatial precision and recall indices to assess the performance of instance segmentation algorithms 空间精度和召回率指标评价实例分割算法的性能
Pub Date : 2022-01-16 DOI: 10.2352/ei.2022.34.16.avm-101
Mattis Brummel, Patrick Müller, Alexander Braun
Since it is essential for Computer Vision systems to reliably perform in safety-critical applications such as autonomous vehicles, there is a need to evaluate their robustness to naturally occurring image perturbations. More specifically, the performance of Computer Vision systems needs to be linked to the image quality, which hasn’t received much research attention so far. In fact, aberrations of a camera system are always spatially variable over the Field of View, which may influence the performance of Computer Vision systems dependent on the degree of local aberrations. Therefore, the goal is to evaluate the performance of Computer Vision systems under effects of defocus by taking into account the spatial domain. Large-scale Autonomous Driving datasets are degraded by a parameterized optical model to simulate driving scenes under physically realistic effects of defocus. Using standard evaluation metrics, the Spatial Recall Index (SRI) and the new Spatial Precision Index (SPI), the performance of Computer Visions systems on these degraded datasets are compared with the optical performance of the applied optical model. A correlation could be observed between the spatially varying optical performance and the spatial performance of Instance Segmentation systems.
由于计算机视觉系统在自动驾驶汽车等安全关键应用中可靠地运行至关重要,因此有必要评估其对自然发生的图像扰动的鲁棒性。更具体地说,计算机视觉系统的性能需要与图像质量联系起来,这一点到目前为止还没有得到太多的研究关注。事实上,相机系统的像差在视场范围内总是空间变化的,这可能会影响计算机视觉系统的性能,这取决于局部像差的程度。因此,目标是通过考虑空间域来评估计算机视觉系统在离焦影响下的性能。采用参数化光学模型对大规模自动驾驶数据集进行退化,模拟离焦物理逼真效果下的驾驶场景。利用标准评价指标空间召回指数(SRI)和新的空间精度指数(SPI),将计算机视觉系统在这些退化数据集上的性能与应用光学模型的光学性能进行了比较。实例分割系统的空间性能与光学性能之间存在一定的相关性。
{"title":"Spatial precision and recall indices to assess the performance of instance segmentation algorithms","authors":"Mattis Brummel, Patrick Müller, Alexander Braun","doi":"10.2352/ei.2022.34.16.avm-101","DOIUrl":"https://doi.org/10.2352/ei.2022.34.16.avm-101","url":null,"abstract":"Since it is essential for Computer Vision systems to reliably perform in safety-critical applications such as autonomous vehicles, there is a need to evaluate their robustness to naturally occurring image perturbations. More specifically, the performance of Computer Vision systems needs to be linked to the image quality, which hasn’t received much research attention so far. In fact, aberrations of a camera system are always spatially variable over the Field of View, which may influence the performance of Computer Vision systems dependent on the degree of local aberrations. Therefore, the goal is to evaluate the performance of Computer Vision systems under effects of defocus by taking into account the spatial domain. Large-scale Autonomous Driving datasets are degraded by a parameterized optical model to simulate driving scenes under physically realistic effects of defocus. Using standard evaluation metrics, the Spatial Recall Index (SRI) and the new Spatial Precision Index (SPI), the performance of Computer Visions systems on these degraded datasets are compared with the optical performance of the applied optical model. A correlation could be observed between the spatially varying optical performance and the spatial performance of Instance Segmentation systems.","PeriodicalId":177462,"journal":{"name":"Autonomous Vehicles and Machines","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-01-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115265319","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Real-time LIDAR imaging by solid-state single chip beam scanner 固态单片波束扫描仪实时激光雷达成像
Pub Date : 2022-01-16 DOI: 10.2352/ei.2022.34.16.avm-172
Jisan Lee, Kyunghyun Son, Chang-Bum Lee, Inoh Hwang, Bongyong Jang, Eunkyung Lee, Dongshik Shim, H. Byun, C. Shin, Tatsuhiro Otsuka, Yongchul Cho, Kyoungho Ha, H. Choo
{"title":"Real-time LIDAR imaging by solid-state single chip beam scanner","authors":"Jisan Lee, Kyunghyun Son, Chang-Bum Lee, Inoh Hwang, Bongyong Jang, Eunkyung Lee, Dongshik Shim, H. Byun, C. Shin, Tatsuhiro Otsuka, Yongchul Cho, Kyoungho Ha, H. Choo","doi":"10.2352/ei.2022.34.16.avm-172","DOIUrl":"https://doi.org/10.2352/ei.2022.34.16.avm-172","url":null,"abstract":"","PeriodicalId":177462,"journal":{"name":"Autonomous Vehicles and Machines","volume":"100 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2022-01-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116998115","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Adversarial Attacks on Multi-task Visual Perception for Autonomous Driving 自动驾驶多任务视觉感知的对抗性攻击
Pub Date : 2021-07-15 DOI: 10.2352/j.imagingsci.technol.2021.65.6.060408
Ibrahim Sobh, Ahmed Hamed, V. Kumar, S. Yogamani
In recent years, deep neural networks (DNNs) have accomplished impressive success in various applications, including autonomous driving perception tasks. However, current deep neural networks are easily deceived by adversarial attacks. This vulnerability raises significant concerns, particularly in safety-critical applications. As a result, research into attacking and defending DNNs has gained much coverage. In this work, detailed adversarial attacks are applied on a diverse multi-task visual perception deep network across distance estimation, semantic segmentation, motion detection, and object detection. The experiments consider both white and black box attacks for targeted and un-targeted cases, while attacking a task and inspecting the effect on all others, in addition to inspecting the effect of applying a simple defense method. We conclude this paper by comparing and discussing the experimental results, proposing insights and future work. The visualizations of the attacks are available at https://youtu.be/6AixN90budY.
近年来,深度神经网络(dnn)在各种应用中取得了令人印象深刻的成功,包括自动驾驶感知任务。然而,目前的深度神经网络很容易被对抗性攻击欺骗。这个漏洞引起了很大的关注,特别是在安全关键型应用程序中。因此,对攻击和防御dnn的研究已经得到了广泛的报道。在这项工作中,详细的对抗性攻击应用于跨越距离估计、语义分割、运动检测和目标检测的多种多任务视觉感知深度网络。实验考虑了针对目标和非目标情况的白盒和黑盒攻击,同时攻击一个任务并检查对所有其他任务的影响,以及检查应用简单防御方法的效果。最后,我们对实验结果进行了比较和讨论,并提出了自己的见解和未来的工作。攻击的可视化可以在https://youtu.be/6AixN90budY上获得。
{"title":"Adversarial Attacks on Multi-task Visual Perception for Autonomous Driving","authors":"Ibrahim Sobh, Ahmed Hamed, V. Kumar, S. Yogamani","doi":"10.2352/j.imagingsci.technol.2021.65.6.060408","DOIUrl":"https://doi.org/10.2352/j.imagingsci.technol.2021.65.6.060408","url":null,"abstract":"\u0000 In recent years, deep neural networks (DNNs) have accomplished impressive success in various applications, including autonomous driving perception tasks. However, current deep neural networks are easily deceived by adversarial attacks. This vulnerability raises significant concerns, particularly in safety-critical applications. As a result, research into attacking and defending DNNs has gained much coverage. In this work, detailed adversarial attacks are applied on a diverse multi-task visual perception deep network across distance estimation, semantic segmentation, motion detection, and object detection. The experiments consider both white and black box attacks for targeted and un-targeted cases, while attacking a task and inspecting the effect on all others, in addition to inspecting the effect of applying a simple defense method. We conclude this paper by comparing and discussing the experimental results, proposing insights and future work. The visualizations of the attacks are available at https://youtu.be/6AixN90budY.\u0000","PeriodicalId":177462,"journal":{"name":"Autonomous Vehicles and Machines","volume":"98 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-07-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126090231","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
AVM Conference Overview and Papers Program AVM会议概述和论文计划
Pub Date : 2021-01-18 DOI: 10.2352/issn.2470-1173.2021.17.avm-a17
{"title":"AVM Conference Overview and Papers Program","authors":"","doi":"10.2352/issn.2470-1173.2021.17.avm-a17","DOIUrl":"https://doi.org/10.2352/issn.2470-1173.2021.17.avm-a17","url":null,"abstract":"","PeriodicalId":177462,"journal":{"name":"Autonomous Vehicles and Machines","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-01-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126120794","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Single Chip Auto-Valet Parking System with TDA4VMID SoC 基于TDA4VMID SoC的单芯片自动代客泊车系统
Pub Date : 2021-01-18 DOI: 10.2352/issn.2470-1173.2021.17.avm-113
Mihir Mody, Kedar Chitnis, H. Hariyani, Shyam Jagannathan, Jason Jones, G. Shurtz, Abhishek Shankar, Ankur, Mayank Mangla, Sriramakrishnan Govindarajan, Aish Dubey, K. Chirca
Auto-Valet parking is a key emerging function for Advanced Driver Assistance Systems (ADAS) enhancing traditional surround view system providing more autonomy during parking scenario. Auto-Valet parking system is typically built using multiple HW components e.g. ISP, micro-controllers, FPGAs, GPU, Ethernet/PCIe switch etc. Texas Instrument’s new Jacinto7 platform is one of industry’s highest integrated SoC replacing these components with a single TDA4VMID chip. The TDA4VMID SoC can concurrently do analytics (traditional computer vision as well as deep learning) and sophisticated 3D surround view, making it a cost effective and power optimized solution. TDA4VMID is a truly heterogeneous architecture and it can be programmed using an efficient and easy to use OpenVX based middle-ware framework to realize distribution of software components across cores. This paper explains typical functions for analytics and 3D surround view in auto-valet parking system with data-flow and its mapping to multiple cores of TDA4VMID SoC. Auto-valet parking system can be realized on TDA4VMID SOC with complete processing offloaded of host ARM to the rest of SoC cores, providing ample headroom for customers for future proofing as well as ability to add customer specific differentiation.
自动代客泊车是先进驾驶辅助系统(ADAS)的一项关键新兴功能,可增强传统的环视系统在停车场景中的自主性。自动代客泊车系统通常使用多个硬件组件构建,例如ISP,微控制器,fpga, GPU,以太网/PCIe交换机等。德州仪器的新Jacinto7平台是业界集成度最高的SoC之一,用单个TDA4VMID芯片取代了这些组件。TDA4VMID SoC可以同时进行分析(传统的计算机视觉和深度学习)和复杂的3D环绕视图,使其成为一种具有成本效益和功耗优化的解决方案。TDA4VMID是一个真正的异构架构,它可以使用一个高效且易于使用的基于OpenVX的中间件框架来编程,以实现跨内核的软件组件分布。本文介绍了自动代客泊车系统中数据流分析和3D环视的典型功能及其与TDA4VMID SoC多核的映射。自动代客泊车系统可以在TDA4VMID SOC上实现,将主机ARM的完整处理卸载到其他SOC内核上,为客户提供充足的空间,以备未来的验证,以及增加客户特定差异化的能力。
{"title":"Single Chip Auto-Valet Parking System with TDA4VMID SoC","authors":"Mihir Mody, Kedar Chitnis, H. Hariyani, Shyam Jagannathan, Jason Jones, G. Shurtz, Abhishek Shankar, Ankur, Mayank Mangla, Sriramakrishnan Govindarajan, Aish Dubey, K. Chirca","doi":"10.2352/issn.2470-1173.2021.17.avm-113","DOIUrl":"https://doi.org/10.2352/issn.2470-1173.2021.17.avm-113","url":null,"abstract":"\u0000 Auto-Valet parking is a key emerging function for Advanced Driver Assistance Systems (ADAS) enhancing traditional surround view system providing more autonomy during parking scenario. Auto-Valet parking system is typically built using multiple HW components e.g. ISP, micro-controllers,\u0000 FPGAs, GPU, Ethernet/PCIe switch etc. Texas Instrument’s new Jacinto7 platform is one of industry’s highest integrated SoC replacing these components with a single TDA4VMID chip. The TDA4VMID SoC can concurrently do analytics (traditional computer vision as well as deep learning)\u0000 and sophisticated 3D surround view, making it a cost effective and power optimized solution. TDA4VMID is a truly heterogeneous architecture and it can be programmed using an efficient and easy to use OpenVX based middle-ware framework to realize distribution of software components across cores.\u0000 This paper explains typical functions for analytics and 3D surround view in auto-valet parking system with data-flow and its mapping to multiple cores of TDA4VMID SoC. Auto-valet parking system can be realized on TDA4VMID SOC with complete processing offloaded of host ARM to the rest of SoC\u0000 cores, providing ample headroom for customers for future proofing as well as ability to add customer specific differentiation.\u0000","PeriodicalId":177462,"journal":{"name":"Autonomous Vehicles and Machines","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-01-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114578796","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
FisheyeDistanceNet++: Self-Supervised Fisheye Distance Estimation with Self-Attention, Robust Loss Function and Camera View Generalization fishheyedistancenet++:自关注、鲁棒损失函数和相机视图泛化的自监督鱼眼距离估计
Pub Date : 2021-01-18 DOI: 10.2352/issn.2470-1173.2021.17.avm-181
V. Kumar, S. Yogamani, Stefan Milz, Patrick Mäder
FisheyeDistanceNet [1] proposed a self-supervised monocular depth estimation method for fisheye cameras with a large field of view (> 180°). To achieve scale-invariant depth estimation, FisheyeDistanceNet supervises depth map predictions over multiple scales during training. To overcome this bottleneck, we incorporate self-attention layers and robust loss function [2] to FisheyeDistanceNet. A general adaptive robust loss function helps obtain sharp depth maps without a need to train over multiple scales and allows us to learn hyperparameters in loss function to aid in better optimization in terms of convergence speed and accuracy. We also ablate the importance of Instance Normalization over Batch Normalization in the network architecture. Finally, we generalize the network to be invariant to camera views by training multiple perspectives using front, rear, and side cameras. Proposed algorithm improvements, FisheyeDistanceNet++, result in 30% relative improvement in RMSE while reducing the training time by 25% on the WoodScape dataset. We also obtain state-of-the-art results on the KITTI dataset, in comparison to other self-supervised monocular methods.
fishheyedistancenet[1]针对大视场(> 180°)的鱼眼相机提出了一种自监督单眼深度估计方法。为了实现尺度不变的深度估计,fishheyedistancenet在训练期间监督多个尺度的深度图预测。为了克服这一瓶颈,我们将自关注层和鲁棒损失函数[2]结合到fishheyedistancenet中。一个通用的自适应鲁棒损失函数有助于获得清晰的深度图,而不需要在多个尺度上进行训练,并允许我们学习损失函数中的超参数,以帮助在收敛速度和精度方面更好地优化。在网络架构中,实例规范化比批处理规范化更重要。最后,我们通过使用前置、后置和侧置摄像头训练多个视角,使网络对摄像头视图保持不变。提出的算法改进fishheyedistancenet ++在WoodScape数据集上的RMSE相对提高了30%,同时减少了25%的训练时间。与其他自监督单目方法相比,我们还在KITTI数据集上获得了最先进的结果。
{"title":"FisheyeDistanceNet++: Self-Supervised Fisheye Distance Estimation with Self-Attention, Robust Loss Function and Camera View Generalization","authors":"V. Kumar, S. Yogamani, Stefan Milz, Patrick Mäder","doi":"10.2352/issn.2470-1173.2021.17.avm-181","DOIUrl":"https://doi.org/10.2352/issn.2470-1173.2021.17.avm-181","url":null,"abstract":"\u0000 FisheyeDistanceNet [1] proposed a self-supervised monocular depth estimation method for fisheye cameras with a large field of view (> 180°). To achieve scale-invariant depth estimation, FisheyeDistanceNet supervises depth map predictions over multiple scales during training.\u0000 To overcome this bottleneck, we incorporate self-attention layers and robust loss function [2] to FisheyeDistanceNet. A general adaptive robust loss function helps obtain sharp depth maps without a need to train over multiple scales and allows us to learn hyperparameters in loss function to\u0000 aid in better optimization in terms of convergence speed and accuracy. We also ablate the importance of Instance Normalization over Batch Normalization in the network architecture. Finally, we generalize the network to be invariant to camera views by training multiple perspectives using front,\u0000 rear, and side cameras. Proposed algorithm improvements, FisheyeDistanceNet++, result in 30% relative improvement in RMSE while reducing the training time by 25% on the WoodScape dataset. We also obtain state-of-the-art results on the KITTI dataset, in comparison to other self-supervised monocular\u0000 methods.\u0000","PeriodicalId":177462,"journal":{"name":"Autonomous Vehicles and Machines","volume":"125 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2021-01-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130618135","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
期刊
Autonomous Vehicles and Machines
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1