Monocular 3D Object Detection for Autonomous Driving

2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) Pub Date : 2016-06-27 DOI:10.1109/CVPR.2016.236

Xiaozhi Chen, Kaustav Kundu, Ziyu Zhang, Huimin Ma, S. Fidler, R. Urtasun

引用次数: 795

Abstract

The goal of this paper is to perform 3D object detection from a single monocular image in the domain of autonomous driving. Our method first aims to generate a set of candidate class-specific object proposals, which are then run through a standard CNN pipeline to obtain high-quality object detections. The focus of this paper is on proposal generation. In particular, we propose an energy minimization approach that places object candidates in 3D using the fact that objects should be on the ground-plane. We then score each candidate box projected to the image plane via several intuitive potentials encoding semantic segmentation, contextual information, size and location priors and typical object shape. Our experimental evaluation demonstrates that our object proposal generation approach significantly outperforms all monocular approaches, and achieves the best detection performance on the challenging KITTI benchmark, among published monocular competitors.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

用于自动驾驶的单目3D目标检测

本文的目标是从自动驾驶领域的单眼图像中进行3D目标检测。我们的方法首先旨在生成一组候选类特定对象建议，然后通过标准的CNN管道运行以获得高质量的对象检测。本文的研究重点是提案生成。特别是，我们提出了一种能量最小化方法，利用物体应该在地平面上的事实，将候选物体放置在3D中。然后，我们通过编码语义分割、上下文信息、大小和位置先验以及典型物体形状的几个直观电位对投影到图像平面上的每个候选框进行评分。我们的实验评估表明，我们的目标建议生成方法明显优于所有单眼方法，并在具有挑战性的KITTI基准测试中取得了最佳的检测性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

自引率

0.00%

发文量

期刊最新文献

Sketch Me That Shoe Multivariate Regression on the Grassmannian for Predicting Novel Domains How Hard Can It Be? Estimating the Difficulty of Visual Search in an Image Discovering the Physical Parts of an Articulated Object Class from Multiple Videos Simultaneous Optical Flow and Intensity Estimation from an Event Camera