DeLay: Robust Spatial Layout Estimation for Cluttered Indoor Scenes

2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) Pub Date : 2016-06-27 DOI:10.1109/CVPR.2016.73

Saumitro Dasgupta, Kuan Fang, Kevin Chen, S. Savarese

引用次数: 116

Abstract

We consider the problem of estimating the spatial layout of an indoor scene from a monocular RGB image, modeled as the projection of a 3D cuboid. Existing solutions to this problem often rely strongly on hand-engineered features and vanishing point detection, which are prone to failure in the presence of clutter. In this paper, we present a method that uses a fully convolutional neural network (FCNN) in conjunction with a novel optimization framework for generating layout estimates. We demonstrate that our method is robust in the presence of clutter and handles a wide range of highly challenging scenes. We evaluate our method on two standard benchmarks and show that it achieves state of the art results, outperforming previous methods by a wide margin.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

延迟:混沌室内场景的鲁棒空间布局估计

我们考虑从单眼RGB图像估计室内场景的空间布局问题，建模为三维长方体的投影。这个问题的现有解决方案通常强烈依赖于手工设计的特征和消失点检测，这在存在杂乱的情况下容易失败。在本文中，我们提出了一种使用全卷积神经网络(FCNN)和一种新的优化框架来生成布局估计的方法。我们证明了我们的方法在杂乱的存在下是鲁棒的，并且可以处理各种极具挑战性的场景。我们在两个标准基准上评估我们的方法，并表明它达到了最先进的结果，远远优于以前的方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

自引率

0.00%

发文量

期刊最新文献

Sketch Me That Shoe Multivariate Regression on the Grassmannian for Predicting Novel Domains How Hard Can It Be? Estimating the Difficulty of Visual Search in an Image Discovering the Physical Parts of an Articulated Object Class from Multiple Videos Simultaneous Optical Flow and Intensity Estimation from an Event Camera