Object-based affordances detection with Convolutional Neural Networks and dense Conditional Random Fields

2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) Pub Date : 2017-09-01 DOI:10.1109/IROS.2017.8206484

Anh Nguyen, D. Kanoulas, D. Caldwell, N. Tsagarakis

引用次数: 113

Abstract

We present a new method to detect object affordances in real-world scenes using deep Convolutional Neural Networks (CNN), an object detector and dense Conditional Random Fields (CRF). Our system first trains an object detector to generate bounding box candidates from the images. A deep CNN is then used to learn the depth features from these bounding boxes. Finally, these feature maps are post-processed with dense CRF to improve the prediction along class boundaries. The experimental results on our new challenging dataset show that the proposed approach outperforms recent state-of-the-art methods by a substantial margin. Furthermore, from the detected affordances we introduce a grasping method that is robust to noisy data. We demonstrate the effectiveness of our framework on the full-size humanoid robot WALK-MAN using different objects in real-world scenarios.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于卷积神经网络和密集条件随机场的对象启示检测

我们提出了一种使用深度卷积神经网络(CNN)、对象检测器和密集条件随机场(CRF)来检测现实场景中对象的可视性的新方法。我们的系统首先训练一个物体检测器从图像中生成候选边界框。然后使用深度CNN从这些边界框中学习深度特征。最后，使用密集CRF对这些特征映射进行后处理，以改进沿类边界的预测。在我们新的具有挑战性的数据集上的实验结果表明，所提出的方法在很大程度上优于最近最先进的方法。此外，根据检测到的启示，我们引入了一种对噪声数据具有鲁棒性的抓取方法。我们在真实场景中使用不同的物体在全尺寸人形机器人WALK-MAN上展示了我们的框架的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

自引率

0.00%

发文量

期刊最新文献

Direct visual SLAM fusing proprioception for a humanoid robot Upper limb motion intent recognition using tactile sensing Soft fluidic rotary actuator with improved actuation properties Underwater 3D structures as semantic landmarks in SONAR mapping Adaptive perception: Learning from sensory predictions to extract object shape with a biomimetic fingertip