A Task-Oriented Approach for Cost-Sensitive Recognition

2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) Pub Date : 2016-06-27 DOI:10.1109/CVPR.2016.242

Roozbeh Mottaghi, Hannaneh Hajishirzi, Ali Farhadi

引用次数: 4

Abstract

With the recent progress in visual recognition, we have already started to see a surge of vision related real-world applications. These applications, unlike general scene understanding, are task oriented and require specific information from visual data. Considering the current growth in new sensory devices, feature designs, feature learning methods, and algorithms, the search in the space of features and models becomes combinatorial. In this paper, we propose a novel cost-sensitive task-oriented recognition method that is based on a combination of linguistic semantics and visual cues. Our task-oriented framework is able to generalize to unseen tasks for which there is no training data and outperforms state-of-the-art cost-based recognition baselines on our new task-based dataset.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

面向任务的成本敏感识别方法

随着最近在视觉识别方面的进展，我们已经开始看到与视觉相关的实际应用激增。与一般的场景理解不同，这些应用是面向任务的，需要来自视觉数据的特定信息。考虑到当前新感官设备、特征设计、特征学习方法和算法的增长，特征和模型空间中的搜索变得组合。本文提出了一种基于语言语义和视觉线索相结合的成本敏感任务导向识别方法。我们的面向任务的框架能够推广到没有训练数据的看不见的任务，并且在我们新的基于任务的数据集上优于最先进的基于成本的识别基线。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

自引率

0.00%

发文量

期刊最新文献

Sketch Me That Shoe Multivariate Regression on the Grassmannian for Predicting Novel Domains How Hard Can It Be? Estimating the Difficulty of Visual Search in an Image Discovering the Physical Parts of an Articulated Object Class from Multiple Videos Simultaneous Optical Flow and Intensity Estimation from an Event Camera