Auxiliary Training Information Assisted Visual Recognition

Q1 Computer Science IPSJ Transactions on Computer Vision and Applications Pub Date : 2015-01-01 DOI:10.2197/ipsjtcva.7.138

Qilin Zhang, G. Hua, W. Liu, Zicheng Liu, Zhengyou Zhang

{"title":"Auxiliary Training Information Assisted Visual Recognition","authors":"Qilin Zhang, G. Hua, W. Liu, Zicheng Liu, Zhengyou Zhang","doi":"10.2197/ipsjtcva.7.138","DOIUrl":null,"url":null,"abstract":"In the realm of multi-modal visual recognition, the reliability of the data acquisition system is often a concern due to the increased complexity of the sensors. One of the major issues is the accidental loss of one or more sensing channels, which poses a major challenge to current learning systems. In this paper, we examine one of these specific missing data problems, where we have a main modality/view along with an auxiliary modality/view present in the training data, but merely the main modality/view in the test data. To effectively leverage the auxiliary information to train a stronger classifier, we propose a collaborative auxiliary learning framework based on a new discriminative canonical correlation analysis. This framework reveals a common semantic space shared across both modalities/views through enforcing a series of nonlinear projections. Such projections automatically embed the discriminative cues hidden in both modalities/views into the common space, and better visual recognition is thus achieved on the test data. The efficacy of our proposed auxiliary learning approach is demonstrated through four challenging visual recognition tasks with different kinds of auxiliary information.","PeriodicalId":38957,"journal":{"name":"IPSJ Transactions on Computer Vision and Applications","volume":"75 1","pages":"138-150"},"PeriodicalIF":0.0000,"publicationDate":"2015-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"19","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IPSJ Transactions on Computer Vision and Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2197/ipsjtcva.7.138","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Computer Science","Score":null,"Total":0}

引用次数: 19

Abstract

In the realm of multi-modal visual recognition, the reliability of the data acquisition system is often a concern due to the increased complexity of the sensors. One of the major issues is the accidental loss of one or more sensing channels, which poses a major challenge to current learning systems. In this paper, we examine one of these specific missing data problems, where we have a main modality/view along with an auxiliary modality/view present in the training data, but merely the main modality/view in the test data. To effectively leverage the auxiliary information to train a stronger classifier, we propose a collaborative auxiliary learning framework based on a new discriminative canonical correlation analysis. This framework reveals a common semantic space shared across both modalities/views through enforcing a series of nonlinear projections. Such projections automatically embed the discriminative cues hidden in both modalities/views into the common space, and better visual recognition is thus achieved on the test data. The efficacy of our proposed auxiliary learning approach is demonstrated through four challenging visual recognition tasks with different kinds of auxiliary information.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

辅助训练信息辅助视觉识别

在多模态视觉识别领域，由于传感器的复杂性不断增加，数据采集系统的可靠性经常受到关注。其中一个主要问题是一个或多个传感通道的意外丢失，这对当前的学习系统构成了重大挑战。在本文中，我们研究了这些特定的缺失数据问题之一，其中我们在训练数据中有一个主模态/视图以及一个辅助模态/视图，但在测试数据中只有主模态/视图。为了有效地利用辅助信息来训练更强的分类器，我们提出了一种基于新的判别典型相关分析的协同辅助学习框架。该框架通过执行一系列非线性投影，揭示了两种模式/视图之间共享的公共语义空间。这种投影自动将隐藏在两种模式/视图中的判别线索嵌入到公共空间中，从而在测试数据上实现更好的视觉识别。我们提出的辅助学习方法的有效性通过四个具有不同类型的辅助信息的具有挑战性的视觉识别任务来证明。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

IPSJ Transactions on Computer Vision and Applications Computer Science-Computer Vision and Pattern Recognition

自引率

0.00%

发文量