Semantically Supervised Maximal Correlation For Cross-Modal Retrieval

2020 IEEE International Conference on Image Processing (ICIP) Pub Date : 2020-10-01 DOI:10.1109/ICIP40778.2020.9190873

Mingyang Li, Yongni Li, Shao-Lun Huang, Lin Zhang

引用次数: 6

Abstract

With the rapid growth of multimedia data, the cross-modal retrieval problem has attracted a lot of interest in both research and industry in recent years. However, the inconsistency of data distribution from different modalities makes such task challenging. In this paper, we propose Semantically Supervised Maximal Correlation (S2MC) method for cross-modal retrieval by incorporating semantic label information into the traditional maximal correlation framework. Combining with maximal correlation based method for extracting unsupervised pairing information, our method effectively exploits supervised semantic information on both common feature space and label space. Extensive experiments show that our method outperforms other current state-of-the-art methods on cross-modal retrieval tasks on three widely used datasets.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

跨模态检索的语义监督最大相关

随着多媒体数据量的快速增长，跨模式检索问题近年来引起了学术界和工业界的广泛关注。然而，来自不同模式的数据分布的不一致性给这一任务带来了挑战。本文将语义标签信息整合到传统的最大相关框架中，提出了语义监督最大相关(S2MC)跨模态检索方法。该方法结合基于最大相关的无监督配对信息提取方法，有效地利用了公共特征空间和标签空间上的监督语义信息。大量的实验表明，在三个广泛使用的数据集上，我们的方法在跨模态检索任务上优于其他当前最先进的方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2020 IEEE International Conference on Image Processing (ICIP)

自引率

0.00%

发文量