Memorable basis: towards human-centralized sparse representation

Proceedings of the 20th ACM international conference on Multimedia Pub Date : 2012-10-29 DOI:10.1145/2393347.2396306

Xiaoshuai Sun, H. Yao

引用次数: 0

Abstract

Previous studies of sparse representation in multimedia research focus on developing reliable and efficient dictionary learning algorithms. Despite the sparse prior, how to integrate other related perceptual factors of human being into dictionary learning process was seldom studied. In this paper, we investigate the influence of image memorability for human-centralized sparse representation. Based on the results of a photo memory game, we are able to quantitatively characterize an image's memorability which allows us to train sparse bases from the most memorable images instead of randomly selected natural images. We believed that such kind of basis is more consistent with neural networks in human brain and hence can better predict where human looks. To test our hypothesis, we choose human eye-fixation prediction problem for quantitative evaluation. The experimental results demonstrate the superior performance of our Memorable Basis compared to traditional sparse basis trained from unselected images.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

可记忆基础:向以人为中心的稀疏表示方向发展

多媒体研究中稀疏表示的研究主要集中在开发可靠、高效的字典学习算法。尽管存在稀疏先验，但如何将人类的其他相关感知因素整合到字典学习过程中却鲜有研究。在本文中，我们研究了图像可记忆性对人类集中式稀疏表示的影响。基于照片记忆游戏的结果，我们能够定量表征图像的可记忆性，这使我们能够从最令人难忘的图像中训练稀疏基础，而不是随机选择的自然图像。我们认为这种基础更符合人类大脑中的神经网络，因此可以更好地预测人类的视线。为了验证我们的假设，我们选择人眼注视预测问题进行定量评价。实验结果表明，与从未选择图像中训练的传统稀疏基础相比，我们的记忆基具有更好的性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Proceedings of the 20th ACM international conference on Multimedia

自引率

0.00%

发文量

期刊最新文献

ROI-based protection scheme for high definition interactive video applications TouchPaper: making print interactive A genetic algorithm for audio retargeting Mining in-class social networks for large-scale pedagogical analysis Plug&touch: a mobile interaction solution for large display via vision-based hand gesture detection