基于稀疏表示和多标签学习的图像自动标注

2012 International Conference on Virtual Reality and Visualization Pub Date : 2012-09-14 DOI:10.1109/ICVRV.2012.11

Feng Tian, Sheng Xu-kun, Shang Fu-hua, Zhou Kai

{"title":"基于稀疏表示和多标签学习的图像自动标注","authors":"Feng Tian, Sheng Xu-kun, Shang Fu-hua, Zhou Kai","doi":"10.1109/ICVRV.2012.11","DOIUrl":null,"url":null,"abstract":"Automatic image annotation has emerged as an important research topic due to its potential application on both image understanding and web image search. Due to the inherent ambiguity of image-label mapping, the annotation task has become a challenge to systematically develop robust annotation models with better performance. In this paper, we present an image annotation framework based on Sparse Representation and Multi-Label Learning (SCMLL), which aims at taking full advantage of Image Sparse representation and multi-label learning mechanism to address the annotation problem. We first treat each image as a sparse linear combination of other images, and then consider the component images as the nearest neighbors of the target image based on a sparse representation computed by L-1 minimization. Based on statistical information gained from the label sets of these neighbors, a multiple label learning algorithm based on a posteriori (MAP) principle is presented to determine the tags for the unlabeled image. The experiments over the well known data set demonstrate that the proposed method is beneficial in the image annotation task and outperforms most existing image annotation algorithms.","PeriodicalId":421789,"journal":{"name":"2012 International Conference on Virtual Reality and Visualization","volume":"72 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-09-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Automatic Image Annotation Based on Sparse Representation and Multiple Label Learning\",\"authors\":\"Feng Tian, Sheng Xu-kun, Shang Fu-hua, Zhou Kai\",\"doi\":\"10.1109/ICVRV.2012.11\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Automatic image annotation has emerged as an important research topic due to its potential application on both image understanding and web image search. Due to the inherent ambiguity of image-label mapping, the annotation task has become a challenge to systematically develop robust annotation models with better performance. In this paper, we present an image annotation framework based on Sparse Representation and Multi-Label Learning (SCMLL), which aims at taking full advantage of Image Sparse representation and multi-label learning mechanism to address the annotation problem. We first treat each image as a sparse linear combination of other images, and then consider the component images as the nearest neighbors of the target image based on a sparse representation computed by L-1 minimization. Based on statistical information gained from the label sets of these neighbors, a multiple label learning algorithm based on a posteriori (MAP) principle is presented to determine the tags for the unlabeled image. The experiments over the well known data set demonstrate that the proposed method is beneficial in the image annotation task and outperforms most existing image annotation algorithms.\",\"PeriodicalId\":421789,\"journal\":{\"name\":\"2012 International Conference on Virtual Reality and Visualization\",\"volume\":\"72 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2012-09-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2012 International Conference on Virtual Reality and Visualization\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICVRV.2012.11\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 International Conference on Virtual Reality and Visualization","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICVRV.2012.11","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

摘要

自动图像标注由于其在图像理解和网络图像搜索方面的潜在应用而成为一个重要的研究课题。由于图像标签映射固有的模糊性，如何系统地开发性能更好的鲁棒标注模型成为标注任务的一大挑战。本文提出了一种基于稀疏表示和多标签学习(SCMLL)的图像标注框架，旨在充分利用图像稀疏表示和多标签学习机制来解决图像标注问题。我们首先将每个图像视为其他图像的稀疏线性组合，然后基于L-1最小化计算的稀疏表示将组件图像视为目标图像的最近邻居。基于这些邻域标签集的统计信息，提出了一种基于后验(MAP)原理的多标签学习算法来确定未标记图像的标签。在已知数据集上的实验表明，该方法有利于图像标注任务，并且优于大多数现有的图像标注算法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Automatic Image Annotation Based on Sparse Representation and Multiple Label Learning

Automatic image annotation has emerged as an important research topic due to its potential application on both image understanding and web image search. Due to the inherent ambiguity of image-label mapping, the annotation task has become a challenge to systematically develop robust annotation models with better performance. In this paper, we present an image annotation framework based on Sparse Representation and Multi-Label Learning (SCMLL), which aims at taking full advantage of Image Sparse representation and multi-label learning mechanism to address the annotation problem. We first treat each image as a sparse linear combination of other images, and then consider the component images as the nearest neighbors of the target image based on a sparse representation computed by L-1 minimization. Based on statistical information gained from the label sets of these neighbors, a multiple label learning algorithm based on a posteriori (MAP) principle is presented to determine the tags for the unlabeled image. The experiments over the well known data set demonstrate that the proposed method is beneficial in the image annotation task and outperforms most existing image annotation algorithms.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2012 International Conference on Virtual Reality and Visualization

自引率

0.00%

发文量

期刊最新文献

Real-time Continuous Geometric Calibration for Projector-Camera System under Ambient Illumination Automatic generation of large scale 3D cloud based on weather forecast data Enhancing Touch Screen Games Through a Cable-driven Force Feedback Device 3D Face Reconstruction Based on Geometric Transformation GPU Based Compression and Rendering of Massive Aircraft CAD Models