基于稀疏表示和多标签学习的图像自动标注

Feng Tian, Sheng Xu-kun, Shang Fu-hua, Zhou Kai
{"title":"基于稀疏表示和多标签学习的图像自动标注","authors":"Feng Tian, Sheng Xu-kun, Shang Fu-hua, Zhou Kai","doi":"10.1109/ICVRV.2012.11","DOIUrl":null,"url":null,"abstract":"Automatic image annotation has emerged as an important research topic due to its potential application on both image understanding and web image search. Due to the inherent ambiguity of image-label mapping, the annotation task has become a challenge to systematically develop robust annotation models with better performance. In this paper, we present an image annotation framework based on Sparse Representation and Multi-Label Learning (SCMLL), which aims at taking full advantage of Image Sparse representation and multi-label learning mechanism to address the annotation problem. We first treat each image as a sparse linear combination of other images, and then consider the component images as the nearest neighbors of the target image based on a sparse representation computed by L-1 minimization. Based on statistical information gained from the label sets of these neighbors, a multiple label learning algorithm based on a posteriori (MAP) principle is presented to determine the tags for the unlabeled image. The experiments over the well known data set demonstrate that the proposed method is beneficial in the image annotation task and outperforms most existing image annotation algorithms.","PeriodicalId":421789,"journal":{"name":"2012 International Conference on Virtual Reality and Visualization","volume":"72 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-09-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Automatic Image Annotation Based on Sparse Representation and Multiple Label Learning\",\"authors\":\"Feng Tian, Sheng Xu-kun, Shang Fu-hua, Zhou Kai\",\"doi\":\"10.1109/ICVRV.2012.11\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Automatic image annotation has emerged as an important research topic due to its potential application on both image understanding and web image search. Due to the inherent ambiguity of image-label mapping, the annotation task has become a challenge to systematically develop robust annotation models with better performance. In this paper, we present an image annotation framework based on Sparse Representation and Multi-Label Learning (SCMLL), which aims at taking full advantage of Image Sparse representation and multi-label learning mechanism to address the annotation problem. We first treat each image as a sparse linear combination of other images, and then consider the component images as the nearest neighbors of the target image based on a sparse representation computed by L-1 minimization. Based on statistical information gained from the label sets of these neighbors, a multiple label learning algorithm based on a posteriori (MAP) principle is presented to determine the tags for the unlabeled image. The experiments over the well known data set demonstrate that the proposed method is beneficial in the image annotation task and outperforms most existing image annotation algorithms.\",\"PeriodicalId\":421789,\"journal\":{\"name\":\"2012 International Conference on Virtual Reality and Visualization\",\"volume\":\"72 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2012-09-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2012 International Conference on Virtual Reality and Visualization\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICVRV.2012.11\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 International Conference on Virtual Reality and Visualization","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICVRV.2012.11","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

摘要

自动图像标注由于其在图像理解和网络图像搜索方面的潜在应用而成为一个重要的研究课题。由于图像标签映射固有的模糊性,如何系统地开发性能更好的鲁棒标注模型成为标注任务的一大挑战。本文提出了一种基于稀疏表示和多标签学习(SCMLL)的图像标注框架,旨在充分利用图像稀疏表示和多标签学习机制来解决图像标注问题。我们首先将每个图像视为其他图像的稀疏线性组合,然后基于L-1最小化计算的稀疏表示将组件图像视为目标图像的最近邻居。基于这些邻域标签集的统计信息,提出了一种基于后验(MAP)原理的多标签学习算法来确定未标记图像的标签。在已知数据集上的实验表明,该方法有利于图像标注任务,并且优于大多数现有的图像标注算法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Automatic Image Annotation Based on Sparse Representation and Multiple Label Learning
Automatic image annotation has emerged as an important research topic due to its potential application on both image understanding and web image search. Due to the inherent ambiguity of image-label mapping, the annotation task has become a challenge to systematically develop robust annotation models with better performance. In this paper, we present an image annotation framework based on Sparse Representation and Multi-Label Learning (SCMLL), which aims at taking full advantage of Image Sparse representation and multi-label learning mechanism to address the annotation problem. We first treat each image as a sparse linear combination of other images, and then consider the component images as the nearest neighbors of the target image based on a sparse representation computed by L-1 minimization. Based on statistical information gained from the label sets of these neighbors, a multiple label learning algorithm based on a posteriori (MAP) principle is presented to determine the tags for the unlabeled image. The experiments over the well known data set demonstrate that the proposed method is beneficial in the image annotation task and outperforms most existing image annotation algorithms.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Real-time Continuous Geometric Calibration for Projector-Camera System under Ambient Illumination Automatic generation of large scale 3D cloud based on weather forecast data Enhancing Touch Screen Games Through a Cable-driven Force Feedback Device 3D Face Reconstruction Based on Geometric Transformation GPU Based Compression and Rendering of Massive Aircraft CAD Models
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1