Unsupervised Face Annotation by Mining the Web

Duy-Dinh Le, S. Satoh
{"title":"Unsupervised Face Annotation by Mining the Web","authors":"Duy-Dinh Le, S. Satoh","doi":"10.1109/ICDM.2008.47","DOIUrl":null,"url":null,"abstract":"Searching for images of people is an essential task for image and video search engines. However, current search engines have limited capabilities for this task since they rely on text associated with images and video, and such text is likely to return many irrelevant results. We propose a method for retrieving relevant faces of one person by learning the visual consistency among results retrieved from text correlation-based search engines. The method consists of two steps. In the first step, each candidate face obtained from a text-based search engine is ranked with a score that measures the distribution of visual similarities among the faces. Faces that are possibly very relevant or irrelevant are ranked at the top or bottom of the list, respectively. The second step improves this ranking by treating this problem as a classification problem in which input faces are classified as psilaperson-Xpsila or psilanon-person-Xpsila; and the faces are re-ranked according to their relevant score inferred from the classifierpsilas probability output. To train this classifier, we use a bagging-based framework to combine results from multiple weak classifiers trained using different subsets. These training subsets are extracted and labeled automatically from the rank list produced from the classifier trained from the previous step. In this way, the accuracy of the ranked list increases after a number of iterations. Experimental results on various face sets retrieved from captions of news photos show that the retrieval performance improved after each iteration, with the final performance being higher than those of the existing algorithms.","PeriodicalId":252958,"journal":{"name":"2008 Eighth IEEE International Conference on Data Mining","volume":"58 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2008-12-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"35","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2008 Eighth IEEE International Conference on Data Mining","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDM.2008.47","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 35

Abstract

Searching for images of people is an essential task for image and video search engines. However, current search engines have limited capabilities for this task since they rely on text associated with images and video, and such text is likely to return many irrelevant results. We propose a method for retrieving relevant faces of one person by learning the visual consistency among results retrieved from text correlation-based search engines. The method consists of two steps. In the first step, each candidate face obtained from a text-based search engine is ranked with a score that measures the distribution of visual similarities among the faces. Faces that are possibly very relevant or irrelevant are ranked at the top or bottom of the list, respectively. The second step improves this ranking by treating this problem as a classification problem in which input faces are classified as psilaperson-Xpsila or psilanon-person-Xpsila; and the faces are re-ranked according to their relevant score inferred from the classifierpsilas probability output. To train this classifier, we use a bagging-based framework to combine results from multiple weak classifiers trained using different subsets. These training subsets are extracted and labeled automatically from the rank list produced from the classifier trained from the previous step. In this way, the accuracy of the ranked list increases after a number of iterations. Experimental results on various face sets retrieved from captions of news photos show that the retrieval performance improved after each iteration, with the final performance being higher than those of the existing algorithms.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于网络挖掘的无监督人脸标注
人物图像的搜索是图像和视频搜索引擎的重要任务。然而,目前的搜索引擎在这项任务上的能力有限,因为它们依赖于与图像和视频相关的文本,而这样的文本很可能返回许多不相关的结果。我们提出了一种通过学习基于文本关联搜索引擎检索结果之间的视觉一致性来检索某个人相关面孔的方法。该方法包括两个步骤。在第一步中,从基于文本的搜索引擎中获得的每一张候选面孔都用一个分数来衡量这些面孔之间的视觉相似性分布。可能非常相关或不相关的面孔分别排在列表的顶部或底部。第二步通过将这个问题作为一个分类问题来改进这个排序,在这个分类问题中,输入的人脸被分类为psilanon- xpsila或psilanon-person-Xpsila;根据从分类器的概率输出中推断出的相关分数,对人脸进行重新排序。为了训练这个分类器,我们使用基于bagging的框架来组合使用不同子集训练的多个弱分类器的结果。这些训练子集从上一步训练的分类器生成的秩表中自动提取和标记。这样,经过多次迭代后,排名列表的准确性就会提高。从新闻图片的说明文字中检索到的各种人脸集的实验结果表明,每次迭代后检索性能都有所提高,最终的检索性能都高于现有算法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
SeqStream: Mining Closed Sequential Patterns over Stream Sliding Windows Support Vector Regression for Censored Data (SVRc): A Novel Tool for Survival Analysis A Probability Model for Projective Clustering on High Dimensional Data Text Cube: Computing IR Measures for Multidimensional Text Database Analysis A Hierarchical Algorithm for Clustering Uncertain Data via an Information-Theoretic Approach
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1