An Approach to Large Scale Interactive Retrieval of Cultural Heritage

Masato Takami, Peter Bell, B. Ommer
{"title":"An Approach to Large Scale Interactive Retrieval of Cultural Heritage","authors":"Masato Takami, Peter Bell, B. Ommer","doi":"10.2312/gch.20141307","DOIUrl":null,"url":null,"abstract":"Large scale digitization campaigns are simplifying the accessibility of a rapidly increasing number of images from cultural heritage. However, digitization alone is not sufficient to effectively open up these valuable resources. Retrieval and analysis within these datasets is currently mainly based on manual annotation and laborious preprocessing. This is not only a tedious task, which rapidly becomes infeasible due to the enormous data load. We also risk to be biased to only see what an annotator beforehand has focused on. Thus a lot of potential is being wasted. \n \nOne of the most prevalent tasks is that of discovering similar objects in a dataset to find relations therein. The majority of existing systems for this task are detecting similar objects using visual feature keypoints. While having a low processing time, these methods are limited to detect only close duplicates due to their keypoint based representation. In this work we propose a search method which can detect similar objects even if they exhibit considerable variability. Our procedure learns models of the appearance of objects and trains a classifier to find related instances. We address a central problem of such learning-based methods, the need for appropriate negative and positive training samples. To avoid a highly complicated hard negative mining stage we propose a pooling procedure for gathering generic negatives. Moreover, a bootstrap approach is presented to aggregate positive training samples. Comparison of existing search methods in cultural heritage benchmark problems demonstrates that our approach yields significantly improved detection performance. Moreover, we show examples of searching across different types of datasets, e.g., drafts and photographs.","PeriodicalId":203827,"journal":{"name":"Eurographics Workshop on Graphics and Cultural Heritage","volume":"3 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-10-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Eurographics Workshop on Graphics and Cultural Heritage","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2312/gch.20141307","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5

Abstract

Large scale digitization campaigns are simplifying the accessibility of a rapidly increasing number of images from cultural heritage. However, digitization alone is not sufficient to effectively open up these valuable resources. Retrieval and analysis within these datasets is currently mainly based on manual annotation and laborious preprocessing. This is not only a tedious task, which rapidly becomes infeasible due to the enormous data load. We also risk to be biased to only see what an annotator beforehand has focused on. Thus a lot of potential is being wasted. One of the most prevalent tasks is that of discovering similar objects in a dataset to find relations therein. The majority of existing systems for this task are detecting similar objects using visual feature keypoints. While having a low processing time, these methods are limited to detect only close duplicates due to their keypoint based representation. In this work we propose a search method which can detect similar objects even if they exhibit considerable variability. Our procedure learns models of the appearance of objects and trains a classifier to find related instances. We address a central problem of such learning-based methods, the need for appropriate negative and positive training samples. To avoid a highly complicated hard negative mining stage we propose a pooling procedure for gathering generic negatives. Moreover, a bootstrap approach is presented to aggregate positive training samples. Comparison of existing search methods in cultural heritage benchmark problems demonstrates that our approach yields significantly improved detection performance. Moreover, we show examples of searching across different types of datasets, e.g., drafts and photographs.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
一种大规模文化遗产交互式检索方法
大规模的数字化活动简化了快速增长的文化遗产图像的可访问性。然而,仅靠数字化还不足以有效地开放这些宝贵的资源。这些数据集的检索和分析目前主要基于人工标注和费力的预处理。这不仅是一项繁琐的任务,而且由于巨大的数据负载,它很快变得不可行。我们也冒着偏见的风险,只看到注释者事先关注的内容。因此,大量的潜力被浪费了。最常见的任务之一是在数据集中发现相似的对象以找到其中的关系。大多数现有的系统都是使用视觉特征关键点来检测相似的物体。虽然处理时间较短,但由于这些方法基于关键点表示,因此仅限于检测接近的重复项。在这项工作中,我们提出了一种搜索方法,可以检测相似的对象,即使它们表现出相当大的可变性。我们的程序学习对象的外观模型,并训练分类器来查找相关实例。我们解决了这种基于学习的方法的一个核心问题,需要适当的负和正训练样本。为了避免高度复杂的硬否定挖掘阶段,我们提出了一种收集通用否定的池化程序。此外,还提出了一种自举方法来聚合正训练样本。与现有的文化遗产基准问题搜索方法的比较表明,我们的方法显著提高了检测性能。此外,我们还展示了跨不同类型数据集(例如草稿和照片)进行搜索的示例。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Direct Elastic Unrollings of Painted Pottery Surfaces from Sparse Image Sets 3D for Studying Reuse in 19th Century Cairo: the Case of Saint-Maurice Residence Reimagining a 2D Painted Portrait as a Kinetic 3D Sculpture Interactive 3D Artefact Puzzles to Support Engagement Beyond the Museum Environment Riedones3D: a celtic coin dataset for registration and fine-grained clustering
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1