Automated generation of high-quality training data for appearance-based object models

S. Becker, Arno Voelker, H. Kieritz, W. Hübner, Michael Arens
{"title":"Automated generation of high-quality training data for appearance-based object models","authors":"S. Becker, Arno Voelker, H. Kieritz, W. Hübner, Michael Arens","doi":"10.1117/12.2028714","DOIUrl":null,"url":null,"abstract":"Methods for automated person detection and person tracking are essential core components in modern security and surveillance systems. Most state-of-the-art person detectors follow a statistical approach, where prototypical appearances of persons are learned from training samples with known class labels. Selecting appropriate learning samples has a significant impact on the quality of the generated person detectors. For example, training a classifier on a rigid body model using training samples with strong pose variations is in general not effective, irrespective of the classifiers capabilities. Generation of high-quality training data is, apart from performance issues, a very time consuming process, comprising a significant amount of manual work. Furthermore, due to inevitable limitations of freely available training data, corresponding classifiers are not always transferable to a given sensor and are only applicable in a well-defined narrow variety of scenes and camera setups. Semi-supervised learning methods are a commonly used alternative to supervised training, in general requiring only few labeled samples. However, as a drawback semi-supervised methods always include a generative component, which is known to be difficult to learn. Therefore, automated processes for generating training data sets for supervised methods are needed. Such approaches could either help to better adjust classifiers to respective hardware, or serve as a complement to existing data sets. Towards this end, this paper provides some insights into the quality requirements of automatically generated training data for supervised learning methods. Assuming a static camera, labels are generated based on motion detection by background subtraction with respect to weak constraints on the enclosing bounding box of the motion blobs. Since this labeling method consists of standard components, we illustrate the effectiveness by adapting a person detector to cameras of a sensor network. While varying the training data and keeping the detection framework identical, we derive statements about the sample quality.","PeriodicalId":344928,"journal":{"name":"Optics/Photonics in Security and Defence","volume":"34 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Optics/Photonics in Security and Defence","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1117/12.2028714","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

Methods for automated person detection and person tracking are essential core components in modern security and surveillance systems. Most state-of-the-art person detectors follow a statistical approach, where prototypical appearances of persons are learned from training samples with known class labels. Selecting appropriate learning samples has a significant impact on the quality of the generated person detectors. For example, training a classifier on a rigid body model using training samples with strong pose variations is in general not effective, irrespective of the classifiers capabilities. Generation of high-quality training data is, apart from performance issues, a very time consuming process, comprising a significant amount of manual work. Furthermore, due to inevitable limitations of freely available training data, corresponding classifiers are not always transferable to a given sensor and are only applicable in a well-defined narrow variety of scenes and camera setups. Semi-supervised learning methods are a commonly used alternative to supervised training, in general requiring only few labeled samples. However, as a drawback semi-supervised methods always include a generative component, which is known to be difficult to learn. Therefore, automated processes for generating training data sets for supervised methods are needed. Such approaches could either help to better adjust classifiers to respective hardware, or serve as a complement to existing data sets. Towards this end, this paper provides some insights into the quality requirements of automatically generated training data for supervised learning methods. Assuming a static camera, labels are generated based on motion detection by background subtraction with respect to weak constraints on the enclosing bounding box of the motion blobs. Since this labeling method consists of standard components, we illustrate the effectiveness by adapting a person detector to cameras of a sensor network. While varying the training data and keeping the detection framework identical, we derive statements about the sample quality.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
为基于外观的对象模型自动生成高质量的训练数据
自动化人员检测和跟踪方法是现代安防监控系统中必不可少的核心组成部分。大多数最先进的人检测器都采用统计方法,其中从具有已知类别标签的训练样本中学习人的原型外观。选择合适的学习样本对生成的人检测器的质量有重要影响。例如,无论分类器的能力如何,使用具有强姿态变化的训练样本在刚体模型上训练分类器通常都是无效的。除了性能问题外,生成高质量的训练数据是一个非常耗时的过程,包括大量的手工工作。此外,由于可免费获得的训练数据的不可避免的局限性,相应的分类器并不总是可转移到给定的传感器,并且仅适用于定义良好的各种场景和相机设置。半监督学习方法是一种常用的替代监督训练的方法,通常只需要很少的标记样本。然而,作为缺点,半监督方法总是包含生成组件,这是众所周知的难以学习。因此,需要为监督方法生成训练数据集的自动化过程。这种方法既可以帮助更好地调整分类器以适应各自的硬件,也可以作为现有数据集的补充。为此,本文对监督学习方法自动生成训练数据的质量要求提出了一些见解。假设相机是静态的,基于运动检测,根据运动blob包围框的弱约束,通过背景减法生成标签。由于这种标记方法由标准组件组成,我们通过将人探测器适应于传感器网络的摄像机来说明其有效性。在改变训练数据和保持检测框架相同的情况下,我们得出了关于样本质量的陈述。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Compact camera technologies for real-time false-color imaging in the SWIR band Arbitrary waveform generation using optical direct digital synthesis Advances in AlGaInN laser diode technology for defence applications Design of high sensitivity detector for underwater communication system Automated generation of high-quality training data for appearance-based object models
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1