A Dataset for Persistent Multi-target Multi-camera Tracking in RGB-D

Ryan Layne, S. Hannuna, M. Camplani, Jake Hall, Timothy M. Hospedales, T. Xiang, M. Mirmehdi, D. Damen
{"title":"A Dataset for Persistent Multi-target Multi-camera Tracking in RGB-D","authors":"Ryan Layne, S. Hannuna, M. Camplani, Jake Hall, Timothy M. Hospedales, T. Xiang, M. Mirmehdi, D. Damen","doi":"10.1109/CVPRW.2017.189","DOIUrl":null,"url":null,"abstract":"Video surveillance systems are now widely deployed to improve our lives by enhancing safety, security, health monitoring and business intelligence. This has motivated extensive research into automated video analysis. Nevertheless, there is a gap between the focus of contemporary research, and the needs of end users of video surveillance systems. Many existing benchmarks and methodologies focus on narrowly defined problems in detection, tracking, re-identification or recognition. In contrast, end users face higher-level problems such as long-term monitoring of identities in order to build a picture of a person's activity across the course of a day, producing usage statistics of a particular area of space, and that these capabilities should be robust to challenges such as change of clothing. To achieve this effectively requires less widely studied capabilities such as spatio-temporal reasoning about people identities and locations within a space partially observed by multiple cameras over an extended time period. To bridge this gap between research and required capabilities, we propose a new dataset LIMA that encompasses the challenges of monitoring a typical home / office environment. LIMA contains 4.5 hours of RGB-D video from three cameras monitoring a four room house. To reflect the challenges of a realistic practical application, the dataset includes clothes changes and visitors to ensure the global reasoning is a realistic open-set problem. In addition to raw data, we provide identity annotation for benchmarking, and tracking results from a contemporary RGB-D tracker – thus allowing focus on the higher level monitoring problems.","PeriodicalId":6668,"journal":{"name":"2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)","volume":"53 1","pages":"1462-1470"},"PeriodicalIF":0.0000,"publicationDate":"2017-07-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CVPRW.2017.189","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 8

Abstract

Video surveillance systems are now widely deployed to improve our lives by enhancing safety, security, health monitoring and business intelligence. This has motivated extensive research into automated video analysis. Nevertheless, there is a gap between the focus of contemporary research, and the needs of end users of video surveillance systems. Many existing benchmarks and methodologies focus on narrowly defined problems in detection, tracking, re-identification or recognition. In contrast, end users face higher-level problems such as long-term monitoring of identities in order to build a picture of a person's activity across the course of a day, producing usage statistics of a particular area of space, and that these capabilities should be robust to challenges such as change of clothing. To achieve this effectively requires less widely studied capabilities such as spatio-temporal reasoning about people identities and locations within a space partially observed by multiple cameras over an extended time period. To bridge this gap between research and required capabilities, we propose a new dataset LIMA that encompasses the challenges of monitoring a typical home / office environment. LIMA contains 4.5 hours of RGB-D video from three cameras monitoring a four room house. To reflect the challenges of a realistic practical application, the dataset includes clothes changes and visitors to ensure the global reasoning is a realistic open-set problem. In addition to raw data, we provide identity annotation for benchmarking, and tracking results from a contemporary RGB-D tracker – thus allowing focus on the higher level monitoring problems.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
RGB-D中持久多目标多摄像机跟踪数据集
视频监控系统现已广泛部署,通过加强安全,安全,健康监控和商业智能来改善我们的生活。这激发了对自动视频分析的广泛研究。然而,当代研究的重点与视频监控系统最终用户的需求之间存在差距。许多现有的基准和方法侧重于检测、跟踪、再识别或识别方面的狭义问题。相比之下,终端用户面临着更高层次的问题,例如长期监控身份,以便建立一个人在一天中的活动图像,生成特定空间区域的使用统计数据,并且这些功能应该能够应对诸如更换衣服之类的挑战。为了有效地实现这一点,需要较少被广泛研究的能力,例如在长时间内由多个摄像头部分观察到的空间内,对人的身份和位置进行时空推理。为了弥合研究与所需能力之间的差距,我们提出了一个新的数据集LIMA,该数据集涵盖了监测典型家庭/办公环境的挑战。LIMA包含了4个小时的RGB-D视频,这些视频来自3个摄像头对一栋有4个房间的房子的监控。为了反映现实实际应用的挑战,数据集包括衣服变化和访客,以确保全局推理是一个现实的开集问题。除了原始数据之外,我们还提供了用于基准测试的标识注释,并跟踪来自现代RGB-D跟踪器的结果—从而允许关注更高级别的监视问题。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Measuring Energy Expenditure in Sports by Thermal Video Analysis Court-Based Volleyball Video Summarization Focusing on Rally Scene Generating 5D Light Fields in Scattering Media for Representing 3D Images Application of Computer Vision and Vector Space Model for Tactical Movement Classification in Badminton A Taxonomy and Evaluation of Dense Light Field Depth Estimation Algorithms
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1