Rene Kaiser, M. Thaler, Andreas Kriechbaum, Hannes Fassold, W. Bailer, Jakub Rosner
{"title":"用于自动广播制作的高分辨率全景视频中的实时人物跟踪","authors":"Rene Kaiser, M. Thaler, Andreas Kriechbaum, Hannes Fassold, W. Bailer, Jakub Rosner","doi":"10.1109/CVMP.2011.9","DOIUrl":null,"url":null,"abstract":"For enabling immersive user experiences for interactive TV services and automating camera view selection and framing, knowledge of the location of persons in a scene is essential. We describe an architecture for detecting and tracking persons in high-resolution panoramic video streams, obtained from the Omni Cam, a panoramic camera stitching video streams from 6 HD resolution tiles. We use a CUDA accelerated feature point tracker, a blob detector and a CUDA HOG person detector, which are used for region tracking in each of the tiles before fusing the results for the entire panorama. In this paper we focus on the application of the HOG person detector in real-time and the speedup of the feature point tracker by porting it to NVIDIA's Fermi architecture. Evaluations indicate significant speedup for our feature point tracker implementation, enabling the entire process in a real-time system.","PeriodicalId":167135,"journal":{"name":"2011 Conference for Visual Media Production","volume":"23 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-11-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"28","resultStr":"{\"title\":\"Real-time Person Tracking in High-resolution Panoramic Video for Automated Broadcast Production\",\"authors\":\"Rene Kaiser, M. Thaler, Andreas Kriechbaum, Hannes Fassold, W. Bailer, Jakub Rosner\",\"doi\":\"10.1109/CVMP.2011.9\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"For enabling immersive user experiences for interactive TV services and automating camera view selection and framing, knowledge of the location of persons in a scene is essential. We describe an architecture for detecting and tracking persons in high-resolution panoramic video streams, obtained from the Omni Cam, a panoramic camera stitching video streams from 6 HD resolution tiles. We use a CUDA accelerated feature point tracker, a blob detector and a CUDA HOG person detector, which are used for region tracking in each of the tiles before fusing the results for the entire panorama. In this paper we focus on the application of the HOG person detector in real-time and the speedup of the feature point tracker by porting it to NVIDIA's Fermi architecture. Evaluations indicate significant speedup for our feature point tracker implementation, enabling the entire process in a real-time system.\",\"PeriodicalId\":167135,\"journal\":{\"name\":\"2011 Conference for Visual Media Production\",\"volume\":\"23 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2011-11-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"28\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2011 Conference for Visual Media Production\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CVMP.2011.9\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2011 Conference for Visual Media Production","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CVMP.2011.9","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 28
摘要
为了实现交互式电视服务的沉浸式用户体验和自动相机视图选择和取景,了解场景中人物的位置至关重要。我们描述了一种用于在高分辨率全景视频流中检测和跟踪人员的架构,该视频流来自Omni Cam, Omni Cam是一种全景相机,从6个高清分辨率块拼接视频流。我们使用CUDA加速特征点跟踪器,blob检测器和CUDA HOG人检测器,在融合整个全景图的结果之前,它们用于每个瓷砖的区域跟踪。本文重点研究了HOG人检测器的实时应用,并将HOG特征点跟踪器移植到NVIDIA的Fermi架构中,提高了HOG特征点跟踪器的速度。评估表明我们的特征点跟踪器实现显著加速,使整个过程在实时系统中实现。
Real-time Person Tracking in High-resolution Panoramic Video for Automated Broadcast Production
For enabling immersive user experiences for interactive TV services and automating camera view selection and framing, knowledge of the location of persons in a scene is essential. We describe an architecture for detecting and tracking persons in high-resolution panoramic video streams, obtained from the Omni Cam, a panoramic camera stitching video streams from 6 HD resolution tiles. We use a CUDA accelerated feature point tracker, a blob detector and a CUDA HOG person detector, which are used for region tracking in each of the tiles before fusing the results for the entire panorama. In this paper we focus on the application of the HOG person detector in real-time and the speedup of the feature point tracker by porting it to NVIDIA's Fermi architecture. Evaluations indicate significant speedup for our feature point tracker implementation, enabling the entire process in a real-time system.