2013 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB)最新文献

英文中文

Parallel Hand Shape Classification 平行手型分类

2013 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB)

Pub Date : 2013-12-09 DOI: 10.1109/ISM.2013.76

J. Nalepa, M. Kawulok

This paper introduces a new parallel algorithm (PA) for fast hand shape classification. This problem is challenging as a hand is characterized by a high number of degrees of freedom. Our objective is to design and implement a robust algorithm suitable for real-time applications. We show how the analysis time can be decreased, together with the increase of the classification accuracy, by the means of parallelization. Also, we propose to combine the shape contexts approach with the appearance-based techniques to increase the efficacy of the PA. An extensive experimental study confirms the effectiveness of the proposed PA compared with other state-of-the-art methods.

介绍了一种新的手部形状快速分类并行算法(PA)。这个问题很有挑战性，因为手的特点是有很多的自由度。我们的目标是设计和实现一个适合实时应用的鲁棒算法。我们展示了如何通过并行化的方法来减少分析时间，同时提高分类精度。此外，我们建议将形状上下文方法与基于外观的技术相结合，以提高PA的效率。一项广泛的实验研究证实了与其他最先进的方法相比，所提出的PA的有效性。

引用次数: 4

Towards Sketch-Based Motion Queries in Sports Videos 运动视频中基于草图的运动查询

2013 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB)

Pub Date : 2013-12-09 DOI: 10.1109/ISM.2013.60

Ihab Al Kabary, H. Schuldt

The advent of pen-based user interfaces has facilitated several natural ways for human-computer interaction. One example is sketch-based retrieval, i.e., the search for (multimedia) objects on the basis of sketches as query input. So far, work has focused mainly on sketch-based image retrieval. However, more and more application domains also benefit from sketches as query input for searching in video collections. Enabling spatial search in videos, in the form of sketch-based motion queries, is increasingly demanded by coaches and analysts in team sports as a novel and innovative tool for game analysis. Even though game analysis is already a major activity in this domain, it is still mostly based on manual selection of video sequences. In this paper, we present Sport Sense, a first approach to enabling intuitive and efficient video retrieval using sketch-based motion queries. This is accomplished by using videos of games in team sports, together with an overlay of meta data that incorporates spatio-temporal information about various events. Sport Sense exploits spatio-temporal databases to store, index, and retrieve the tracked information at interactive response times. Moreover, it provides first intuitive user input interfaces for sketches representing motion paths. A particular challenge is to convert the users' sketches into spatial queries and to execute these queries in a flexible way that allows for some controlled deviation between the sketched path and the actual movement of the players and/or the ball. The evaluation results of Sport Sense show that this approach to sketch-based retrieval in sports videos is both very effective and efficient.

基于笔的用户界面的出现促进了人机交互的几种自然方式。一个例子是基于草图的检索，即基于草图作为查询输入来搜索(多媒体)对象。到目前为止，工作主要集中在基于草图的图像检索上。然而，越来越多的应用领域也受益于草图作为视频集合搜索的查询输入。在视频中以基于草图的运动查询的形式进行空间搜索，作为一种新颖和创新的游戏分析工具，越来越多地被团队运动的教练和分析师要求。尽管游戏分析已经是该领域的主要活动，但它仍然主要基于手动选择视频序列。在本文中，我们提出了Sport Sense，这是使用基于草图的运动查询实现直观高效视频检索的第一种方法。这是通过使用团队运动中的游戏视频，以及包含各种事件时空信息的元数据叠加来实现的。Sport Sense利用时空数据库来存储、索引和检索交互式响应时间的跟踪信息。此外，它为表示运动路径的草图提供了第一个直观的用户输入界面。一个特别的挑战是将用户的草图转换为空间查询，并以一种灵活的方式执行这些查询，允许在草图路径和球员和/或球的实际运动之间有一些可控的偏差。体育感官的评价结果表明，该方法在体育视频中基于速写的检索是非常有效的。

{"title":"Towards Sketch-Based Motion Queries in Sports Videos","authors":"Ihab Al Kabary, H. Schuldt","doi":"10.1109/ISM.2013.60","DOIUrl":"https://doi.org/10.1109/ISM.2013.60","url":null,"abstract":"The advent of pen-based user interfaces has facilitated several natural ways for human-computer interaction. One example is sketch-based retrieval, i.e., the search for (multimedia) objects on the basis of sketches as query input. So far, work has focused mainly on sketch-based image retrieval. However, more and more application domains also benefit from sketches as query input for searching in video collections. Enabling spatial search in videos, in the form of sketch-based motion queries, is increasingly demanded by coaches and analysts in team sports as a novel and innovative tool for game analysis. Even though game analysis is already a major activity in this domain, it is still mostly based on manual selection of video sequences. In this paper, we present Sport Sense, a first approach to enabling intuitive and efficient video retrieval using sketch-based motion queries. This is accomplished by using videos of games in team sports, together with an overlay of meta data that incorporates spatio-temporal information about various events. Sport Sense exploits spatio-temporal databases to store, index, and retrieve the tracked information at interactive response times. Moreover, it provides first intuitive user input interfaces for sketches representing motion paths. A particular challenge is to convert the users' sketches into spatial queries and to execute these queries in a flexible way that allows for some controlled deviation between the sketched path and the actual movement of the players and/or the ball. The evaluation results of Sport Sense show that this approach to sketch-based retrieval in sports videos is both very effective and efficient.","PeriodicalId":6311,"journal":{"name":"2013 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB)","volume":"25 1","pages":"309-314"},"PeriodicalIF":0.0,"publicationDate":"2013-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"73497888","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 10

Efficient Multi-stage Image Classification for Mobile Sensing in Urban Environments 城市环境下移动传感的高效多阶段图像分类

2013 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB)

Pub Date : 2013-12-09 DOI: 10.1109/ISM.2013.45

Shashank Mujumdar, N. Rajamani, L. V. Subramaniam, Dror Porat

With the recent dramatic increase in the popularity of mobile electronic devices equipped with cameras, there is a growing number of real-world applications for image classification. Nevertheless, some of these real-world applications aim to classify images captured in an unconstrained manner and in complex environments where existing image classification techniques may not perform well. We propose an efficient image classification system that is robust enough to cope with challenging imaging conditions, and demonstrate its effectiveness in the context of classification of real-world images of dumpsters captured by mobile phones in the Indian metropolitan city of Hyderabad. Our system is able to achieve accurate classification of the cleanliness state of the dumpsters despite the challenging uncontrolled urban environment by utilizing a multi-stage approach, where the first stage is the efficient detection of the dumpster, and the second stage is the classification of its state. We analyze the performance of the system and provide comprehensive experimental results on a real-world public dataset.

随着最近配备相机的移动电子设备的急剧普及，图像分类在现实世界中的应用越来越多。然而，这些现实世界中的一些应用程序旨在对以不受约束的方式捕获的图像进行分类，并在现有图像分类技术可能表现不佳的复杂环境中进行分类。我们提出了一种有效的图像分类系统，该系统具有足够的鲁棒性，可以应对具有挑战性的成像条件，并在印度海得拉巴大都市手机捕获的垃圾箱真实图像分类的背景下证明了其有效性。我们的系统能够实现垃圾箱的清洁状态的准确分类，尽管具有挑战性的不受控制的城市环境，利用多阶段的方法，其中第一阶段是垃圾箱的有效检测，第二阶段是其状态的分类。我们分析了系统的性能，并在真实世界的公共数据集上提供了全面的实验结果。

{"title":"Efficient Multi-stage Image Classification for Mobile Sensing in Urban Environments","authors":"Shashank Mujumdar, N. Rajamani, L. V. Subramaniam, Dror Porat","doi":"10.1109/ISM.2013.45","DOIUrl":"https://doi.org/10.1109/ISM.2013.45","url":null,"abstract":"With the recent dramatic increase in the popularity of mobile electronic devices equipped with cameras, there is a growing number of real-world applications for image classification. Nevertheless, some of these real-world applications aim to classify images captured in an unconstrained manner and in complex environments where existing image classification techniques may not perform well. We propose an efficient image classification system that is robust enough to cope with challenging imaging conditions, and demonstrate its effectiveness in the context of classification of real-world images of dumpsters captured by mobile phones in the Indian metropolitan city of Hyderabad. Our system is able to achieve accurate classification of the cleanliness state of the dumpsters despite the challenging uncontrolled urban environment by utilizing a multi-stage approach, where the first stage is the efficient detection of the dumpster, and the second stage is the classification of its state. We analyze the performance of the system and provide comprehensive experimental results on a real-world public dataset.","PeriodicalId":6311,"journal":{"name":"2013 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB)","volume":"76 1","pages":"237-240"},"PeriodicalIF":0.0,"publicationDate":"2013-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"79718817","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Audio Feature and Classifier Analysis for Efficient Recognition of Environmental Sounds 有效识别环境声音的音频特征与分类器分析

2013 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB)

Pub Date : 2013-12-09 DOI: 10.1109/ISM.2013.29

C. Okuyucu, M. Sert, A. Yazıcı

Environmental sounds (ES) have different characteristics, such as unstructured nature and typically noise-like and flat spectrums, which make recognition task difficult compared to speech or music sounds. Here, we perform an exhaustive feature and classifier analysis for the recognition of considerably similar ES categories and propose a best representative feature to yield higher recognition accuracy. In the experiments, thirteen (13) ES categories, namely emergency alarm, car horn, gun, explosion, automobile, helicopter, water, wind, rain, applause, crowd, and laughter are detected and tested based on eleven (11) audio features (MPEG-7 family, ZCR, MFCC, and combinations) by using the HMM and SVM classifiers. Extensive experiments have been conducted to demonstrate the effectiveness of these joint features for ES classification. Our experiments show that, the joint feature set ASFCS-H (Audio Spectrum Flatness, Centroid, Spread, and Audio Harmonicity) is the best representative feature set with an average F-measure value of 80.6%.

环境声音(ES)具有不同的特征，例如非结构化的性质和典型的噪声和平坦的频谱，与语音或音乐声音相比，这使得识别任务变得困难。在这里，我们对相当相似的ES类别的识别进行了详尽的特征和分类器分析，并提出了一个最佳代表性特征，以产生更高的识别精度。在实验中，基于11个音频特征(MPEG-7族、ZCR、MFCC和组合)，使用HMM和SVM分类器对紧急报警、汽车喇叭、枪、爆炸、汽车、直升机、水、风、雨、掌声、人群、笑声等13个ES类别进行检测和测试。已经进行了大量的实验来证明这些联合特征对ES分类的有效性。实验表明，联合特征集ASFCS-H (Audio Spectrum Flatness, Centroid, Spread, and Audio Harmonicity)是最具代表性的特征集，平均f测量值为80.6%。

引用次数: 20

Dichotomic Decision Cascading for Video Shot Boundary Detection 视频镜头边界检测的二分类决策级联

2013 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB)

Pub Date : 2013-12-09 DOI: 10.1109/ISM.2013.43

Mennan Güder, N. Cicekli

In this paper, we present a shot boundary decision fusion strategy which implements a multi-modal cascaded dichotomic search on the boundary space. The initial and core step of the proposed method is narrowing the shot boundary decision space as long as the accuracy is improved. Instead of the default sequential change detection, a dichotomic change strategy which is supervised with a cascaded fusion, is implemented to achieve higher accuracy and less algorithmic complexity. The main decision sources are image color histograms, object recognizer results, motion comparators, audio pattern analyzers, key point extractors and edge descriptors which are selectively employed in a cascaded manner. We propose a shot boundary detection algorithm which is noise tolerant, video genre adaptable, context aware and computationally efficient. In order to reduce computational complexity, we construct a shot boundary search heuristic for pruning the set of candidate shot boundary frames. We employ both statistical and rule based approaches in a cascaded fashion in order to decide the size of the search space to be pruned for the purposes of improving computational efficiency. TRECVid 2006 and 2007 data sets are used in the evaluation process and the performance results are given for both cuts and gradual transitions.

提出了一种镜头边界决策融合策略，该策略在边界空间上实现了多模态级联二分类搜索。该方法的初始和核心步骤是在提高精度的前提下缩小投篮边界决策空间。代替默认的顺序变化检测，采用级联融合监督的二分类变化策略，以达到更高的精度和更低的算法复杂度。主要决策源是图像颜色直方图、目标识别器结果、运动比较器、音频模式分析器、关键点提取器和边缘描述符，它们以级联方式选择性地使用。我们提出了一种具有噪声容忍、视频类型适应性、上下文感知和计算效率的镜头边界检测算法。为了降低计算复杂度，构造了一个镜头边界搜索启发式算法，用于对候选镜头边界帧集进行剪枝。我们以级联的方式采用统计和基于规则的方法来决定要修剪的搜索空间的大小，以提高计算效率。在评估过程中使用了TRECVid 2006和2007数据集，并给出了切割和渐变的性能结果。

{"title":"Dichotomic Decision Cascading for Video Shot Boundary Detection","authors":"Mennan Güder, N. Cicekli","doi":"10.1109/ISM.2013.43","DOIUrl":"https://doi.org/10.1109/ISM.2013.43","url":null,"abstract":"In this paper, we present a shot boundary decision fusion strategy which implements a multi-modal cascaded dichotomic search on the boundary space. The initial and core step of the proposed method is narrowing the shot boundary decision space as long as the accuracy is improved. Instead of the default sequential change detection, a dichotomic change strategy which is supervised with a cascaded fusion, is implemented to achieve higher accuracy and less algorithmic complexity. The main decision sources are image color histograms, object recognizer results, motion comparators, audio pattern analyzers, key point extractors and edge descriptors which are selectively employed in a cascaded manner. We propose a shot boundary detection algorithm which is noise tolerant, video genre adaptable, context aware and computationally efficient. In order to reduce computational complexity, we construct a shot boundary search heuristic for pruning the set of candidate shot boundary frames. We employ both statistical and rule based approaches in a cascaded fashion in order to decide the size of the search space to be pruned for the purposes of improving computational efficiency. TRECVid 2006 and 2007 data sets are used in the evaluation process and the performance results are given for both cuts and gradual transitions.","PeriodicalId":6311,"journal":{"name":"2013 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB)","volume":"8 1","pages":"227-230"},"PeriodicalIF":0.0,"publicationDate":"2013-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"84446436","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 6

Efficient Execution of Conjunctive Complex Queries on Big Multimedia Databases 大型多媒体数据库中联合复杂查询的高效执行

2013 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB)

Pub Date : 2013-12-09 DOI: 10.1109/ISM.2013.112

Karina Fasolin, Renato Fileto, Marcelo Krüger, D. S. Kaster, Mônica Ribeiro Porto Ferreira, R. Cordeiro, A. Traina, C. Traina

This paper proposes an approach to efficiently execute conjunctive queries on big complex data together with their related conventional data. The basic idea is to horizontally fragment the database according to criteria frequently used in query predicates. The collection of fragments is indexed to efficiently find the fragment(s) whose contents satisfy some query predicate(s). The contents of each fragment are then indexed as well, to support efficient filtering of the fragment data according to other query predicate(s) conjunctively connected to the former. This strategy has been applied to a collection of more than 106 million images together with their related conventional data. Experimental results show considerable performance gain of the proposed approach for queries with conventional and similarity-based predicates, compared to the use of a unique metric index for the entire database contents.

本文提出了一种对大型复杂数据及其相关常规数据高效执行联合查询的方法。基本思想是根据查询谓词中经常使用的标准水平分割数据库。对片段集合进行索引，以便有效地找到其内容满足某些查询谓词的片段。然后对每个片段的内容也进行索引，以支持根据连接到片段的其他查询谓词对片段数据进行有效过滤。该策略已应用于超过1.06亿张图像及其相关常规数据的集合。实验结果表明，与对整个数据库内容使用唯一的度量索引相比，对于使用传统谓词和基于相似性的谓词的查询，所提出的方法获得了相当大的性能提升。

引用次数: 10

Multi Category Content Selection in Spaced Repetition Based Mobile Learning Games 基于间隔重复的手机学习游戏的多类别内容选择

2013 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB)

Pub Date : 2013-12-09 DOI: 10.1109/ISM.2013.90

Florian Schimanke, R. Mertens, O. Vornberger, Stephanie Vollmer

Learning requires repetition. Spaced repetition algorithms are aimed at reducing the number of times a learning item has to be accessed by the learner by scheduling item presentation based on psychological models. These models take into account learner performance on previous interactions with the learning item and the rate at which humans forget what they have learned. In recent years, spaced repetition learning software has become popular for simple learning tasks like flash cards used for learning vocabulary. This paper presents a prototype application that extends the spaced repetition learning approach to more complex content like the kind usually found in learning games. One major difference between this content and flash cards is that learning games usually contain a number of different tasks that convey the same underlying concept categories. To complicate matters, one task might even be classified as belonging to a number of independent or orthogonal categories. This paper explores how these categories can be modeled on the basis of a mobile game designed for training in the field of relational databases. We have chosen a mobile approach to leverage it's anytime/anyplace availability which allows a more precise scheduling by the spaced repetition algorithm.

学习需要重复。间隔重复算法的目的是通过基于心理模型来安排学习项目的呈现，从而减少学习者访问学习项目的次数。这些模型考虑了学习者在之前与学习项目的互动中的表现，以及人类忘记所学内容的速度。近年来，间隔重复学习软件在简单的学习任务中变得流行起来，比如用于学习词汇的闪存卡。本文提出了一个原型应用程序，将间隔重复学习方法扩展到更复杂的内容，如学习游戏中常见的那种内容。这种内容与闪存卡的一个主要区别在于，学习游戏通常包含许多传达相同潜在概念类别的不同任务。更复杂的是，一个任务甚至可能被归类为属于多个独立或正交的类别。本文探讨了如何在一个为关系数据库领域的训练而设计的手机游戏的基础上对这些类别进行建模。我们选择了一种移动方法来利用它的随时随地可用性，这允许通过间隔重复算法进行更精确的调度。

{"title":"Multi Category Content Selection in Spaced Repetition Based Mobile Learning Games","authors":"Florian Schimanke, R. Mertens, O. Vornberger, Stephanie Vollmer","doi":"10.1109/ISM.2013.90","DOIUrl":"https://doi.org/10.1109/ISM.2013.90","url":null,"abstract":"Learning requires repetition. Spaced repetition algorithms are aimed at reducing the number of times a learning item has to be accessed by the learner by scheduling item presentation based on psychological models. These models take into account learner performance on previous interactions with the learning item and the rate at which humans forget what they have learned. In recent years, spaced repetition learning software has become popular for simple learning tasks like flash cards used for learning vocabulary. This paper presents a prototype application that extends the spaced repetition learning approach to more complex content like the kind usually found in learning games. One major difference between this content and flash cards is that learning games usually contain a number of different tasks that convey the same underlying concept categories. To complicate matters, one task might even be classified as belonging to a number of independent or orthogonal categories. This paper explores how these categories can be modeled on the basis of a mobile game designed for training in the field of relational databases. We have chosen a mobile approach to leverage it's anytime/anyplace availability which allows a more precise scheduling by the spaced repetition algorithm.","PeriodicalId":6311,"journal":{"name":"2013 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB)","volume":"62 3","pages":"468-473"},"PeriodicalIF":0.0,"publicationDate":"2013-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"91466472","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 10

The LectureSight System in Production Scenarios and Its Impact on Learning from Video Recorded Lectures 生产场景下的LectureSight系统及其对视频讲座学习的影响

2013 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB)

Pub Date : 2013-12-09 DOI: 10.1109/ISM.2013.91

Benjamin Wulff, L. Rupp, Alexander Fecke, Kai-Christoph Hamborg

The LectureSight project aims to develop a cost-effective solution for automatic camera control for lecture recordings. An earlier work presented the prototypical implementation of the system. In this work we present the results form testing the system in two experiments: LectureSight instances were deployed in two rooms and the performance of the system was assessed in live lectures. Furthermore an experimental study has been conducted to investigate the usefulness of videos with presenter tracking for the learner. The experiment involved participants from two universities that were put into a simulated exam situation.

LectureSight项目旨在开发一种具有成本效益的解决方案，用于讲座录音的自动摄像机控制。早期的工作展示了该系统的原型实现。在这项工作中，我们展示了在两个实验中测试系统的结果:LectureSight实例部署在两个房间中，并在现场讲座中评估了系统的性能。此外，还进行了一项实验研究，以调查带有演示者跟踪的视频对学习者的有用性。该实验涉及来自两所大学的参与者，他们被置于模拟考试环境中。

引用次数: 3

Spherical Panorama Construction Using Multi Sensor Registration Priors and Its Real-Time Hardware 基于多传感器配准先验的球面全景构建及其实时硬件

2013 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB)

Pub Date : 2013-12-09 DOI: 10.1109/ISM.2013.35

Omer Cogal, Vladan Popovic, Y. Leblebici

In this work, a novel method is presented to improve the quality of panoramic images on a spherically arranged multi sensor imaging system. The new method is composed of two parts. The first approach proposed is based on mapping the panorama generation problem onto a Markov Random Field (MRF) and then estimating posterior probabilities from initial likelihoods. The novelty of approach is based on extracting the prior evidence from the registration information of multiple cameras and estimating expected value on an undirected graph. The second part of the method is a geometrical approach targeting a better estimation for the initial priors, which is also not applied before. The aim of both approaches is to decrease the parallax errors and ghosting effects which occur due to the nature of multi camera systems. It is shown that instead of directly using independent intensity coefficients extracted from registration information, applying a neighborhood based local probability distribution for each pixel of panorama utilizing the registration information as prior gives better results. Visual comparisons are provided to show the achieved quality enhancement in terms of seamless and more natural panoramic image with less ghosting effects. Since the registration priors are used effectively with a single iteration step in a 4 connected neighborhood, the need for an intensity based loopy and iterative inference method is prohibited. Hence, the proposed methods are suitable for real-time hardware implementation. A hardware implementation of the method for real-time operation is proposed.

本文提出了一种提高球面多传感器成像系统全景图像质量的新方法。新方法由两部分组成。第一种方法是将全景图生成问题映射到马尔可夫随机场(MRF)上，然后从初始似然估计后验概率。该方法的新颖性是基于从多个摄像机的配准信息中提取先验证据，并在无向图上估计期望值。该方法的第二部分是一种几何方法，旨在更好地估计初始先验，这也是以前没有应用过的。这两种方法的目的都是为了减少由于多相机系统的特性而产生的视差误差和重影效应。结果表明，与直接使用从配准信息中提取的独立强度系数相比，利用之前的配准信息对全景图的每个像素进行基于邻域的局部概率分布，可以获得更好的效果。提供了视觉对比，以显示所取得的质量提升，在无缝和更自然的全景图像，更少的重影效果。由于配准先验在4连通的邻域内只需一次迭代就能有效地利用，因此不需要基于强度的循环迭代推理方法。因此，所提出的方法适用于实时硬件实现。提出了一种实时操作方法的硬件实现。

{"title":"Spherical Panorama Construction Using Multi Sensor Registration Priors and Its Real-Time Hardware","authors":"Omer Cogal, Vladan Popovic, Y. Leblebici","doi":"10.1109/ISM.2013.35","DOIUrl":"https://doi.org/10.1109/ISM.2013.35","url":null,"abstract":"In this work, a novel method is presented to improve the quality of panoramic images on a spherically arranged multi sensor imaging system. The new method is composed of two parts. The first approach proposed is based on mapping the panorama generation problem onto a Markov Random Field (MRF) and then estimating posterior probabilities from initial likelihoods. The novelty of approach is based on extracting the prior evidence from the registration information of multiple cameras and estimating expected value on an undirected graph. The second part of the method is a geometrical approach targeting a better estimation for the initial priors, which is also not applied before. The aim of both approaches is to decrease the parallax errors and ghosting effects which occur due to the nature of multi camera systems. It is shown that instead of directly using independent intensity coefficients extracted from registration information, applying a neighborhood based local probability distribution for each pixel of panorama utilizing the registration information as prior gives better results. Visual comparisons are provided to show the achieved quality enhancement in terms of seamless and more natural panoramic image with less ghosting effects. Since the registration priors are used effectively with a single iteration step in a 4 connected neighborhood, the need for an intensity based loopy and iterative inference method is prohibited. Hence, the proposed methods are suitable for real-time hardware implementation. A hardware implementation of the method for real-time operation is proposed.","PeriodicalId":6311,"journal":{"name":"2013 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB)","volume":"15 1","pages":"171-178"},"PeriodicalIF":0.0,"publicationDate":"2013-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"87381873","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 10

Multidimensional QoE Assessment of Multi-view Video and Selectable Audio (MVV-SA) IP Transmission 多视点视频和可选音频(MVV-SA) IP传输的多维QoE评估

2013 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB)

Pub Date : 2013-12-09 DOI: 10.1109/ISM.2013.109

Takuya Ishida, Toshiro Nunome

This paper deals with Multi-View Video and Selectable Audio (MVV-SA) IP transmission, users can switch not only video but also audio according to a viewpoint change request. We evaluate QoE of MVV-SA by a subjective experiment. The evaluation is performed by the Semantic Differential (SD) method with 13 adjective pairs. In the subjective experiment, we ask assessors to evaluate 40 stimuli which consist of two kinds of UDP load traffic, two kinds of fixed additional delay, five kinds of playout buffering time, and selectable or un-selectable audio (i.e., MVV-SA or the previous MVV-A). As a result, MVV-SA gives higher presence to the user than MVV-A and then enhances QoE. We also conduct factor analysis to clarify component factors of QoE.

本文研究了多视点视频和可选音频(MVV-SA) IP传输，用户可以根据视点变化的要求进行视频和音频的切换。我们通过主观实验来评价MVV-SA的QoE。采用语义差分(SD)方法对13对形容词进行评价。在主观实验中，我们要求评估者评估40种刺激，包括两种UDP负载流量，两种固定额外延迟，五种播放缓冲时间，以及可选或不可选音频(即MVV-SA或之前的MVV-A)。因此，MVV-SA比MVV-A给用户更高的存在感，从而提高了QoE。我们还进行了因子分析，明确了QoE的构成因素。

引用次数: 2

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

2013 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB)

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀