2015 IEEE International Symposium on Multimedia (ISM)最新文献

英文中文

Stability Improvement of Indirect Matching for Music Information Retrieval 音乐信息检索中间接匹配稳定性的改进

2015 IEEE International Symposium on Multimedia (ISM)

Pub Date : 2015-12-01 DOI: 10.1109/ISM.2015.29

Takahiro Hayashi, N. Ishii, Masato Ishimori, K. Abe

This paper reports improvement of indirect matching, which is a fast CBMIR (content-based music information retrieval) framework proposed in our previous study. Indirect matching achieves fast retrieval by combining offline search with representative queries and online quick similarity estimation based on the results of the offline search. We have found that the retrieval accuracy of indirect matching decreases when representative queries have little variation. This paper proposes a method for selecting representative queries having wide variation. To ensure wide variation between representative queries, the proposed method combines MDS (multi-dimensional scaling) and Ward's clustering. Experimental results have shown that the retrieval accuracy of indirect matching can be stabilized by the proposed method.

本文报道了对间接匹配的改进，间接匹配是我们之前提出的一种快速基于内容的音乐信息检索框架。间接匹配将离线搜索与代表性查询相结合，并基于离线搜索结果在线快速估计相似度，实现快速检索。我们发现，当代表性查询变化较小时，间接匹配的检索精度会降低。本文提出了一种选择具有广泛变化的代表性查询的方法。为了保证代表性查询之间的广泛差异，提出的方法结合了MDS(多维缩放)和Ward聚类。实验结果表明，该方法可以稳定间接匹配的检索精度。

引用次数: 3

Towards Security of Native DRM Execution in HTML5 HTML5中原生DRM执行的安全性

2015 IEEE International Symposium on Multimedia (ISM)

Pub Date : 2015-12-01 DOI: 10.1109/ISM.2015.48

D. Livshits, A. Mikityuk, S. Pham, A. Shabtai

In this paper we present an open, secure and flexible architectural approach for integrating Content Decryption Module (CDM) and Encryption Media Extensions (EME) interface with Trusted Execution Environment (TEE). This provides security hardening for playback of the Digital Rights Management (DRM) protected content without any need for a dedicated secure processor. This article is not focused on specific TEE approaches, but rather considers that any TEE can be potentially integrated with CDM through the EME interface. Our work introduces the approaches for integrating CDM with Intel SGX and ARM TrustZone TEE technologies.

本文提出了一种开放、安全、灵活的架构方法，用于将内容解密模块(CDM)和加密媒体扩展(EME)接口与可信执行环境(TEE)集成。这为播放受数字版权管理(DRM)保护的内容提供了安全加固，而不需要专用的安全处理器。本文不关注特定的TEE方法，而是考虑任何TEE都可以通过EME接口与CDM集成。我们的工作介绍了将CDM与Intel SGX和ARM TrustZone TEE技术集成的方法。

引用次数: 1

A Novel Algorithm for Vehicle Detection and Tracking in Airborne Videos 一种新的机载视频中车辆检测与跟踪算法

2015 IEEE International Symposium on Multimedia (ISM)

Pub Date : 2015-12-01 DOI: 10.1109/ISM.2015.77

M. Abdelwahab, M. Abdelwahab

Real time detection and tracking of multi vehicles in airborne videos is still a challenging problem due to the camera motion and low resolution. In this paper, a real time technique for simultaneously detecting, tracking and counting vehicles in airborne and stationary camera videos is proposed. First, feature points are extracted and tracked through video frames. A new strategy is used for removing the non-stationary background points by measuring the changes in the histogram of the pixels around each feature point with time. The obtained foreground features are clustered and grouped into separate trackable vehicles based on their motion properties. Experimental results performed on videos representing airborne and fixed cameras confirm the excellent properties of the proposed algorithm.

机载视频中多飞行器的实时检测和跟踪一直是一个具有挑战性的问题。本文提出了一种机载和静止摄像机视频中同时检测、跟踪和计数车辆的实时技术。首先，通过视频帧提取特征点并进行跟踪。通过测量每个特征点周围像素的直方图随时间的变化，提出了一种去除非平稳背景点的新策略。对获得的前景特征进行聚类，并根据其运动特性分组为单独的可跟踪车辆。在机载和固定摄像机视频上的实验结果证实了该算法的优良性能。

引用次数: 16

Priority-Aware Joint Packet Fragmentation and Error Protection Scheme for H.264 Video over Wireless Channels 无线信道上H.264视频的优先级感知联合分组碎片和错误保护方案

2015 IEEE International Symposium on Multimedia (ISM)

Pub Date : 2015-12-01 DOI: 10.1109/ISM.2015.47

Seethal Paluri, K. Kambhatla, M. Medley, J. Matyjas, Sunil Kumar

We present a real-time, priority-aware joint packet fragmentation and error protection scheme for transmitting H.264/AVC compressed video over Rayleigh fading channels. A priority is assigned to every video slice based on the predicted cumulative mean squared error (CMSE) contributed by its loss relative to all the slices of the group of pictures (GOP). Here, the CMSE of a slice, which is predicted by using our low-complexity generalized linear model, represents the degradation in video quality contributed by its loss. The slices of a frame belonging to the same priority are aggregated to form video packet(s). We simulate the fragment error rates (FERs) for a combination of different fragment sizes and rate compatible punctured convolutional (RCPC) code rates. These FERs are then used to determine the optimal fragment sizes and code rates for packets of each priority class by minimizing the expected normalized predicted CMSE of all the priority classes per GOP in H.264 video bitstream. We observed a significant improvement in the received video quality over the conventional and priority-agnostic packet fragmentation schemes.

针对H.264/AVC压缩视频在瑞利衰落信道上的传输，提出了一种实时、优先级感知的联合分组碎片和错误保护方案。根据预测的累积均方误差(CMSE)为每个视频切片分配优先级，CMSE是由其相对于图像组(GOP)的所有切片的损失贡献的。在这里，使用我们的低复杂度广义线性模型预测的切片的CMSE表示由其损失导致的视频质量下降。具有相同优先级的帧的片被聚合成视频包。我们模拟了不同片段大小和速率兼容的穿孔卷积(RCPC)码率组合的片段错误率。然后，通过最小化H.264视频比特流中每个GOP中所有优先级类的预期规范化预测CMSE，这些fer用于确定每个优先级类的数据包的最佳片段大小和码率。我们观察到接收到的视频质量比传统的和优先级无关的分组分片方案有显著的改善。

{"title":"Priority-Aware Joint Packet Fragmentation and Error Protection Scheme for H.264 Video over Wireless Channels","authors":"Seethal Paluri, K. Kambhatla, M. Medley, J. Matyjas, Sunil Kumar","doi":"10.1109/ISM.2015.47","DOIUrl":"https://doi.org/10.1109/ISM.2015.47","url":null,"abstract":"We present a real-time, priority-aware joint packet fragmentation and error protection scheme for transmitting H.264/AVC compressed video over Rayleigh fading channels. A priority is assigned to every video slice based on the predicted cumulative mean squared error (CMSE) contributed by its loss relative to all the slices of the group of pictures (GOP). Here, the CMSE of a slice, which is predicted by using our low-complexity generalized linear model, represents the degradation in video quality contributed by its loss. The slices of a frame belonging to the same priority are aggregated to form video packet(s). We simulate the fragment error rates (FERs) for a combination of different fragment sizes and rate compatible punctured convolutional (RCPC) code rates. These FERs are then used to determine the optimal fragment sizes and code rates for packets of each priority class by minimizing the expected normalized predicted CMSE of all the priority classes per GOP in H.264 video bitstream. We observed a significant improvement in the received video quality over the conventional and priority-agnostic packet fragmentation schemes.","PeriodicalId":250353,"journal":{"name":"2015 IEEE International Symposium on Multimedia (ISM)","volume":"159 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131644842","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 6

Featuring Immediate Revocation in Mikey-Sakke (FIRM) Mikey-Sakke (FIRM)立即撤销

2015 IEEE International Symposium on Multimedia (ISM)

Pub Date : 2015-12-01 DOI: 10.1109/ISM.2015.101

P. Martins, L. Sousa, P. Chawan

The use of Voice over Internet Protocol (VoIP) is becoming ubiquitous due to the multiple shortcomings of traditional Public Switched Telephone Network (PSTN) systems. As a result, the development of secure key establishment protocols is becoming increasingly important. The Communications-Electronics Security Group (CESG), in response to this demand, has published new key agreement protocols for the Multimedia Internet KEYing (MIKEY) protocol to provide low-cost secure VoIP communications, supported on Identity-based Public-Key Cryptography (IDPKC). In the context of IDPKC, the identity of users is used to derive their public-keys, which eliminates the expenses of maintaining a Public-Key Infrastructure (PKI). However, IDPKC systems suffer from inefficient user revocation and key renewal. In this paper, we take advantage of the fact that users need to be connected to the Internet to communicate, for introducing a SEcurity Mediator (SEM), who possesses a share of the users' private-keys, and with whom the users must cooperate, to sign and decrypt cryptograms. By taking advantage of this sharing, we introduce mechanisms to provide immediate user revocation and key renewal.

由于传统的公共交换电话网(PSTN)系统存在诸多缺陷，VoIP (Voice over Internet Protocol)的使用变得越来越普遍。因此，开发安全的密钥建立协议变得越来越重要。为了响应这一需求，通信电子安全组(CESG)发布了多媒体互联网密钥(MIKEY)协议的新密钥协议协议，以提供基于身份的公钥加密(IDPKC)支持的低成本安全VoIP通信。在IDPKC上下文中，用户的身份用于派生其公钥，从而消除了维护公钥基础设施(PKI)的费用。然而，IDPKC系统存在低效的用户撤销和密钥更新问题。在本文中，我们利用用户需要连接到互联网进行通信的事实，引入了一个安全中介(SEM)，它拥有用户私钥的共享，并且必须与用户合作，对密码进行签名和解密。通过利用这种共享，我们引入了提供即时用户撤销和密钥更新的机制。

{"title":"Featuring Immediate Revocation in Mikey-Sakke (FIRM)","authors":"P. Martins, L. Sousa, P. Chawan","doi":"10.1109/ISM.2015.101","DOIUrl":"https://doi.org/10.1109/ISM.2015.101","url":null,"abstract":"The use of Voice over Internet Protocol (VoIP) is becoming ubiquitous due to the multiple shortcomings of traditional Public Switched Telephone Network (PSTN) systems. As a result, the development of secure key establishment protocols is becoming increasingly important. The Communications-Electronics Security Group (CESG), in response to this demand, has published new key agreement protocols for the Multimedia Internet KEYing (MIKEY) protocol to provide low-cost secure VoIP communications, supported on Identity-based Public-Key Cryptography (IDPKC). In the context of IDPKC, the identity of users is used to derive their public-keys, which eliminates the expenses of maintaining a Public-Key Infrastructure (PKI). However, IDPKC systems suffer from inefficient user revocation and key renewal. In this paper, we take advantage of the fact that users need to be connected to the Internet to communicate, for introducing a SEcurity Mediator (SEM), who possesses a share of the users' private-keys, and with whom the users must cooperate, to sign and decrypt cryptograms. By taking advantage of this sharing, we introduce mechanisms to provide immediate user revocation and key renewal.","PeriodicalId":250353,"journal":{"name":"2015 IEEE International Symposium on Multimedia (ISM)","volume":"421 2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131676675","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Smart Toy Car Localization and Navigation Using Projected Light 使用投影光的智能玩具车定位和导航

2015 IEEE International Symposium on Multimedia (ISM)

Pub Date : 2015-12-01 DOI: 10.1109/ISM.2015.113

Mingming Fan, Qiong Liu, Shang Ma, Patrick Chiu

In this paper, we present the design and implementation of toy car localization and navigation system, which enables toy cars and "passengers" to learn and exchange their fine-grained locations in an indoor environment with the help of the projected light based localization technique. The projected light consists of a sequence of gray code images which assigns each pixel in the projection area a unique gray code to distinguish their coordination. The light sensors installed on a toy cars and a potential "passenger" receive the light streams from the projected light, based on which their locations are inferred. The toy car then utilizes A* algorithm to plan a route based on its location, its orientation, the target's location and the map of "roads". The fast speed of projected light based localization technique enables the toy car to adjust its own orientation while "driving" and keep itself on "roads". The toy car system demonstrates that the localization technique and the client-server architecture can benefit similar applications that require fine-grained location information of multiple objects simultaneously.

在本文中，我们提出了玩具车定位和导航系统的设计和实现，该系统利用投影光定位技术，使玩具车和“乘客”能够在室内环境中学习和交换他们的细粒度位置。投影光由一系列灰度编码图像组成，这些图像为投影区域中的每个像素分配一个唯一的灰度编码以区分它们的协调。安装在玩具汽车和潜在“乘客”上的光传感器接收来自投射光的光流，并据此推断他们的位置。然后，玩具车利用A*算法根据自己的位置、方向、目标的位置和“道路”地图来规划路线。基于投影光的快速定位技术使玩具车能够在“行驶”时调整自己的方向，并保持自己在“道路”上。玩具车系统表明，定位技术和客户机-服务器体系结构可以使同时需要多个对象的细粒度位置信息的类似应用程序受益。

引用次数: 1

GPU Based Robust Image Registration for Composite Translational, Rotational and Scale Transformations

2015 IEEE International Symposium on Multimedia (ISM)

Pub Date : 2015-12-01 DOI: 10.1109/ISM.2015.51

S. Dinç, R. S. Aygün, F. Fahimi

This paper presents a GPU based image registration algorithm that utilizes Hough Transform and Least Square Optimization to calculate the transformation between two images. In our approach, we calculate the transformation parameters of all possible combination solutions of matched feature points by exploiting parallel processing power of the GPU. We applied our algorithm on a variety of images including the problem of mosaic image generation. Experimental results show that our method is robust to the outliers (incorrect matches) and it can achieve very accurate registration (numeric and visual) results with much faster (up to 20 times) than CPU implementation.

本文提出了一种基于GPU的图像配准算法，该算法利用霍夫变换和最小二乘优化来计算两幅图像之间的变换。在我们的方法中，我们利用GPU的并行处理能力计算匹配特征点的所有可能组合解的变换参数。我们将该算法应用于多种图像，包括马赛克图像生成问题。实验结果表明，我们的方法对异常值(不正确匹配)具有鲁棒性，并且可以以比CPU实现更快(高达20倍)的速度获得非常准确的配准(数字和视觉)结果。

引用次数: 3

Efficient Query Processing in 3D Motion Capture Databases via Lower Bound Approximation of the Gesture Matching Distance 基于手势匹配距离下界逼近的三维动作捕捉数据库查询处理

2015 IEEE International Symposium on Multimedia (ISM)

Pub Date : 2015-12-01 DOI: 10.1109/ISM.2015.86

C. Beecks, Marwan Hassani, Florian Obeloer, T. Seidl

One of the most fundamental challenges when accessing gestural patterns in 3D motion capture databases is the definition of spatiotemporal similarity. While distance-based similarity models such as the Gesture Matching Distance on gesture signatures are able to leverage the spatial and temporal characteristics of gestural patterns, their applicability to large 3D motion capture databases is limited due to their high computational complexity. To this end, we present a lower bound approximation of the Gesture Matching Distance that can be utilized in an optimal multi-step query processing architecture in order to provide efficient query processing. We investigate the performance in terms of accuracy and efficiency based on 3D motion capture databases and show that our approach is able to achieve an increase in efficiency of more than one order of magnitude with a negligible loss in accuracy.

在3D动作捕捉数据库中访问手势模式时，最基本的挑战之一是时空相似性的定义。虽然基于距离的相似模型(如手势签名的手势匹配距离)能够利用手势模式的空间和时间特征，但由于其高计算复杂性，它们对大型3D动作捕捉数据库的适用性受到限制。为此，我们提出了手势匹配距离的下界近似值，该近似值可用于最优的多步查询处理架构，以提供高效的查询处理。我们研究了基于3D运动捕捉数据库的精度和效率方面的性能，并表明我们的方法能够在精度损失可以忽略不计的情况下实现超过一个数量级的效率提高。

引用次数: 5

Using Pico Projectors to Explore Other Worlds 使用微型投影仪探索其他世界

2015 IEEE International Symposium on Multimedia (ISM)

Pub Date : 2015-12-01 DOI: 10.1109/ISM.2015.59

Rossana Santos, N. Correia

This paper proposes a concept of exploring virtual worlds using a pico projector. To demonstrate this concept, a game, called Gone Fishing, was created on top of the infrastructure vuSpot. Gone Fishing is a game where a mobile device projects a virtual seabed over a room. Each user owns an aquarium and fish, flora and decorations can be collected and exchanged. Fish must be pursued to be captured. Pico projectors are becoming popular and more applications are needed. We intend to create applications that use pico projectors and mobile devices to play games and explore spaces using vuSpot, an infrastructure adaptable to a space that uses existing video camera networks to support interactive experiences. With this game we aim at exploring the application of pico projectors in gaming, investigating new forms of interaction and social experiences using mobile devices and pico projectors.

本文提出了一种利用微型投影仪探索虚拟世界的概念。为了证明这一理念，我们基于基础设施vuSpot创造了一款名为《Gone Fishing》的游戏。《Gone Fishing》是一款移动设备在房间上方投射虚拟海床的游戏。每个用户都拥有一个水族箱和鱼，植物和装饰品可以收集和交换。鱼必须被追击才能被捕获。微型投影仪正变得越来越流行，需要更多的应用。我们打算创建应用程序，使用微型投影仪和移动设备来玩游戏，并使用vuSpot探索空间，这是一种适用于使用现有视频摄像机网络来支持互动体验的空间的基础设施。通过这款游戏，我们旨在探索微型投影仪在游戏中的应用，研究使用移动设备和微型投影仪进行互动和社交体验的新形式。

引用次数: 0

Self Similarity Wide-Joins for Near-Duplicate Image Detection 近重复图像检测的自相似宽连接

2015 IEEE International Symposium on Multimedia (ISM)

Pub Date : 2015-12-01 DOI: 10.1109/ISM.2015.114

Luiz Olmes Carvalho, Lúcio F. D. Santos, Willian D. Oliveira, A. Traina, C. Traina

Near-duplicate image detection plays an important role in several real applications. Such task is usually achieved by applying a clustering algorithm followed by refinement steps, which is a computationally expensive process. In this paper we introduce a framework based on a novel similarity join operator, which is able both to replace and speed up the clustering step, whereas also releasing the need of further refinement processes. It is based on absolute and relative similarity ratios, ensuring that top ranked image pairs are in the final result. Experiments performed on real datasets shows that our proposal is up to three orders of magnitude faster than the best techniques in the literature, always returning a high-quality result set.

近重复图像检测在许多实际应用中起着重要的作用。这样的任务通常是通过应用聚类算法和细化步骤来实现的，这是一个计算成本很高的过程。在本文中，我们引入了一种新的基于相似连接算子的框架，该框架既可以替代聚类步骤，又可以加快聚类步骤，同时还可以避免进一步的细化过程。它基于绝对和相对相似性比率，确保排名靠前的图像对出现在最终结果中。在真实数据集上进行的实验表明，我们的建议比文献中最好的技术快三个数量级，始终返回高质量的结果集。

引用次数: 5

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

2015 IEEE International Symposium on Multimedia (ISM)

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀