2011 IEEE International Symposium on Multimedia最新文献

英文中文

A Low Memory Requirements Execution Flow for the Non-Uniform Grid Projection Super-Resolution Algorithm 一种低内存要求的非均匀网格投影超分辨率算法执行流程

2011 IEEE International Symposium on Multimedia

Pub Date : 2011-12-05 DOI: 10.1109/ISM.2011.22

T. Szydzik, G. Callicó, A. Núñez

In this work we present a novel execution flow for the super-resolution image restoration (SRIR) non-uniform grid projection algorithm -- the macroblock-level flow. The novel flow is compared with the reference frame-level flow. The frame-level flow is characterized by the fact that transitions from one step of the algorithm to another occur only after the current step is carried out for all macro blocks (MBs) of the frame being currently processed. The novel flow carries out complete processing of one MB before the processing of another MB starts. The memory requirements of both schemes are evaluated in detail and compared. The study on the achievable memory reduction in total memory requirements was carried out for different values of the algorithm parameters: the MB size, scale factor, search area size and number of reference frames included in the sliding frame window. The results show quantitatively that the parameter that influences storage instantiation the most and has the greatest influence on the total memory size is the number of reference frames in the sliding frame window. The conducted study shows that, for a QCIF frame format, switching from frame-to macroblock-level is feasible and fully validated functionally and that the new execution flow can lead to memory reduction by a factor of 6.8 to 40, depending on the algorithm parameters values. Memory reduction greatly facilitates hardware implementations of the algorithm and this is the main result claimed. But the reduction in memory size comes at the cost of increasing the number of memory accesses and therefore communications traffic. The increase noted in memory accesses it to be quantified in future work as well as the potential impact on power consumption. The reduction in memory size might also make it fit on chip without turning to external memory, thereby reducing power consumption. This trade off in power is yet to be quantified.

在这项工作中，我们提出了一种新的超分辨率图像恢复(SRIR)非均匀网格投影算法的执行流程——宏块级流程。将该流程与参考帧级流程进行了比较。帧级流的特点是，只有对当前正在处理的帧的所有宏块(mb)执行当前步骤之后，才会发生从算法的一个步骤到另一个步骤的转换。新流程在开始处理另一个MB之前完成一个MB的处理。对两种方案的内存需求进行了详细的评估和比较。对滑动帧窗口中包含的MB大小、比例因子、搜索区域大小和参考帧数量等不同的算法参数值，进行了在总内存需求方面可实现的内存缩减研究。结果定量地表明，对存储实例化影响最大、对总内存大小影响最大的参数是滑动帧窗口中的参考帧数。所进行的研究表明，对于QCIF帧格式，从帧级切换到宏块级是可行的，并且在功能上得到了充分验证，并且根据算法参数值的不同，新的执行流程可以使内存减少6.8到40倍。内存减少极大地促进了算法的硬件实现，这是所声称的主要结果。但是内存大小的减小是以增加内存访问次数和通信流量为代价的。内存的增加将在未来的工作中进行量化，并对功耗产生潜在影响。内存大小的减小也可能使其适合芯片而无需转向外部存储器，从而降低功耗。这种权力上的权衡还有待量化。

{"title":"A Low Memory Requirements Execution Flow for the Non-Uniform Grid Projection Super-Resolution Algorithm","authors":"T. Szydzik, G. Callicó, A. Núñez","doi":"10.1109/ISM.2011.22","DOIUrl":"https://doi.org/10.1109/ISM.2011.22","url":null,"abstract":"In this work we present a novel execution flow for the super-resolution image restoration (SRIR) non-uniform grid projection algorithm -- the macroblock-level flow. The novel flow is compared with the reference frame-level flow. The frame-level flow is characterized by the fact that transitions from one step of the algorithm to another occur only after the current step is carried out for all macro blocks (MBs) of the frame being currently processed. The novel flow carries out complete processing of one MB before the processing of another MB starts. The memory requirements of both schemes are evaluated in detail and compared. The study on the achievable memory reduction in total memory requirements was carried out for different values of the algorithm parameters: the MB size, scale factor, search area size and number of reference frames included in the sliding frame window. The results show quantitatively that the parameter that influences storage instantiation the most and has the greatest influence on the total memory size is the number of reference frames in the sliding frame window. The conducted study shows that, for a QCIF frame format, switching from frame-to macroblock-level is feasible and fully validated functionally and that the new execution flow can lead to memory reduction by a factor of 6.8 to 40, depending on the algorithm parameters values. Memory reduction greatly facilitates hardware implementations of the algorithm and this is the main result claimed. But the reduction in memory size comes at the cost of increasing the number of memory accesses and therefore communications traffic. The increase noted in memory accesses it to be quantified in future work as well as the potential impact on power consumption. The reduction in memory size might also make it fit on chip without turning to external memory, thereby reducing power consumption. This trade off in power is yet to be quantified.","PeriodicalId":339410,"journal":{"name":"2011 IEEE International Symposium on Multimedia","volume":"6 3","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114122751","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

3D Image Browsing on Mobile Devices 移动设备上的3D图像浏览

2011 IEEE International Symposium on Multimedia

Pub Date : 2011-12-05 DOI: 10.1109/ISM.2011.60

Klaus Schöffmann, David Ahlström, C. Beecks

We present an intuitive user interface for the exploration of images on mobile multi-touch devices. Our interface uses a novel cylindrical 3D visualization of visually sorted images as well as touch gestures and tilting operations to support mobile users in interactive browsing of images by providing convenient navigation/interaction and intuitive visualization capabilities.

我们提出了一个直观的用户界面，用于探索移动多点触摸设备上的图像。我们的界面采用新颖的圆柱形3D可视化视觉分类图像，以及触摸手势和倾斜操作，通过提供方便的导航/交互和直观的可视化功能，支持移动用户交互式浏览图像。

引用次数: 16

Perceptually-Driven Scalable MDCT Enhancement of Compressed Audio Based on Statistical Conversion 基于统计转换的压缩音频感知驱动可扩展MDCT增强

2011 IEEE International Symposium on Multimedia

Pub Date : 2011-12-05 DOI: 10.1109/ISM.2011.16

D. Cantzos, A. Mouchtaris, C. Kyriakakis

Many state-of-the-art audio codecs operating in a transform domain provide scalability as a core function by allowing to selectively subtract bits -- usually according to a nonperceptual criterion from the full bit rate data stream. This work presents a different, or even reverse, scalability approach in which a scalable codec can selectively add perceptually significant bits to a low bit rate data stream. The scalable enhancement algorithm presented here operates in the Modified Discrete Cosine Transform domain, which is popular among perceptual audio transform encoders, but its extension on other domains is straightforward. By exploiting the information of an existing low bit rate base layer, the algorithm adds perceptually significant data to the data stream according to a psycho acoustic model, and improves the audio quality at a fraction of the bit rate that would normally be required for the encoding or transmission of the whole audio piece of the same quality. Applications of this can be found in packet retransmission schemes of compressed audio networks and in remote audio enhancement.

许多在变换域中运行的最先进的音频编解码器通过允许选择性地减去比特(通常根据全比特率数据流的非感知标准)作为核心功能提供可扩展性。这项工作提出了一种不同的，甚至是相反的可扩展性方法，在这种方法中，可扩展的编解码器可以选择性地向低比特率数据流添加具有感知意义的比特。本文提出的可扩展增强算法在改进离散余弦变换域运行，这在感知音频变换编码器中很流行，但它在其他领域的扩展是直接的。通过利用现有的低比特率基础层的信息，该算法根据心理声学模型向数据流中添加感知上重要的数据，并以通常编码或传输相同质量的整个音频片段所需的比特率的一小部分提高音频质量。这种方法可以应用于压缩音频网络的分组重传方案和远程音频增强。

引用次数: 1

Exploiting of Flickr Note and its Applications for Social Image Sharing and Search Flickr Note的开发及其在社交图片分享和搜索中的应用

2011 IEEE International Symposium on Multimedia

Pub Date : 2011-12-05 DOI: 10.1109/ISM.2011.34

Jin-Woo Jeong, Hyun-Ki Hong, Dong-Ho Lee

In this paper, we present analytical information about Flickr notes and propose further directions of note based image search. Compared to a tag that is used for traditional social image search, Flickr note is a kind of text directly assigned on the image regions. Even though note has various information that may help intelligent social image sharing and search, there is no significant research that focuses on the potential and the impact of note for image search. In order to reveal the useful information and potential of Flickr notes, we have collected a number of images and analyzed them with regard to various aspects. Additionally, from the analytical results about Flickr notes, we show various possible research issues to which note information can be applied.

在本文中，我们提供了关于Flickr笔记的分析信息，并提出了基于笔记的图像搜索的进一步方向。与传统社交图片搜索使用的标签相比，Flickr注释是一种直接分配在图片区域上的文本。尽管笔记有各种各样的信息可以帮助智能社交图像共享和搜索，但没有重要的研究关注笔记对图像搜索的潜力和影响。为了揭示Flickr笔记的有用信息和潜力，我们收集了一些图片，并从各个方面进行了分析。此外，从Flickr笔记的分析结果中，我们展示了可以应用笔记信息的各种可能的研究问题。

引用次数: 3

A Visual Analytics Multimedia Mobile System for Emergency Response 面向应急响应的可视化分析多媒体移动系统

2011 IEEE International Symposium on Multimedia

Pub Date : 2011-12-05 DOI: 10.1109/ISM.2011.61

Steven Luis, Fausto Fleites, Yimin Yang, Hsin-Yu Ha, Shu‐Ching Chen

We present a novel visual analytics system and multimedia enabled mobile application that allows emergency management (EM) personnel access to timely and relevant disaster situation information. The system is able to semantically integrate text-based emergency management disaster situation reports with related disaster imagery taken in the field by EM responders and community residents. In addition, through an intuitive and seamless Apple iPad application, users are able to interact with the system in diverse places and conditions and thus provide a more effective response. The system is demonstrated via its iPad application which aims at providing relevant and actionable information.

我们提出了一种新颖的可视化分析系统和多媒体移动应用程序，使应急管理人员能够及时获取相关的灾害情况信息。该系统能够将基于文本的应急管理灾害情况报告与EM响应者和社区居民在现场拍摄的相关灾害图像在语义上集成。此外，通过直观无缝的Apple iPad应用程序，用户可以在不同的地点和条件下与系统进行交互，从而提供更有效的响应。该系统通过其iPad应用程序进行演示，旨在提供相关和可操作的信息。

引用次数: 12

Library of Labs - A European Project on the Dissemination of Remote Experiments and Virtual Laboratories 实验室图书馆-远程实验和虚拟实验室传播的欧洲项目

2011 IEEE International Symposium on Multimedia

Pub Date : 2011-12-05 DOI: 10.1109/ISM.2011.96

T. Richter, Yvonne Tetour, D. Boehringer

In this paper, we provide background information on the EC funded Lila Project ("Library of Labs"), describe its goals and purposes, provide some insight into its software design and provide first experiences, made at the University of Stuttgart using the eLearning content deployed by the project.

在本文中，我们提供了欧共体资助的Lila项目(“实验室图书馆”)的背景信息，描述了它的目标和目的，提供了一些对其软件设计的见解，并提供了在斯图加特大学使用该项目部署的电子学习内容的第一次体验。

引用次数: 35

RFID-based Solutions for User Profiling in Interactive Exhibits 交互式展览中基于rfid的用户分析解决方案

2011 IEEE International Symposium on Multimedia

Pub Date : 2011-12-05 DOI: 10.1109/ISM.2011.42

Gianpaolo D'Amico, A. Bimbo, Andrea Ferracani, Lea Landucci, Daniele Pezzatini, Luca Santi

In this paper we present a work-in-progress interactive exhibit for the museum of Onna, a little town near to L'Aquila (Italy), almost completely destroyed by the earthquake of April 2009. The installation will be developed as an environment in which visitors of the museum can interact with a natural interaction system and then discover the history of the disaster via rich multimedia contents. Visitors are detected through the adoption of an RFID-based technology, which allows to store their interaction history and build an interest profile used to enrich the experience. Different scenarios have been implemented and tested in order to evaluate the effectiveness of the proposed solution.

在本文中，我们为意大利拉奎拉(L'Aquila)附近的小镇Onna博物馆展示了一个正在进行中的互动展览，这个小镇在2009年4月的地震中几乎被完全摧毁。该装置将被开发成一个环境，在这个环境中，博物馆的参观者可以与一个自然的互动系统互动，然后通过丰富的多媒体内容发现灾难的历史。通过采用基于rfid的技术来检测访客，该技术允许存储他们的互动历史并建立兴趣档案，以丰富体验。为了评估所建议的解决方案的有效性，已经实现和测试了不同的场景。

引用次数: 2

PicoLife: A Computer Vision-based Gesture Recognition and 3D Gaming System for Android Mobile Devices PicoLife:基于计算机视觉的Android移动设备手势识别和3D游戏系统

2011 IEEE International Symposium on Multimedia

Pub Date : 2011-12-05 DOI: 10.1109/ISM.2011.13

Mahesh Babu Mariappan, X. Guo, B. Prabhakaran

Pico Life is envisioned to be an augmented reality game in which 3D characters will be controlled by hand gestures on Android smart phones. Pico Life is currently powered by two mobile optimized engines: (1) The computer vision engine that runs our advanced object tracking program for hand tracking and (2) The 3D engine that runs our 3D models for the characters in the game. In the near future, we will be adding yet another mobile optimized engine, namely, the augmented reality engine. In this paper, we will present our work on object tracking and 3D modeling for Pico Life and contrast the performances of the two engines on three different mobile platforms, namely, Texas Instruments' OMAP3630 (Motorola Droid X running Android Gingerbread), Qualcomm's MSM8660 Snapdragon (HTC Evo 3D running Android Gingerbread) and the Texas Instruments' OMAP4430 (Blaze Development platform running Android Gingerbread).

Pico Life被设想为一款增强现实游戏，其中3D角色将通过Android智能手机上的手势控制。《Pico Life》目前由两个移动优化引擎提供支持:(1)运行我们用于手部追踪的高级目标追踪程序的计算机视觉引擎，以及(2)运行游戏中角色的3D模型的3D引擎。在不久的将来，我们将添加另一个移动优化引擎，即增强现实引擎。在本文中，我们将介绍我们在Pico Life的目标跟踪和3D建模方面的工作，并对比两种引擎在三个不同的移动平台上的性能，即德州仪器的OMAP3630(运行Android Gingerbread的摩托罗拉Droid X)，高通的MSM8660骁龙(运行Android Gingerbread的HTC Evo 3D)和德州仪器的OMAP4430(运行Android Gingerbread的Blaze开发平台)。

引用次数: 6

On the Properties of Mean Opinion Scores for Quality of Experience Management 体验管理质量平均意见分数的性质研究

2011 IEEE International Symposium on Multimedia

Pub Date : 2011-12-05 DOI: 10.1109/ISM.2011.88

Jie Xu, Liyuan Xing, A. Perkis, Yuming Jiang

For research on quality of experience (QoE), mean opinion scores (MOS) are widely chosen as the results of subjective tests and the ground-truth reference for further research on objective quality modeling. Furthermore, the results of objective quality modeling are used for QoE management subsequently. Therefore, the performance of QoE management process actually depends heavily on MOS. However, the rationality of MOS for QoE management is not yet technically proven in the literature. In this paper, we first prove that subject homogeneity is implicitly assumed for obtaining MOS by modeling the arithmetic averaging process from a systematic viewpoint. However, we point out that actually subjects exhibit variability in terms of quality assessment. Then we elaborate that this mismatch may results in failures if we conduct QoE management based on MOS. Finally we propose a utility-based averaging method (uMOS) which improves the performance of QoE management.

在经验质量(QoE)的研究中，普遍采用平均意见分数(MOS)作为主观检验的结果和进一步研究客观质量模型的基础真值参考。在此基础上，将客观质量建模的结果用于质量质量评价(QoE)的管理。因此，质量质量管理过程的绩效实际上在很大程度上取决于质量管理体系。然而，在文献中，质量质量管理的合理性尚未得到技术上的证实。本文首先从系统的角度对算法平均过程进行建模，证明了主体均匀性是获得MOS的隐式假设条件。然而，我们指出，实际上受试者在质量评估方面表现出可变性。然后详细阐述了这种不匹配可能导致基于MOS的QoE管理失败。最后提出了一种基于效用的平均方法(uMOS)，提高了QoE管理的性能。

引用次数: 27

What Cooks Needs from Multimedia and Textually Enhanced Recipes 什么厨师需要从多媒体和文本增强食谱

2011 IEEE International Symposium on Multimedia

Pub Date : 2011-12-05 DOI: 10.1109/ISM.2011.70

Lucy Buykx, H. Petrie

Using recipes in a step-by-step format with multimedia enhancements has been found to increase confidence and enjoyment of cooking but the field lacks research with cooks on the problems they encounter, so it is unclear what granularity of recipe step and associated multimedia would best support them. The current study observed 16 cooks prepare 3 dishes using recipes in 3 different formats to understand what problems cooks have with recipes. Recipe format had a significant effect on the ratings given to the recipe for clarity and ease of use but not on time to complete the recipe. Analysis of cooking activity and cooks' feedback shows that cooks want (i) step-by-step recipes with ingredient quantities in the recipe step, (ii) pictures of the interim states of the recipe, (iii) videos of preparation of unfamiliar ingredients, and (iv) videos of preparation techniques with different types of utensils.

人们发现，使用带有多媒体增强功能的分步食谱可以增加烹饪的信心和乐趣，但该领域缺乏对厨师遇到的问题的研究，因此不清楚食谱步骤和相关多媒体的粒度如何才能最好地支持他们。目前的研究观察了16名厨师用3种不同格式的食谱准备3道菜，以了解厨师在食谱上遇到的问题。配方格式对配方的清晰度和易用性评分有显著影响，但对完成配方的时间没有影响。对烹饪活动和厨师反馈的分析表明，厨师想要的是(i)一步一步的食谱，以及食谱步骤中配料的数量，(ii)食谱中间状态的图片，(iii)制作不熟悉的食材的视频，以及(iv)使用不同类型的器具制作技术的视频。

引用次数: 13

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

2011 IEEE International Symposium on Multimedia

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀