2011 IEEE International Symposium on Multimedia最新文献

英文中文

AH+-Tree: An Efficient Multimedia Indexing Structure for Similarity Queries AH+-Tree:一种用于相似度查询的高效多媒体索引结构

2011 IEEE International Symposium on Multimedia

Pub Date : 2011-12-05 DOI: 10.1109/ISM.2011.20

Fausto Fleites, Shu‐Ching Chen, Kasturi Chatterjee

This paper presents the AH+-tree, a balanced, tree-based index structure that efficiently supports Content-Based Image Retrieval (CBIR) through similarity queries. The proposed index structure addresses the problems of semantic gap and user subjectivity by considering the high-level semantics of multimedia data during the retrieval process. The AH+-tree provides the same functionality as the Affinity-Hybrid Tree (AH-Tree) but utilizes the high-level semantics in a novel way to eliminate the I/O overhead incurred by the AH-Tree due to the process of affinity propagation, which requires a complete traversal of the tree. The novel structure of the tree is explained, and detailed range and nearest neighbor algorithms are implemented and analyzed. Extensive discussions and experiments demonstrate the superior efficiency of the AH+-tree over the AH-Tree and the M-tree. Results show the AH+-tree significantly reduces I/O cost during similarity searches. The I/O efficiency of the AH+-tree and its ability to incorporate high-level semantics from different machine learning mechanisms make the AH+-tree a promising index access method for large multimedia databases.

本文提出了一种平衡的、基于树的索引结构AH+树，它通过相似性查询有效地支持基于内容的图像检索(CBIR)。该索引结构通过考虑多媒体数据在检索过程中的高级语义，解决了语义缺口和用户主观性问题。AH+树提供与亲和性混合树(affinity - hybrid Tree, AH -tree)相同的功能，但以一种新颖的方式利用高级语义来消除由于亲和性传播过程(需要对树进行完整遍历)而由AH -tree引起的I/O开销。对树的新结构进行了解释，并对详细的范围和最近邻算法进行了实现和分析。大量的讨论和实验表明，AH+树比AH-树和m -树更有效。结果表明，AH+树显著降低了相似性搜索期间的I/O开销。AH+树的I/O效率及其结合来自不同机器学习机制的高级语义的能力使AH+树成为大型多媒体数据库的一种有前途的索引访问方法。

{"title":"AH+-Tree: An Efficient Multimedia Indexing Structure for Similarity Queries","authors":"Fausto Fleites, Shu‐Ching Chen, Kasturi Chatterjee","doi":"10.1109/ISM.2011.20","DOIUrl":"https://doi.org/10.1109/ISM.2011.20","url":null,"abstract":"This paper presents the AH+-tree, a balanced, tree-based index structure that efficiently supports Content-Based Image Retrieval (CBIR) through similarity queries. The proposed index structure addresses the problems of semantic gap and user subjectivity by considering the high-level semantics of multimedia data during the retrieval process. The AH+-tree provides the same functionality as the Affinity-Hybrid Tree (AH-Tree) but utilizes the high-level semantics in a novel way to eliminate the I/O overhead incurred by the AH-Tree due to the process of affinity propagation, which requires a complete traversal of the tree. The novel structure of the tree is explained, and detailed range and nearest neighbor algorithms are implemented and analyzed. Extensive discussions and experiments demonstrate the superior efficiency of the AH+-tree over the AH-Tree and the M-tree. Results show the AH+-tree significantly reduces I/O cost during similarity searches. The I/O efficiency of the AH+-tree and its ability to incorporate high-level semantics from different machine learning mechanisms make the AH+-tree a promising index access method for large multimedia databases.","PeriodicalId":339410,"journal":{"name":"2011 IEEE International Symposium on Multimedia","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129573197","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 4

Characters Identification in TV Series 电视剧中的人物识别

2011 IEEE International Symposium on Multimedia

Pub Date : 2011-12-05 DOI: 10.1109/ISM.2011.31

Madjid Maidi, Veronica Scurtu, M. Preda

This work aims to realize a recognition system for a software engine that will automatically generate a quiz starting from a video content and reinsert it into the video, turning thus any available foreign-language video (such as news or TV series) into a remarkable learning tool. Our system includes a face tracking application which integrates the eigen face method with a temporal tracking approach. The main part of our work is to detect and identify faces from movies and to associate specific quizzes for each recognized character. The proposed approach allows to label the detected faces and maintains face tracking along the video stream. This task is challenging since characters present significant variation in their appearance. Therefore, we employed eigen faces to reconstruct the original image from training models and we developed a new technique based on frames buffering for continuous tracking in unfavorable environment conditions. Many tests were conducted and proved that our system is able to identify multiple characters. The obtained results showed the performance and the effectiveness of the proposed method.

本工作旨在实现一个软件引擎的识别系统，该系统将自动从视频内容开始生成测验，并将其重新插入视频中，从而将任何可用的外语视频(如新闻或电视剧)变成一个出色的学习工具。我们的系统包括一个人脸跟踪应用程序，该应用程序将特征人脸方法与时间跟踪方法相结合。我们工作的主要部分是检测和识别电影中的面孔，并为每个识别的角色关联特定的测验。该方法可以标记检测到的人脸，并在视频流中保持人脸跟踪。这个任务是具有挑战性的，因为角色在他们的外表上有很大的变化。因此，我们利用特征人脸从训练模型中重建原始图像，并开发了一种基于帧缓冲的新技术，用于不利环境条件下的连续跟踪。经过多次测试，证明该系统能够识别多个字符。仿真结果表明了该方法的性能和有效性。

引用次数: 1

Adaptive Pattern-driven Compression of Large-Area High-Resolution Terrain Data 自适应模式驱动的大面积高分辨率地形数据压缩

2011 IEEE International Symposium on Multimedia

Pub Date : 2011-12-05 DOI: 10.1109/ISM.2011.62

Hai Wei, S. Zabuawala, Lei Zhang, Jiejie Zhu, J. Yadegar, J. D. Cruz, Hector J. Gonzalez

This paper presents a novel adaptive pattern-driven approach for compressing large-area high-resolution terrain data. Utilizing a pattern-driven model, the proposed approach achieves efficient terrain data reduction by modeling and encoding disparate visual patterns using a compact set of extracted features. The feasibility and efficiency of the proposed technique were corroborated by experiments using various terrain datasets and comparisons with the state-of-the-art compression techniques. Since different visual patterns are separated and modeled explicitly during the compression process, the proposed technique also holds a great potential for providing a good synergy between compression and compressed-domain analysis.

提出了一种基于自适应模式驱动的大面积高分辨率地形数据压缩方法。该方法利用模式驱动模型，利用一组紧凑的提取特征对不同的视觉模式进行建模和编码，从而实现高效的地形数据缩减。利用不同地形数据集进行实验，并与最先进的压缩技术进行比较，证实了所提出技术的可行性和效率。由于不同的可视化模式在压缩过程中被分离并显式建模，因此所提出的技术在提供压缩和压缩域分析之间的良好协同方面也具有很大的潜力。

引用次数: 1

Shot Boundary Detection Using Genetic Algorithm Optimization 基于遗传算法优化的镜头边界检测

2011 IEEE International Symposium on Multimedia

Pub Date : 2011-12-05 DOI: 10.1109/ISM.2011.58

Calvin Chan, A. Wong

This paper presents a novel method for shot boundary detection via an optimization of traditional scoring based metrics using a genetic algorithm search heuristic. The advantage of this approach is that it allows for the detection of shots without requiring the direct use of thresholds. The methodology is described using the edge-change ratio metric and applied to several test video segments from the TREC 2002 video track and contemporary television shows. The shot boundary detection results are evaluated using recall, precision and F1 metrics, which demonstrate that the proposed approach provides superior overall performance when compared to the effective edge-change ratio method. In addition, the convergence of the genetic algorithm is examined to show that the proposed method is both efficient and stable.

本文提出了一种新的镜头边界检测方法，该方法利用遗传算法搜索启发式对传统的基于评分的度量进行优化。这种方法的优点是，它允许检测镜头，而不需要直接使用阈值。该方法使用边缘变化比度量来描述，并应用于来自TREC 2002视频轨道和当代电视节目的几个测试视频片段。使用召回率、精度和F1指标对镜头边界检测结果进行了评估，结果表明，与有效的边缘变化比方法相比，该方法具有更好的整体性能。此外，对遗传算法的收敛性进行了检验，证明了该方法的有效性和稳定性。

引用次数: 11

Models and Tools for Aggregating and Annotating Content on ECLAP 在ECLAP上聚合和注释内容的模型和工具

2011 IEEE International Symposium on Multimedia

Pub Date : 2011-12-05 DOI: 10.1109/ISM.2011.41

P. Bellini, P. Nesi, M. Paolucci, Marco Serena

In the area of cultural heritage there is a strong push on aggregating content metadata from institutions (such as museums, university, archives, library, foundations, etc.) to make them widely accessible. This action is going to reduce fragmentation, allows aggregation and integrates valuable collections in a unique place. For example, European a (the so called European digital library) collects only metadata, while content files are referred via some URL. These URLs refer to the original content owner and/or to the Content Aggregator, facilitating the collection. That model leaves space to the Content Aggregator to provide additional services on their enriched models. The proposed Content Aggregation model attempts to satisfy specific requirements with a semantic model and tools providing support for executable aggregations such as: play lists, collections, e-learning courses, and media annotations/synchronizations. The produced aggregations may also be provided by mapping semantic concepts to European a. The paper also performs an analysis of semantics models mentioned and of their difficulties including some comments about the adoption of linked open data and media model. The results have been produced in the project ECLAP ICT PSP founded by the European Commission, http://www.eclap.eu.

在文化遗产领域，人们大力推动从机构(如博物馆、大学、档案馆、图书馆、基金会等)汇总内容元数据，使其广泛可访问。这个动作将减少碎片，允许聚合，并将有价值的集合集成到一个独特的地方。例如，European a(所谓的欧洲数字图书馆)只收集元数据，而内容文件则通过某个URL引用。这些url引用原始内容所有者和/或内容聚合器，从而促进收集。该模型为Content Aggregator留下了空间，以便在其丰富的模型上提供额外的服务。提议的内容聚合模型试图通过语义模型和工具来满足特定的需求，这些模型和工具为可执行的聚合提供支持，例如:播放列表、集合、电子学习课程和媒体注释/同步。生成的聚合也可以通过将语义概念映射到欧洲a来提供。本文还对所提到的语义模型及其困难进行了分析，包括对采用链接开放数据和媒体模型的一些评论。这些结果是在欧洲委员会建立的ECLAP ICT PSP项目中产生的，网址是http://www.eclap.eu。

{"title":"Models and Tools for Aggregating and Annotating Content on ECLAP","authors":"P. Bellini, P. Nesi, M. Paolucci, Marco Serena","doi":"10.1109/ISM.2011.41","DOIUrl":"https://doi.org/10.1109/ISM.2011.41","url":null,"abstract":"In the area of cultural heritage there is a strong push on aggregating content metadata from institutions (such as museums, university, archives, library, foundations, etc.) to make them widely accessible. This action is going to reduce fragmentation, allows aggregation and integrates valuable collections in a unique place. For example, European a (the so called European digital library) collects only metadata, while content files are referred via some URL. These URLs refer to the original content owner and/or to the Content Aggregator, facilitating the collection. That model leaves space to the Content Aggregator to provide additional services on their enriched models. The proposed Content Aggregation model attempts to satisfy specific requirements with a semantic model and tools providing support for executable aggregations such as: play lists, collections, e-learning courses, and media annotations/synchronizations. The produced aggregations may also be provided by mapping semantic concepts to European a. The paper also performs an analysis of semantics models mentioned and of their difficulties including some comments about the adoption of linked open data and media model. The results have been produced in the project ECLAP ICT PSP founded by the European Commission, http://www.eclap.eu.","PeriodicalId":339410,"journal":{"name":"2011 IEEE International Symposium on Multimedia","volume":"37 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131677825","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

Popularity Awareness in Temporal-DHT for P2P-based Media Streaming Applications 基于p2p的媒体流应用的时间dht的流行意识

2011 IEEE International Symposium on Multimedia

Pub Date : 2011-12-05 DOI: 10.1109/ISM.2011.46

Abhishek Bhattacharya, Zhenyu Yang, Deng Pan

Application-layer overlay networks are receiving considerable popularity due to its flexibility and readily deployable nature thereby providing support for a plethora of Peer-to-Peer (P2P) applications. Currently, the real-world deployment of Internet-scale P2P media streaming systems involve the usage of tracker server for content discovery in on-demand model with asynchronous interactivity. The inherent drawbacks of tracker-server based approach are obvious due to scalability and bottleneck issues, which prompted us to pursue a structured P2P based proposition such as Distributed Hash Tables (DHT) which are already proved to be stable substrates. The challenging issue of accommodating a large number of update operations with the continuous change of user's playing position in DHT-based overlay is addressed in our previous work by the concept of Temporal-DHT which exploits the temporal dynamics of the content to estimate playing position. In this paper, we incorporate the notion of popularity awareness in the Temporal-DHT framework which will help to adapt the query resolution mechanism by addressing the skew ness of content popularity typically found in real multimedia user access patterns. The essential objective of popularity awareness mechanism is to increase the overall performance of Temporal-DHT by optimizing the search cost among the entire content set within the system. We formulate the problem and provide practical solutions with extensive simulation results that demonstrates the effectiveness of popularity-aware Temporal-DHT by achieving optimized query resolution cost and high streaming quality for on-demand systems in a dynamic network environment where user's are free to asynchronously join/leave the system.

应用层覆盖网络由于其灵活性和易于部署的特性，从而为大量的点对点(P2P)应用程序提供支持，正受到相当大的欢迎。目前，互联网规模的P2P流媒体系统的实际部署涉及到在按需模式下使用跟踪服务器进行内容发现，具有异步交互性。由于可扩展性和瓶颈问题，基于跟踪器服务器的方法的固有缺点是显而易见的，这促使我们追求结构化的基于P2P的命题，如分布式哈希表(DHT)，这已经被证明是稳定的基础。在基于dht的覆盖中，由于用户的游戏位置不断变化而容纳大量更新操作的挑战性问题在我们之前的工作中通过利用内容的时间动态来估计游戏位置的temporal - dht概念得到了解决。在本文中，我们在Temporal-DHT框架中加入了流行度意识的概念，这将有助于通过解决真实多媒体用户访问模式中常见的内容流行度偏差来调整查询解析机制。人气感知机制的本质目标是通过优化系统内整个内容集的搜索成本来提高Temporal-DHT的整体性能。在用户可以自由异步加入/离开系统的动态网络环境中，通过优化查询解析成本和高流质量，我们阐述了问题并提供了实用的解决方案，并提供了广泛的仿真结果，证明了流行感知的time - dht的有效性。

{"title":"Popularity Awareness in Temporal-DHT for P2P-based Media Streaming Applications","authors":"Abhishek Bhattacharya, Zhenyu Yang, Deng Pan","doi":"10.1109/ISM.2011.46","DOIUrl":"https://doi.org/10.1109/ISM.2011.46","url":null,"abstract":"Application-layer overlay networks are receiving considerable popularity due to its flexibility and readily deployable nature thereby providing support for a plethora of Peer-to-Peer (P2P) applications. Currently, the real-world deployment of Internet-scale P2P media streaming systems involve the usage of tracker server for content discovery in on-demand model with asynchronous interactivity. The inherent drawbacks of tracker-server based approach are obvious due to scalability and bottleneck issues, which prompted us to pursue a structured P2P based proposition such as Distributed Hash Tables (DHT) which are already proved to be stable substrates. The challenging issue of accommodating a large number of update operations with the continuous change of user's playing position in DHT-based overlay is addressed in our previous work by the concept of Temporal-DHT which exploits the temporal dynamics of the content to estimate playing position. In this paper, we incorporate the notion of popularity awareness in the Temporal-DHT framework which will help to adapt the query resolution mechanism by addressing the skew ness of content popularity typically found in real multimedia user access patterns. The essential objective of popularity awareness mechanism is to increase the overall performance of Temporal-DHT by optimizing the search cost among the entire content set within the system. We formulate the problem and provide practical solutions with extensive simulation results that demonstrates the effectiveness of popularity-aware Temporal-DHT by achieving optimized query resolution cost and high streaming quality for on-demand systems in a dynamic network environment where user's are free to asynchronously join/leave the system.","PeriodicalId":339410,"journal":{"name":"2011 IEEE International Symposium on Multimedia","volume":"91 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124242260","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 3

Social Skills Training Support of Cognitive Dysfunctions by Cooperative Cooking Navigation System 合作烹饪导航系统对认知功能障碍的社会技能训练支持

2011 IEEE International Symposium on Multimedia

Pub Date : 2011-12-05 DOI: 10.1109/ISM.2011.73

Kenzaburo Miyawaki, Mutsuo Sano, Syunichi Yonemura, M. Ode

We have an important issue that the people with cognitive dysfunctions should improve social skills for self supporting. This paper notices their fundamental cooking activities and proposes a cooperative cooking navigation system supporting their social skills training. We have econstructed this system to be composed of cooperative behavior expression support and cooperative behavior evaluation support. We evaluate the experimental results of applying our proposed system to patients with cognitive dysfunctions and extract the essential conditions for working this system well.

我们有一个重要的问题，认知功能障碍的人应该提高自我支持的社交技能。注意到他们的基本烹饪活动，提出了一种支持他们社交技能训练的合作烹饪导航系统。我们将该系统构建为由合作行为表达支持和合作行为评价支持组成。我们评估了将我们提出的系统应用于认知功能障碍患者的实验结果，并提取了该系统良好工作的必要条件。

引用次数: 7

Real-Time Stereo Mosaicing Using Feature Tracking 使用特征跟踪的实时立体拼接

2011 IEEE International Symposium on Multimedia

Pub Date : 2011-12-05 DOI: 10.1109/ISM.2011.102

Marc Vivet, Shmuel Peleg, Xavier Binefa

Real-time creation of video mosaics needs fast and accurate motion computation. While most mosaicing methods can use 2D image motion, the creation of multi view stereo mosaics needs more accurate 3D motion computation. Fast and accurate computation of 3D motion is challenging in the case of unstabilized cameras moving in 3D scenes, which is always the case when stereo mosaics are used. Efficient blending of the mosaic strip is also essential. Most cases of stereo mosaicing satisfy the assumption of limited camera motion, with no forward motion and no change in internal parameters. Under these assumptions uniform sideways motion creates straight epipolar lines. When the 3D motion is computed correctly, images can be aligned in space-time volume to give straight epipolar lines, a method which is depth invariant. We propose to align the video sequence in a space-time volume based on efficient feature tracking, and in this paper we used Kernel Tracking. Computation is fast as the motion in computed only for a few regions of the image, yet giving accurate 3D motion. This computation is faster and more accurate than the previously used direct approach. We also present "Barcode Blending", a new approach for using pyramid blending in video mosaics, which is very efficient. Barcode Blending overcomes the complexity of building pyramids for multiple narrow strips, combining all strips in a single blending step. The entire stereo mosaicing process is highly efficient in computation and in memory, and can be performed on mobile devices.

视频拼接的实时生成需要快速准确的运动计算。虽然大多数拼接方法可以使用二维图像运动，但创建多视图立体拼接需要更精确的三维运动计算。在不稳定的摄像机在3D场景中移动的情况下，快速准确地计算3D运动是具有挑战性的，这通常是使用立体马赛克时的情况。马赛克条的有效混合也是必不可少的。大多数立体拼接都满足摄像机运动受限的假设，没有向前运动，内部参数没有变化。在这些假设下，均匀的横向运动产生了直线极线。当三维运动计算正确时，可以将图像在时空体积上对齐，得到直线的极线，这是一种深度不变的方法。我们提出了基于高效特征跟踪的视频序列在一个时空体中对齐，并在本文中使用了核跟踪。计算速度快，因为只计算图像的几个区域的运动，但给出准确的3D运动。这种计算比以前使用的直接方法更快、更准确。我们还提出了“条形码混合”，这是一种在视频马赛克中使用金字塔混合的新方法，效率很高。条形码混合克服了为多个窄条构建金字塔的复杂性，在单个混合步骤中组合所有条带。整个立体拼接过程在计算和内存方面都非常高效，并且可以在移动设备上执行。

{"title":"Real-Time Stereo Mosaicing Using Feature Tracking","authors":"Marc Vivet, Shmuel Peleg, Xavier Binefa","doi":"10.1109/ISM.2011.102","DOIUrl":"https://doi.org/10.1109/ISM.2011.102","url":null,"abstract":"Real-time creation of video mosaics needs fast and accurate motion computation. While most mosaicing methods can use 2D image motion, the creation of multi view stereo mosaics needs more accurate 3D motion computation. Fast and accurate computation of 3D motion is challenging in the case of unstabilized cameras moving in 3D scenes, which is always the case when stereo mosaics are used. Efficient blending of the mosaic strip is also essential. Most cases of stereo mosaicing satisfy the assumption of limited camera motion, with no forward motion and no change in internal parameters. Under these assumptions uniform sideways motion creates straight epipolar lines. When the 3D motion is computed correctly, images can be aligned in space-time volume to give straight epipolar lines, a method which is depth invariant. We propose to align the video sequence in a space-time volume based on efficient feature tracking, and in this paper we used Kernel Tracking. Computation is fast as the motion in computed only for a few regions of the image, yet giving accurate 3D motion. This computation is faster and more accurate than the previously used direct approach. We also present \"Barcode Blending\", a new approach for using pyramid blending in video mosaics, which is very efficient. Barcode Blending overcomes the complexity of building pyramids for multiple narrow strips, combining all strips in a single blending step. The entire stereo mosaicing process is highly efficient in computation and in memory, and can be performed on mobile devices.","PeriodicalId":339410,"journal":{"name":"2011 IEEE International Symposium on Multimedia","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114883496","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 4

Shooting Assistance by Recognizing User's Camera Manipulation for Intelligible Video Production 通过识别用户的相机操作来帮助拍摄，以实现可理解的视频制作

2011 IEEE International Symposium on Multimedia

Pub Date : 2011-12-05 DOI: 10.1109/ISM.2011.33

H. Mitarai, A. Yoshitaka

We propose a system which achieves cooperative capture assistance by camera manipulation recognition. Based on an experimental result on inexperienced users, the incremental interaction model, which the system and a user cooperatively shoot, was formerly proposed. The system based on the model compensates for user's lack of cinematographic knowledge or skills by relating affective information such as atmosphere or mood to capture techniques. When a user captures a shot after selecting a specific atmosphere, the system analyzes the current shooting image and the camera operation including the camera angle and the zooming speed. Then it gives guidance for better capture according to the analysis. The proposed system based on the model achieves an incremental interaction between the user and the system, evolving from user's unidirectional manipulation of the system. The system assists the user in reflecting user intention of the scene appropriately, therefore it enables the user to capture scenes more appropriately and effectively without specific cinematographic knowledge or skills. As a result, the user can acquire basic shooting skills smoothly and shoot more effectively.

提出了一种利用相机操纵识别实现协同捕获辅助的系统。在对无经验用户进行实验的基础上，提出了系统与用户协同拍摄的增量交互模型。基于该模型的系统通过将情感信息(如氛围或情绪)与捕捉技术联系起来，弥补了用户在电影知识或技能方面的不足。当用户选择特定的环境进行拍摄时，系统会分析当前的拍摄图像和相机操作，包括相机角度和变焦速度。根据分析结果，为更好的捕获提供指导。基于该模型的系统由用户对系统的单向操作演变而来，实现了用户与系统之间的增量交互。该系统帮助用户适当地反映场景的用户意图，从而使用户能够在没有特定的电影知识或技能的情况下更适当有效地捕捉场景。因此，用户可以顺利掌握基本的射击技能，更有效地射击。

{"title":"Shooting Assistance by Recognizing User's Camera Manipulation for Intelligible Video Production","authors":"H. Mitarai, A. Yoshitaka","doi":"10.1109/ISM.2011.33","DOIUrl":"https://doi.org/10.1109/ISM.2011.33","url":null,"abstract":"We propose a system which achieves cooperative capture assistance by camera manipulation recognition. Based on an experimental result on inexperienced users, the incremental interaction model, which the system and a user cooperatively shoot, was formerly proposed. The system based on the model compensates for user's lack of cinematographic knowledge or skills by relating affective information such as atmosphere or mood to capture techniques. When a user captures a shot after selecting a specific atmosphere, the system analyzes the current shooting image and the camera operation including the camera angle and the zooming speed. Then it gives guidance for better capture according to the analysis. The proposed system based on the model achieves an incremental interaction between the user and the system, evolving from user's unidirectional manipulation of the system. The system assists the user in reflecting user intention of the scene appropriately, therefore it enables the user to capture scenes more appropriately and effectively without specific cinematographic knowledge or skills. As a result, the user can acquire basic shooting skills smoothly and shoot more effectively.","PeriodicalId":339410,"journal":{"name":"2011 IEEE International Symposium on Multimedia","volume":"58 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2011-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124339689","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 4

Evaluation of Crosstalk Metrics for 3D Display Technologies with Respect to Temporal Luminance Analysis 基于时间亮度分析的三维显示技术串扰度量评价

2011 IEEE International Symposium on Multimedia

Pub Date : 2011-12-05 DOI: 10.1109/ISM.2011.90

J. Bulat, L. Janowski, Dawid Juszka, M. Socha, M. Grega, Z. Papir

Cross talk is one of the most important parameters of the 3D displays' quality. Different cross talk definitions exist, which makes cross talk measurement and comparison difficult. We take a step back and focus on a detailed 3D display luminance analysis. The conclusions we draw from the temporal luminance analysis can be used to propose an effective approach to cross talk measurements. In scope of the presented work we have measured four different 3D displays.

串扰是影响三维显示质量的重要参数之一。不同的相声定义给相声测量和比较带来了困难。我们退一步，专注于详细的3D显示亮度分析。我们从时间亮度分析中得出的结论可以用来提出一种有效的串扰测量方法。在展示的工作范围内，我们测量了四种不同的3D显示器。

引用次数: 3

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

2011 IEEE International Symposium on Multimedia

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀