Proceedings IEEE International Conference on Multimedia Computing and Systems最新文献

英文中文

Development of a legible deaf-signing virtual human 一个可辨认的聋人手语虚拟人的开发

Proceedings IEEE International Conference on Multimedia Computing and Systems

Pub Date : 1999-06-07 DOI: 10.1109/MMCS.1999.779226

F. Pezeshkpour, I. Marshall, R. Elliott, J. Bangham

Many deaf people rely on sign language as their primary mode of communication. They will enjoy enhanced information access if media applications can provide signed commentaries. The advent of multimedia makes such provision possible. We outline a prototype real time subtitle-to-signing translation system, based on the adaptation and integration of existing software components. We describe the development of a framework, using the Tcl/Tk environment, that supports the integration of distributed system components using a basic communications infrastructure. We discuss the development of a virtual human (avatar), deployed in this framework, to perform the signing.

许多聋哑人依靠手语作为他们的主要交流方式。如果媒体应用程序可以提供签名评论，他们将享受更大的信息访问权限。多媒体的出现使这种提供成为可能。在对现有软件组件进行改编和集成的基础上，提出了一个字幕到手语实时翻译系统的原型。我们描述了一个使用Tcl/Tk环境的框架的开发，该框架支持使用基本通信基础设施集成分布式系统组件。我们讨论部署在此框架中的虚拟人(化身)的开发，以执行签名。

引用次数: 24

A contour analysis based technique to extract objects for MPEG-4 基于轮廓分析的MPEG-4目标提取技术

Proceedings IEEE International Conference on Multimedia Computing and Systems

Pub Date : 1999-06-07 DOI: 10.1109/MMCS.1999.779232

E. Edirisinghe, Jianmin Jiang

MPEG-4 is an emerging global standard for digital multimedia services. The core development of this video standard is a content based video data structure, which constitutes arbitrarily shaped video objects. The data compression is designed in terms of these video objects rather than frames. Thus, it is a key requirement in the development stages of MPEG-4 to explore possible means of object extraction from video frames. We propose a contour based technique to extract video objects for MPEG-4. The object contours are extracted by convolving a given frame with a Laplacian-of-Gaussian operator, followed by an edge detection process. The contours are later blocked and finally filled with the aid of a pixel based, parity check filling algorithm. Experimental results for several object extractions on 'Lena' and 'Peppers' images are included.

MPEG-4是一种新兴的全球数字多媒体服务标准。该视频标准的核心发展是基于内容的视频数据结构，它构成任意形状的视频对象。数据压缩是根据这些视频对象而不是帧来设计的。因此，探索从视频帧中提取对象的可能方法是MPEG-4开发阶段的一个关键要求。提出了一种基于轮廓线的MPEG-4视频对象提取技术。利用拉普拉斯高斯算子对给定帧进行卷积提取目标轮廓，然后进行边缘检测。轮廓随后被阻塞，最后用基于像素的奇偶校验填充算法填充。包括对“Lena”和“Peppers”图像的几个目标提取的实验结果。

引用次数: 6

Tape-less video technologies: issues in workflow transitions 无磁带视频技术:工作流程转换中的问题

Proceedings IEEE International Conference on Multimedia Computing and Systems

Pub Date : 1999-06-07 DOI: 10.1109/MMCS.1999.779120

F. Arman

This paper provides an overview of the issues when migrating from tape-based video systems to complete tape-less systems. The main focus is on workflow and how each system sub-component may effect the workflow. Workflow is discussed in detail due to its overwhelming effects on any organization that decides to undertake and implement change. The sub-systems discussed are storage, networking, media asset management and databases.

本文概述了从基于磁带的视频系统迁移到完整的无磁带系统时的问题。主要关注工作流以及每个系统子组件如何影响工作流。我们将详细讨论工作流，因为它对任何决定承担和实施变更的组织都具有压倒性的影响。讨论了存储、网络、媒体资产管理和数据库等子系统。

引用次数: 1

Semantic video model for content-based retrieval 基于内容检索的语义视频模型

Proceedings IEEE International Conference on Multimedia Computing and Systems

Pub Date : 1999-06-07 DOI: 10.1109/MMCS.1999.778508

Jia-Ling Koh, Chin-Sung Lee, Arbee L. P. Chen

Traditional research on video data retrieval follows two general approaches. One is based on text annotation and the other on content-based comparison. However these approaches do not fully make use of the meaning implied in a video stream. To improve these approaches, a semantic video model cooperating with a knowledge database is studied. We propose a new semantic video model and focus on presenting the semantic meaning implied in a video. According to the granularity of the meaning implied in a video, a five-level layered structure to model a video stream is proposed. A mechanism is also provided to construct the five levels based on the knowledge categories defined in the knowledge database. The five-level layered structure consists of raw-data levels and semantic-data levels. A uniform semantics representation is proposed to represent the semantic-data levels. This uniform semantics representation allows measuring the similarity of two video streams with different duration. Then an interactive interface can provide browsing and querying video data efficiently through the uniform semantics representation.

传统的视频数据检索研究一般采用两种方法。一种是基于文本标注，另一种是基于内容比较。然而，这些方法并没有充分利用视频流中隐含的含义。为了改进这些方法，研究了一种与知识库协作的语义视频模型。本文提出了一种新的语义视频模型，重点关注视频中隐含的语义。根据视频中隐含意义的粒度，提出了一种视频流建模的五层分层结构。基于知识库中所定义的知识类别，给出了构建五个层次的机制。五层分层结构由原始数据层和语义数据层组成。提出了一种统一的语义表示来表示语义数据层。这种统一的语义表示允许测量具有不同持续时间的两个视频流的相似性。通过统一的语义表示，实现了视频数据的高效浏览和查询。

引用次数: 25

Fast forward and fast rewind play system based on the MPEG system stream with new concept 快进快退播放系统是基于MPEG系统的流媒体新概念

Proceedings IEEE International Conference on Multimedia Computing and Systems

Pub Date : 1999-06-07 DOI: 10.1109/MMCS.1999.778597

Sang Y. Doh, Min Jang

In this paper, we present systems and algorithms for fast forward and fast rewind play system of motion picture streams encoded in standard MPEG system. Many traditional standard MPEG system decoders cannot easily control the MPEG system stream. Especially they have difficulty in controlling fast forward and fast rewind play. We adapt a new concept for our system. We use index data in MPEG system stream. It has compatibility with the standard MPEG decoding method and also covers problems of other models.

本文提出了一种基于标准MPEG编码的电影流快进快倒播放系统的系统和算法。许多传统的标准MPEG系统解码器不能很容易地控制MPEG系统流。特别是他们在控制快进快倒播放方面有困难。我们为我们的系统采用了一个新概念。我们在MPEG系统流中使用索引数据。它与标准的MPEG解码方法兼容，也涵盖了其他模型的问题。

引用次数: 3

"What is in that video anyway?": in search of better browsing “视频里到底有什么?”为了更好的浏览

Proceedings IEEE International Conference on Multimedia Computing and Systems

Pub Date : 1999-06-07 DOI: 10.1109/MMCS.1999.779235

S. Srinivasan, D. Ponceleón, A. Amir, D. Petkovic

Effective use of digital video can be greatly improved by a combination of two technologies: computer vision for automated video analysis and information visualization for data visualization. The unstructured spatio-temporal nature of video poses tough challenges in the extraction of semantics using fully automated techniques. In the CueVideo project, we combine these automated technologies together with a user interface designed for rapid filtering and comprehension of video content. Our interface introduces two new techniques for viewing video and builds upon existing techniques to provide synergistic views of the video content. We also report on a preliminary user study that compares the efficacy of these views in providing comprehension of video content.

两种技术的结合可以极大地提高数字视频的有效利用:用于自动视频分析的计算机视觉和用于数据可视化的信息可视化。视频的非结构化时空性质对使用全自动技术提取语义提出了严峻的挑战。在CueVideo项目中，我们将这些自动化技术与一个旨在快速过滤和理解视频内容的用户界面结合在一起。我们的界面引入了两种观看视频的新技术，并在现有技术的基础上提供视频内容的协同视图。我们还报告了一项初步的用户研究，比较了这些视图在提供视频内容理解方面的功效。

引用次数: 59

Extracting Web design knowledge: the Web De-Compiler 提取网页设计知识:网页反编译器

Proceedings IEEE International Conference on Multimedia Computing and Systems

Pub Date : 1999-06-07 DOI: 10.1109/MMCS.1999.778544

M. Chan, G. Yu

We introduce a Web De-Compiler (WDC) that extracts Web design information for reuse. Given a Web site, the system extracts design knowledge at several levels: site organization and navigation, page layout, and objects. Objects are regular structures within a page including paragraphs, tables and images. Page layout includes the use of color, fonts, background images, and placement of objects. Design information is extracted by analyzing the HTML tags and images of a Web page. An autonomous agent utilizing the WDC is cataloging all of the designs on the Internet. The design information can be reused in automated and semi-automated Web site design, re-design and analysis.

我们引入了一个Web反编译器(WDC)，它可以提取Web设计信息以供重用。给定一个Web站点，系统从几个层次提取设计知识:站点组织和导航、页面布局和对象。对象是页面中的规则结构，包括段落、表格和图像。页面布局包括使用颜色、字体、背景图像和对象的位置。设计信息是通过分析网页的HTML标记和图像来提取的。一个利用WDC的自主代理正在对Internet上的所有设计进行编目。设计信息可以在自动化和半自动化的网站设计、重新设计和分析中重用。

引用次数: 3

A human-assisted system to build 3-D models from a single image 一个人工辅助系统，从一张图像中建立三维模型

Proceedings IEEE International Conference on Multimedia Computing and Systems

Pub Date : 1999-06-07 DOI: 10.1109/MMCS.1999.779218

A. François, G. Medioni

We present a system at the junction between Computer Vision and Computer Graphics, to produce a 3-D model of an object as observed in a single image, with a minimum of high-level interaction from a user. The input to our system is a single image. First, the user points, coarsely, at image features (edges) that are subsequently automatically and reproducibly extracted in real-time. The user then performs a high level labeling of the curves (e.g. limb edge, cross-section) and specifies relations between edges (e.g. symmetry, surface or part). NURBS are used as working representation of image edges. The objects described by the user specified, qualitative relationships are then reconstructed either as a set of connected parts modeled as Generalized Cylinders, or as a set of 3-D surfaces for 3-D bilateral symmetric objects. In both cases, the texture is also extracted from the image.

我们在计算机视觉和计算机图形学的交界处提出了一个系统，以产生在单个图像中观察到的物体的三维模型，与用户的高层交互最少。我们系统的输入是一张图像。首先，用户粗略地指向图像特征(边缘)，这些特征随后被实时自动地、可重复地提取出来。然后，用户对曲线进行高级标记(例如，边缘，截面)并指定边缘之间的关系(例如，对称，表面或部分)。NURBS作为图像边缘的工作表示。由用户指定的定性关系描述的对象然后被重构为一组作为广义圆柱体建模的连接部件，或者作为一组三维双边对称对象的三维表面。在这两种情况下，纹理也从图像中提取。

引用次数: 4

Using Food Web as an evolution computing model for Internet-based multimedia agents 基于食物网的多媒体代理进化计算模型

Proceedings IEEE International Conference on Multimedia Computing and Systems

Pub Date : 1999-06-07 DOI: 10.1109/MMCS.1999.778551

T. Shih

The ecosystem is an evolutionary result of natural laws. Food Web (or Food Chain) embeds a set of computation rules of natural balance. Based one the concepts of Food Web, one of the laws that we may learn from the natural besides neural networks and genetic algorithms, we propose a theoretical computation model for mobile agent evolution on the Internet. We define an agent niche overlap graph and agent evolution states. We also propose a set of algorithms, which is used in our multimedia search programs, to simulate agent evolution. Agents are cloned to live on a remote host station based on three different strategies: the brute force strategy, the semi-brute force strategy, and the selective strategy. Evaluations of different strategies are discussed. Guidelines of writing mobile agent programs are proposed. The technique can be used in distributed information retrieval which allows the computation load to be added to servers, but significantly reduces the traffic of network communication.

生态系统是自然规律演化的结果。食物网(或食物链)嵌入了一套自然平衡的计算规则。基于食物网(Food Web)的概念，除了神经网络和遗传算法外，我们还可以从自然界学习到食物网(Food Web)的规律，提出了互联网上移动智能体进化的理论计算模型。定义了智能体生态位重叠图和智能体演化状态。我们还提出了一套用于多媒体搜索程序的算法来模拟智能体的进化。基于三种不同的策略克隆代理以在远程主机站上运行:暴力破解策略、半暴力破解策略和选择性策略。讨论了不同策略的评价。提出了编写移动代理程序的指导原则。该技术可用于分布式信息检索，使计算负荷增加到服务器上，但显著降低了网络通信流量。

引用次数: 1

Temporal synchronization in multimedia presentations 多媒体演示中的时间同步

Proceedings IEEE International Conference on Multimedia Computing and Systems

Pub Date : 1999-06-07 DOI: 10.1109/MMCS.1999.778598

I. Cruz, Parag S. Mahalley

Designing high quality multimedia presentations is a tedious and time consuming task, even for skilled authors. This is particularly true when temporal media such as speech and animation are involved. The focus of our research is to determine whether the multimedia presentation is synchronized or amenable to synchronization. We define a formal framework for verifying temporal synchronization of a presentation, which is based on a modified all-pairs shortest path algorithm.

设计高质量的多媒体演示文稿是一项乏味而耗时的任务，即使对于熟练的作者也是如此。当涉及到言语和动画等时间媒体时，这一点尤其正确。我们研究的重点是确定多媒体呈现是否同步或可适应同步。我们定义了一个基于改进的全对最短路径算法的形式化框架来验证演示的时间同步。

引用次数: 11

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Proceedings IEEE International Conference on Multimedia Computing and Systems

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀