首页 > 最新文献

MULTIMEDIA '93最新文献

英文 中文
Design of an information skimming space 信息浏览空间的设计
Pub Date : 1993-09-01 DOI: 10.1145/166266.168424
M. Ohkubo, N. Kobayashi, Toru Nakagawa
This paper proposes the Information Skimming Space (ISS) which provides users with a window through which interesting news reports can be culled from the large number of reports created daily by the mass media, such as newspapers and TV stations, as well as serving as a powerful information retrieval system. Our idea is that information acquisition can be best performed by enhancing the efficiency with which the information is presented to and accessed by the user. To realize this concept, newspapers are analyzed as an information acquisition tool to make an effective presentation scheme suitable for rapid cognitive understanding. News Transmission Model is proposed to define navigation paths that allow the user to access other reports according to his or her interests.
本文提出了信息浏览空间(Information Skimming Space, ISS),它为用户提供了一个窗口,使用户可以从报纸、电视台等大众传播媒介每天产生的大量新闻报道中挑选出有趣的新闻报道,并作为一个强大的信息检索系统。我们的想法是,通过提高向用户呈现和访问信息的效率,可以最好地执行信息获取。为了实现这一概念,本文将报纸作为一种信息获取工具进行分析,以制定适合快速认知理解的有效呈现方案。提出新闻传播模型,定义导航路径,允许用户根据自己的兴趣访问其他报道。
{"title":"Design of an information skimming space","authors":"M. Ohkubo, N. Kobayashi, Toru Nakagawa","doi":"10.1145/166266.168424","DOIUrl":"https://doi.org/10.1145/166266.168424","url":null,"abstract":"This paper proposes the Information Skimming Space (ISS) which provides users with a window through which interesting news reports can be culled from the large number of reports created daily by the mass media, such as newspapers and TV stations, as well as serving as a powerful information retrieval system. Our idea is that information acquisition can be best performed by enhancing the efficiency with which the information is presented to and accessed by the user. To realize this concept, newspapers are analyzed as an information acquisition tool to make an effective presentation scheme suitable for rapid cognitive understanding. News Transmission Model is proposed to define navigation paths that allow the user to access other reports according to his or her interests.","PeriodicalId":412458,"journal":{"name":"MULTIMEDIA '93","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1993-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125347880","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
CMIFed: a presentation environment for portable hypermedia documents CMIFed:便携式超媒体文档的表示环境
Pub Date : 1993-09-01 DOI: 10.1145/166266.166287
G. Rossum, Jack Jansen, K. S. Mullender, D. Bulterman
This paper discusses the architecture and implementation of CMIFed, an editing and presentation environment for hypermedia documents. Typically such documents contain a mixture of text, images, audio, and video (and possibly other media), augmented with user interaction. CMIFed allows the author flexibility in specifying what is presented when, using multiple simultaneous output channels. Unlike systems that use a timeline or scripting metaphor to control the presentation, in CMIFed the user manipulates a collection of events and timing constraints among those events. Common timing requirements can be specified by grouping events together in a tree whose nodes indicate sequential and parallel composition. More general timing constraints between events can be added in the form of synchronization arcs. User interaction is supported in the form of hyperlinks. We place CMIFed in the context of the CMIF model for hypermedia documents, which formalizes the properties of hypermedia presentations in a platform-independent manner. CR Subject Classification (1991): H.5.1, H.5.2, I.7.2, I.3.6, I.3.4, D.4.1, D.4.4, D.4.7.
本文讨论了CMIFed的体系结构和实现,CMIFed是一个用于超媒体文档的编辑和表示环境。通常,这样的文档包含文本、图像、音频和视频(可能还有其他媒体)的混合,并通过用户交互进行增强。CMIFed允许作者灵活地指定在使用多个同步输出通道时呈现的内容。与使用时间轴或脚本隐喻来控制表示的系统不同,在CMIFed中,用户可以操纵事件集合和这些事件之间的时间约束。可以通过将事件分组在一个树中来指定常见的时序需求,该树的节点表示顺序和并行组合。可以以同步弧线的形式添加事件之间更一般的定时约束。以超链接的形式支持用户交互。我们将CMIFed放在用于超媒体文档的CMIF模型的上下文中,该模型以与平台无关的方式形式化了超媒体表示的属性。CR主题分类(1991):H.5.1, H.5.2, I.7.2, I.3.6, I.3.4, D.4.1, D.4.4 D.4.7。
{"title":"CMIFed: a presentation environment for portable hypermedia documents","authors":"G. Rossum, Jack Jansen, K. S. Mullender, D. Bulterman","doi":"10.1145/166266.166287","DOIUrl":"https://doi.org/10.1145/166266.166287","url":null,"abstract":"This paper discusses the architecture and implementation of CMIFed, an editing and presentation environment for hypermedia documents. Typically such documents contain a mixture of text, images, audio, and video (and possibly other media), augmented with user interaction. CMIFed allows the author flexibility in specifying what is presented when, using multiple simultaneous output channels. Unlike systems that use a timeline or scripting metaphor to control the presentation, in CMIFed the user manipulates a collection of events and timing constraints among those events. Common timing requirements can be specified by grouping events together in a tree whose nodes indicate sequential and parallel composition. More general timing constraints between events can be added in the form of synchronization arcs. User interaction is supported in the form of hyperlinks. We place CMIFed in the context of the CMIF model for hypermedia documents, which formalizes the properties of hypermedia presentations in a platform-independent manner. CR Subject Classification (1991): H.5.1, H.5.2, I.7.2, I.3.6, I.3.4, D.4.1, D.4.4, D.4.7.","PeriodicalId":412458,"journal":{"name":"MULTIMEDIA '93","volume":"55 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1993-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127672826","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 150
Integrating video into an application framework 将视频集成到应用程序框架中
Pub Date : 1993-09-01 DOI: 10.1145/166266.168439
Peter Schnorf
Object-oriented application frameworks help software engineers develop sophisticated interactive applications quickly and reduce maintenance costs significantly. An application framework provides, among others, predefined visual classes for text or graphics display and for user interaction elements, as well as integrated control of these elements. Designed before motion video display was technically feasible on desktop computers, these frameworks’ architectures are built around the assumption of relatively slowly changing bitmap images to be displayed on a computer screen. Motion video screen update rates were not anticipated. We extended a typical application framework in a multitasking environment to support motion video display in full generality. The internal architecture of the framework was changed to remove the subtle obstacles for high screen update rates and a data type ‘video’ was designed and integrated seamlessly with the existing visual classes. Our video objects display motion video generated by autonomous hardware or software processes. They may appear in any shape or number mixed with other visual objects in scrollable views, as building blocks in graphics editors, as characters in text editors, as items in list or pop-up menus, etc. Video objects support the full visual class protocol for client-transparent double buffering, cut/copy/paste/undo operations, and output to or input from files. In this paper, we describe the features of these video objects and our changes to the architecture of the application framework necessary to support the new paradigms of motion video.
面向对象的应用程序框架帮助软件工程师快速开发复杂的交互式应用程序,并显著降低维护成本。应用程序框架提供了预定义的可视化类,用于文本或图形显示和用户交互元素,以及对这些元素的集成控制。在运动视频显示技术在台式计算机上可行之前,这些框架的架构是围绕在计算机屏幕上显示的相对缓慢变化的位图图像的假设建立的。动态视频屏幕更新率超出预期。我们在多任务环境下扩展了一个典型的应用程序框架,以全面支持运动视频显示。框架的内部架构被改变,以消除高屏幕更新率的微妙障碍,并设计了一个数据类型“视频”,并与现有的视觉类无缝集成。我们的视频对象显示由自主硬件或软件过程生成的运动视频。它们可以在可滚动视图中以任何形状或数字与其他可视对象混合出现,在图形编辑器中作为构建块,在文本编辑器中作为字符,在列表或弹出菜单中作为项等。视频对象支持完整的可视类协议,用于客户端透明的双重缓冲,剪切/复制/粘贴/撤消操作,以及从文件输出或输入。在本文中,我们描述了这些视频对象的特征,以及我们对支持运动视频新范式所必需的应用程序框架体系结构的改变。
{"title":"Integrating video into an application framework","authors":"Peter Schnorf","doi":"10.1145/166266.168439","DOIUrl":"https://doi.org/10.1145/166266.168439","url":null,"abstract":"Object-oriented application frameworks help software engineers develop sophisticated interactive applications quickly and reduce maintenance costs significantly. An application framework provides, among others, predefined visual classes for text or graphics display and for user interaction elements, as well as integrated control of these elements. Designed before motion video display was technically feasible on desktop computers, these frameworks’ architectures are built around the assumption of relatively slowly changing bitmap images to be displayed on a computer screen. Motion video screen update rates were not anticipated. We extended a typical application framework in a multitasking environment to support motion video display in full generality. The internal architecture of the framework was changed to remove the subtle obstacles for high screen update rates and a data type ‘video’ was designed and integrated seamlessly with the existing visual classes. Our video objects display motion video generated by autonomous hardware or software processes. They may appear in any shape or number mixed with other visual objects in scrollable views, as building blocks in graphics editors, as characters in text editors, as items in list or pop-up menus, etc. Video objects support the full visual class protocol for client-transparent double buffering, cut/copy/paste/undo operations, and output to or input from files. In this paper, we describe the features of these video objects and our changes to the architecture of the application framework necessary to support the new paradigms of motion video.","PeriodicalId":412458,"journal":{"name":"MULTIMEDIA '93","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1993-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128601179","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
Optimistic strategies for large-scale dissemination of multimedia information 多媒体信息大规模传播的乐观策略
Pub Date : 1993-09-01 DOI: 10.1145/166266.166267
R. Yavatkar, Leelanivas Manoj
We are investigating alternative transport protocol strategies for realizing large scale dissemination services across a wide area network. Communication requirements of such applications are distinct from those based on conventional client-server interactions. Conventional ow and error control methods based on the retransmissions-with-timeout paradigm are not appropriate for such applications. Instead, we are interested in using optimistic ow and error control strategies that take into account application-speci c error tolerance and media rates of multimedia applications. This paper describes transport level policies that use a combination of redundant transmissions, rate-based ow control, and selective feedback from receivers. A simulation-based performance evaluation demonstrates that relatively simple techniques succeed well in meeting the QOS requirements of a multimedia multicast and in scaling to hundreds of recipients.
我们正在研究在广域网上实现大规模传播服务的替代传输协议策略。此类应用程序的通信需求与基于传统客户机-服务器交互的通信需求不同。传统的基于带超时重传范式的低电平和错误控制方法不适用于此类应用。相反,我们感兴趣的是使用考虑到应用程序特定的容错性和多媒体应用程序的媒体速率的乐观低电平和错误控制策略。本文描述了使用冗余传输、基于速率的低控制和来自接收器的选择性反馈的组合的传输级策略。基于仿真的性能评估表明,相对简单的技术可以很好地满足多媒体多播的QOS要求,并且可以扩展到数百个接收方。
{"title":"Optimistic strategies for large-scale dissemination of multimedia information","authors":"R. Yavatkar, Leelanivas Manoj","doi":"10.1145/166266.166267","DOIUrl":"https://doi.org/10.1145/166266.166267","url":null,"abstract":"We are investigating alternative transport protocol strategies for realizing large scale dissemination services across a wide area network. Communication requirements of such applications are distinct from those based on conventional client-server interactions. Conventional ow and error control methods based on the retransmissions-with-timeout paradigm are not appropriate for such applications. Instead, we are interested in using optimistic ow and error control strategies that take into account application-speci c error tolerance and media rates of multimedia applications. This paper describes transport level policies that use a combination of redundant transmissions, rate-based ow control, and selective feedback from receivers. A simulation-based performance evaluation demonstrates that relatively simple techniques succeed well in meeting the QOS requirements of a multimedia multicast and in scaling to hundreds of recipients.","PeriodicalId":412458,"journal":{"name":"MULTIMEDIA '93","volume":"35 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1993-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126396706","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 40
Facial image retrieval, identification, and inference system 面部图像检索,识别和推理系统
Pub Date : 1993-09-01 DOI: 10.1145/166266.166271
Jian-Kang Wu, Y. H. Ang, P. Lam, S. Moorthy, A. D. Narasimhalu
Recognition of a human face is very easy even to a child, but is extremely difficult for computers. Here we present a Computer Aided Facial Image Identification, Retrieval and Inference System (CAFIIRIS) for criminal identification. The system stores and manages facial images and criminal records, providing necessary image and text processing, and editing tools. Inference of facial images of different ages of a person is also possible. Access to facial images can be done via key words, fuzzy descriptions, and visual browsing. A facial image database system stores and manages a large amount of facial images together with text-based criminal record. It provides users with a flexible means to manipulate, archive, retrieve, and make use of facial images and text data. The facial images are visual rather than descriptive. Each digital image is a large array of pixels of various sizes, and a facial image database contains thousands or even hundred thousands of images. Therefore, this huge visual database needs special techniques for its management, namely, embedded functions for image pre-processing, feature extraction, presentation (screen display and report formatter); visual access to image data via special indexing techniques; application-specific image inference to derive new images based on images and other available information.
即使对孩子来说,识别人脸也很容易,但对计算机来说却极其困难。本文提出了一种用于罪犯识别的计算机辅助面部图像识别、检索和推理系统(CAFIIRIS)。该系统存储和管理面部图像和犯罪记录,提供必要的图像和文本处理和编辑工具。对一个人不同年龄的面部图像进行推断也是可能的。可以通过关键词、模糊描述和视觉浏览来访问面部图像。人脸图像数据库系统存储和管理大量的人脸图像和基于文本的犯罪记录。它为用户提供了一种灵活的方法来操作、存档、检索和利用面部图像和文本数据。面部图像是视觉的,而不是描述性的。每个数字图像都是由各种大小的像素组成的大阵列,面部图像数据库包含数千甚至数十万张图像。因此,这个庞大的可视化数据库的管理需要特殊的技术,即嵌入图像预处理、特征提取、呈现(屏幕显示、报表格式化)等功能;通过特殊索引技术对图像数据进行可视化访问;特定于应用程序的图像推理,基于图像和其他可用信息派生新图像。
{"title":"Facial image retrieval, identification, and inference system","authors":"Jian-Kang Wu, Y. H. Ang, P. Lam, S. Moorthy, A. D. Narasimhalu","doi":"10.1145/166266.166271","DOIUrl":"https://doi.org/10.1145/166266.166271","url":null,"abstract":"Recognition of a human face is very easy even to a child, but is extremely difficult for computers. Here we present a Computer Aided Facial Image Identification, Retrieval and Inference System (CAFIIRIS) for criminal identification. The system stores and manages facial images and criminal records, providing necessary image and text processing, and editing tools. Inference of facial images of different ages of a person is also possible. Access to facial images can be done via key words, fuzzy descriptions, and visual browsing. A facial image database system stores and manages a large amount of facial images together with text-based criminal record. It provides users with a flexible means to manipulate, archive, retrieve, and make use of facial images and text data. The facial images are visual rather than descriptive. Each digital image is a large array of pixels of various sizes, and a facial image database contains thousands or even hundred thousands of images. Therefore, this huge visual database needs special techniques for its management, namely, embedded functions for image pre-processing, feature extraction, presentation (screen display and report formatter); visual access to image data via special indexing techniques; application-specific image inference to derive new images based on images and other available information.","PeriodicalId":412458,"journal":{"name":"MULTIMEDIA '93","volume":"190 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1993-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122360938","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 27
Architectures for multi-source multi-user video compositing 多源多用户视频合成的体系结构
Pub Date : 1993-09-01 DOI: 10.1145/166266.166291
L. C. Yun, D. G. Messerschmidt
Video compositing is the editing and integrating of many video images into a single presentation. Several single-user compositing systems have already been suggested, but the multiple users problem remains unstudied. We propose two new architectures for digital video compositing in a multiuser environment that are both memory efficient and can operate in real-time. We show that under hard throughput and bandwidth constraints, a memoryless solution for transferring data from many video sources to many users does not exist. We overcome this using (i) a dynamic memory buffering architecture ; and (ii) a constant memory bandwidth solution that transforms the sources-to-users transfer schedule into 2 schedules, then pipelines the computation. The architectures support opaque overlapping of images, arbitrarily shaped images, and images whose shapes dynamically change from frame to frame.
视频合成是将许多视频图像编辑和集成到一个单独的演示中。已经提出了几个单用户合成系统,但多用户问题仍未研究。我们提出了两种新的多用户环境下的数字视频合成架构,它们既节省内存又可以实时运行。我们表明,在硬吞吐量和带宽限制下,不存在从多个视频源向多个用户传输数据的无内存解决方案。我们使用(i)动态内存缓冲架构来克服这个问题;(ii)一个恒定的内存带宽解决方案,将源到用户的传输调度转换为2个调度,然后进行管道计算。该体系结构支持图像的不透明重叠、任意形状的图像以及图像的形状在帧与帧之间动态变化。
{"title":"Architectures for multi-source multi-user video compositing","authors":"L. C. Yun, D. G. Messerschmidt","doi":"10.1145/166266.166291","DOIUrl":"https://doi.org/10.1145/166266.166291","url":null,"abstract":"Video compositing is the editing and integrating of many video images into a single presentation. Several single-user compositing systems have already been suggested, but the multiple users problem remains unstudied. We propose two new architectures for digital video compositing in a multiuser environment that are both memory efficient and can operate in real-time. We show that under hard throughput and bandwidth constraints, a memoryless solution for transferring data from many video sources to many users does not exist. We overcome this using (i) a dynamic memory buffering architecture ; and (ii) a constant memory bandwidth solution that transforms the sources-to-users transfer schedule into 2 schedules, then pipelines the computation. The architectures support opaque overlapping of images, arbitrarily shaped images, and images whose shapes dynamically change from frame to frame.","PeriodicalId":412458,"journal":{"name":"MULTIMEDIA '93","volume":"26 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1993-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126346789","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Media scaling for audiovisual communication with the Heidelberg transport system 与海德堡运输系统进行视听通信的媒体缩放
Pub Date : 1993-09-01 DOI: 10.1145/166266.166277
L. Delgrossi, C. Halstrick, D. Hehmann, R. Herrtwich, O. Krone, J. Sandvoss, C. Vogt
HeiTS, the Heidelberg Transport System, is a multimedia communication system for real-time delivery of digital audio and video. HeiTS operates on top of guaranteed-performance networks that apply resource reservation techniques. To make HeiTS also work with networks for which no reservation scheme can be realized (for example, Ethernet or existing internetworks), we implement an extension to HeiTS which performs media scaling at the transport level: The media encoding is modified according to the bandwidth available in the underlying networks. Both transparent and non-transparent scaling methods are examined. HeiTS lends itself to implement transparent temporal and spatial scaling of media streams. At the HeiTS interface, functions are provided which report information on the available resource bandwidth to the application so that non-transparent scaling methods may be used, too. Both a continuous and discrete scaling solution for HeiTS are presented. The continuous solution uses feedback messages to adjust the data flow. The discrete solution also exploits the multipoint network connection mechanism of HeiTS. Whereas the first method is more flexible, the second technique is better suited for multicast scenarios. The combination of resource reservation and media scaling seems to be particularly well-suited to meet the varying demands of distributed multimedia applications.
海德堡运输系统(HeiTS)是一个多媒体通信系统,用于实时传输数字音频和视频。HeiTS在应用资源预留技术的保证性能网络上运行。为了使HeiTS也适用于无法实现保留方案的网络(例如,以太网或现有的互联网),我们实现了对HeiTS的扩展,该扩展在传输级别执行媒体缩放:媒体编码根据底层网络中的可用带宽进行修改。研究了透明和非透明的标度方法。HeiTS有助于实现媒体流的透明时间和空间缩放。在HeiTS接口上,提供了向应用程序报告可用资源带宽信息的函数,以便也可以使用非透明的缩放方法。给出了HeiTS的连续和离散缩放解。连续解决方案使用反馈消息来调整数据流。离散方案还利用了HeiTS的多点网络连接机制。虽然第一种方法更灵活,但第二种技术更适合多播场景。资源保留和媒体扩展的组合似乎特别适合满足分布式多媒体应用程序的各种需求。
{"title":"Media scaling for audiovisual communication with the Heidelberg transport system","authors":"L. Delgrossi, C. Halstrick, D. Hehmann, R. Herrtwich, O. Krone, J. Sandvoss, C. Vogt","doi":"10.1145/166266.166277","DOIUrl":"https://doi.org/10.1145/166266.166277","url":null,"abstract":"HeiTS, the Heidelberg Transport System, is a multimedia communication system for real-time delivery of digital audio and video. HeiTS operates on top of guaranteed-performance networks that apply resource reservation techniques. To make HeiTS also work with networks for which no reservation scheme can be realized (for example, Ethernet or existing internetworks), we implement an extension to HeiTS which performs media scaling at the transport level: The media encoding is modified according to the bandwidth available in the underlying networks. Both transparent and non-transparent scaling methods are examined. HeiTS lends itself to implement transparent temporal and spatial scaling of media streams. At the HeiTS interface, functions are provided which report information on the available resource bandwidth to the application so that non-transparent scaling methods may be used, too. Both a continuous and discrete scaling solution for HeiTS are presented. The continuous solution uses feedback messages to adjust the data flow. The discrete solution also exploits the multipoint network connection mechanism of HeiTS. Whereas the first method is more flexible, the second technique is better suited for multicast scenarios. The combination of resource reservation and media scaling seems to be particularly well-suited to meet the varying demands of distributed multimedia applications.","PeriodicalId":412458,"journal":{"name":"MULTIMEDIA '93","volume":"6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1993-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124148341","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 168
Automatic temporal layout mechanisms 自动时间布局机制
Pub Date : 1993-09-01 DOI: 10.1145/166266.168415
M. C. Buchanan, P. Zellweger
A traditional static document has a spatial layout that indicates where objects in the document appear. Because multimedia documents incorporate time, they also require a temporal layout, or schedule, that indicates when events in the document occur. This paper argues that multimedia document systems should provide mechanisms for automatically producing temporal layouts for documents. The major advantage of this approach is that it makes it easier for authors to create and modify multimedia documents. This paper constructs a framework for understanding automatic temporal formatters and explores the basic issues surrounding them. It also describes the Firefly multimedia document system, which has been developed to test the potential of automatic temporal formatting.
传统的静态文档具有指示文档中对象出现位置的空间布局。由于多媒体文档包含时间,因此它们还需要临时布局或调度,以指示文档中的事件何时发生。本文认为多媒体文档系统应该提供自动生成文档时态布局的机制。这种方法的主要优点是,它使作者更容易创建和修改多媒体文档。本文构建了一个理解自动时间格式的框架,并探讨了与之相关的基本问题。它还描述了Firefly多媒体文档系统,该系统是为了测试自动时间格式的潜力而开发的。
{"title":"Automatic temporal layout mechanisms","authors":"M. C. Buchanan, P. Zellweger","doi":"10.1145/166266.168415","DOIUrl":"https://doi.org/10.1145/166266.168415","url":null,"abstract":"A traditional static document has a spatial layout that indicates where objects in the document appear. Because multimedia documents incorporate time, they also require a temporal layout, or schedule, that indicates when events in the document occur. This paper argues that multimedia document systems should provide mechanisms for automatically producing temporal layouts for documents. The major advantage of this approach is that it makes it easier for authors to create and modify multimedia documents. \u0000 \u0000This paper constructs a framework for understanding automatic temporal formatters and explores the basic issues surrounding them. It also describes the Firefly multimedia document system, which has been developed to test the potential of automatic temporal formatting.","PeriodicalId":412458,"journal":{"name":"MULTIMEDIA '93","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1993-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129441092","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 163
Image processing on compressed data for large video databases 大型视频数据库压缩数据的图像处理
Pub Date : 1993-09-01 DOI: 10.1145/166266.166297
F. Arman, A. Hsu, M. Chiu
This paper presents a novel approach to processing encoded video sequences prior to decoding. Scene changes may be easily detected using DCT coefficients in JPEG and MPEG encoded video sequences. In addition, by analyzing the DCT coefficients, regions of interest may be isolated prior to decompression, increasing efficiency of any subsequent image processing steps, such as edge detection. The results are currently used in a video browser, and are part of an ongoing research project in creating large video databases. The procedure is presented in detail and several examples are exhibited.
本文提出了一种在解码前处理编码视频序列的新方法。在JPEG和MPEG编码的视频序列中,使用DCT系数可以很容易地检测到场景变化。此外,通过分析DCT系数,可以在解压缩之前隔离感兴趣的区域,从而提高后续图像处理步骤(如边缘检测)的效率。该结果目前用于视频浏览器,并且是正在进行的创建大型视频数据库的研究项目的一部分。详细介绍了该方法,并给出了几个实例。
{"title":"Image processing on compressed data for large video databases","authors":"F. Arman, A. Hsu, M. Chiu","doi":"10.1145/166266.166297","DOIUrl":"https://doi.org/10.1145/166266.166297","url":null,"abstract":"This paper presents a novel approach to processing encoded video sequences prior to decoding. Scene changes may be easily detected using DCT coefficients in JPEG and MPEG encoded video sequences. In addition, by analyzing the DCT coefficients, regions of interest may be isolated prior to decompression, increasing efficiency of any subsequent image processing steps, such as edge detection. The results are currently used in a video browser, and are part of an ongoing research project in creating large video databases. The procedure is presented in detail and several examples are exhibited.","PeriodicalId":412458,"journal":{"name":"MULTIMEDIA '93","volume":"44 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1993-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132863995","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 272
Programming the multimodal interface 编程多模态接口
Pub Date : 1993-09-01 DOI: 10.1145/166266.166288
E. Glinert, M. Blattner
Fundamental problems will confront those who wish to take full advantage of the power of tomorrow`s multimodal environments. We argue that our recently introduced concept of meta-widget, when embedded within a high level, networked user interface server, can support the effective implementation of complex multimedia applications. We develop algorithms which enable a multimodal system to select the ``best`` combination of representations for the various ``information packets`` in a display at any moment. If no acceptable - combination of available representations can be found, strategies are provided for creating a new and useful, if not beautiful, representation to resolve the impasse. A running example is provided to motivate and clarify the discussion.
那些希望充分利用未来多式联运环境力量的人将面临根本性的问题。我们认为,我们最近引入的元部件的概念,当嵌入到一个高层次的,网络化的用户界面服务器,可以支持复杂的多媒体应用程序的有效实现。我们开发了一种算法,使多模态系统能够在任何时候为显示器中的各种“信息包”选择“最佳”的表示组合。如果找不到可接受的可用表示组合,则提供策略来创建新的有用的表示,如果不是漂亮的,以解决僵局。提供了一个运行的例子来激励和澄清讨论。
{"title":"Programming the multimodal interface","authors":"E. Glinert, M. Blattner","doi":"10.1145/166266.166288","DOIUrl":"https://doi.org/10.1145/166266.166288","url":null,"abstract":"Fundamental problems will confront those who wish to take full advantage of the power of tomorrow`s multimodal environments. We argue that our recently introduced concept of meta-widget, when embedded within a high level, networked user interface server, can support the effective implementation of complex multimedia applications. We develop algorithms which enable a multimodal system to select the ``best`` combination of representations for the various ``information packets`` in a display at any moment. If no acceptable - combination of available representations can be found, strategies are provided for creating a new and useful, if not beautiful, representation to resolve the impasse. A running example is provided to motivate and clarify the discussion.","PeriodicalId":412458,"journal":{"name":"MULTIMEDIA '93","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1993-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130486909","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
期刊
MULTIMEDIA '93
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1