Pub Date : 1996-06-17DOI: 10.1109/MMCS.1996.535006
Junehwa Song, Y. Doganata, Michelle Y. Kim, A. Tantawi
Interactive multimedia documents (or systems) can be characterized by active user participation and the diversity of multimedia information accessed at various levels of granularity. They need to support extensive user participation in selecting and tailoring the information and its presentation. Multimedia information fragments may vary from portions of video, pieces of audio, newspaper quotations or chapters of a book. Effectively managing the creation, evolution and complexity of multimedia documents formed by combining media fragments is an essential capability. We consider the hyperstory model of a multimedia document, where a document is structured hierarchically in three dimensions: time, space and asynchrony. The model provides a layered approach in structuring multimedia documents, thus reducing the complexities of large systems. The hyperstory model supports user interactions that are timed, and also supports preemptive resuming. Timed user interactions with the document are modeled with a newly introduced timed Petri-net (TPN*). TPN* is used to infer the behavior of the system. This paper describes TPN*'s modeling and analysis and its application to the hyperstory model.
{"title":"Modeling timed user-interactions in multimedia documents","authors":"Junehwa Song, Y. Doganata, Michelle Y. Kim, A. Tantawi","doi":"10.1109/MMCS.1996.535006","DOIUrl":"https://doi.org/10.1109/MMCS.1996.535006","url":null,"abstract":"Interactive multimedia documents (or systems) can be characterized by active user participation and the diversity of multimedia information accessed at various levels of granularity. They need to support extensive user participation in selecting and tailoring the information and its presentation. Multimedia information fragments may vary from portions of video, pieces of audio, newspaper quotations or chapters of a book. Effectively managing the creation, evolution and complexity of multimedia documents formed by combining media fragments is an essential capability. We consider the hyperstory model of a multimedia document, where a document is structured hierarchically in three dimensions: time, space and asynchrony. The model provides a layered approach in structuring multimedia documents, thus reducing the complexities of large systems. The hyperstory model supports user interactions that are timed, and also supports preemptive resuming. Timed user interactions with the document are modeled with a newly introduced timed Petri-net (TPN*). TPN* is used to infer the behavior of the system. This paper describes TPN*'s modeling and analysis and its application to the hyperstory model.","PeriodicalId":371043,"journal":{"name":"Proceedings of the Third IEEE International Conference on Multimedia Computing and Systems","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1996-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133229378","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 1996-06-17DOI: 10.1109/MMCS.1996.534951
L. D. Silva, T. Miyasato, F. Kishino
We investigate the unique advantages of our proposed virtual space teleconferencing system (VST) in the area of multimedia teleconferencing, with emphasis on facial emotion transmission and recognition. We show that, using this concept, emotions of a local participant can be transmitted to the remote party with a higher recognition rate by enhancing the emotions using some intelligence processing between the local and the remote participants. This leads to a kind of emotion enhanced teleconferencing system which can supersede face to face meetings, by effectively alleviating the barriers in recognizing emotions between different nations. We consider a concept known as a virtual person, which is a better alternative to blurred or mosaic facial images that one can find in some television interviews with people who are not willing to be exposed in public.
{"title":"Emotion enhanced multimedia meetings using the concept of virtual space teleconferencing","authors":"L. D. Silva, T. Miyasato, F. Kishino","doi":"10.1109/MMCS.1996.534951","DOIUrl":"https://doi.org/10.1109/MMCS.1996.534951","url":null,"abstract":"We investigate the unique advantages of our proposed virtual space teleconferencing system (VST) in the area of multimedia teleconferencing, with emphasis on facial emotion transmission and recognition. We show that, using this concept, emotions of a local participant can be transmitted to the remote party with a higher recognition rate by enhancing the emotions using some intelligence processing between the local and the remote participants. This leads to a kind of emotion enhanced teleconferencing system which can supersede face to face meetings, by effectively alleviating the barriers in recognizing emotions between different nations. We consider a concept known as a virtual person, which is a better alternative to blurred or mosaic facial images that one can find in some television interviews with people who are not willing to be exposed in public.","PeriodicalId":371043,"journal":{"name":"Proceedings of the Third IEEE International Conference on Multimedia Computing and Systems","volume":"23 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1996-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124701166","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 1996-06-17DOI: 10.1109/MMCS.1996.535007
A. Caloini, Eiichiro Tanaka
In this paper, we extend the notion of style, widely used in modern desktop publishing systems, to hypermedia documents. We also show how such styles can be used to provide different views of the same hyperdocument (e.g. English vs. Japanese).
{"title":"Extending styles to hypermedia documents","authors":"A. Caloini, Eiichiro Tanaka","doi":"10.1109/MMCS.1996.535007","DOIUrl":"https://doi.org/10.1109/MMCS.1996.535007","url":null,"abstract":"In this paper, we extend the notion of style, widely used in modern desktop publishing systems, to hypermedia documents. We also show how such styles can be used to provide different views of the same hyperdocument (e.g. English vs. Japanese).","PeriodicalId":371043,"journal":{"name":"Proceedings of the Third IEEE International Conference on Multimedia Computing and Systems","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1996-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125234511","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 1996-06-17DOI: 10.1109/MMCS.1996.534985
Michelle Y. Kim
QA Builder is a toolkit for building interactive knowledge-based systems for domains in which the domain knowledge can be expressed as a sequence of questions and answers. It integrates knowledge acquisition, knowledge-base validation, multimedia authoring, and user interface design and validation in a unifying framework. It is unique in that it treats non-textual information such as video, audio, and graphical images as integral parts of the domain knowledge, and that it provides authors with immediate feedback. This paper describes the QA Builder with an emphasis on the object-oriented visual editors.
{"title":"QA Builder: a visual toolkit for building multimedia knowledge-based systems with immediate feedback","authors":"Michelle Y. Kim","doi":"10.1109/MMCS.1996.534985","DOIUrl":"https://doi.org/10.1109/MMCS.1996.534985","url":null,"abstract":"QA Builder is a toolkit for building interactive knowledge-based systems for domains in which the domain knowledge can be expressed as a sequence of questions and answers. It integrates knowledge acquisition, knowledge-base validation, multimedia authoring, and user interface design and validation in a unifying framework. It is unique in that it treats non-textual information such as video, audio, and graphical images as integral parts of the domain knowledge, and that it provides authors with immediate feedback. This paper describes the QA Builder with an emphasis on the object-oriented visual editors.","PeriodicalId":371043,"journal":{"name":"Proceedings of the Third IEEE International Conference on Multimedia Computing and Systems","volume":"94 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1996-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125240380","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 1996-06-17DOI: 10.1109/MMCS.1996.535016
C. Asakawa
After describing the problems that blind users face in accessing conventional printed documents, we give an overview of a document-processing reader for OS/2. The discussion concerns mainly the user interface of the system, since GUIs present serious problems for blind users. Instead of icons and a mouse, the system interface is built around voice menus and a numeric keypad. A structured file is also created to allow users to read a text quickly after the scanning and character recognition is complete.
{"title":"User interface of a document-processing reader for the blind","authors":"C. Asakawa","doi":"10.1109/MMCS.1996.535016","DOIUrl":"https://doi.org/10.1109/MMCS.1996.535016","url":null,"abstract":"After describing the problems that blind users face in accessing conventional printed documents, we give an overview of a document-processing reader for OS/2. The discussion concerns mainly the user interface of the system, since GUIs present serious problems for blind users. Instead of icons and a mouse, the system interface is built around voice menus and a numeric keypad. A structured file is also created to allow users to read a text quickly after the scanning and character recognition is complete.","PeriodicalId":371043,"journal":{"name":"Proceedings of the Third IEEE International Conference on Multimedia Computing and Systems","volume":"6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1996-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127944222","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 1996-06-17DOI: 10.1109/MMCS.1996.535900
Hain-Ching Liu, G. Zick
Presents a novel scene-adaptive MPEG encoding algorithm using parameters encoded in P-pictures. In the MPEG format, a P-picture can have two types of macroblock: intra-coded macroblocks and forward-predicted macroblocks. The proposed algorithm defines a correlation function based on the number of different types of macroblocks and applies this function to detect scene changes in the video sequence. The frame type is then adaptively arranged according to the detection result. Because of motion compensation, the scene change detection is very reliable. Since all the encoded parameters are available during the encoding process, only minimal additional computation is needed. Therefore, this algorithm is faster and more accurate than those approaches using histogram matching.
{"title":"Scene adaptive MPEG encoding algorithm using the P-picture-based analysis","authors":"Hain-Ching Liu, G. Zick","doi":"10.1109/MMCS.1996.535900","DOIUrl":"https://doi.org/10.1109/MMCS.1996.535900","url":null,"abstract":"Presents a novel scene-adaptive MPEG encoding algorithm using parameters encoded in P-pictures. In the MPEG format, a P-picture can have two types of macroblock: intra-coded macroblocks and forward-predicted macroblocks. The proposed algorithm defines a correlation function based on the number of different types of macroblocks and applies this function to detect scene changes in the video sequence. The frame type is then adaptively arranged according to the detection result. Because of motion compensation, the scene change detection is very reliable. Since all the encoded parameters are available during the encoding process, only minimal additional computation is needed. Therefore, this algorithm is faster and more accurate than those approaches using histogram matching.","PeriodicalId":371043,"journal":{"name":"Proceedings of the Third IEEE International Conference on Multimedia Computing and Systems","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1996-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129530380","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 1996-06-17DOI: 10.1109/MMCS.1996.534989
Paul N Pazandak, J. Srivastava
The paper presents an overview of DAMSEL and it's implementation. DAMSEL comprises an embeddable dynamic multimedia specification language, and a software architecture suitable for multi user interactive multimedia environments. The goal of DAMSEL is to explore language constructs and execution environments for next generation interactive multimedia applications. The constructs of DAMSEL include primitives for event driven temporal specification-supporting causation and inhibition. Specifications allow behavioral parameters to be chosen, enabling very powerful temporal relations to be defined. Other constructs support the modification, and analysis of multimedia data, and high level abstractions for connections to user interfaces. Further, DAMSEL constructs support conditional and constraint logics, enabling more complex specifications than currently possible. DAMSEL is being implemented using a java implementation of the CORBA standard.
{"title":"Interactive multi-user multimedia environments on the Internet: an overview of DAMSEL and its implementation","authors":"Paul N Pazandak, J. Srivastava","doi":"10.1109/MMCS.1996.534989","DOIUrl":"https://doi.org/10.1109/MMCS.1996.534989","url":null,"abstract":"The paper presents an overview of DAMSEL and it's implementation. DAMSEL comprises an embeddable dynamic multimedia specification language, and a software architecture suitable for multi user interactive multimedia environments. The goal of DAMSEL is to explore language constructs and execution environments for next generation interactive multimedia applications. The constructs of DAMSEL include primitives for event driven temporal specification-supporting causation and inhibition. Specifications allow behavioral parameters to be chosen, enabling very powerful temporal relations to be defined. Other constructs support the modification, and analysis of multimedia data, and high level abstractions for connections to user interfaces. Further, DAMSEL constructs support conditional and constraint logics, enabling more complex specifications than currently possible. DAMSEL is being implemented using a java implementation of the CORBA standard.","PeriodicalId":371043,"journal":{"name":"Proceedings of the Third IEEE International Conference on Multimedia Computing and Systems","volume":"76 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1996-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132076666","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 1996-06-17DOI: 10.1109/MMCS.1996.534955
L. Ngoh, Hongyi Li, H. Pung
Multicast has been well tested on non-ATM networks and proven to be a highly effective mode of network service especially in the area of real-time distributed multimedia applications. Although multicast is also supported in ATM, so far the signalling facilities provided have been rudimentary at best. This has prevented multicasting to be exploited effectively in ATM networks. In this paper, a new scheme in providing ATM based multicast service is described. The proposed scheme does not only provide many-to-many ATM virtual circuit-level multicast, it also supports dynamic Quality-of-Service (QoS) guarantees with heterogeneous receivers similar to those being proposed by the Internet Protocol (IP). One of the major emphases of the design is to provide a transparent set of common network services for applications to operate across both the proposed VC-based and IP multicast services on the same ATM networks. This is achieved in part by adopting the same Internet resource reservation protocol for the proposed ATM multicast service. In this paper, the various components of the multicast service such as multicast routing, QoS negotiation and connection management are presented in detail. Initial experience with a prototype implementation is also discussed.
{"title":"A direct ATM multicast service with quality-of-service guarantees","authors":"L. Ngoh, Hongyi Li, H. Pung","doi":"10.1109/MMCS.1996.534955","DOIUrl":"https://doi.org/10.1109/MMCS.1996.534955","url":null,"abstract":"Multicast has been well tested on non-ATM networks and proven to be a highly effective mode of network service especially in the area of real-time distributed multimedia applications. Although multicast is also supported in ATM, so far the signalling facilities provided have been rudimentary at best. This has prevented multicasting to be exploited effectively in ATM networks. In this paper, a new scheme in providing ATM based multicast service is described. The proposed scheme does not only provide many-to-many ATM virtual circuit-level multicast, it also supports dynamic Quality-of-Service (QoS) guarantees with heterogeneous receivers similar to those being proposed by the Internet Protocol (IP). One of the major emphases of the design is to provide a transparent set of common network services for applications to operate across both the proposed VC-based and IP multicast services on the same ATM networks. This is achieved in part by adopting the same Internet resource reservation protocol for the proposed ATM multicast service. In this paper, the various components of the multicast service such as multicast routing, QoS negotiation and connection management are presented in detail. Initial experience with a prototype implementation is also discussed.","PeriodicalId":371043,"journal":{"name":"Proceedings of the Third IEEE International Conference on Multimedia Computing and Systems","volume":"90 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1996-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114167960","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 1996-06-17DOI: 10.1109/MMCS.1996.535014
K. Kaneko, A. Makinouchi, M. Aritsugi
INADA is a new database programming language under development at Kyushu University. INADA can create and manipulate C++ objects stored in a database (persistent object). The key idea of INADA is the introduction of distributed shared virtual memory. We propose multimedia applications using distributed shared virtual memory. We introduce stream heap for time-based media, and global heap for non time-based media. To test our idea we implemented two prototype multimedia applications, white board and telephone using INADA.
{"title":"Multimedia applications using a database programming language-INADA","authors":"K. Kaneko, A. Makinouchi, M. Aritsugi","doi":"10.1109/MMCS.1996.535014","DOIUrl":"https://doi.org/10.1109/MMCS.1996.535014","url":null,"abstract":"INADA is a new database programming language under development at Kyushu University. INADA can create and manipulate C++ objects stored in a database (persistent object). The key idea of INADA is the introduction of distributed shared virtual memory. We propose multimedia applications using distributed shared virtual memory. We introduce stream heap for time-based media, and global heap for non time-based media. To test our idea we implemented two prototype multimedia applications, white board and telephone using INADA.","PeriodicalId":371043,"journal":{"name":"Proceedings of the Third IEEE International Conference on Multimedia Computing and Systems","volume":"142 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1996-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123453589","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 1996-06-17DOI: 10.1109/MMCS.1996.534981
D. Zucker, M. Flynn, R. Lee
Data prefetching is a well known technique for improving cache performance. While several studies have examined prefetch strategies for scientific and commercial applications, no published work has studied the special memory requirements of multimedia applications. This paper presents data for three types of hardware prefetching schemes: stream buffers, stride prediction tables, and a hybrid combination of the two, the stream cache. Use of the stride prediction table is shown to eliminate up to 90% of the misses that would otherwise be incurred in a moderate or large sized cache with no prefetching hardware. The stream cache, proposed for the first time in this paper, has the potential to cut execution times by half with the addition of a relatively small amount of additional hardware.
{"title":"A comparison of hardware prefetching techniques for multimedia benchmarks","authors":"D. Zucker, M. Flynn, R. Lee","doi":"10.1109/MMCS.1996.534981","DOIUrl":"https://doi.org/10.1109/MMCS.1996.534981","url":null,"abstract":"Data prefetching is a well known technique for improving cache performance. While several studies have examined prefetch strategies for scientific and commercial applications, no published work has studied the special memory requirements of multimedia applications. This paper presents data for three types of hardware prefetching schemes: stream buffers, stride prediction tables, and a hybrid combination of the two, the stream cache. Use of the stride prediction table is shown to eliminate up to 90% of the misses that would otherwise be incurred in a moderate or large sized cache with no prefetching hardware. The stream cache, proposed for the first time in this paper, has the potential to cut execution times by half with the addition of a relatively small amount of additional hardware.","PeriodicalId":371043,"journal":{"name":"Proceedings of the Third IEEE International Conference on Multimedia Computing and Systems","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1996-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122531516","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}