首页 > 最新文献

2005 IEEE International Conference on Multimedia and Expo最新文献

英文 中文
Separation of Voice and Music by Harmonic Structure Stability Analysis 和声结构稳定性分析对声乐分离的影响
Pub Date : 2005-07-06 DOI: 10.1109/ICME.2005.1521485
Yungang Zhang, Changshui Zhang
Separation of voice and music is an interesting but difficult problem. It is useful for many other researches such as audio content analysis. In this paper, the difference between voice and music signals is carefully studied. It is proposed that the harmonic structure stability is the key difference between them. A separation algorithm based on this theory is proposed. The main idea is to learn the average harmonic structure of the music, and then separate signals by using it to distinguish voice and music harmonic structures. Experimental results show that the algorithm can separate mixed signals and obtains not only a very high signal-to-noise ratio (SNR) but also a rather good subjective audio quality
声音和音乐的分离是一个有趣但又困难的问题。它对音频内容分析等其他研究也很有帮助。本文对语音信号和音乐信号的区别进行了细致的研究。谐波结构的稳定性是两者的关键区别。在此基础上提出了一种分离算法。其主要思想是学习音乐的平均和声结构,然后利用它来分离信号来区分声音和音乐的和声结构。实验结果表明,该算法能够有效地分离混合信号,获得较高的信噪比和较好的主观音质
{"title":"Separation of Voice and Music by Harmonic Structure Stability Analysis","authors":"Yungang Zhang, Changshui Zhang","doi":"10.1109/ICME.2005.1521485","DOIUrl":"https://doi.org/10.1109/ICME.2005.1521485","url":null,"abstract":"Separation of voice and music is an interesting but difficult problem. It is useful for many other researches such as audio content analysis. In this paper, the difference between voice and music signals is carefully studied. It is proposed that the harmonic structure stability is the key difference between them. A separation algorithm based on this theory is proposed. The main idea is to learn the average harmonic structure of the music, and then separate signals by using it to distinguish voice and music harmonic structures. Experimental results show that the algorithm can separate mixed signals and obtains not only a very high signal-to-noise ratio (SNR) but also a rather good subjective audio quality","PeriodicalId":244360,"journal":{"name":"2005 IEEE International Conference on Multimedia and Expo","volume":"9 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128315364","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 14
A Probabilistic Description of Man-Machine Spoken Communication 人机语音交流的概率描述
Pub Date : 2005-07-06 DOI: 10.1109/ICME.2005.1521447
O. Pietquin
Speech enabled interfaces and spoken dialog systems are mostly based on statistical speech and language processing modules. Their behavior is therefore not deterministic and hardly predictable. This makes the simulation and the optimization of such systems performances difficult, as well as the reuse of previous work to build new systems. In the aim of a partially automated optimization of such systems, this paper presents a formalism attempt for the description of man-machine spoken communication in the framework of spoken dialog systems. This formalization is partly based on a probabilistic description of the information processing occurring in each module composing a spoken dialog system but also on a stochastic user modeling. Eventually, some possible applications of this theoretic framework are proposed
支持语音的界面和语音对话系统主要基于统计语音和语言处理模块。因此,它们的行为是不确定的,也很难预测。这使得仿真和优化这类系统的性能变得困难,同时也使得在构建新系统时重用以前的工作变得困难。为了对这类系统进行部分自动化的优化,本文提出了一种在口语对话系统框架下描述人机口语交流的形式化尝试。这种形式化部分基于构成口语对话系统的每个模块中发生的信息处理的概率描述,但也基于随机用户建模。最后,提出了该理论框架的一些可能的应用
{"title":"A Probabilistic Description of Man-Machine Spoken Communication","authors":"O. Pietquin","doi":"10.1109/ICME.2005.1521447","DOIUrl":"https://doi.org/10.1109/ICME.2005.1521447","url":null,"abstract":"Speech enabled interfaces and spoken dialog systems are mostly based on statistical speech and language processing modules. Their behavior is therefore not deterministic and hardly predictable. This makes the simulation and the optimization of such systems performances difficult, as well as the reuse of previous work to build new systems. In the aim of a partially automated optimization of such systems, this paper presents a formalism attempt for the description of man-machine spoken communication in the framework of spoken dialog systems. This formalization is partly based on a probabilistic description of the information processing occurring in each module composing a spoken dialog system but also on a stochastic user modeling. Eventually, some possible applications of this theoretic framework are proposed","PeriodicalId":244360,"journal":{"name":"2005 IEEE International Conference on Multimedia and Expo","volume":"104 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127953416","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 23
Neighborhood issue in single-frame image super-resolution 单帧图像超分辨率中的邻域问题
Pub Date : 2005-07-06 DOI: 10.1109/ICME.2005.1521623
Xu Su, Q. Tian, Q. Xue, N. Sebe, Jingsheng Ma
Super-resolution is the problem of generating one or a set of high-resolution images from one or a sequence of low-resolution frames. Most methods have been proposed for super-resolution based on multiple low resolution images of the same scene, which is called multiple-frame super-resolution. Only a few approaches produce a high-resolution image from a single low-resolution image, with the help of one or a set of training images from scenes of the same or different types. It is referred to as single-frame super-resolution. This article reviews a variety of single-frame super-resolution methods proposed in the recent years. In the paper, a new manifold learning method: locally linear embedding (LLE) and its relation with single-frame super-resolution is introduced. Detailed study of a critical issue: "neighborhood issue" is presented with related experimental results and analysis and possible future research is given.
超分辨率是指从一个或一系列低分辨率帧生成一个或一组高分辨率图像的问题。目前提出的超分辨率方法大多是基于同一场景的多幅低分辨率图像,称为多帧超分辨率。只有少数几种方法可以在来自相同或不同类型场景的一个或一组训练图像的帮助下,从单个低分辨率图像生成高分辨率图像。它被称为单帧超分辨率。本文综述了近年来提出的各种单帧超分辨方法。本文介绍了一种新的流形学习方法:局部线性嵌入(LLE)及其与单帧超分辨率的关系。对一个关键问题“邻里问题”进行了详细的研究,给出了相关的实验结果和分析,并给出了可能的未来研究。
{"title":"Neighborhood issue in single-frame image super-resolution","authors":"Xu Su, Q. Tian, Q. Xue, N. Sebe, Jingsheng Ma","doi":"10.1109/ICME.2005.1521623","DOIUrl":"https://doi.org/10.1109/ICME.2005.1521623","url":null,"abstract":"Super-resolution is the problem of generating one or a set of high-resolution images from one or a sequence of low-resolution frames. Most methods have been proposed for super-resolution based on multiple low resolution images of the same scene, which is called multiple-frame super-resolution. Only a few approaches produce a high-resolution image from a single low-resolution image, with the help of one or a set of training images from scenes of the same or different types. It is referred to as single-frame super-resolution. This article reviews a variety of single-frame super-resolution methods proposed in the recent years. In the paper, a new manifold learning method: locally linear embedding (LLE) and its relation with single-frame super-resolution is introduced. Detailed study of a critical issue: \"neighborhood issue\" is presented with related experimental results and analysis and possible future research is given.","PeriodicalId":244360,"journal":{"name":"2005 IEEE International Conference on Multimedia and Expo","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128346596","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 81
Non-linear image enhancement for digital TV applications using Gabor filters 使用Gabor滤波器的数字电视应用的非线性图像增强
Pub Date : 2005-07-06 DOI: 10.1109/ICME.2005.1521597
Yue Yang, Baoxin Li
We propose a non-linear image enhancement method based on Gabor filters, which allows selective enhancement based on the contrast sensitivity function of the human visual system. We also propose an evaluation method for measuring the performance of the algorithm and for comparing it with existing approaches. The selective enhancement of the proposed approach is especially suitable for digital television applications to improve the perceived visual quality of the images when the source image contains less satisfactory amount of high frequencies due to various reasons, including interpolation that is used to convert standard definition sources into high-definition images.
提出了一种基于Gabor滤波器的非线性图像增强方法,该方法可以根据人类视觉系统的对比敏感度函数进行选择性增强。我们还提出了一种评估方法来衡量算法的性能,并将其与现有方法进行比较。所提出的方法的选择性增强特别适用于数字电视应用,当源图像由于各种原因(包括用于将标准清晰度源转换为高清图像的插值)而包含较少的高频时,可以改善图像的感知视觉质量。
{"title":"Non-linear image enhancement for digital TV applications using Gabor filters","authors":"Yue Yang, Baoxin Li","doi":"10.1109/ICME.2005.1521597","DOIUrl":"https://doi.org/10.1109/ICME.2005.1521597","url":null,"abstract":"We propose a non-linear image enhancement method based on Gabor filters, which allows selective enhancement based on the contrast sensitivity function of the human visual system. We also propose an evaluation method for measuring the performance of the algorithm and for comparing it with existing approaches. The selective enhancement of the proposed approach is especially suitable for digital television applications to improve the perceived visual quality of the images when the source image contains less satisfactory amount of high frequencies due to various reasons, including interpolation that is used to convert standard definition sources into high-definition images.","PeriodicalId":244360,"journal":{"name":"2005 IEEE International Conference on Multimedia and Expo","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130950058","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
Fuzzy image segmentation using shape information 基于形状信息的模糊图像分割
Pub Date : 2005-07-06 DOI: 10.1109/ICME.2005.1521529
Mohammed Ameer Ali, G. Karmakar, L. Dooley
Results of any clustering algorithm are highly sensitive to features that limit their generalization and hence provide a strong motivation to integrate shape information into the algorithm. Existing fuzzy shape-based clustering algorithms consider only circular and elliptical shape information and consequently do not segment well, arbitrary shaped objects. To address this issue, this paper introduces a new shape-based algorithm, called fuzzy image segmentation using shape information (FISS) by incorporating general shape information. Both qualitative and quantitative analysis proves the superiority of the new FISS algorithm compared to other well-established shape-based fuzzy clustering algorithms, including Gustafson-Kessel, ring-shaped, circular shell, c-ellipsoidal shells and elliptic ring-shaped clusters.
任何聚类算法的结果都对限制其泛化的特征高度敏感,因此提供了将形状信息集成到算法中的强烈动机。现有的基于模糊形状的聚类算法只考虑圆形和椭圆形的形状信息,因此不能很好地分割任意形状的物体。为了解决这一问题,本文引入了一种基于形状的模糊图像分割算法——基于形状信息的模糊图像分割(FISS)。定性和定量分析都证明了新的FISS算法与其他已建立的基于形状的模糊聚类算法(包括Gustafson-Kessel、环形聚类、圆形聚类、c椭球壳聚类和椭圆环形聚类)相比具有优越性。
{"title":"Fuzzy image segmentation using shape information","authors":"Mohammed Ameer Ali, G. Karmakar, L. Dooley","doi":"10.1109/ICME.2005.1521529","DOIUrl":"https://doi.org/10.1109/ICME.2005.1521529","url":null,"abstract":"Results of any clustering algorithm are highly sensitive to features that limit their generalization and hence provide a strong motivation to integrate shape information into the algorithm. Existing fuzzy shape-based clustering algorithms consider only circular and elliptical shape information and consequently do not segment well, arbitrary shaped objects. To address this issue, this paper introduces a new shape-based algorithm, called fuzzy image segmentation using shape information (FISS) by incorporating general shape information. Both qualitative and quantitative analysis proves the superiority of the new FISS algorithm compared to other well-established shape-based fuzzy clustering algorithms, including Gustafson-Kessel, ring-shaped, circular shell, c-ellipsoidal shells and elliptic ring-shaped clusters.","PeriodicalId":244360,"journal":{"name":"2005 IEEE International Conference on Multimedia and Expo","volume":"222 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130655048","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Comparative evaluation of Web image search engines for multimedia applications 多媒体应用的网络图像搜索引擎的比较评价
Pub Date : 2005-07-06 DOI: 10.1109/ICME.2005.1521641
Keon Stevenson, C. Leung
While text-oriented document searching are relatively mature on the Internet, image searching, which requires much more than text matching, significantly lags behind. The use of image search engines significantly enlarges the scope of images to users accessibility. This paper provides an understanding of current technologies in image searching on the Internet, and points to future areas of improvement for multimedia applications. We develop a systematic set of image queries to assess the competence and performance of the major image search engines. We find that current technology is only able to deliver an average precision of around 42% and an average recall of around 12%, while the best performers are capable of producing over 70% for precision and around 27% for recall. The reasons for such differences, and mechanisms for search improvement, are also indicated.
虽然面向文本的文档搜索在Internet上相对成熟,但图像搜索需要的远远超过文本匹配,因此明显落后。图像搜索引擎的使用极大地扩大了图像对用户的可访问性。本文提供了对当前互联网图像搜索技术的理解,并指出了多媒体应用的未来改进领域。我们开发了一套系统的图像查询来评估主要图像搜索引擎的能力和性能。我们发现,目前的技术只能提供42%左右的平均准确率和12%左右的平均召回率,而表现最好的技术能够提供70%以上的准确率和27%左右的召回率。还指出了造成这种差异的原因和改进搜索的机制。
{"title":"Comparative evaluation of Web image search engines for multimedia applications","authors":"Keon Stevenson, C. Leung","doi":"10.1109/ICME.2005.1521641","DOIUrl":"https://doi.org/10.1109/ICME.2005.1521641","url":null,"abstract":"While text-oriented document searching are relatively mature on the Internet, image searching, which requires much more than text matching, significantly lags behind. The use of image search engines significantly enlarges the scope of images to users accessibility. This paper provides an understanding of current technologies in image searching on the Internet, and points to future areas of improvement for multimedia applications. We develop a systematic set of image queries to assess the competence and performance of the major image search engines. We find that current technology is only able to deliver an average precision of around 42% and an average recall of around 12%, while the best performers are capable of producing over 70% for precision and around 27% for recall. The reasons for such differences, and mechanisms for search improvement, are also indicated.","PeriodicalId":244360,"journal":{"name":"2005 IEEE International Conference on Multimedia and Expo","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129500596","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 31
Analysis of expressing audiences in a cyber-theater 网络剧场观众表达分析
Pub Date : 2005-07-06 DOI: 10.1109/ICME.2005.1521526
Dong-Wan Kang, K. Huang, J. Ohya
This paper studies how audiences should be expressed in a Cyber-theater, in which remotely located persons can direct plays as directors, perform as performers and/or see the performances as audiences through a networked virtual environment. It is noted that the audience effect has been widely acknowledged in the real-world theater: that is, the audience reaction has a significant effect on the acting of player and performance of the play itself. However, only a few works relevant to audiences in the cyber theater can be seen. This paper studies whether the audience effect exists also in the cyber-theater. By constructing, a system in which two actors are displayed a remotely located audience's avatar in which the audience can display his/her emotional actions, we clarified that interaction between the actors and audiences are effective.
本文研究了在网络剧场中观众应该如何表达,在网络剧场中,远程位置的人可以通过网络虚拟环境作为导演指导戏剧,作为表演者表演和/或作为观众观看表演。值得注意的是,观众效应在现实戏剧中得到了广泛的认可:即观众的反应对演员的表演和戏剧本身的表现都有显著的影响。然而,在网络剧场中,只有少数与观众相关的作品可以看到。本文研究了网络剧场中是否也存在观众效应。通过构建一个系统,在这个系统中,两个演员被显示在一个远程位置的观众的化身中,观众可以在其中显示他/她的情感行为,我们明确了演员和观众之间的互动是有效的。
{"title":"Analysis of expressing audiences in a cyber-theater","authors":"Dong-Wan Kang, K. Huang, J. Ohya","doi":"10.1109/ICME.2005.1521526","DOIUrl":"https://doi.org/10.1109/ICME.2005.1521526","url":null,"abstract":"This paper studies how audiences should be expressed in a Cyber-theater, in which remotely located persons can direct plays as directors, perform as performers and/or see the performances as audiences through a networked virtual environment. It is noted that the audience effect has been widely acknowledged in the real-world theater: that is, the audience reaction has a significant effect on the acting of player and performance of the play itself. However, only a few works relevant to audiences in the cyber theater can be seen. This paper studies whether the audience effect exists also in the cyber-theater. By constructing, a system in which two actors are displayed a remotely located audience's avatar in which the audience can display his/her emotional actions, we clarified that interaction between the actors and audiences are effective.","PeriodicalId":244360,"journal":{"name":"2005 IEEE International Conference on Multimedia and Expo","volume":"35 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130985474","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Content-based block watermarking against cumulative and temporal attack 针对累积攻击和时间攻击的基于内容的块水印
Pub Date : 2005-07-06 DOI: 10.1109/ICME.2005.1521364
Ju Wang, Jonathan C. L. Liu
This paper presents a block-selection-based video watermarking scheme that is designed to be resilient against two dangerous attacks: cumulative attack and temporal attack. We use content-based block selection to counteract cumulative attack by spreading the locations of marked blocks. The block selection algorithm also leads to a novel frame synchronization method that can effectively re-synchronize suspected video frames to their original positions. Our scheme has low computation overhead and robust detection performance for moderately compressed video.
本文提出了一种基于块选择的视频水印方案,该方案能够抵御两种危险的攻击:累积攻击和时间攻击。我们使用基于内容的块选择,通过扩展标记块的位置来抵消累积攻击。块选择算法还导致了一种新的帧同步方法,可以有效地将可疑视频帧重新同步到原始位置。该方案对中等压缩视频具有较低的计算量和较强的检测性能。
{"title":"Content-based block watermarking against cumulative and temporal attack","authors":"Ju Wang, Jonathan C. L. Liu","doi":"10.1109/ICME.2005.1521364","DOIUrl":"https://doi.org/10.1109/ICME.2005.1521364","url":null,"abstract":"This paper presents a block-selection-based video watermarking scheme that is designed to be resilient against two dangerous attacks: cumulative attack and temporal attack. We use content-based block selection to counteract cumulative attack by spreading the locations of marked blocks. The block selection algorithm also leads to a novel frame synchronization method that can effectively re-synchronize suspected video frames to their original positions. Our scheme has low computation overhead and robust detection performance for moderately compressed video.","PeriodicalId":244360,"journal":{"name":"2005 IEEE International Conference on Multimedia and Expo","volume":"39 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128181476","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A Reversible Watermarking Scheme for JPEG-2000 Compressed Images 一种用于JPEG-2000压缩图像的可逆水印方案
Pub Date : 2005-07-06 DOI: 10.1109/ICME.2005.1521362
S. Emmanuel, C. K. Heng, A. Das
In this paper, we present a novel reversible watermarking scheme for image authentication for JPEG/JPEG-2000 coded images. Since the watermarking scheme is reversible, the exact original image can be recovered from the watermarked image. The watermarking scheme makes use of finite state machine principles. The proposed scheme is asymmetric as the watermark extraction key is different from its embedding key. The algorithm is implemented and tested for its visual quality, compression overhead, execution time overhead and payload capacity. It is found that the algorithm has high visual quality, high payload capacity, low compression overhead and low execution time overhead
本文提出了一种新的用于JPEG/JPEG-2000编码图像认证的可逆水印方案。由于水印方案是可逆的,因此可以从水印图像中恢复出准确的原始图像。该水印方案利用有限状态机原理。由于水印提取密钥与其嵌入密钥不同,该方案具有非对称性。对该算法的视觉质量、压缩开销、执行时间开销和有效载荷容量进行了实现和测试。结果表明,该算法具有高视觉质量、高负载容量、低压缩开销和低执行时间开销等优点
{"title":"A Reversible Watermarking Scheme for JPEG-2000 Compressed Images","authors":"S. Emmanuel, C. K. Heng, A. Das","doi":"10.1109/ICME.2005.1521362","DOIUrl":"https://doi.org/10.1109/ICME.2005.1521362","url":null,"abstract":"In this paper, we present a novel reversible watermarking scheme for image authentication for JPEG/JPEG-2000 coded images. Since the watermarking scheme is reversible, the exact original image can be recovered from the watermarked image. The watermarking scheme makes use of finite state machine principles. The proposed scheme is asymmetric as the watermark extraction key is different from its embedding key. The algorithm is implemented and tested for its visual quality, compression overhead, execution time overhead and payload capacity. It is found that the algorithm has high visual quality, high payload capacity, low compression overhead and low execution time overhead","PeriodicalId":244360,"journal":{"name":"2005 IEEE International Conference on Multimedia and Expo","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128489082","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Automatic Segmentation of Home Videos 家庭视频的自动分割
Pub Date : 2005-07-06 DOI: 10.1109/ICME.2005.1521347
Y. Zhai, M. Shah
Temporal video segmentation is one of the fundamental and essential tasks in video processing, understanding and management. In this paper, we present an automatic method for segmenting the home videos into temporal logical units. We have developed a statistical framework using Markov chain Monte Carlo (MCMC) technique. The temporal scene boundaries are detected by maximizing the posterior probability of the model parameters. The model parameters contain the number of the scenes and the boundary locations of the scenes. The proposed method has been demonstrated on several home videos, and high accuracy has been obtained
时间视频分割是视频处理、理解和管理的基本任务之一。在本文中,我们提出了一种将家庭视频自动分割成时间逻辑单元的方法。我们开发了一个统计框架使用马尔可夫链蒙特卡罗(MCMC)技术。通过最大化模型参数的后验概率来检测时间场景边界。模型参数包含场景的个数和场景的边界位置。该方法已在多个家庭视频中进行了验证,取得了较高的精度
{"title":"Automatic Segmentation of Home Videos","authors":"Y. Zhai, M. Shah","doi":"10.1109/ICME.2005.1521347","DOIUrl":"https://doi.org/10.1109/ICME.2005.1521347","url":null,"abstract":"Temporal video segmentation is one of the fundamental and essential tasks in video processing, understanding and management. In this paper, we present an automatic method for segmenting the home videos into temporal logical units. We have developed a statistical framework using Markov chain Monte Carlo (MCMC) technique. The temporal scene boundaries are detected by maximizing the posterior probability of the model parameters. The model parameters contain the number of the scenes and the boundary locations of the scenes. The proposed method has been demonstrated on several home videos, and high accuracy has been obtained","PeriodicalId":244360,"journal":{"name":"2005 IEEE International Conference on Multimedia and Expo","volume":"81 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126434551","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
期刊
2005 IEEE International Conference on Multimedia and Expo
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1