2005 IEEE International Conference on Multimedia and Expo最新文献

英文中文

Spherical Harmonics Descriptor for 2D-Image Retrieval 用于二维图像检索的球面谐波描述子

2005 IEEE International Conference on Multimedia and Expo

Pub Date : 2005-07-06 DOI: 10.1109/ICME.2005.1521371

Atul Sajjanhar, Guojun Lu, Dengsheng Zhang

In this paper, spherical harmonics are proposed as shape descriptors for 2D images. We introduce the concept of connectivity; 2D images are decomposed using connectivity, which is followed by 3D model construction. Spherical harmonics are obtained for 3D models and used as descriptors for the underlying 2D shapes. Difference between two images is computed as the Euclidean distance between their spherical harmonics descriptors. Experiments are performed to test the effectiveness of spherical harmonics for retrieval of 2D images. Item S8 within the MPEG-7 still images content set is used for performing experiments; this dataset consists of 3621 still images. Experimental results show that the proposed descriptors for 2D images are effective

本文提出了球面谐波作为二维图像的形状描述符。我们引入了连通性的概念;利用连通性对二维图像进行分解，然后构建三维模型。获得三维模型的球面谐波，并将其用作底层二维形状的描述符。两个图像之间的差计算为它们的球面谐波描述符之间的欧几里得距离。通过实验验证了球面谐波在二维图像检索中的有效性。使用MPEG-7静态图像内容集中的Item S8进行实验;该数据集由3621张静止图像组成。实验结果表明，所提出的二维图像描述符是有效的

引用次数: 11

Content-Free Image Retrieval Based on Relations Exploited from User Feedbacks 基于用户反馈关系的无内容图像检索

2005 IEEE International Conference on Multimedia and Expo

Pub Date : 2005-07-06 DOI: 10.1109/ICME.2005.1521682

Shingo Uchihashi, T. Kanade

We propose a new "content-free" image retrieval method which attempts to exploit certain common tendencies that exist among people's interpretation of images from user feedbacks. The system simply accumulates records of user feedback and recycles them in the form of collaborative filtering. We discuss various issues of image retrieval, argue for the idea of content-free, and present results of experiment. The results indicate that the performance of content-free image retrieval improves with the number of accumulated feedbacks, outperforming a basic but typical conventional content-based image retrieval system

我们提出了一种新的“无内容”图像检索方法，该方法试图利用用户反馈中存在的人们对图像解释的某些共同倾向。系统简单地积累用户反馈记录，并以协同过滤的形式进行循环利用。我们讨论了图像检索的各种问题，论证了无内容的思想，并给出了实验结果。结果表明，无内容图像检索的性能随着反馈的累积数量的增加而提高，优于传统的基于内容的图像检索系统

引用次数: 8

Enhance speaker segmentation by elaborating utterance detection 通过细化话语检测，增强说话人分割

2005 IEEE International Conference on Multimedia and Expo

Pub Date : 2005-07-06 DOI: 10.1109/ICME.2005.1521419

Min Yang, Zhaohui Wu, Yingchun Yang

In this paper, we introduce an elaborate utterance detection algorithm to enhance speaker segmentation. Silence detector, further divider and audio type classifier are employed in this elaborate utterance detection, to make this algorithm adaptive for both silent and noisy environments. Open-set verification testing has taken on the Hub4-NE broadcasts database. The experiment results show that this enhanced segmentation method can provide better information for speaker models.

在本文中，我们介绍了一种精细的话语检测算法来增强说话人分割。该算法在语音检测中采用了沉默检测器、进一步分频器和音频类型分类器，使该算法能够适应噪声环境和沉默环境。对Hub4-NE广播数据库进行了开集验证测试。实验结果表明，这种增强的分割方法可以为说话人模型提供更好的信息。

引用次数: 1

Data prefetching for smooth navigation of large scale JPEG 2000 images 用于大规模JPEG 2000图像平滑导航的数据预取

2005 IEEE International Conference on Multimedia and Expo

Pub Date : 2005-07-06 DOI: 10.1109/ICME.2005.1521571

A. Descampe, J. Ou, P. Chevalier, B. Macq

Remote access to large scale images arouses a growing interest in fields such as medical imagery or remote sensing. This raises the need for algorithms guaranteeing navigation smoothness while minimizing the network resources used. In this paper, we present a model taking advantage of the JPEG 2000 scalability combined with a prefetching policy. The model uses the last user action to efficiently manage the cache and to prefetch the most probable data to be used next. Three different network configurations are considered. In each case, comparison with two more classic policies shows the improvement brought by our approach.

大尺度图像的远程获取在医学图像或遥感等领域引起了越来越多的兴趣。这就需要在保证导航平滑的同时最小化网络资源的使用的算法。在本文中，我们提出了一个利用JPEG 2000可伸缩性和预取策略相结合的模型。该模型使用用户的最后一次操作来有效地管理缓存，并预取最有可能使用的数据。这里考虑了三种不同的网络配置。在每种情况下，与两个更经典的策略进行比较，可以看出我们的方法带来的改进。

引用次数: 12

Hash-based Counter Scheme for Digital Rights Management 基于哈希的数字版权管理计数器方案

2005 IEEE International Conference on Multimedia and Expo

Pub Date : 2005-07-06 DOI: 10.1109/ICME.2005.1521375

Mikko Löytynoja, T. Seppänen

This paper describes a counter scheme that uses hash functions to count how many times the user is allowed to play protected content in a DRM-enabled player. The proposed basic scheme can be used in scenarios where the user cannot be assumed to have online connection. We discuss the weaknesses of the proposed scheme and present alternative to the basic scheme, which increases the security of the counter

本文描述了一个计数器方案，该方案使用哈希函数来计算用户在启用drm的播放器中被允许播放受保护内容的次数。提出的基本方案可用于不能假定用户具有在线连接的场景。我们讨论了所提出方案的缺点，并提出了基本方案的替代方案，提高了计数器的安全性

引用次数: 6

Discriminative techniques for keyframe selection 关键帧选择的判别技术

2005 IEEE International Conference on Multimedia and Expo

Pub Date : 2005-07-06 DOI: 10.1109/ICME.2005.1521470

Matthew L. Cooper, J. Foote

A convenient representation of a video segment is a single "keyframe". Keyframes are widely used in applications such as non-linear browsing and video editing. With existing methods of keyframe selection, similar video segments result in very similar keyframes, with the drawback that actual differences between the segments may be obscured. We present methods for keyframe selection based on two criteria: capturing the similarity to the represented segment, and preserving the differences from other segment keyframes, so that different segments will have visually distinct representations. We present two discriminative keyframe selection methods, and an example of experimental results.

视频片段的一种方便的表示是单个“关键帧”。关键帧被广泛应用于非线性浏览和视频编辑等应用中。在现有的关键帧选择方法中，相似的视频片段会产生非常相似的关键帧，其缺点是片段之间的实际差异可能会被掩盖。我们提出了基于两个标准的关键帧选择方法:捕获与表示的片段的相似性，并保留与其他片段关键帧的差异，以便不同的片段具有视觉上不同的表示。提出了两种判别关键帧选择方法，并给出了实验结果。

引用次数: 60

Maximizing the profit for cache replacement in a transcoding proxy 最大化在转码代理中缓存替换的利润

2005 IEEE International Conference on Multimedia and Expo

Pub Date : 2005-07-06 DOI: 10.1109/ICME.2005.1521633

Hao-Ping Hung, Ming-Syan Chen

Recent technology advances in multimedia communication have ushered in a new era of personal communication. Users can ubiquitously access the Internet via various mobile devices. For the mobile devices featured with lower-bandwidth network connectivity, transcoding can be used to reduce the object size by lowering the quality of a multimedia object. In this paper, we focus on the cache replacement policy in a transcoding proxy, which is a proxy server responsible for transcoding the object and reducing the network traffic. Based on the architecture in prior works, we propose a maximum profit replacement algorithm, abbreviated as MPR. MPR performs cache replacement according to the content in the caching candidate set, which is generated by the concept of dynamic programming. Experimental results show that the the proposed MPR outperforms the prior scheme in terms of the cache hit ratio.

最近多媒体通信技术的进步开创了个人通信的新时代。用户可以通过各种移动设备无处不在地访问互联网。对于网络连接带宽较低的移动设备，可以使用转码技术通过降低多媒体对象的质量来减小对象大小。本文主要研究转码代理中的缓存替换策略，它是一个代理服务器，负责对象的转码并减少网络流量。在此基础上，我们提出了一种利润最大化替代算法，简称为MPR。MPR根据缓存候选集中的内容执行缓存替换，缓存候选集中的内容是由动态规划概念生成的。实验结果表明，该算法在缓存命中率方面优于现有算法。

引用次数: 6

Image Authentication Under Geometric Attacks Via Structure Matching 几何攻击下基于结构匹配的图像认证

2005 IEEE International Conference on Multimedia and Expo

Pub Date : 2005-07-06 DOI: 10.1109/ICME.2005.1521402

V. Monga, Divyanshu Vats, B. Evans

Surviving geometric attacks in image authentication is considered to be of great importance. This is because of the vulnerability of classical watermarking and digital signature based schemes to geometric image manipulations, particularly local geometric attacks. In this paper, we present a general framework for image content authentication using salient feature points. We first develop an iterative feature detector based on an explicit modeling of the human visual system. Then, we compare features from two images by developing a generalized Hausdorff distance measure. The use of such a distance measure is crucial to the robustness of the scheme, and accounts for feature detector failure or occlusion, which previously proposed methods do not address. The proposed algorithm withstands standard benchmark (e.g. Stirmark) attacks including compression, common signal processing operations, global as well as local geometric transformations, and even hard to model distortions such as print and scan. Content changing (malicious) manipulations of image data are also accurately detected

几何攻击在图像认证中被认为是非常重要的。这是因为经典的基于水印和数字签名的方案容易受到几何图像处理，特别是局部几何攻击的攻击。在本文中，我们提出了一个使用显著特征点进行图像内容认证的通用框架。我们首先基于人类视觉系统的显式建模开发了一个迭代特征检测器。然后，我们通过开发广义豪斯多夫距离度量来比较两幅图像的特征。使用这样的距离度量对于方案的鲁棒性至关重要，并且可以解释先前提出的方法无法解决的特征检测器故障或遮挡问题。该算法可承受标准基准(例如Stirmark)攻击，包括压缩、常见信号处理操作、全局和局部几何变换，甚至难以模拟打印和扫描等扭曲。对图像数据的内容更改(恶意)操作也可以准确检测到

引用次数: 47

Hidden Markov Model Based Weighted Likelihood Discriminant for Minimum Error Shape Classification 基于隐马尔可夫模型的加权似然判别最小误差形状分类

2005 IEEE International Conference on Multimedia and Expo

Pub Date : 2005-07-06 DOI: 10.1109/ICME.2005.1521430

N. Thakoor, Sungyong Jung, Jean X. Gao

The goal of this communication is to present a weighted likelihood discriminant for minimum error shape classification. Different from traditional maximum likelihood (ML) methods in which classification is carried out based on probabilities from independent individual class models as is the case for general hidden Markov model (HMM) methods, our proposed method utilizes information from all classes to minimize classification error. Proposed approach uses a hidden Markov model as a curvature feature based 2D shape descriptor. In this contribution we present a generalized probabilistic descent (GPD) method to weight the curvature likelihoods to achieve a discriminant function with minimum classification error. In contrast with other approaches, a weighted likelihood discriminant function is introduced. We believe that our sound theory based implementation reduces classification error by combining hidden Markov model with generalized probabilistic descent theory. We show comparative results obtained with our approach and classic maximum-likelihood calculation for fighter planes in terms of classification accuracies

该通信的目标是提出一个加权似然判别最小误差形状分类。与传统的最大似然(ML)方法不同，传统的最大似然(ML)方法是基于独立的单个类模型的概率进行分类，而一般的隐马尔可夫模型(HMM)方法则是基于所有类的信息来最小化分类误差。该方法采用隐马尔可夫模型作为基于曲率特征的二维形状描述符。在这篇贡献中，我们提出了一种广义概率下降(GPD)方法来加权曲率似然，以获得具有最小分类误差的判别函数。与其他方法相比，引入了加权似然判别函数。通过将隐马尔可夫模型与广义概率下降理论相结合，降低了分类误差。在分类精度方面，我们展示了用我们的方法和经典的战斗机最大似然计算获得的比较结果

{"title":"Hidden Markov Model Based Weighted Likelihood Discriminant for Minimum Error Shape Classification","authors":"N. Thakoor, Sungyong Jung, Jean X. Gao","doi":"10.1109/ICME.2005.1521430","DOIUrl":"https://doi.org/10.1109/ICME.2005.1521430","url":null,"abstract":"The goal of this communication is to present a weighted likelihood discriminant for minimum error shape classification. Different from traditional maximum likelihood (ML) methods in which classification is carried out based on probabilities from independent individual class models as is the case for general hidden Markov model (HMM) methods, our proposed method utilizes information from all classes to minimize classification error. Proposed approach uses a hidden Markov model as a curvature feature based 2D shape descriptor. In this contribution we present a generalized probabilistic descent (GPD) method to weight the curvature likelihoods to achieve a discriminant function with minimum classification error. In contrast with other approaches, a weighted likelihood discriminant function is introduced. We believe that our sound theory based implementation reduces classification error by combining hidden Markov model with generalized probabilistic descent theory. We show comparative results obtained with our approach and classic maximum-likelihood calculation for fighter planes in terms of classification accuracies","PeriodicalId":244360,"journal":{"name":"2005 IEEE International Conference on Multimedia and Expo","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131054580","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 5

Fast camera calibration for the analysis of sport sequences 用于运动序列分析的快速摄像机校准

2005 IEEE International Conference on Multimedia and Expo

Pub Date : 2005-07-06 DOI: 10.1109/ICME.2005.1521465

D. Farin, J. Han, P. D. With

Semantic analysis of sport sequences requires camera calibration to obtain player and ball positions in real-world coordinates. For court sports like tennis, the marker lines on the field can be used to determine the calibration parameters. We propose a real-time calibration algorithm that can be applied to all court sports simply by exchanging the court model. The algorithm is based on (1) a specialized court-line detector, (2) a RANSAC-based line parameter estimation, (3) a combinatorial optimization step to localize the court within the set of detected line segments, and (4) an iterative court-model tracking step. Our results show real-time calibration of, e.g., tennis and soccer sequences with a computation time of only about 6 ms per frame.

运动序列的语义分析需要摄像机校准来获得球员和球在真实世界坐标中的位置。对于网球等球场运动，可以使用场地上的标记线来确定校准参数。我们提出了一种实时校准算法，只需交换球场模型即可应用于所有球场运动。该算法基于(1)专门的球场线检测器，(2)基于ransac的线参数估计，(3)在检测线段集合内定位球场的组合优化步骤，以及(4)迭代球场模型跟踪步骤。我们的结果显示，实时校准，例如，网球和足球序列的计算时间仅为每帧约6毫秒。

引用次数: 83

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

2005 IEEE International Conference on Multimedia and Expo

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀