Seventh IEEE International Symposium on Multimedia (ISM'05)最新文献

英文中文

An efficient key management scheme for pervasive computing 一种有效的普适计算密钥管理方案

Seventh IEEE International Symposium on Multimedia (ISM'05)

Pub Date : 2005-12-12 DOI: 10.1109/ISM.2005.30

H. Ge

In this paper we propose a variant of RSA public key scheme, called "hidden exponent RSA". Based on this new scheme, we devised an efficient key distribution/management scheme for secure communication among devices in the context of pervasive computing, with emphasis on the simplicity and efficiency of the protocol. We show the new scheme is secure under the strong RSA assumption.

本文提出了一种RSA公钥方案的变体，称为“隐指数RSA”。在此基础上，我们设计了一个有效的密钥分发/管理方案，用于普适计算环境下设备间的安全通信，强调协议的简单性和高效性。在强RSA假设下证明了新方案的安全性。

引用次数: 1

Global warp metric distance: boosting content-based image retrieval through histograms 全局翘曲度量距离:通过直方图增强基于内容的图像检索

Seventh IEEE International Symposium on Multimedia (ISM'05)

Pub Date : 2005-12-12 DOI: 10.1109/ISM.2005.64

J. C. Felipe, A. Traina, C. Traina

This work presents a new distance function - the global warp metric distance - to compare histograms used as a feature to index image databases in content based image retrieval environments. The metric histogram represents a compact, but efficient alternative to the use of traditional gray level histograms to represent images. The global warp metric distance (GWMD) enhances the comparison between histograms, replacing the rigid bin to bin evaluation by the warp method, which allows a local "adjustment" of one histogram to the other during the distance calculation, introducing a global matching of the curves. Besides this, GWMD applies a set of geometric global features of histograms to determine the final distance. Results on similarity retrieval in medical images demonstrate the superiority of the proposed approach in analyzing image sets that present brightness and contrast disparities: it reduces the amount of both false positive and false negative retrievals. Moreover, these results comply with similarity evaluations performed by domain specialists.

这项工作提出了一个新的距离函数-全局扭曲度量距离-来比较直方图作为一个特征来索引基于内容的图像检索环境中的图像数据库。度量直方图代表了一种紧凑，但有效的替代使用传统的灰度直方图来表示图像。全局warp度量距离(GWMD)增强了直方图之间的比较，用warp方法取代了刚性的bin到bin评估，允许在距离计算过程中局部“调整”一个直方图到另一个直方图，引入曲线的全局匹配。此外，GWMD应用直方图的一组几何全局特征来确定最终距离。医学图像相似性检索的结果表明，该方法在分析存在亮度和对比度差异的图像集方面具有优越性:它减少了假阳性和假阴性检索的数量。此外，这些结果符合领域专家进行的相似性评估。

引用次数: 14

PTT + IMS = PTM - towards community/presence-based IMS multimedia services PTT + IMS = PTM -面向基于社区/在场的IMS多媒体业务

Seventh IEEE International Symposium on Multimedia (ISM'05)

Pub Date : 2005-12-12 DOI: 10.1109/ISM.2005.93

N. Blum, T. Magedanz

The specification of the IP multimedia subsystem as a service delivery architecture for next generation networks and the introduction of push to talk (PTT) as an IMS based service moves VoIP applications for mobile devices already to the market. PTT has gained a strong following in the US market and is on the verge of spreading globally. The open mobile alliance (OMA) currently specifies PTT as an IMS based service to assure interoperability between different operator domains. Most PTT solution vendors think already about extending PTT with other media types then voice, like video communication, file transfer or service subscription for content push services. Thus, push to multimedia (PTM) does not seem to be that far away from market and is well suited as an enabler to provide IMS applications with advanced multimedia communication functionalities. The department for Next Generation Network Integration (NGNI) at Fraunhofer Institute FOKUS has created a PTM application that utilises the IMS architecture. This paper reports about a concept of integrating this PTT/PTM functionality in community based applications to enable already existing groups and communities with new communication features.

IP多媒体子系统作为下一代网络的服务交付体系结构的规范，以及推送通话(PTT)作为基于IMS的服务的引入，已经将用于移动设备的VoIP应用推向了市场。PTT在美国市场已经获得了强大的追随者，并即将向全球扩张。开放移动联盟(OMA)目前将PTT指定为基于IMS的服务，以确保不同运营商域之间的互操作性。大多数PTT解决方案供应商已经在考虑将PTT扩展到语音之外的其他媒体类型，如视频通信、文件传输或内容推送服务的服务订阅。因此，推送到多媒体(PTM)似乎离市场并不遥远，而且非常适合作为一种使能器，为IMS应用程序提供高级多媒体通信功能。弗劳恩霍夫研究所(Fraunhofer Institute focus)的下一代网络集成(NGNI)部门创建了一个利用IMS架构的PTM应用程序。本文报告了在基于社区的应用程序中集成这种PTT/PTM功能的概念，以使已经存在的组和社区具有新的通信特性。

{"title":"PTT + IMS = PTM - towards community/presence-based IMS multimedia services","authors":"N. Blum, T. Magedanz","doi":"10.1109/ISM.2005.93","DOIUrl":"https://doi.org/10.1109/ISM.2005.93","url":null,"abstract":"The specification of the IP multimedia subsystem as a service delivery architecture for next generation networks and the introduction of push to talk (PTT) as an IMS based service moves VoIP applications for mobile devices already to the market. PTT has gained a strong following in the US market and is on the verge of spreading globally. The open mobile alliance (OMA) currently specifies PTT as an IMS based service to assure interoperability between different operator domains. Most PTT solution vendors think already about extending PTT with other media types then voice, like video communication, file transfer or service subscription for content push services. Thus, push to multimedia (PTM) does not seem to be that far away from market and is well suited as an enabler to provide IMS applications with advanced multimedia communication functionalities. The department for Next Generation Network Integration (NGNI) at Fraunhofer Institute FOKUS has created a PTM application that utilises the IMS architecture. This paper reports about a concept of integrating this PTT/PTM functionality in community based applications to enable already existing groups and communities with new communication features.","PeriodicalId":322363,"journal":{"name":"Seventh IEEE International Symposium on Multimedia (ISM'05)","volume":"101 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123190459","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 14

Inter-frame similarity based video transcoding 基于帧间相似性的视频转码

Seventh IEEE International Symposium on Multimedia (ISM'05)

Pub Date : 2005-12-12 DOI: 10.1109/ISM.2005.71

R. Balsree, A. Thawani, S. Gopalan, V. Sridhar

Multimedia services over wireless networks were made popular by the arrival of smart handhelds. These devices bring heterogeneity to the wireless networks and to the content creation as the content cannot be delivered in its original format due to the difference in the capabilities of these handhelds. An intermediate stage of processing like transcoding is carried out before delivering multimedia content. Under certain scenarios, it is required to convert the content attributes such as bit rate, frame rate, etc. while still retaining the content format to cater to devices with varying capabilities. We propose an algorithm that prioritizes frames taking into account inter frame similarity to perform frame dropping. The priority value based frame dropping in turn aids in delivering better quality video. Our frame priority assignment algorithm is based on uniform distribution of dropped frames to minimize jitter and maximizing the distance between two consecutive dropped frames.

随着智能手持设备的出现，无线网络上的多媒体服务变得流行起来。这些设备给无线网络和内容创建带来了异构性，因为由于这些手持设备功能的差异，内容不能以其原始格式交付。在传送多媒体内容之前，要进行像转码这样的中间处理阶段。在某些情况下，需要转换内容属性，如比特率、帧率等，同时仍然保留内容格式，以满足具有不同功能的设备。我们提出了一种考虑帧间相似性的帧优先级算法来执行帧丢弃。基于优先级值的帧丢弃反过来有助于提供更高质量的视频。我们的帧优先级分配算法基于丢帧的均匀分布，以最小化抖动和最大化两个连续丢帧之间的距离。

引用次数: 4

Face-to-face media sharing using wireless mobile devices 使用无线移动设备进行面对面的媒体共享

Seventh IEEE International Symposium on Multimedia (ISM'05)

Pub Date : 2005-12-12 DOI: 10.1109/ISM.2005.57

T. Pering, David H. Nguyen, J. Light, R. Want

Advanced personal wireless mobile devices, such as today's emerging smart phones, are capable computers that have the potential to enable individuals to share personal content, such as photographs, music, and video. Face to face sharing can be a satisfying and even emotional experience, yet it is not well supported by existing digital technologies, which typically isolate media into separate collections or require that they be manually combined into a single collection on a single machine. Federating wireless mobile devices with fixed infrastructure, such as a digital home entertainment center, provides a lightweight, unified, and intuitive way to share media among friends and family. This paper looks at both technological and social issues that surround sharing media using federated devices, considering the relevant emerging technologies, media types, and usage contexts.

先进的个人无线移动设备，如今天新兴的智能手机，是有能力的计算机，有可能使个人分享个人内容，如照片、音乐和视频。面对面的分享可以是一种令人满意的，甚至是情感上的体验，但是现有的数字技术并不能很好地支持它，这些技术通常将媒体隔离到单独的集合中，或者要求它们在一台机器上手动组合成单个集合。将无线移动设备与固定的基础设施(如数字家庭娱乐中心)结合起来，可以提供一种轻量级、统一且直观的方式在朋友和家人之间共享媒体。本文着眼于围绕使用联合设备共享媒体的技术和社会问题，并考虑了相关的新兴技术、媒体类型和使用环境。

引用次数: 21

Audio scene analysis as a control system for hearing aids 音频场景分析作为助听器的控制系统

Seventh IEEE International Symposium on Multimedia (ISM'05)

Pub Date : 2005-12-12 DOI: 10.1109/ISM.2005.36

M. Roch, T. Huang, Jing Liu, R. Hurtig

It is well known that simple amplification cannot help many hearing-impaired listeners. As a consequence of this, numerous signal enhancement algorithms have been proposed for digital hearing aids. Many of these algorithms are only effective in certain environments. The ability to quickly and correctly detect elements of the auditory scene can permit the selection/parameterization of enhancement algorithms from a library of available routines. In this work, the authors examine the real time parameterization of a frequency-domain compression algorithm which preserves formant ratios and thus enhances speech understanding for some individuals with severe sensorineural hearing loss in the 2-3 kHz range. The optimal compression ratio is dependent upon qualities of the acoustical signal. We briefly review the frequency-compression technology and describe a Gaussian mixture model classifier which can dynamically set the frequency compression ratio according to broad acoustic categories which we call cohorts. We discuss the results of a prototype simulator which has been implemented on a general purpose computer.

众所周知，简单的扩音并不能帮助许多听力受损的听众。因此，已经为数字助听器提出了许多信号增强算法。这些算法中的许多只在某些环境中有效。快速和正确地检测听觉场景元素的能力可以允许从可用例程库中选择/参数化增强算法。在这项工作中，作者研究了一种频域压缩算法的实时参数化，该算法保留了形成峰比，从而提高了一些严重感音神经性听力损失患者在2-3 kHz范围内的语音理解能力。最佳压缩比取决于声信号的质量。本文简要介绍了频率压缩技术，并描述了一种高斯混合模型分类器，它可以根据广泛的声学类别动态设置频率压缩比，我们称之为队列。我们讨论了在通用计算机上实现的原型模拟器的结果。

引用次数: 1

eSports: collaborative and synchronous video annotation system in grid computing environment 电子竞技:网格计算环境下的协同同步视频标注系统

Seventh IEEE International Symposium on Multimedia (ISM'05)

Pub Date : 2005-12-12 DOI: 10.1109/ISM.2005.55

Gang Zhai, G. Fox, M. Pierce, Wenjun Wu, Hasan Bulut

We designed eSports - a collaborative and synchronous video annotation platform, which is to be used in Internet scale cross-platform grid computing environment to facilitate computer supported cooperative work (CSCW) in education settings such as distance sport coaching, distance classroom etc. Different from traditional multimedia annotation systems, eSports provides the capabilities to collaboratively and synchronously play and archive real time live video, to take snapshots, to annotate video snapshots using whiteboard and to play back the video annotations synchronized with original video streams. eSports is designed based on the grid based collaboration paradigm $the shared event model using NaradaBrokering, which is a publish/subscribe based distributed message passing and event notification system. In addition to elaborate the design and implementation of eSports, we analyze the potential use cases of eSports under different education settings. We believed that eSports is very useful to improve the online collaborative coaching and education.

我们设计了电子竞技——一个协作和同步的视频注释平台，该平台将用于互联网规模的跨平台网格计算环境，以促进计算机支持的协同工作(CSCW)在教育环境中，如远程体育教练，远程课堂等。与传统的多媒体注释系统不同，eSports提供了协作和同步播放和存档实时直播视频、拍摄快照、使用白板注释视频快照以及播放与原始视频流同步的视频注释的功能。电子竞技是基于网格协作模式设计的，即使用NaradaBrokering的共享事件模型，NaradaBrokering是一个基于发布/订阅的分布式消息传递和事件通知系统。除了详细阐述电子竞技的设计和实现之外，我们还分析了电子竞技在不同教育环境下的潜在用例。我们认为电子竞技对提高在线协作教练和教育非常有用。

引用次数: 33

A model based factorization approach for dense 3D recovery from monocular video 基于模型分解的单目视频密集三维恢复方法

Seventh IEEE International Symposium on Multimedia (ISM'05)

Pub Date : 2005-12-12 DOI: 10.1109/ISM.2005.15

J. Yagnik, K. Ramakrishnan

Feature track matrix factorization based methods have been attractive solutions to the structure-from-motion (Sfm) problem. Group motion of the feature points is analyzed to get the 3D information. It is well known that the factorization formulations give rise to rank deficient system of equations. Even when enough constraints exist, the extracted models are sparse due the unavailability of pixel level tracks. Pixel level tracking of 3D surfaces is a difficult problem, particularly when the surface has very little texture as in a human face. Only sparsely located feature points can be tracked and tracking errors are inevitable along rotating low texture surfaces. However, the 3D models of an object class lie in a subspace of the set of all possible 3D models. We propose a novel solution to the structure-from-motion problem which utilizes the high-resolution 3D obtained from range scanner to compute a basis for this desired subspace. Adding subspace constraints during factorization also facilitates removal of tracking noise which causes distortions outside the subspace. We demonstrate the effectiveness of our formulation by extracting dense 3D structure of a human face and comparing it with a well known structure-from-motion algorithm due to brand.

基于特征轨迹矩阵分解的方法是解决结构-运动(Sfm)问题的有效方法。分析特征点的群运动，得到三维信息。众所周知，因式分解公式会产生缺秩方程组。即使存在足够的约束条件，由于无法获得像素级轨道，提取的模型也是稀疏的。3D表面的像素级跟踪是一个难题，特别是当表面像人脸一样纹理很少的时候。在旋转的低纹理表面上，只能跟踪稀疏的特征点，跟踪误差不可避免。然而，一个对象类的3D模型位于所有可能的3D模型集合的子空间中。我们提出了一种新的运动结构问题的解决方案，利用距离扫描仪获得的高分辨率三维空间来计算该期望子空间的基。在分解过程中加入子空间约束也有助于去除引起子空间外畸变的跟踪噪声。我们通过提取人脸的密集3D结构，并将其与众所周知的基于品牌的运动结构算法进行比较，证明了我们的公式的有效性。

{"title":"A model based factorization approach for dense 3D recovery from monocular video","authors":"J. Yagnik, K. Ramakrishnan","doi":"10.1109/ISM.2005.15","DOIUrl":"https://doi.org/10.1109/ISM.2005.15","url":null,"abstract":"Feature track matrix factorization based methods have been attractive solutions to the structure-from-motion (Sfm) problem. Group motion of the feature points is analyzed to get the 3D information. It is well known that the factorization formulations give rise to rank deficient system of equations. Even when enough constraints exist, the extracted models are sparse due the unavailability of pixel level tracks. Pixel level tracking of 3D surfaces is a difficult problem, particularly when the surface has very little texture as in a human face. Only sparsely located feature points can be tracked and tracking errors are inevitable along rotating low texture surfaces. However, the 3D models of an object class lie in a subspace of the set of all possible 3D models. We propose a novel solution to the structure-from-motion problem which utilizes the high-resolution 3D obtained from range scanner to compute a basis for this desired subspace. Adding subspace constraints during factorization also facilitates removal of tracking noise which causes distortions outside the subspace. We demonstrate the effectiveness of our formulation by extracting dense 3D structure of a human face and comparing it with a well known structure-from-motion algorithm due to brand.","PeriodicalId":322363,"journal":{"name":"Seventh IEEE International Symposium on Multimedia (ISM'05)","volume":"32 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-12-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133989171","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 4

Discrete wavelet transform and support vector machine applied to pathological voice signals identification 离散小波变换与支持向量机在病理语音信号识别中的应用

Seventh IEEE International Symposium on Multimedia (ISM'05)

Pub Date : 2005-12-12 DOI: 10.1109/ISM.2005.50

E. Fonseca, R. Guido, Andre C. Silvestre, J. Pereira

An algorithm able to classify pathological and normal voice signals based on Daubechies discrete wavelet transform (DWT-db) and support vector machines (SVM) classifier is presented. DWT-db is used for time-frequency analysis giving quantitative evaluation of signal characteristics to identify pathologies in voice signals, particularly nodules in vocal folds, of subjects with different ages for both male and female. After using a linear prediction coefficients (LPC) filter, the signals mean square values of a particular scale from wavelet analysis are entries to a nonlinear least square support vector machine (LS-SVM) classifier, which leads to an adequate larynx pathology classifier which over 95% of classification accuracy.

提出了一种基于多贝西离散小波变换(DWT-db)和支持向量机(SVM)分类器的病理和正常语音信号分类算法。DWT-db用于时频分析，对信号特征进行定量评估，以识别不同年龄男性和女性受试者的语音信号病理，特别是声带结节。采用线性预测系数(LPC)滤波后，将小波分析得到的特定尺度的信号均方值输入到非线性最小二乘支持向量机(LS-SVM)分类器中，得到的喉部病理分类器的分类准确率达到95%以上。

引用次数: 30

Differencing worm flows and normal flows for automatic generation of worm signatures 区分蠕虫流和正常流，自动生成蠕虫特征

Seventh IEEE International Symposium on Multimedia (ISM'05)

Pub Date : 2005-12-12 DOI: 10.1109/ISM.2005.49

K. Simkhada, H. Tsunoda, Yuji Waizumi, Y. Nemoto

Internet worms pose a serious threat to networks. Most current intrusion detection systems (IDSs) take signature matching approach to detect worms. Given the fact that most signatures are developed manually, generating new signatures for each variant of a worm incurs significant overhead. In this paper, we propose a difference-based scheme which differences worm flows and normal flows to generate robust worm signatures. The proposed scheme is based on two observational facts - worm flows contain several invariant portions in their payloads, and core worm codes do not exist in normal flows. It uses samples of worm flows detected by available means to extract common tokens. It then differences the set of these tokens with those of normal flows and generates signature candidates. By using such signatures within enterprises, out of reach of worm writers, the possibility of being tricked by worm writers can be reduced. We evaluate the proposed scheme using real network traffic traces that contains worms. Experiment results show that the proposed scheme exhibits high detection rate with low false positives.

网络蠕虫对网络构成严重威胁。当前的入侵检测系统大多采用特征匹配的方法来检测蠕虫。由于大多数签名都是手动开发的，因此为每个蠕虫变体生成新的签名会产生很大的开销。本文提出了一种基于差分的方案，通过区分蠕虫流和正常流来生成鲁棒蠕虫签名。该方案基于两个观测事实，即蠕虫流在其有效载荷中包含多个不变部分，以及正常流中不存在核心蠕虫代码。它使用可用手段检测到的蠕虫流样本来提取通用令牌。然后，它将这些令牌集与正常流的令牌集区分开来，并生成候选签名。通过在蠕虫编写者无法触及的企业内部使用这种签名，可以降低被蠕虫编写者欺骗的可能性。我们使用包含蠕虫的真实网络流量轨迹来评估所提出的方案。实验结果表明，该方法具有较高的检测率和较低的误报率。

引用次数: 5

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Seventh IEEE International Symposium on Multimedia (ISM'05)

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀