2006 IEEE International Conference on Multimedia and Expo最新文献

英文中文

Acoustic Echo Cancellation in a Channel with Rapidly Varying Gain 快速变增益信道中的声回波消除

2006 IEEE International Conference on Multimedia and Expo

Pub Date : 2006-12-26 DOI: 10.1109/ICME.2006.262596

S. Basu

We present a method for performing acoustic echo cancellation in a channel with rapidly varying gain and thus a rapidly varying channel characteristic. This is a situation in which standard AEC approaches perform poorly. Our method involves learning a scale-free channel characteristic (Htilde). We then apply this to a windowed version of the signal and remove the projection of the transformed signal from the output signal. We also develop a "ramp projection" method that allows for a linear variation in gain within the window. We show results in a telephony application with 3 dB to more than 8 dB of improvement over conventional AEC using the simple projection and an additional 1 dB using the ramp projection

我们提出了一种在具有快速变化的增益和快速变化的信道特性的信道中执行声学回波抵消的方法。在这种情况下，标准AEC方法表现不佳。我们的方法包括学习无标度通道特性(Htilde)。然后我们将其应用于信号的带窗版本，并从输出信号中去除变换后的信号的投影。我们还开发了一种“斜坡投影”方法，允许在窗口内增益的线性变化。我们展示了一个电话应用程序的结果，使用简单投影比传统AEC提高了3 dB到8 dB以上，使用斜坡投影又提高了1 dB

引用次数: 0

A 3D Spatio-Temporal Motion Estimation Algorithm for Video Coding 一种用于视频编码的三维时空运动估计算法

2006 IEEE International Conference on Multimedia and Expo

Pub Date : 2006-07-09 DOI: 10.1109/ICME.2006.262405

G. Lee, Ming-Jiun Wang, He-Yuan Lin, D. W. Su, Bo-Yun Lin

This paper presents a new spatio-temporal motion estimation algorithm for video coding. The algorithm is based on optimization theory and consists of the strategies including 3D spatio-temporal motion vector prediction, modified one-at-a-time search scheme, and multiple update paths. The simulation results indicate our algorithm is better than other recently proposed ones under the same computational budget and is very close to full search. The low-cost feature and regular demand of computational resource make our algorithm suitable for VLSI implementation. The algorithm also makes single chip solution for high-definition coding feasible

提出了一种新的用于视频编码的时空运动估计算法。该算法基于优化理论，包括三维时空运动矢量预测、改进的一次搜索方案和多路径更新策略。仿真结果表明，在相同的计算预算下，我们的算法优于最近提出的其他算法，并且非常接近于完全搜索。该算法的低成本特点和对计算资源的规律性需求使其适合于VLSI的实现。该算法还使高清编码的单芯片解决方案成为可能

引用次数: 2

Robust Watermarking in the Wigner Domain Wigner域的鲁棒水印

2006 IEEE International Conference on Multimedia and Expo

Pub Date : 2006-07-09 DOI: 10.1109/ICME.2006.262841

M. Al-khassaweneh, Selin Aviyente

In this paper, a new watermarking scheme in the joint time-frequency domain is introduced. Wigner distribution is used to transform an image into the spatial-spectral domain. The proposed method selects the time-frequency cells to be watermarked based on the particular image's energy distribution in the joint domain. This approach ensures the imperceptibility of the embedded watermark. It is shown that embedding in the time-frequency domain is equivalent to a nonlinear embedding function in the spatial domain. A corresponding watermark detection algorithm is also introduced. The performance of the proposed watermarking algorithm under possible attacks, such as noise, re-sampling, rotation, filtering, and JPEG compression is illustrated

本文提出了一种新的时频联合水印方案。利用维格纳分布将图像变换到空间-光谱域。该方法根据特定图像在联合域中的能量分布选择需要进行水印的时频单元。这种方法保证了嵌入水印的不可感知性。结果表明，时频域的嵌入等价于空间域的非线性嵌入函数。并介绍了相应的水印检测算法。说明了该算法在噪声、重采样、旋转、滤波和JPEG压缩等可能的攻击下的性能

引用次数: 1

Robust Video Transmission Over MIMO-OFDM System using MDC and Space Time Codes MIMO-OFDM系统中使用MDC和空时码的鲁棒视频传输

2006 IEEE International Conference on Multimedia and Expo

Pub Date : 2006-07-09 DOI: 10.1109/ICME.2006.262488

Haifeng Zheng, Congchong Ru, Lun Yu, C. Chen

MIMO-OFDM is a promising technique for the broadband wireless communication system. In this paper, we propose a novel scheme that integrates multiple description coding (MDC), error resilient video coding, and unequal error protection strategy with various space time coding codes for robust video transmission over MIMO-OFDM system. The proposed MDC coder generates multiple bitstreams of equal importance which are very suitable for multiple antennas system. Furthermore, according to the contribution to the reconstructed video quality, we apply unequal error protection strategy using BLAST and STBC space time codes for each video bitstream. Experimental results have demonstrated that the proposed scheme can achieve desired tradeoff between the reconstructed video quality and the transmission efficiency

MIMO-OFDM是一种很有前途的宽带无线通信技术。本文提出了一种将多描述编码(MDC)、纠错视频编码、不等错误保护策略与各种空时编码相结合的新方案，用于MIMO-OFDM系统的鲁棒视频传输。所提出的MDC编码器可产生多个同等重要的比特流，非常适合于多天线系统。此外，根据对重构视频质量的贡献，我们对每个视频比特流采用BLAST和STBC空时码的不等错保护策略。实验结果表明，该方案在重构视频质量和传输效率之间取得了理想的平衡

引用次数: 14

Media Streaming with Conservative Delay on Variable Rate Channels 可变速率信道上具有保守延迟的媒体流

2006 IEEE International Conference on Multimedia and Expo

Pub Date : 2006-07-09 DOI: 10.1109/ICME.2006.262912

D. Jurca, P. Frossard

We address the problem of delay-constrained streaming of multimedia packets over dynamic bandwidth channels. Efficient streaming solutions generally rely on the knowledge of the channel bandwidth, in order to select the media packets to be transmitted, according with their sending time. However, the streaming server usually cannot have a perfect knowledge of the channel bandwidth, and important packets may be lost because of over-estimation. We address the rate prediction mismatch by media scheduling with a conservative delay, which provides a safety margin for the packet delivery, even in the presence of unpredicted bandwidth variations. We formulate an optimization problem whose goal is to find the optimal conservative delay to be used in the scheduling process, given the network model the playback delay imposed by the client. We then propose a simple solution to the scheduling delay estimation, effective in real-time streaming scenarios. Our streaming method proves robust against channel prediction errors, and performs better than other mechanisms based on frame reordering strategies

我们解决了多媒体数据包在动态带宽通道上延迟受限的问题。高效的流解决方案通常依赖于信道带宽的知识，以便根据其发送时间选择要传输的媒体数据包。然而，流服务器通常不能完全了解信道带宽，重要的数据包可能会因为估计过高而丢失。我们通过具有保守延迟的媒体调度来解决速率预测不匹配问题，即使在存在不可预测的带宽变化的情况下，这也为分组交付提供了安全裕度。我们提出了一个优化问题，其目标是在给定客户端施加的播放延迟的网络模型中找到用于调度过程的最优保守延迟。然后，我们提出了一个简单的调度延迟估计的解决方案，在实时流场景中有效。该方法对信道预测误差具有鲁棒性，性能优于其他基于帧重排序策略的机制

引用次数: 7

Fast Inter Mode Decision Based on Hierarchical Homogeneous Detection and Cost Analysis for H.264/AVC Coders 基于层次均匀检测和成本分析的H.264/AVC编码器模式间快速决策

2006 IEEE International Conference on Multimedia and Expo

Pub Date : 2006-07-09 DOI: 10.1109/ICME.2006.262412

Hung-Ming Wang, Ji-Kun Lin, J. Yang

In H.264 advanced video coding (AVC), variable block size motion estimation plays an important role in compression of interframes. In this paper, we propose a fast inter prediction algorithm based on hierarchical homogeneous detection and cost analysis to select the best mode effectively. For each macroblock, we first detect that whether the macroblock is spatial homogeneous or not. For the non spatial homogeneous macroblock, we then perform the 16times16 motion estimation and examine if the 16times16 block is temporal homogeneous or not. Once the homogeneous macroblock is detected in the above process, the best mode will be chosen as 16times16 mode. For the non-homogeneous macroblock, we then execute 8times8 motion estimation and analyze the cost of 8times8 mode and 16times16 mode for deciding the best inter mode should be 16times16 mode or any other mode. The process for searching the best 8times8 block subtype is similar to the process for macroblocks. Finally, the best inter mode is decided by selecting the inter mode with least cost from the candidate modes. Experimental results show that our proposed algorithm can save about 32~54% computation time without introducing any noticeable performance degradation

在H.264高级视频编码(AVC)中，可变块大小的运动估计在帧间压缩中起着重要的作用。本文提出了一种基于分层同构检测和代价分析的快速互预测算法，以有效地选择最佳模式。对于每个宏块，我们首先检测该宏块是否空间同质。对于非空间均匀的宏块，我们执行16times16运动估计，并检查16times16块是否在时间上均匀。一旦在上述进程中检测到同质宏块，则选择最佳模式为16times16模式。对于非同构宏块，我们执行8times8运动估计，并分析8times8模式和16times16模式的开销，以确定最佳的交互模式应该是16times16模式或任何其他模式。搜索最佳8times8块子类型的过程类似于搜索宏块的过程。最后，从候选互连模式中选择代价最小的互连模式来确定最佳互连模式。实验结果表明，该算法可节省32~54%的计算时间，且不会带来明显的性能下降

{"title":"Fast Inter Mode Decision Based on Hierarchical Homogeneous Detection and Cost Analysis for H.264/AVC Coders","authors":"Hung-Ming Wang, Ji-Kun Lin, J. Yang","doi":"10.1109/ICME.2006.262412","DOIUrl":"https://doi.org/10.1109/ICME.2006.262412","url":null,"abstract":"In H.264 advanced video coding (AVC), variable block size motion estimation plays an important role in compression of interframes. In this paper, we propose a fast inter prediction algorithm based on hierarchical homogeneous detection and cost analysis to select the best mode effectively. For each macroblock, we first detect that whether the macroblock is spatial homogeneous or not. For the non spatial homogeneous macroblock, we then perform the 16times16 motion estimation and examine if the 16times16 block is temporal homogeneous or not. Once the homogeneous macroblock is detected in the above process, the best mode will be chosen as 16times16 mode. For the non-homogeneous macroblock, we then execute 8times8 motion estimation and analyze the cost of 8times8 mode and 16times16 mode for deciding the best inter mode should be 16times16 mode or any other mode. The process for searching the best 8times8 block subtype is similar to the process for macroblocks. Finally, the best inter mode is decided by selecting the inter mode with least cost from the candidate modes. Experimental results show that our proposed algorithm can save about 32~54% computation time without introducing any noticeable performance degradation","PeriodicalId":339258,"journal":{"name":"2006 IEEE International Conference on Multimedia and Expo","volume":"110 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2006-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124250433","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 12

A Decision Mechanism for Processing Multimodal Services in Future Generation Network 下一代网络中处理多模式服务的决策机制

2006 IEEE International Conference on Multimedia and Expo

Pub Date : 2006-07-09 DOI: 10.1109/ICME.2006.262775

Yang Li, H. Chan

Communication technologies, old or new, are pushing the development of telecommunication industry. Thus, technically performing a multimodal service session (e.g. one end is involved with data while the other end is with multimedia) is no longer a problem. People are gaining interests in managing these multimodal services by considering the choice and preference of users. This research brings in a four-option decision mechanism to intelligently process a multimodal service and may ensure a successful and friendly communication session for two reasons. Firstly, this mechanism provides four extra options for a service that may fail in current communication systems. After three extra tries, the service session will most probably succeed. Secondly, the user make decisions them by setting the rules of how to make proper decisions in advance

新旧通信技术都在推动着电信行业的发展。因此，在技术上执行多模式服务会话(例如，一端处理数据而另一端处理多媒体)不再是问题。通过考虑用户的选择和偏好，人们对管理这些多式联运服务越来越感兴趣。本研究提出了一个四选项决策机制，以智能处理多式联运服务，并可确保成功和友好的沟通会话，原因有二。首先，该机制为在当前通信系统中可能失败的服务提供了四个额外的选项。在额外尝试三次之后，服务会话很可能会成功。其次，用户通过预先设定如何做出正确决策的规则来做出决策

引用次数: 0

Online Mining of Recent Music Query Streams 近期音乐查询流的在线挖掘

2006 IEEE International Conference on Multimedia and Expo

Pub Date : 2006-07-09 DOI: 10.1109/ICME.2006.262948

Hua-Fu Li, Chin-Chuan Ho, M. Shan, Suh-Yin Lee

Mining multimedia data is one of the most important issues in data mining. In this paper, we propose an online one-pass algorithm to mine the set of frequent temporal patterns in online music query streams with a sliding window. An effective bit-sequence representation is used to reduce the processing time and memory needed to slide the windows. Experiments show that the proposed algorithm only needs a half of memory requirement of original music query data, and just scans the data once

多媒体数据挖掘是数据挖掘中的一个重要问题。在本文中，我们提出了一种在线一次性算法来挖掘具有滑动窗口的在线音乐查询流中的频繁时间模式集。使用有效的位序列表示来减少窗口滑动所需的处理时间和内存。实验表明，该算法只需要原始音乐查询数据一半的内存需求，并且只需扫描一次数据

引用次数: 3

Face Recognition using 3D Summation Invariant Features 基于三维求和不变特征的人脸识别

2006 IEEE International Conference on Multimedia and Expo

Pub Date : 2006-07-09 DOI: 10.1109/ICME.2006.262885

Wei-Yang Lin, Kin-Chung Wong, Y. Hu, N. Boston

In this paper, we developed a family of 2D and 3D invariant features with applications to 3D human faces recognition. The main contributions of this paper are: (a) systematically deriving a family of novel features, called summation invariant that are invariant to Euclidean transformation in both 2D and 3D; (b) developing an effective method to apply summation invariant to the 3D face recognition problem. Tested with the 3D data from the face recognition grand challenge v1.0 dataset, the proposed new features exhibit achieves a performance that rivals the best 3D face recognition algorithms reported so far

在本文中，我们开发了一组二维和三维不变特征，并将其应用于三维人脸识别。本文的主要贡献是:(a)系统地导出了一组新的特征，称为和不变量，它们在二维和三维的欧氏变换中都是不变量;(b)开发一种将求和不变量应用于三维人脸识别问题的有效方法。使用来自人脸识别大挑战v1.0数据集的3D数据进行测试，所提出的新特征展示的性能可与迄今为止报道的最佳3D人脸识别算法相媲美

引用次数: 2

Utilizing SSR Indications for Improved Video Communication in Presence of 802.11B Residue Errors 利用SSR标记改进802.11B残留错误下的视频通信

2006 IEEE International Conference on Multimedia and Expo

Pub Date : 2006-07-09 DOI: 10.1109/ICME.2006.262945

Shirish S. Karande, U. Parrikar, K. Misra, H. Radha

Radio hardware used for the reception of 802.11b frames is capable of associating a signal to silence ratio (SSR) with each received frame. If a received frame is corrupted, then these SSR indications can be used to provide robust apriori estimate of the bit error rate in the packet. In many recently proposed cross-layer protocols, for transmission of video over wireless networks, recovery of information from partially corrupted packets has shown significant utility. In this paper, based on experiments with actual 802.11b error traces, we show that the channel state information (CSI) provided by the SSR indications can be used to improve the error recovery performance of an FEC scheme employed in conjunction with a cross-layer protocol. H.264 based simulation are used to establish the efficacy of the proposed work for video applications; specifically for video over 802.11b WLAN

用于接收802.11b帧的无线电硬件能够将每个接收帧的信号与沉默比(SSR)相关联。如果接收到的帧被损坏，那么这些SSR指示可以用来提供数据包中误码率的鲁棒先验估计。在许多最近提出的跨层协议中，为了在无线网络上传输视频，从部分损坏的数据包中恢复信息已经显示出重要的效用。在本文中，基于实际802.11b错误跟踪的实验，我们证明了SSR指示提供的信道状态信息(CSI)可以用来提高与跨层协议结合使用的FEC方案的错误恢复性能。基于H.264的仿真验证了所提工作在视频应用中的有效性;专门用于802.11b WLAN上的视频

引用次数: 7

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

2006 IEEE International Conference on Multimedia and Expo

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀