2008 IEEE 10th Workshop on Multimedia Signal Processing最新文献

英文中文

Distributed source coding based encryption and lossless compression of gray scale and color images 基于分布式源编码的灰度和彩色图像的加密和无损压缩

2008 IEEE 10th Workshop on Multimedia Signal Processing

Pub Date : 2008-11-05 DOI: 10.1109/MMSP.2008.4665176

Arun Kumar, A. Makur

Compression of encrypted data is possible by using distributed source coding. In this paper, we consider the encryption, followed by lossless compression of gray scale and color images. We propose to apply encryption on the prediction errors instead of directly applying on the images and use distributed source coding for compressing the cipher texts. The simulation results show that by using the proposed technique comparable compression gains, with compression ratios varying from 1.5 to 2.5 can be achieved despite encryption.

通过使用分布式源编码可以压缩加密数据。在本文中，我们首先考虑加密，然后对灰度图像和彩色图像进行无损压缩。我们建议对预测误差进行加密，而不是直接对图像进行加密，并使用分布式源编码对密文进行压缩。仿真结果表明，在加密的情况下，使用该技术可以获得相当的压缩增益，压缩比在1.5 ~ 2.5之间变化。

引用次数: 60

A compressive sensing approach of multiple descriptions for network multimedia communication 面向网络多媒体通信的多描述压缩感知方法

2008 IEEE 10th Workshop on Multimedia Signal Processing

Pub Date : 2008-11-05 DOI: 10.1109/MMSP.2008.4665120

Liangjun Wang, Xiaolin Wu, Guangming Shi

A new multiple description coding (MDC) approach is proposed based on the theory of compressive sensing (CS). The CS theory allows a signal to be reconstructed from a small number of its random measurements if the signal is sparse in some space. An attractive property of CS for MDC applications is that the reconstruction error only depends on the number but not on which of the transmitted measurements that are received. By treating each CS measurement as a description, we have a balanced MDC scheme with fine description granularity and low encoding complexity. Another advantage of the new MDC approach is that all signals can be coded the same but decoded in different spaces for better sparse reconstruction.

基于压缩感知(CS)理论，提出了一种新的多描述编码方法。如果信号在某些空间中是稀疏的，则CS理论允许从少量随机测量值重构信号。CS对MDC应用程序的一个吸引人的特性是，重建误差仅取决于接收到的传输测量值的数量，而不取决于接收到的传输测量值。通过将每个CS测量作为一个描述，我们得到了一个描述粒度好、编码复杂度低的均衡MDC方案。新的MDC方法的另一个优点是，所有的信号都可以编码相同，但在不同的空间中解码，以便更好地进行稀疏重建。

引用次数: 9

Reliability-based generation and view synthesis in layered depth video 基于可靠性的分层深度视频生成与视图合成

2008 IEEE 10th Workshop on Multimedia Signal Processing

Pub Date : 2008-11-05 DOI: 10.1109/MMSP.2008.4665045

K. Müller, A. Smolic, K. Dix, P. Kauff, T. Wiegand

In this paper, a system for video rendering on multiscopic 3D displays is considered where the data is represented as layered depth video (LDV). This representation consists of one full or central video with associated per-pixel depth and additional residual layers. Thus, only one full view with additional residual data needs to be transmitted. The LDV data is used at the receiver to generate all intermediate views for the display. The paper presents the LDV layer extraction as well as the view synthesis, using a scene reliability-driven approach. Here, unreliable image regions are detected and in contrast to previous approaches the residual data is enlarged to reduce artifacts in unreliable areas during rendering. To provide maximum data coverage, the residual data remains at its original positions and will not be projected towards the central view. The view synthesis process also uses this reliability analysis to provide higher quality intermediate views than previous approaches. As a final result, high quality intermediate views for an existing 9-view auto-stereoscopic display are presented, which prove the suitability of the LDV approach for advanced 3D video (3DV) systems.

本文研究了一种多视场三维显示器上的视频渲染系统，其中数据表示为分层深度视频(LDV)。这种表示由一个完整的或中心的视频与相关的每像素深度和额外的剩余层组成。因此，只需要传输一个带有附加剩余数据的完整视图。接收端使用LDV数据生成显示的所有中间视图。本文采用场景可靠性驱动的方法，提出了LDV层提取和视图合成。在这里，检测不可靠的图像区域，与之前的方法相比，残差数据被放大，以减少渲染过程中不可靠区域的伪影。为了提供最大的数据覆盖范围，残差数据保留在其原始位置，不会向中央视图投射。视图合成过程也使用这种可靠性分析来提供比以前的方法更高质量的中间视图。最后，为现有的9视图自动立体显示器提供了高质量的中间视图，证明了LDV方法适用于先进的3D视频(3DV)系统。

{"title":"Reliability-based generation and view synthesis in layered depth video","authors":"K. Müller, A. Smolic, K. Dix, P. Kauff, T. Wiegand","doi":"10.1109/MMSP.2008.4665045","DOIUrl":"https://doi.org/10.1109/MMSP.2008.4665045","url":null,"abstract":"In this paper, a system for video rendering on multiscopic 3D displays is considered where the data is represented as layered depth video (LDV). This representation consists of one full or central video with associated per-pixel depth and additional residual layers. Thus, only one full view with additional residual data needs to be transmitted. The LDV data is used at the receiver to generate all intermediate views for the display. The paper presents the LDV layer extraction as well as the view synthesis, using a scene reliability-driven approach. Here, unreliable image regions are detected and in contrast to previous approaches the residual data is enlarged to reduce artifacts in unreliable areas during rendering. To provide maximum data coverage, the residual data remains at its original positions and will not be projected towards the central view. The view synthesis process also uses this reliability analysis to provide higher quality intermediate views than previous approaches. As a final result, high quality intermediate views for an existing 9-view auto-stereoscopic display are presented, which prove the suitability of the LDV approach for advanced 3D video (3DV) systems.","PeriodicalId":402287,"journal":{"name":"2008 IEEE 10th Workshop on Multimedia Signal Processing","volume":"41 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-11-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117128538","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 101

Joint audio-visual processing, representation and indexing of TV news programmes 联合视听处理、表示及索引电视新闻节目

2008 IEEE 10th Workshop on Multimedia Signal Processing

Pub Date : 2008-11-05 DOI: 10.1109/MMSP.2008.4665213

J. Zdánský, J. Chaloupka, J. Nouza

In the paper we present a complex platform for automatic processing of Czech TV news programmes. Its audio processing module provides text transcription in form of metadata that contain information about spoken content, speaker identities, used pronunciation, word positions and intonation. The video processing module provides pictures representing individual video scenes and information about detected and possibly recognized human faces. The audio and video data are merged into single XML files that are indexed and stored in a searchable database. A simple Web-based search engine can be used to retrieve information from the database that recently contain more than 1800 hours of transcribed programmes from Czech CT24 station.

在本文中，我们提出了一个复杂的捷克电视新闻节目自动处理平台。它的音频处理模块以元数据的形式提供文本转录，元数据包含有关口语内容、说话者身份、使用的发音、单词位置和语调的信息。视频处理模块提供代表单个视频场景的图片以及关于被检测到和可能被识别的人脸的信息。音频和视频数据被合并到单个XML文件中，这些文件被索引并存储在一个可搜索的数据库中。一个简单的基于网络的搜索引擎可用于从数据库中检索信息，该数据库最近包含捷克CT24站1800多个小时的转录节目。

引用次数: 3

Statistical L-infinite distortion estimation in scalable coding of meshes 网格可伸缩编码中的统计-无限失真估计

2008 IEEE 10th Workshop on Multimedia Signal Processing

Pub Date : 2008-11-05 DOI: 10.1109/MMSP.2008.4665128

D. Cernea, A. Munteanu, J. Cornelis, P. Schelkens

This paper investigates the novel concept of local error control in arbitrary mesh encoding, and proposes a new L-infinite mesh coding approach implementing this concept. In contrast to traditional mesh coding systems that use the mean-square error as distortion measure, the proposed approach employs the L-infinite distortion as target distortion metric. In this context, a novel wavelet-based L-infinite-constrained coding approach for meshes is proposed, which ensures that the maximum local error between the original and decoded meshes is lower than a given upper-bound. Additionally, the proposed system achieves scalability in L-infinite sense, that is, the L-infinite distortion upper-bound can be accurately estimated when decoding any layer from the input stream. Moreover, a distortion estimation approach is proposed, expressing the L-infinite distortion in the spatial domain as a statistical estimate of quantization errors produced in the wavelet domain. An instantiation of the proposed L-infinite coding approach is demonstrated for MESHGRID, which is a scalable 3D object coding system, part of MPEG-4 AFX. The proposed L-infinite coding approach guarantees that the maximum error is upper-bounded, it enables a fast real-time implementation of the rate-allocation, and it preserves all the scalability features and animation capabilities of the employed scalable mesh codec.

本文研究了任意网格编码中局部误差控制的新概念，并提出了一种新的l -无限网格编码方法。与传统网格编码系统使用均方误差作为失真度量不同，该方法采用l无限失真作为目标失真度量。在此背景下，提出了一种新的基于小波的l无限约束网格编码方法，该方法可以保证原始网格与解码网格之间的最大局部误差小于给定的上界。此外，本文提出的系统实现了l -无限意义上的可扩展性，即从输入流中任意层解码时都可以准确估计l -无限失真上界。此外，提出了一种失真估计方法，将空间域的l -无限失真表示为小波域产生的量化误差的统计估计。提出的l -无限编码方法的实例为MESHGRID演示，MESHGRID是一个可扩展的3D对象编码系统，是MPEG-4 AFX的一部分。所提出的L-infinite编码方法保证了最大误差是上界的，它使速率分配的快速实时实现，并且保留了所采用的可伸缩网格编解码器的所有可伸缩性特征和动画功能。

{"title":"Statistical L-infinite distortion estimation in scalable coding of meshes","authors":"D. Cernea, A. Munteanu, J. Cornelis, P. Schelkens","doi":"10.1109/MMSP.2008.4665128","DOIUrl":"https://doi.org/10.1109/MMSP.2008.4665128","url":null,"abstract":"This paper investigates the novel concept of local error control in arbitrary mesh encoding, and proposes a new L-infinite mesh coding approach implementing this concept. In contrast to traditional mesh coding systems that use the mean-square error as distortion measure, the proposed approach employs the L-infinite distortion as target distortion metric. In this context, a novel wavelet-based L-infinite-constrained coding approach for meshes is proposed, which ensures that the maximum local error between the original and decoded meshes is lower than a given upper-bound. Additionally, the proposed system achieves scalability in L-infinite sense, that is, the L-infinite distortion upper-bound can be accurately estimated when decoding any layer from the input stream. Moreover, a distortion estimation approach is proposed, expressing the L-infinite distortion in the spatial domain as a statistical estimate of quantization errors produced in the wavelet domain. An instantiation of the proposed L-infinite coding approach is demonstrated for MESHGRID, which is a scalable 3D object coding system, part of MPEG-4 AFX. The proposed L-infinite coding approach guarantees that the maximum error is upper-bounded, it enables a fast real-time implementation of the rate-allocation, and it preserves all the scalability features and animation capabilities of the employed scalable mesh codec.","PeriodicalId":402287,"journal":{"name":"2008 IEEE 10th Workshop on Multimedia Signal Processing","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2008-11-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128683024","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

Biosignal and context monitoring: Distributed multimedia applications of Body Area Networks in healthcare 生物信号和环境监测:身体区域网络在医疗保健中的分布式多媒体应用

2008 IEEE 10th Workshop on Multimedia Signal Processing

Pub Date : 2008-11-05 DOI: 10.1109/MMSP.2008.4665187

V. Jones, R. J. I. Veld, T. Tönis, R. Bults, B. Beijnum, I. Widya, M. Vollenbroek-Hutten, H. Hermens

We are investigating the use of body area networks (BANs), wearable sensors and wireless communications for measuring, processing, transmission, interpretation and display of biosignals. The goal is to provide telemonitoring and teletreatment services for patients. The remote health professional can view a multimedia display which includes graphical and numerical representation of patientspsila biosignals. Addition of feedback-control enables teletreatment services; teletreatment can be delivered to the patient via multiple modalities including tactile, text, auditory and visual. We describe the health BAN and a generic mobile health service platform and two context aware applications. The epilepsy application illustrates processing and interpretation of multi-source, multimedia BAN data. The chronic pain application illustrates multi-modal feedback and treatment, with patients able to view their own biosignals on their handheld device.

我们正在研究使用身体区域网络(ban)、可穿戴传感器和无线通信来测量、处理、传输、解释和显示生物信号。目标是为患者提供远程监测和远程治疗服务。远程医疗专业人员可以查看多媒体显示，其中包括患者的生物信号的图形和数字表示。增加反馈控制使远程治疗服务成为可能;远程治疗可以通过多种方式提供给患者，包括触觉、文字、听觉和视觉。我们描述了健康BAN和通用移动健康服务平台以及两个上下文感知应用。癫痫的应用说明了多源多媒体BAN数据的处理和解释。慢性疼痛应用说明了多模态反馈和治疗，患者可以在他们的手持设备上查看自己的生物信号。

引用次数: 49

A text segmentation based approach to video shot boundary detection 基于文本分割的视频镜头边界检测方法

2008 IEEE 10th Workshop on Multimedia Signal Processing

Pub Date : 2008-11-05 DOI: 10.1109/MMSP.2008.4665166

Duy-Dinh Le, S. Satoh, T. Ngo, D. Duong

Video shot boundary detection is one of the fundamental tasks of video indexing and retrieval applications. Although many methods have been proposed for this task, finding a general and robust shot boundary method that is able to handle the various transition types caused by photo flashes, rapid camera movement and object movement is still challenging. We present a novel approach for detecting video shot boundaries in which we cast the problem of shot boundary detection into the problem of text segmentation in natural language processing. This is possible by assuming that each frame is a word and then the shot boundaries are treated as text segment boundaries (e.g. topics). The text segmentation based approaches in natural language processing can be used. The experimental results from various long video sequences have proved the effectiveness of our approach.

视频镜头边界检测是视频索引和检索应用的基本任务之一。尽管为此提出了许多方法，但找到一种通用且鲁棒的拍摄边界方法，能够处理由照片闪光、相机快速移动和物体移动引起的各种过渡类型，仍然具有挑战性。本文提出了一种新的视频镜头边界检测方法，将镜头边界检测问题转化为自然语言处理中的文本分割问题。这可以通过假设每帧是一个单词，然后镜头边界被视为文本段边界(例如主题)来实现。自然语言处理中基于文本分割的方法是可行的。各种长视频序列的实验结果证明了该方法的有效性。

引用次数: 10

Network image coding for multicast 多播网络图像编码

2008 IEEE 10th Workshop on Multimedia Signal Processing

Pub Date : 2008-11-05 DOI: 10.1109/MMSP.2008.4665105

D. Varodayan, David M. Chen, B. Girod

We consider a new problem in network image coding for multicast. In a multihop mesh network, structured as a directed graph, all nodes decode and display reconstructions of the image (at possibly different qualities). Each node may also perform transcoding before transmitting data downstream in the network. The problem is the design of the coding and transcoding schemes to deliver the best image quality over the network. For a network with diamond topology, we show that multiple description coding combined with Wyner-Ziv transcoding is often superior to other methods. We argue further that the benefits are magnified for larger networks containing one or more diamond subnets. Our image coding experiments demonstrate that multiple description coding with Wyner-Ziv transcoding outperforms single description coding or multiple description coding with conventional transcoding, for both a diamond network and a two-hop mesh network with four branches.

研究了组播网络图像编码中的一个新问题。在多跳网格网络中，结构为有向图，所有节点解码并显示图像的重建(可能是不同的质量)。每个节点还可以在向网络下游传输数据之前执行转码。问题是编码和转码方案的设计，以在网络上提供最佳的图像质量。对于具有菱形拓扑的网络，我们证明了多重描述编码结合Wyner-Ziv转码往往优于其他方法。我们进一步论证，对于包含一个或多个菱形子网的大型网络，这些好处会被放大。我们的图像编码实验表明，对于菱形网络和具有四个分支的两跳网状网络，使用Wyner-Ziv转码的多描述编码优于使用传统转码的单描述编码或多描述编码。

引用次数: 3

A non-linear post filtering method for flicker reduction in H.264/AVC coded video sequences 一种用于减少H.264/AVC编码视频序列闪烁的非线性后滤波方法

2008 IEEE 10th Workshop on Multimedia Signal Processing

Pub Date : 2008-11-05 DOI: 10.1109/MMSP.2008.4665071

Jie Xiang Yang, H. Wu

The H.264/AVC coding standard reduces the blocking artifact by applying a spatial loop-filter in the encoder and in the decoder. However, the temporal fluctuation or flickering artifact is still noticeable between intra-coded frames or between an intra (I) frame and the preceding or subsequent inter prediction (P) frames. This paper proposes a non-linear temporal filter to reduce the flickering artifact and preserve image sharpness of the reconstructed video, by using a robust prior model. Performance of the flickering reduction with proposed filter is evaluated by a temporal metric, namely the sum of square difference (SSD), and the traditional measure, the peak signal to noise ratio (PSNR).

H.264/AVC编码标准通过在编码器和解码器中应用空间环路滤波器来减少阻塞伪影。然而，内编码帧之间或内(I)帧与之前或随后的内部预测(P)帧之间的时间波动或闪烁伪影仍然是明显的。本文提出了一种非线性时间滤波器，利用鲁棒先验模型来减少图像的闪烁伪影，并保持重构视频的图像清晰度。采用时域度量(即平方差和(SSD))和传统度量(峰值信噪比(PSNR))来评估该滤波器的降频性能。

引用次数: 7

Recent developments in panoramic image generation and sprite coding 全景图像生成和精灵编码的最新进展

2008 IEEE 10th Workshop on Multimedia Signal Processing

Pub Date : 2008-11-05 DOI: 10.1109/MMSP.2008.4665050

D. Farin, M. Haller, A. Krutz, T. Sikora

The composition of panoramic images has recently received considerable attention. While panoramic images were first used mainly as a flexible visualization technique, they also found application in video coding, video enhancement, format conversion, and content analysis. The topic has enlarged and diverged into many specialized research directions, which makes it difficult to stay in touch with recent developments. This paper intends to give an overview of the current state of research, including recent developments. Two of the applications of sprite coding and global-motion estimation are presented in more detail to provide some insights into the system aspects.

全景图像的构图最近受到了相当大的关注。虽然全景图像最初主要用作灵活的可视化技术，但它们也在视频编码、视频增强、格式转换和内容分析中得到了应用。这个话题已经扩大并分化为许多专门的研究方向，这使得很难与最新的发展保持联系。本文旨在概述目前的研究现状，包括最近的发展。更详细地介绍了精灵编码和全局运动估计的两种应用，以提供对系统方面的一些见解。

引用次数: 11

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

2008 IEEE 10th Workshop on Multimedia Signal Processing

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀