2013 Visual Communications and Image Processing (VCIP)最新文献

英文中文

New motherwavelet for pattern detection in IR image 红外图像模式检测的新母小波

2013 Visual Communications and Image Processing (VCIP)

Pub Date : 2013-11-20 DOI: 10.1109/VCIP.2013.6706327

M. Mirzaei, S. Prianto, J. Chardonnet, C. Pere, F. Mérienne

The paper presents a new mother wavelet adapted from a specific pattern. Wavelet multi-resolution analysis uses this wavelet to detect the position of the pattern in an Infra-Red (IR) signal under scale variation and the presence of noise. IR signal is extracted from IR image sequence recorded by an IR camera, Time of Flight (TOF) sensor configuration. The maximum correlation between the pattern and the signal of interest will be used as a criterion to define the mother wavelet. The proposed mother wavelet were tested and verified under the scale variation and the presence of noise. The experimental tests and performance analysis show promising results for both scale variation and noisy signal. 90% accuracy for the proposed wavelet under intensive noisy condition (50% of the signal amplitude) is guaranteed and high precision is expected under real condition.

本文提出了一种基于特定模式的新母小波。小波多分辨率分析利用小波在尺度变化和噪声存在下检测红外信号中图案的位置。红外信号从红外相机记录的红外图像序列中提取，飞行时间(TOF)传感器配置。模式和感兴趣的信号之间的最大相关性将被用作定义母小波的标准。在尺度变化和噪声存在的情况下，对提出的母小波进行了测试和验证。实验测试和性能分析表明，该方法对尺度变化和噪声信号都有良好的效果。在强噪声条件下(信号幅值的50%)，所提小波保证了90%的精度，在实际条件下具有较高的精度。

引用次数: 1

Reverse scan for transform skip mode in HEVC codec HEVC编解码器中变换跳过模式的反向扫描

2013 Visual Communications and Image Processing (VCIP)

Pub Date : 2013-11-01 DOI: 10.1109/VCIP.2013.6706421

Vijay Bansal, A. Chawla, Mahesh Narain Shukla

High Efficiency Video Coding (HEVC) is being developed by Joint Collaborative Team on Video Coding (JCTVC). HEVC has transform skip mode which is only applicable to 4×4 TUs (Transform Units). Transform process is skipped when this mode is selected. By introducing transform skip there is significant gain in coding efficiency for F class sequences [1][2]. In this paper it is proposed that before variable length encoding of the transform skip cases, reverse the scanning of the residual of 4×4 block sizes. Due to this modification it is observed that coding efficiency further increased on an average by 1.6% in terms of bd-rate [3] for class F sequences in all intra (AI) testing configurations. For other GOP structures like RA (random access), low delay with B pictures (LB), and low delay with P pictures (LP) average-bit-rate gains are 1.1%, 0.64% and 0.57% respectively. Due to this change there is a negligible impact on encoding/decoding time.

高效视频编码(HEVC)是由视频编码联合协作小组(JCTVC)开发的。HEVC有转换跳过模式，只适用于4×4 TUs(转换单位)。选择此模式时，将跳过转换过程。通过引入变换跳过，可以显著提高F类序列的编码效率[1][2]。本文提出在变换跳码的变长编码之前，对4×4块大小的残差进行反向扫描。由于这种修改，我们观察到，在所有内(AI)测试配置中，F类序列的编码效率在bd率方面平均进一步提高了1.6%[3]。对于RA(随机存取)等其他GOP结构，低延迟B图(LB)和低延迟P图(LP)的平均比特率增益分别为1.1%，0.64%和0.57%。由于这个变化，对编码/解码时间的影响可以忽略不计。

引用次数: 3

Low complexity transform coding for depth maps in 3D video 3D视频中深度图的低复杂度变换编码

2013 Visual Communications and Image Processing (VCIP)

Pub Date : 2013-11-01 DOI: 10.1109/VCIP.2013.6706338

F. Jager, Karam Naser

3D video is a new technology, which requires transmission of depth data alongside conventional 2D video. The additional depth information allows to synthesize arbitrary viewpoints at the receiver for adaptation of perceived depth impression and for driving of multi-view auto-stereoscopic displays. Depth maps typically show different signal characteristics compared to textured video data. Piecewise smooth regions are bounded by sharp edges resembling depth discontinuities. These edges lead to strong ringing artifacts when depth maps are coded with DCT-based transform codecs, such as AVC or its successor HEVC. In this paper alternative transforms are proposed to be used for coding depth maps for 3D video. By replacing the DCT with these transforms, ringing artifacts in the reconstructed depth maps are reduced and at the same time the complexity of the transform stage is lowered significantly. For high quality depth map coding the proposed alternative transforms can even increase coding efficiency.

3D视频是一项新技术，它需要在传统2D视频的同时传输深度数据。额外的深度信息允许在接收器上合成任意视点，以适应感知的深度印象和驱动多视图自动立体显示器。与纹理视频数据相比，深度图通常显示不同的信号特征。分段平滑区域由类似深度不连续的尖锐边缘包围。当深度图用基于dct的变换编解码器(如AVC或其后继HEVC)编码时，这些边缘会导致强烈的环形伪影。本文提出了用于三维视频深度图编码的替代变换。用这些变换代替DCT，减少了重建深度图中的环形伪影，同时显著降低了变换阶段的复杂度。对于高质量的深度图编码，所提出的替代变换甚至可以提高编码效率。

引用次数: 2

Anchor-supported multi-modality hashing embedding for person re-identification 锚支持的多模散列嵌入，用于人员重新识别

2013 Visual Communications and Image Processing (VCIP)

Pub Date : 2013-11-01 DOI: 10.1109/VCIP.2013.6706325

Kai Liu, Zhicheng Zhao, Xin Guo, A. Cai

Person re-identification is a challenging problem in multi-camera surveillance systems. Most existing methods focus on metric learning which aims to match images from different cameras in a common metric space. Boosted hashing projection provides a new way of identifying instances based on pairwise similarity. However, both of these approaches ignore the underlying fact that images captured by two cameras should be seen as in different modalities. To address this drawback, we formulate person re-identification as an Anchor-supported Multi-Modality Hashing Embedding (AMMHE) problem, in which different projections are used to map data from different cameras into a common Hamming space. The data are projected to binary bits by using boosted hash projections, making the weighted Hamming distance of intra-class data pairs minimized and simultaneously those of inter-class data pairs maximized. We also introduce an anchor-supported dimension reduction method to avoid the computational burden of high feature dimensionality. Our approach obtains competitive performance compared with state-of-the-art methods on publicly available benchmarks.

在多摄像机监控系统中，人员再识别是一个具有挑战性的问题。大多数现有的方法都集中在度量学习上，目的是在一个共同的度量空间中匹配来自不同相机的图像。增强哈希投影提供了一种基于成对相似度的实例识别新方法。然而，这两种方法都忽略了一个基本事实，即两台相机拍摄的图像应该被视为不同的模式。为了解决这一缺点，我们将人员再识别制定为锚支持的多模态哈希嵌入(AMMHE)问题，其中使用不同的投影将来自不同摄像机的数据映射到公共汉明空间。使用增强哈希投影将数据投影到二进制位，使类内数据对的加权汉明距离最小化，同时使类间数据对的加权汉明距离最大化。为了避免高特征维数的计算负担，我们还引入了锚支持降维方法。与最先进的方法相比，我们的方法在公开可用的基准上获得了具有竞争力的性能。

{"title":"Anchor-supported multi-modality hashing embedding for person re-identification","authors":"Kai Liu, Zhicheng Zhao, Xin Guo, A. Cai","doi":"10.1109/VCIP.2013.6706325","DOIUrl":"https://doi.org/10.1109/VCIP.2013.6706325","url":null,"abstract":"Person re-identification is a challenging problem in multi-camera surveillance systems. Most existing methods focus on metric learning which aims to match images from different cameras in a common metric space. Boosted hashing projection provides a new way of identifying instances based on pairwise similarity. However, both of these approaches ignore the underlying fact that images captured by two cameras should be seen as in different modalities. To address this drawback, we formulate person re-identification as an Anchor-supported Multi-Modality Hashing Embedding (AMMHE) problem, in which different projections are used to map data from different cameras into a common Hamming space. The data are projected to binary bits by using boosted hash projections, making the weighted Hamming distance of intra-class data pairs minimized and simultaneously those of inter-class data pairs maximized. We also introduce an anchor-supported dimension reduction method to avoid the computational burden of high feature dimensionality. Our approach obtains competitive performance compared with state-of-the-art methods on publicly available benchmarks.","PeriodicalId":407080,"journal":{"name":"2013 Visual Communications and Image Processing (VCIP)","volume":"18 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124261820","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

High precision probability estimation for CABAC CABAC的高精度概率估计

2013 Visual Communications and Image Processing (VCIP)

Pub Date : 2013-11-01 DOI: 10.1109/VCIP.2013.6706454

A. Alshin, E. Alshina, Jeonghoon Park

Entropy coding is the main important part of all advanced video compression schemes. Context-adaptive binary arithmetic coding (CABAC) is entropy coding used in H.264/MPEG-4 AVC and H.265/HEVC standards. Probability estimation is the key factor of CABAC performance efficiency. In this paper high accuracy probability estimation for CABAC is presented. This technique is based on multiple estimations using different models. Proposed method was efficiently realized in integer arithmetic. High precision probability estimation for CABAC provides up-to 1,4% BD-rate gain.

熵编码是所有高级视频压缩方案的重要组成部分。上下文自适应二进制算术编码(CABAC)是H.264/MPEG-4 AVC和H.265/HEVC标准中使用的熵编码。概率估计是影响CABAC性能效率的关键因素。本文提出了CABAC的高精度概率估计方法。该技术基于使用不同模型的多重估计。该方法在整数算法中得到了有效的实现。高精度的CABAC概率估计可提供高达1.4%的bd速率增益。

引用次数: 20

Soft mobile video broadcast based on side information refining 基于侧信息精炼的软移动视频广播

2013 Visual Communications and Image Processing (VCIP)

Pub Date : 2013-11-01 DOI: 10.1109/VCIP.2013.6706374

Wei Huang, Xiaopeng Fan, Debin Zhao

Video broadcasting is a popular application of wireless network, whose main challenge is to accommodate different users with different channel conditions. Recently, a novel `D-Cast' approach based on distributed source coding (DSC) is proposed. It can avoid error propagation and still achieve high compression efficiency in inter frame coding by utilizing coset coding and soft broadcast. However, D-CAST is not very efficient because of rough side information. In this work, we present a novel soft mobile video broadcast approach based on side information refinement algorithm (SIR-CAST) to improve the quality of the side information. Moreover, SIR-Cast optimizes the estimate of the quantifying step (Qstep) which is corresponding to the refined side information. Thus, SIR-CAST outperforms D-CAST about 1dB-2dB in video PSNR while maintaining the similar graceful degradation feature as D-CAST.

视频广播是无线网络的一种流行应用，其主要挑战是在不同的信道条件下适应不同的用户。最近，提出了一种新的基于分布式源编码(DSC)的“D-Cast”方法。在帧间编码中，利用协集编码和软广播技术，在避免错误传播的同时仍能获得较高的压缩效率。然而，D-CAST不是很有效，因为粗糙的侧信息。在这项工作中，我们提出了一种新的基于侧信息细化算法(SIR-CAST)的软移动视频广播方法，以提高侧信息的质量。此外，SIR-Cast优化了与精炼侧信息对应的量化步长(Qstep)的估计。因此，SIR-CAST在视频PSNR方面优于D-CAST约1dB-2dB，同时保持与D-CAST相似的优雅降级特性。

引用次数: 3

Recognizing human actions based on Sparse Coding with Non-negative and Locality constraints 基于非负约束和局部性约束的稀疏编码人类行为识别

2013 Visual Communications and Image Processing (VCIP)

Pub Date : 2013-11-01 DOI: 10.1109/VCIP.2013.6706359

Yuanbo Chen, Yanyun Zhao, A. Cai

In this paper, Sparse Coding with Non-negative and Locality constraints (SCNL) is proposed to generate discriminative feature descriptions for human action recognition. The non-negative constraint ensures that every data sample is in the convex hull of its neighbors. The locality constraint makes a data sample only represented by its related neighbor atoms. The sparsity constraint confines the dictionary atoms involved in the sample representation as fewer as possible. The SCNL model can better capture the global subspace structures of data than classical sparse coding, and are more robust to noise compared to locality-constrained linear coding. Extensive experiments testify the significant advantages of the proposed SCNL model through evaluations on three remarkable human action datasets.

本文提出了基于非负局域约束的稀疏编码(SCNL)来生成判别特征描述，用于人体动作识别。非负约束确保每个数据样本都在其邻居的凸包中。局部性约束使得数据样本仅由其相关的相邻原子表示。稀疏性约束将样本表示中涉及的字典原子限制得尽可能少。与传统的稀疏编码相比，SCNL模型能更好地捕获数据的全局子空间结构，与位置约束的线性编码相比，SCNL模型对噪声的鲁棒性更强。通过对三个显著的人类动作数据集的评估，大量的实验证明了所提出的SCNL模型的显著优势。

引用次数: 2

An analytical study of subpixel-based image down-sampling patterns in frequency domain 基于子像素的图像频域下采样模式分析研究

2013 Visual Communications and Image Processing (VCIP)

Pub Date : 2013-11-01 DOI: 10.1109/VCIP.2013.6706342

Yonggen Ling, O. Au, Ketan Tang, Jiahao Pang, Jin Zeng, Lu Fang

Subpixel-based image down-sampling is a class of methods that can provide improved apparent resolution of the down-scaled image compared to the pixel-based methods. The frequency characteristics of all possible subpixel-based down-sampling patterns for RGB vertical stripes are analytically studied in this paper. Our proposed algorithm reveals that there are merely seven equivalent energy distributions in the luminance frequency spectrum. To achieve higher luminance resolution, we then calculate and choose the optimal down-sampling pattern with anti-aliasing low-pass filter designed for it so as to maximize the energy of the luminance component within the cut-off shape. Experimental results show that the proposed method provides sharper images compared to the state-of-art subpixel-based methods, with little color distortion.

基于子像素的图像降采样是一类与基于像素的方法相比，可以提供更好的图像表观分辨率的方法。本文分析研究了RGB垂直条纹中所有可能的基于亚像素的下采样模式的频率特性。我们提出的算法表明，在亮度频谱中只有七个等效的能量分布。为了获得更高的亮度分辨率，我们计算并选择了最优的下采样模式，并为其设计了抗混叠低通滤波器，使亮度分量在截止形状内的能量最大化。实验结果表明，与目前基于亚像素的方法相比，该方法能提供更清晰的图像，且颜色失真较小。

引用次数: 4

Surgical motion task performance in a hand eye colocated digital stereo microcsope 手眼配位数字立体显微镜的手术运动任务表现

2013 Visual Communications and Image Processing (VCIP)

Pub Date : 2013-11-01 DOI: 10.1109/VCIP.2013.6706425

J. K. Rappel, A. Lahiri, C. Teo

The effect of hand-eye colocation in performing dexterous fine movements such as microsurgical manipulation is studied under a novel digital stereo microscope. Hand motion data is captured under conditions of hand-eye colocation and separation. Both configurations are tested with monoscopic and stereoscopic vision. A set of microsurgical task abstractions are created to reduce the effect of prior expertise. Finally the captured motion data is analyzed to determine the effect of colocation and stereopsis for surgical motion tasks.

在一种新型数字立体显微镜下，研究了手眼配位对显微外科手术等灵巧精细动作的影响。在手眼定位和分离的条件下捕获手部运动数据。这两种配置都用单视和立体视觉进行了测试。创建了一组显微外科任务抽象，以减少先前专业知识的影响。最后对采集到的运动数据进行分析，以确定配位和立体视对手术运动任务的影响。

引用次数: 0

Power aware HEVC streaming for mobile 电源感知HEVC流的移动

2013 Visual Communications and Image Processing (VCIP)

Pub Date : 2013-11-01 DOI: 10.1109/VCIP.2013.6706445

Yuwen He, Markus Künstner, Srinivas Gudumasu, Eun‐Seok Ryu, Yan Ye, Xiaoyu Xiu

Mobile devices, increasingly equipped with high capability processors and connected with fast wireless networks, have become a major consumer of multi-media content. Limited battery life on mobile devices makes power saving a critical factor in delivering a good user experience. This paper proposes a power aware streaming system that combines the emerging High Efficiency Video Coding (HEVC) standard and the Dynamic Adaptive Streaming over HTTP (DASH) standard. The proposed system uses power aware HEVC encoding technologies and client side power adaptation logic to adaptively control power consumption on the client device. The proposed power aware HEVC streaming system can improve quality of experience by setting full-length video playback as client's objective. Demonstration of the proposed power aware HEVC system is available on the ASUS Transformer Xfinity (TF700T) tablet using an ARM processor.

移动设备越来越多地配备了高性能处理器并连接了快速无线网络，已成为多媒体内容的主要消费者。移动设备上有限的电池寿命使得省电成为提供良好用户体验的关键因素。本文提出了一种结合新兴的高效视频编码(High Efficiency Video Coding, HEVC)标准和HTTP动态自适应流媒体(Dynamic Adaptive streaming over HTTP, DASH)标准的功率感知流媒体系统。该系统采用功率感知HEVC编码技术和客户端功率自适应逻辑对客户端设备的功耗进行自适应控制。所提出的功率感知HEVC流媒体系统可以通过将全长视频播放作为客户端的目标来提高体验质量。在使用ARM处理器的华硕Transformer Xfinity (TF700T)平板电脑上，可以演示拟议的功率感知HEVC系统。

引用次数: 23

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

2013 Visual Communications and Image Processing (VCIP)

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀