2013 Ninth International Conference on Intelligent Information Hiding and Multimedia Signal Processing最新文献

英文中文

Research on Image Matching Algorithm Based on Local Invariant Features 基于局部不变特征的图像匹配算法研究

2013 Ninth International Conference on Intelligent Information Hiding and Multimedia Signal Processing

Pub Date : 2013-10-16 DOI: 10.1109/IIH-MSP.2013.37

Jiaqi Liu, Qiang Wu, Xuwen Li

As an important foundation for image-guided technology, image matching technique is the key technology of modern war. This paper proposes a new algorithm of affine invariant detector and descriptor of local invariant feature points, starting from feature point detection and description point of view, making up the traditional feature point extraction defects of small number and types. Meantime, proposes an improved similarity measure method based on the previously proposed new feature point detection and description algorithm, it improves the matching accuracy and real-time performance. Finally, compares the experiment results of SURF, SIFT and the improved algorithm proposed in this paper, the experimental results shows that the feature points extracted by the improved algorithm has fully affine invariance, and improved the accuracy and speed of image matching algorithm efficiently.

图像匹配技术是现代战争的关键技术，是图像制导技术的重要基础。本文从特征点检测和描述的角度出发，提出了一种新的仿射不变特征点检测和局部不变特征点描述子算法，弥补了传统特征点提取数量少、类型少的缺陷。同时，在原有特征点检测与描述算法的基础上，提出了一种改进的相似度度量方法，提高了匹配精度和实时性。最后，将SURF、SIFT与本文提出的改进算法的实验结果进行对比，实验结果表明，改进算法提取的特征点具有完全仿射不变性，有效地提高了图像匹配算法的精度和速度。

引用次数: 5

Multi-modal Voice Activity Detection by Embedding Image Features into Speech Signal 将图像特征嵌入语音信号的多模态语音活动检测

2013 Ninth International Conference on Intelligent Information Hiding and Multimedia Signal Processing

Pub Date : 2013-10-16 DOI: 10.1109/IIH-MSP.2013.76

Yohei Abe, A. Ito

Lip movement has a close relationship with speech because the lips move when we talk. The idea behind this work is to extract the lip movement feature from the facial video and embed the movement feature into speech signal using information hiding technique. Using the proposed framework, we can provide advanced speech communication only using the speech signal that includes lip movement features, without increasing the bitrate of the signal. In this paper, we show the basic framework of the method and apply the proposal method to multi-modal voice activity detection (VAD). As a result of detection experiment using the support vector machine, we obtained better performance than the audio-only VAD in a noisy environment. In addition, we investigated how data embedding into speech signal affects sound quality and detection performance.

嘴唇的运动与语言有着密切的关系，因为我们说话的时候嘴唇会动。这项工作的思想是从面部视频中提取唇部运动特征，并利用信息隐藏技术将运动特征嵌入到语音信号中。使用该框架，我们可以在不增加信号比特率的情况下，仅使用包含唇部运动特征的语音信号来提供高级语音通信。本文给出了该方法的基本框架，并将该方法应用于多模态语音活动检测(VAD)。通过支持向量机的检测实验，我们在噪声环境下获得了比纯音频VAD更好的性能。此外，我们还研究了数据嵌入语音信号对声音质量和检测性能的影响。

引用次数: 1

Watermarking Method for Speech Signals Based on Modifications to LSFs 基于lsf修正的语音信号水印方法

2013 Ninth International Conference on Intelligent Information Hiding and Multimedia Signal Processing

Pub Date : 2013-10-16 DOI: 10.1109/IIH-MSP.2013.79

Shengbei Wang, M. Unoki

We propose a method of speech watermarking based on modifications to line spectral frequencies (LSFs) of original speech. LSFs were derived from each frame with linear prediction (LP) analysis and watermarks were embedded into them by using the quantization index modulation (QIM) of different quantization steps. We took into consideration inaudibility and robustness that were influenced by minor modifications to LSFs. The proposed approach was evaluated with two kinds of experiments with respect to inaudibility and robustness against different speech codecs and general processing. The results from the evaluations revealed that the proposed approach not only had high rate of bit detection while keeping the original sound quality undistorted but also good robustness against general speech processing.

提出了一种基于修改原始语音的线谱频率的语音水印方法。对每一帧图像进行线性预测(LP)分析，得到lsf，并采用不同量化步骤的量化指标调制(QIM)嵌入水印。我们考虑了受lsf轻微修改影响的听不清和健壮性。针对不同语音编解码器和一般处理的不听性和鲁棒性，通过两种实验对该方法进行了评估。结果表明，该方法在保持原始音质不失真的情况下具有较高的比特检测率，并且对一般语音处理具有较好的鲁棒性。

引用次数: 12

Multi-channel Audio Compression Method Based on ITU-T G.719 Codec 基于ITU-T G.719编解码器的多通道音频压缩方法

2013 Ninth International Conference on Intelligent Information Hiding and Multimedia Signal Processing

Pub Date : 2013-10-16 DOI: 10.1109/IIH-MSP.2013.81

Wen-ling Jiang, Jing Wang, Yi Zhao, Baoguang Liu, Xuan Ji

Through exploiting the human perception of spatial sound, a new approach for compression coding of multi-channel audio signal based on ITU-T G.719 codec is put forward in this paper. Multi-channel input signals are converted to a down-mixed signal plus spatial perceptual parameters by use of down-mix and up-mix step-by-step techniques in frequency domain. The algorithm can significantly reduce the coding rate under the premise of an acceptable sound quality in combination with the G.719 audio codec. The paper presents the implementation of the algorithm and describes in detail the calculation and features of the selected spatial parameters. Finally some experiments are done to evaluate the algorithm from the perspective of the compression ratio, the reconstructed sound quality, and the algorithm complexity.

本文利用人对空间声音的感知，提出了一种基于ITU-T G.719编解码器的多通道音频信号压缩编码新方法。采用频域下混和上混分步技术，将多通道输入信号转换为下混信号加空间感知参数。该算法与G.719音频编解码器结合使用，可以在音质可接受的前提下显著降低编码率。文中给出了该算法的实现，并详细描述了所选空间参数的计算方法和特点。最后通过实验从压缩比、重构音质和算法复杂度等方面对算法进行了评价。

引用次数: 3

Detection of Region Duplication Forgery in Images under Affine Transforms 仿射变换下图像区域复制伪造检测

2013 Ninth International Conference on Intelligent Information Hiding and Multimedia Signal Processing

Pub Date : 2013-10-16 DOI: 10.1109/IIH-MSP.2013.140

Leida Li, Wei Zhang, Shushang Li, Jeng-Shyang Pan

Region duplication is a common method to produce forgery images, where part of an image is copied and pasted somewhere else in the same image. In order to fit the scene better and leave no visible artifacts, the copied region may be processed by affine transforms before being pasted. Most of the existing methods cannot handle these transforms. This paper presents a method to detect the region-duplication forgery under affine transforms. The image is first filtered and divided into overlapping circular blocks. Then the normalized color histogram (NCH) is extracted as the block feature. Forgery detection is achieved by comparing the NCH features. A new filter is designed to process the initial detection results. The final detection map is obtained after morphological operations. Simulations demonstrate the efficiency of the method.

区域复制是生成伪造图像的常用方法，即复制图像的一部分并将其粘贴到同一图像的其他地方。为了更好地贴合场景，不留下可见的伪影，复制的区域在粘贴前可以进行仿射变换处理。大多数现有方法都不能处理这些转换。提出了一种在仿射变换下检测区域复制伪造的方法。首先对图像进行过滤，并将其划分为重叠的圆形块。然后提取归一化颜色直方图(NCH)作为块特征。伪造检测是通过比较NCH特征来实现的。设计了一种新的滤波器来处理初始检测结果。形态学运算后得到最终的检测图。仿真结果表明了该方法的有效性。

引用次数: 0

A New Moment Based Image Quality Metric 一种新的基于矩量的图像质量度量

2013 Ninth International Conference on Intelligent Information Hiding and Multimedia Signal Processing

Pub Date : 2013-10-16 DOI: 10.1109/IIH-MSP.2013.139

Leida Li, Hancheng Zhu, Deqiang Cheng, Jeng-Shyang Pan

This paper presents a new full-reference image quality measure using discrete orthogonal moments. The sign of the moment is considered and the relative difference of the moments is obtained by comparing the absolute moment difference (AMD) with the magnitude of the original moment. A new quality function is proposed, which is an exponential function of the relative moment difference (RMD). Simulation results show the efficiency of the method.

提出了一种基于离散正交矩的全参考图像质量度量方法。考虑了矩的符号，通过将绝对矩差(AMD)与原始矩的大小进行比较，得到了矩的相对差值。提出了一种新的质量函数，即相对矩差(RMD)的指数函数。仿真结果表明了该方法的有效性。

引用次数: 0

Towards Estimation of Quality of Watermarked Audio Signal Using Objective Measures 用客观度量方法估计带水印音频信号的质量

2013 Ninth International Conference on Intelligent Information Hiding and Multimedia Signal Processing

Pub Date : 2013-10-16 DOI: 10.1109/IIH-MSP.2013.78

K. Kondo

This paper compares two objective audio quality assessment measures calculated for three watermark methods with its corresponding subjective quality. The aim was to see if these measures could be used to estimate the subjective audio quality with various watermarks. Samples were watermarked with the LSB substitution, the direct spread-spectrum, and the echo hiding methods. The objective scores were calculated using peaqb, an implementation of the ITU-R BS.1387-1 standard, and PEMO-Q. PEMO-Q showed significantly higher correlation, about 0.90 compared to peaqb. Initial quality estimation tests were also conducted, where regression from objective score to the subjective score of one watermark (e.g. LSB) was estimated, and this regression was used to estimate the subjective score of another watermark method (e.g. spread-spectrum) from its objective score. PEMO-Q showed higher estimation accuracy, with Root Mean Square Error (RMSE) of about 11%.

本文将三种水印方法计算的两种客观音质评价测度与其相应的主观音质进行了比较。目的是看看这些措施是否可以用来估计主观音频质量与各种水印。采用LSB替换、直接扩频和回波隐藏等方法对样本进行了水印处理。使用peaqb (ITU-R BS.1387-1标准的实现)和PEMO-Q计算客观分数。pomo - q与peaqb的相关性为0.90。还进行了初始质量估计测试，其中估计从客观评分到主观评分的一个水印(例如LSB)的回归，并使用该回归从其客观评分估计另一个水印方法(例如扩频)的主观评分。PEMO-Q具有较高的估计精度，均方根误差(RMSE)约为11%。

引用次数: 1

A New Frequency Pre-estimation Aided Carrier Recovery Algorithm for Multimodal Signal System 一种基于频率预估计的多模态信号载波恢复新算法

2013 Ninth International Conference on Intelligent Information Hiding and Multimedia Signal Processing

Pub Date : 2013-10-16 DOI: 10.1109/IIH-MSP.2013.49

Wang Ranran, Wang Botao, Lu-Xin Yan

Multimedia System has been widely developed. This paper proposes a new kind of fast Fourier transformation (FFT) and a carrier recovery loop for accurate fine tracking. This paper uses the FFT carrier frequency offset to pre-estimate it that corrects the big frequency firstly, based on this, it uses the carrier frequency ring circuit to correct the small frequency offset. Comparing with other methods, its estimation is more accurate.

多媒体系统得到了广泛的发展。本文提出了一种新的快速傅立叶变换(FFT)和载波恢复环，用于精确的精细跟踪。本文采用FFT载波频偏预估先对大频率进行校正，在此基础上采用载波环电路对小频率进行校正。与其他方法相比，其估计精度更高。

引用次数: 1

Digital Rights Management System Based on PKCS#12 基于pkcs# 12的数字版权管理系统

2013 Ninth International Conference on Intelligent Information Hiding and Multimedia Signal Processing

Pub Date : 2013-10-16 DOI: 10.1109/IIH-MSP.2013.163

Zhi-Chun Li, Chunxiao Zhang

This paper proposes a DRM system based on PKCS#12 to meet the requirement of security and flexibility in digital media application. It designs the system architecture and the security protocol of user registration, certificate issuing, encrypted digital content distribution, authorized license delivery, authentication and decryption, etc. With the security feature of PKCS#12 and the designed protocol, the proposed system can ensure the security of certificate and private key during the storage and transfer. And this system supports participation through different devices, can prevent digital rights from illegal sharing.

为了满足数字媒体应用对安全性和灵活性的要求，本文提出了一种基于pkcs# 12的数字版权管理系统。设计了用户注册、证书颁发、加密数字内容分发、授权许可证发放、认证和解密等系统架构和安全协议。利用pkcs# 12的安全特性和所设计的协议，可以保证证书和私钥在存储和传输过程中的安全。并且该系统支持通过不同的设备参与，可以防止非法共享数字版权。

引用次数: 0

Gain Factors Calibration in 3D Sound Reproduction Using VBAP 用VBAP标定三维声音再现中的增益因子

2013 Ninth International Conference on Intelligent Information Hiding and Multimedia Signal Processing

Pub Date : 2013-10-16 DOI: 10.1109/IIH-MSP.2013.82

Hu Ruimin, Zhang Maosheng, Yang Yuhong, Wang Xiaochen, Shi Dong, Jiang Lin

Vector-based amplitude panning in three dimensional sound reproduction aims to preserve both sound image direction and distance perception. While in the estimation process, the loudspeakers are supposed to place on a sphere. It is possible that this requirement cannot be met in home environment. An alternative method to estimate gain factors in vector-based amplitude panning is proposed to preserve distance perception in this study. The experiments confirm that listeners do not perceive obvious distance differences when panning and confirm the validation of the proposed method.

在三维声音再现中，基于矢量的幅度平移是为了保持声音图像的方向和距离感知。在估计过程中，扬声器应该放置在一个球体上。在家庭环境中可能无法满足此要求。本文提出了一种基于矢量的幅值平移中增益因子估计的替代方法，以保持距离感知。实验证实了听众在平移时没有感觉到明显的距离差异，验证了所提方法的有效性。

引用次数: 1

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

2013 Ninth International Conference on Intelligent Information Hiding and Multimedia Signal Processing

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀