首页 > 最新文献

2013 Ninth International Conference on Intelligent Information Hiding and Multimedia Signal Processing最新文献

英文 中文
Research on Image Matching Algorithm Based on Local Invariant Features 基于局部不变特征的图像匹配算法研究
Jiaqi Liu, Qiang Wu, Xuwen Li
As an important foundation for image-guided technology, image matching technique is the key technology of modern war. This paper proposes a new algorithm of affine invariant detector and descriptor of local invariant feature points, starting from feature point detection and description point of view, making up the traditional feature point extraction defects of small number and types. Meantime, proposes an improved similarity measure method based on the previously proposed new feature point detection and description algorithm, it improves the matching accuracy and real-time performance. Finally, compares the experiment results of SURF, SIFT and the improved algorithm proposed in this paper, the experimental results shows that the feature points extracted by the improved algorithm has fully affine invariance, and improved the accuracy and speed of image matching algorithm efficiently.
图像匹配技术是现代战争的关键技术,是图像制导技术的重要基础。本文从特征点检测和描述的角度出发,提出了一种新的仿射不变特征点检测和局部不变特征点描述子算法,弥补了传统特征点提取数量少、类型少的缺陷。同时,在原有特征点检测与描述算法的基础上,提出了一种改进的相似度度量方法,提高了匹配精度和实时性。最后,将SURF、SIFT与本文提出的改进算法的实验结果进行对比,实验结果表明,改进算法提取的特征点具有完全仿射不变性,有效地提高了图像匹配算法的精度和速度。
{"title":"Research on Image Matching Algorithm Based on Local Invariant Features","authors":"Jiaqi Liu, Qiang Wu, Xuwen Li","doi":"10.1109/IIH-MSP.2013.37","DOIUrl":"https://doi.org/10.1109/IIH-MSP.2013.37","url":null,"abstract":"As an important foundation for image-guided technology, image matching technique is the key technology of modern war. This paper proposes a new algorithm of affine invariant detector and descriptor of local invariant feature points, starting from feature point detection and description point of view, making up the traditional feature point extraction defects of small number and types. Meantime, proposes an improved similarity measure method based on the previously proposed new feature point detection and description algorithm, it improves the matching accuracy and real-time performance. Finally, compares the experiment results of SURF, SIFT and the improved algorithm proposed in this paper, the experimental results shows that the feature points extracted by the improved algorithm has fully affine invariance, and improved the accuracy and speed of image matching algorithm efficiently.","PeriodicalId":105427,"journal":{"name":"2013 Ninth International Conference on Intelligent Information Hiding and Multimedia Signal Processing","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-10-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126055744","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Multi-modal Voice Activity Detection by Embedding Image Features into Speech Signal 将图像特征嵌入语音信号的多模态语音活动检测
Yohei Abe, A. Ito
Lip movement has a close relationship with speech because the lips move when we talk. The idea behind this work is to extract the lip movement feature from the facial video and embed the movement feature into speech signal using information hiding technique. Using the proposed framework, we can provide advanced speech communication only using the speech signal that includes lip movement features, without increasing the bitrate of the signal. In this paper, we show the basic framework of the method and apply the proposal method to multi-modal voice activity detection (VAD). As a result of detection experiment using the support vector machine, we obtained better performance than the audio-only VAD in a noisy environment. In addition, we investigated how data embedding into speech signal affects sound quality and detection performance.
嘴唇的运动与语言有着密切的关系,因为我们说话的时候嘴唇会动。这项工作的思想是从面部视频中提取唇部运动特征,并利用信息隐藏技术将运动特征嵌入到语音信号中。使用该框架,我们可以在不增加信号比特率的情况下,仅使用包含唇部运动特征的语音信号来提供高级语音通信。本文给出了该方法的基本框架,并将该方法应用于多模态语音活动检测(VAD)。通过支持向量机的检测实验,我们在噪声环境下获得了比纯音频VAD更好的性能。此外,我们还研究了数据嵌入语音信号对声音质量和检测性能的影响。
{"title":"Multi-modal Voice Activity Detection by Embedding Image Features into Speech Signal","authors":"Yohei Abe, A. Ito","doi":"10.1109/IIH-MSP.2013.76","DOIUrl":"https://doi.org/10.1109/IIH-MSP.2013.76","url":null,"abstract":"Lip movement has a close relationship with speech because the lips move when we talk. The idea behind this work is to extract the lip movement feature from the facial video and embed the movement feature into speech signal using information hiding technique. Using the proposed framework, we can provide advanced speech communication only using the speech signal that includes lip movement features, without increasing the bitrate of the signal. In this paper, we show the basic framework of the method and apply the proposal method to multi-modal voice activity detection (VAD). As a result of detection experiment using the support vector machine, we obtained better performance than the audio-only VAD in a noisy environment. In addition, we investigated how data embedding into speech signal affects sound quality and detection performance.","PeriodicalId":105427,"journal":{"name":"2013 Ninth International Conference on Intelligent Information Hiding and Multimedia Signal Processing","volume":"98 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-10-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126094819","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Watermarking Method for Speech Signals Based on Modifications to LSFs 基于lsf修正的语音信号水印方法
Shengbei Wang, M. Unoki
We propose a method of speech watermarking based on modifications to line spectral frequencies (LSFs) of original speech. LSFs were derived from each frame with linear prediction (LP) analysis and watermarks were embedded into them by using the quantization index modulation (QIM) of different quantization steps. We took into consideration inaudibility and robustness that were influenced by minor modifications to LSFs. The proposed approach was evaluated with two kinds of experiments with respect to inaudibility and robustness against different speech codecs and general processing. The results from the evaluations revealed that the proposed approach not only had high rate of bit detection while keeping the original sound quality undistorted but also good robustness against general speech processing.
提出了一种基于修改原始语音的线谱频率的语音水印方法。对每一帧图像进行线性预测(LP)分析,得到lsf,并采用不同量化步骤的量化指标调制(QIM)嵌入水印。我们考虑了受lsf轻微修改影响的听不清和健壮性。针对不同语音编解码器和一般处理的不听性和鲁棒性,通过两种实验对该方法进行了评估。结果表明,该方法在保持原始音质不失真的情况下具有较高的比特检测率,并且对一般语音处理具有较好的鲁棒性。
{"title":"Watermarking Method for Speech Signals Based on Modifications to LSFs","authors":"Shengbei Wang, M. Unoki","doi":"10.1109/IIH-MSP.2013.79","DOIUrl":"https://doi.org/10.1109/IIH-MSP.2013.79","url":null,"abstract":"We propose a method of speech watermarking based on modifications to line spectral frequencies (LSFs) of original speech. LSFs were derived from each frame with linear prediction (LP) analysis and watermarks were embedded into them by using the quantization index modulation (QIM) of different quantization steps. We took into consideration inaudibility and robustness that were influenced by minor modifications to LSFs. The proposed approach was evaluated with two kinds of experiments with respect to inaudibility and robustness against different speech codecs and general processing. The results from the evaluations revealed that the proposed approach not only had high rate of bit detection while keeping the original sound quality undistorted but also good robustness against general speech processing.","PeriodicalId":105427,"journal":{"name":"2013 Ninth International Conference on Intelligent Information Hiding and Multimedia Signal Processing","volume":"131 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-10-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124871208","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
Multi-channel Audio Compression Method Based on ITU-T G.719 Codec 基于ITU-T G.719编解码器的多通道音频压缩方法
Wen-ling Jiang, Jing Wang, Yi Zhao, Baoguang Liu, Xuan Ji
Through exploiting the human perception of spatial sound, a new approach for compression coding of multi-channel audio signal based on ITU-T G.719 codec is put forward in this paper. Multi-channel input signals are converted to a down-mixed signal plus spatial perceptual parameters by use of down-mix and up-mix step-by-step techniques in frequency domain. The algorithm can significantly reduce the coding rate under the premise of an acceptable sound quality in combination with the G.719 audio codec. The paper presents the implementation of the algorithm and describes in detail the calculation and features of the selected spatial parameters. Finally some experiments are done to evaluate the algorithm from the perspective of the compression ratio, the reconstructed sound quality, and the algorithm complexity.
本文利用人对空间声音的感知,提出了一种基于ITU-T G.719编解码器的多通道音频信号压缩编码新方法。采用频域下混和上混分步技术,将多通道输入信号转换为下混信号加空间感知参数。该算法与G.719音频编解码器结合使用,可以在音质可接受的前提下显著降低编码率。文中给出了该算法的实现,并详细描述了所选空间参数的计算方法和特点。最后通过实验从压缩比、重构音质和算法复杂度等方面对算法进行了评价。
{"title":"Multi-channel Audio Compression Method Based on ITU-T G.719 Codec","authors":"Wen-ling Jiang, Jing Wang, Yi Zhao, Baoguang Liu, Xuan Ji","doi":"10.1109/IIH-MSP.2013.81","DOIUrl":"https://doi.org/10.1109/IIH-MSP.2013.81","url":null,"abstract":"Through exploiting the human perception of spatial sound, a new approach for compression coding of multi-channel audio signal based on ITU-T G.719 codec is put forward in this paper. Multi-channel input signals are converted to a down-mixed signal plus spatial perceptual parameters by use of down-mix and up-mix step-by-step techniques in frequency domain. The algorithm can significantly reduce the coding rate under the premise of an acceptable sound quality in combination with the G.719 audio codec. The paper presents the implementation of the algorithm and describes in detail the calculation and features of the selected spatial parameters. Finally some experiments are done to evaluate the algorithm from the perspective of the compression ratio, the reconstructed sound quality, and the algorithm complexity.","PeriodicalId":105427,"journal":{"name":"2013 Ninth International Conference on Intelligent Information Hiding and Multimedia Signal Processing","volume":"166 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-10-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122585599","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Detection of Region Duplication Forgery in Images under Affine Transforms 仿射变换下图像区域复制伪造检测
Leida Li, Wei Zhang, Shushang Li, Jeng-Shyang Pan
Region duplication is a common method to produce forgery images, where part of an image is copied and pasted somewhere else in the same image. In order to fit the scene better and leave no visible artifacts, the copied region may be processed by affine transforms before being pasted. Most of the existing methods cannot handle these transforms. This paper presents a method to detect the region-duplication forgery under affine transforms. The image is first filtered and divided into overlapping circular blocks. Then the normalized color histogram (NCH) is extracted as the block feature. Forgery detection is achieved by comparing the NCH features. A new filter is designed to process the initial detection results. The final detection map is obtained after morphological operations. Simulations demonstrate the efficiency of the method.
区域复制是生成伪造图像的常用方法,即复制图像的一部分并将其粘贴到同一图像的其他地方。为了更好地贴合场景,不留下可见的伪影,复制的区域在粘贴前可以进行仿射变换处理。大多数现有方法都不能处理这些转换。提出了一种在仿射变换下检测区域复制伪造的方法。首先对图像进行过滤,并将其划分为重叠的圆形块。然后提取归一化颜色直方图(NCH)作为块特征。伪造检测是通过比较NCH特征来实现的。设计了一种新的滤波器来处理初始检测结果。形态学运算后得到最终的检测图。仿真结果表明了该方法的有效性。
{"title":"Detection of Region Duplication Forgery in Images under Affine Transforms","authors":"Leida Li, Wei Zhang, Shushang Li, Jeng-Shyang Pan","doi":"10.1109/IIH-MSP.2013.140","DOIUrl":"https://doi.org/10.1109/IIH-MSP.2013.140","url":null,"abstract":"Region duplication is a common method to produce forgery images, where part of an image is copied and pasted somewhere else in the same image. In order to fit the scene better and leave no visible artifacts, the copied region may be processed by affine transforms before being pasted. Most of the existing methods cannot handle these transforms. This paper presents a method to detect the region-duplication forgery under affine transforms. The image is first filtered and divided into overlapping circular blocks. Then the normalized color histogram (NCH) is extracted as the block feature. Forgery detection is achieved by comparing the NCH features. A new filter is designed to process the initial detection results. The final detection map is obtained after morphological operations. Simulations demonstrate the efficiency of the method.","PeriodicalId":105427,"journal":{"name":"2013 Ninth International Conference on Intelligent Information Hiding and Multimedia Signal Processing","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-10-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130829748","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A New Moment Based Image Quality Metric 一种新的基于矩量的图像质量度量
Leida Li, Hancheng Zhu, Deqiang Cheng, Jeng-Shyang Pan
This paper presents a new full-reference image quality measure using discrete orthogonal moments. The sign of the moment is considered and the relative difference of the moments is obtained by comparing the absolute moment difference (AMD) with the magnitude of the original moment. A new quality function is proposed, which is an exponential function of the relative moment difference (RMD). Simulation results show the efficiency of the method.
提出了一种基于离散正交矩的全参考图像质量度量方法。考虑了矩的符号,通过将绝对矩差(AMD)与原始矩的大小进行比较,得到了矩的相对差值。提出了一种新的质量函数,即相对矩差(RMD)的指数函数。仿真结果表明了该方法的有效性。
{"title":"A New Moment Based Image Quality Metric","authors":"Leida Li, Hancheng Zhu, Deqiang Cheng, Jeng-Shyang Pan","doi":"10.1109/IIH-MSP.2013.139","DOIUrl":"https://doi.org/10.1109/IIH-MSP.2013.139","url":null,"abstract":"This paper presents a new full-reference image quality measure using discrete orthogonal moments. The sign of the moment is considered and the relative difference of the moments is obtained by comparing the absolute moment difference (AMD) with the magnitude of the original moment. A new quality function is proposed, which is an exponential function of the relative moment difference (RMD). Simulation results show the efficiency of the method.","PeriodicalId":105427,"journal":{"name":"2013 Ninth International Conference on Intelligent Information Hiding and Multimedia Signal Processing","volume":"420 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-10-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116855389","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Towards Estimation of Quality of Watermarked Audio Signal Using Objective Measures 用客观度量方法估计带水印音频信号的质量
K. Kondo
This paper compares two objective audio quality assessment measures calculated for three watermark methods with its corresponding subjective quality. The aim was to see if these measures could be used to estimate the subjective audio quality with various watermarks. Samples were watermarked with the LSB substitution, the direct spread-spectrum, and the echo hiding methods. The objective scores were calculated using peaqb, an implementation of the ITU-R BS.1387-1 standard, and PEMO-Q. PEMO-Q showed significantly higher correlation, about 0.90 compared to peaqb. Initial quality estimation tests were also conducted, where regression from objective score to the subjective score of one watermark (e.g. LSB) was estimated, and this regression was used to estimate the subjective score of another watermark method (e.g. spread-spectrum) from its objective score. PEMO-Q showed higher estimation accuracy, with Root Mean Square Error (RMSE) of about 11%.
本文将三种水印方法计算的两种客观音质评价测度与其相应的主观音质进行了比较。目的是看看这些措施是否可以用来估计主观音频质量与各种水印。采用LSB替换、直接扩频和回波隐藏等方法对样本进行了水印处理。使用peaqb (ITU-R BS.1387-1标准的实现)和PEMO-Q计算客观分数。pomo - q与peaqb的相关性为0.90。还进行了初始质量估计测试,其中估计从客观评分到主观评分的一个水印(例如LSB)的回归,并使用该回归从其客观评分估计另一个水印方法(例如扩频)的主观评分。PEMO-Q具有较高的估计精度,均方根误差(RMSE)约为11%。
{"title":"Towards Estimation of Quality of Watermarked Audio Signal Using Objective Measures","authors":"K. Kondo","doi":"10.1109/IIH-MSP.2013.78","DOIUrl":"https://doi.org/10.1109/IIH-MSP.2013.78","url":null,"abstract":"This paper compares two objective audio quality assessment measures calculated for three watermark methods with its corresponding subjective quality. The aim was to see if these measures could be used to estimate the subjective audio quality with various watermarks. Samples were watermarked with the LSB substitution, the direct spread-spectrum, and the echo hiding methods. The objective scores were calculated using peaqb, an implementation of the ITU-R BS.1387-1 standard, and PEMO-Q. PEMO-Q showed significantly higher correlation, about 0.90 compared to peaqb. Initial quality estimation tests were also conducted, where regression from objective score to the subjective score of one watermark (e.g. LSB) was estimated, and this regression was used to estimate the subjective score of another watermark method (e.g. spread-spectrum) from its objective score. PEMO-Q showed higher estimation accuracy, with Root Mean Square Error (RMSE) of about 11%.","PeriodicalId":105427,"journal":{"name":"2013 Ninth International Conference on Intelligent Information Hiding and Multimedia Signal Processing","volume":"29 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-10-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126607975","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A New Frequency Pre-estimation Aided Carrier Recovery Algorithm for Multimodal Signal System 一种基于频率预估计的多模态信号载波恢复新算法
Wang Ranran, Wang Botao, Lu-Xin Yan
Multimedia System has been widely developed. This paper proposes a new kind of fast Fourier transformation (FFT) and a carrier recovery loop for accurate fine tracking. This paper uses the FFT carrier frequency offset to pre-estimate it that corrects the big frequency firstly, based on this, it uses the carrier frequency ring circuit to correct the small frequency offset. Comparing with other methods, its estimation is more accurate.
多媒体系统得到了广泛的发展。本文提出了一种新的快速傅立叶变换(FFT)和载波恢复环,用于精确的精细跟踪。本文采用FFT载波频偏预估先对大频率进行校正,在此基础上采用载波环电路对小频率进行校正。与其他方法相比,其估计精度更高。
{"title":"A New Frequency Pre-estimation Aided Carrier Recovery Algorithm for Multimodal Signal System","authors":"Wang Ranran, Wang Botao, Lu-Xin Yan","doi":"10.1109/IIH-MSP.2013.49","DOIUrl":"https://doi.org/10.1109/IIH-MSP.2013.49","url":null,"abstract":"Multimedia System has been widely developed. This paper proposes a new kind of fast Fourier transformation (FFT) and a carrier recovery loop for accurate fine tracking. This paper uses the FFT carrier frequency offset to pre-estimate it that corrects the big frequency firstly, based on this, it uses the carrier frequency ring circuit to correct the small frequency offset. Comparing with other methods, its estimation is more accurate.","PeriodicalId":105427,"journal":{"name":"2013 Ninth International Conference on Intelligent Information Hiding and Multimedia Signal Processing","volume":"50 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-10-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124187596","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Digital Rights Management System Based on PKCS#12 基于pkcs# 12的数字版权管理系统
Zhi-Chun Li, Chunxiao Zhang
This paper proposes a DRM system based on PKCS#12 to meet the requirement of security and flexibility in digital media application. It designs the system architecture and the security protocol of user registration, certificate issuing, encrypted digital content distribution, authorized license delivery, authentication and decryption, etc. With the security feature of PKCS#12 and the designed protocol, the proposed system can ensure the security of certificate and private key during the storage and transfer. And this system supports participation through different devices, can prevent digital rights from illegal sharing.
为了满足数字媒体应用对安全性和灵活性的要求,本文提出了一种基于pkcs# 12的数字版权管理系统。设计了用户注册、证书颁发、加密数字内容分发、授权许可证发放、认证和解密等系统架构和安全协议。利用pkcs# 12的安全特性和所设计的协议,可以保证证书和私钥在存储和传输过程中的安全。并且该系统支持通过不同的设备参与,可以防止非法共享数字版权。
{"title":"Digital Rights Management System Based on PKCS#12","authors":"Zhi-Chun Li, Chunxiao Zhang","doi":"10.1109/IIH-MSP.2013.163","DOIUrl":"https://doi.org/10.1109/IIH-MSP.2013.163","url":null,"abstract":"This paper proposes a DRM system based on PKCS#12 to meet the requirement of security and flexibility in digital media application. It designs the system architecture and the security protocol of user registration, certificate issuing, encrypted digital content distribution, authorized license delivery, authentication and decryption, etc. With the security feature of PKCS#12 and the designed protocol, the proposed system can ensure the security of certificate and private key during the storage and transfer. And this system supports participation through different devices, can prevent digital rights from illegal sharing.","PeriodicalId":105427,"journal":{"name":"2013 Ninth International Conference on Intelligent Information Hiding and Multimedia Signal Processing","volume":"267 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-10-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116617348","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Gain Factors Calibration in 3D Sound Reproduction Using VBAP 用VBAP标定三维声音再现中的增益因子
Hu Ruimin, Zhang Maosheng, Yang Yuhong, Wang Xiaochen, Shi Dong, Jiang Lin
Vector-based amplitude panning in three dimensional sound reproduction aims to preserve both sound image direction and distance perception. While in the estimation process, the loudspeakers are supposed to place on a sphere. It is possible that this requirement cannot be met in home environment. An alternative method to estimate gain factors in vector-based amplitude panning is proposed to preserve distance perception in this study. The experiments confirm that listeners do not perceive obvious distance differences when panning and confirm the validation of the proposed method.
在三维声音再现中,基于矢量的幅度平移是为了保持声音图像的方向和距离感知。在估计过程中,扬声器应该放置在一个球体上。在家庭环境中可能无法满足此要求。本文提出了一种基于矢量的幅值平移中增益因子估计的替代方法,以保持距离感知。实验证实了听众在平移时没有感觉到明显的距离差异,验证了所提方法的有效性。
{"title":"Gain Factors Calibration in 3D Sound Reproduction Using VBAP","authors":"Hu Ruimin, Zhang Maosheng, Yang Yuhong, Wang Xiaochen, Shi Dong, Jiang Lin","doi":"10.1109/IIH-MSP.2013.82","DOIUrl":"https://doi.org/10.1109/IIH-MSP.2013.82","url":null,"abstract":"Vector-based amplitude panning in three dimensional sound reproduction aims to preserve both sound image direction and distance perception. While in the estimation process, the loudspeakers are supposed to place on a sphere. It is possible that this requirement cannot be met in home environment. An alternative method to estimate gain factors in vector-based amplitude panning is proposed to preserve distance perception in this study. The experiments confirm that listeners do not perceive obvious distance differences when panning and confirm the validation of the proposed method.","PeriodicalId":105427,"journal":{"name":"2013 Ninth International Conference on Intelligent Information Hiding and Multimedia Signal Processing","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2013-10-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123085263","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
2013 Ninth International Conference on Intelligent Information Hiding and Multimedia Signal Processing
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1