首页 > 最新文献

Proceedings of the Fifth IEEE International Symposium on Signal Processing and Information Technology, 2005.最新文献

英文 中文
A butterfly structure for rate k/n convolutional codes 速率k/n卷积码的蝴蝶结构
Chau-Yun Hsu, T. Kuo
In this paper, we propose a butterfly structure for Viterbi decoder which works for convolutional codes of all rates R = k/n. It provides an efficient way to find the inherent symmetry of trellis branches. By exploiting the symmetry, only a part of branch metric need to be computed and the others can be derived from the computed branches. Consequently, the computational complexity of Viterbi decoder can be reduced significantly with no error performance loss. In the best case, the butterfly structure can reduce the branch metric computation by a factor of 4
在本文中,我们提出了一种适用于所有速率R = k/n的卷积码的Viterbi解码器的蝴蝶结构。它提供了一种有效的方法来发现网格分支的内在对称性。利用这种对称性,只需要计算分支度规的一部分,其他的分支度规可以由计算出来的分支导出。因此,维特比解码器的计算复杂度可以显著降低,且没有错误性能损失。在最好的情况下,蝶形结构可以将分支度量计算减少4倍
{"title":"A butterfly structure for rate k/n convolutional codes","authors":"Chau-Yun Hsu, T. Kuo","doi":"10.1093/ietfec/e89-a.2.630","DOIUrl":"https://doi.org/10.1093/ietfec/e89-a.2.630","url":null,"abstract":"In this paper, we propose a butterfly structure for Viterbi decoder which works for convolutional codes of all rates R = k/n. It provides an efficient way to find the inherent symmetry of trellis branches. By exploiting the symmetry, only a part of branch metric need to be computed and the others can be derived from the computed branches. Consequently, the computational complexity of Viterbi decoder can be reduced significantly with no error performance loss. In the best case, the butterfly structure can reduce the branch metric computation by a factor of 4","PeriodicalId":421826,"journal":{"name":"Proceedings of the Fifth IEEE International Symposium on Signal Processing and Information Technology, 2005.","volume":"83 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2006-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126180627","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Path-based encoding efficiency of wavelet-based compressed imagery 基于小波压缩图像的路径编码效率
A.F. Leider, E. Balster
This paper presents a method for testing the performance of different path-based coders of wavelet transformed imagery. Path-based coders are very popular techniques for compressing wavelet-transformed imagery, but there has been little knowledge formally documented on the most efficient method of path traversal for the coding process. Thus, this paper presents the findings of several performance tests using two path-based coding techniques. These techniques are used to independently code each subband of several wavelet transformed images using different coding paths in each test. The results are documented and give rise to conclusions on how to best traverse each subband in a wavelet-transformed image for most efficient coding
本文提出了一种测试小波变换图像不同路径编码器性能的方法。基于路径的编码器是非常流行的压缩小波变换图像的技术,但是很少有关于编码过程中最有效的路径遍历方法的正式文档。因此,本文介绍了使用两种基于路径的编码技术的几个性能测试的结果。这些技术用于在每个测试中使用不同的编码路径对多个小波变换图像的每个子带进行独立编码。结果被记录下来,并得出关于如何在小波变换图像中最好地遍历每个子带以获得最有效编码的结论
{"title":"Path-based encoding efficiency of wavelet-based compressed imagery","authors":"A.F. Leider, E. Balster","doi":"10.1109/ISSPIT.2005.1577095","DOIUrl":"https://doi.org/10.1109/ISSPIT.2005.1577095","url":null,"abstract":"This paper presents a method for testing the performance of different path-based coders of wavelet transformed imagery. Path-based coders are very popular techniques for compressing wavelet-transformed imagery, but there has been little knowledge formally documented on the most efficient method of path traversal for the coding process. Thus, this paper presents the findings of several performance tests using two path-based coding techniques. These techniques are used to independently code each subband of several wavelet transformed images using different coding paths in each test. The results are documented and give rise to conclusions on how to best traverse each subband in a wavelet-transformed image for most efficient coding","PeriodicalId":421826,"journal":{"name":"Proceedings of the Fifth IEEE International Symposium on Signal Processing and Information Technology, 2005.","volume":"6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-12-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116942897","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Zipf, neural networks and SVM for musical genre classification Zipf,神经网络和支持向量机的音乐类型分类
E. Dellandréa, Hadi Harb, Liming Chen
We present in this paper audio classification schemes that we have experimented in order to perform musical genres classification. This type of classification is a part of a more general domain which is automatic semantic audio classification, the applications of which are more and more numerous in such fields as musical or multimedia databases indexing. Experimental results have shown that the feature set we have developed, based on Zipf laws, associated with a combination of classifiers organized hierarchically according to classes taxonomy allow an efficient classification
在本文中,我们提出了我们已经实验的音频分类方案,以便进行音乐类型分类。这种类型的分类是语义音频自动分类这一更广泛领域的一部分,在音乐或多媒体数据库索引等领域的应用越来越多。实验结果表明,我们基于Zipf定律开发的特征集与根据类分类法分层组织的分类器组合相关联,可以实现有效的分类
{"title":"Zipf, neural networks and SVM for musical genre classification","authors":"E. Dellandréa, Hadi Harb, Liming Chen","doi":"10.1109/ISSPIT.2005.1577070","DOIUrl":"https://doi.org/10.1109/ISSPIT.2005.1577070","url":null,"abstract":"We present in this paper audio classification schemes that we have experimented in order to perform musical genres classification. This type of classification is a part of a more general domain which is automatic semantic audio classification, the applications of which are more and more numerous in such fields as musical or multimedia databases indexing. Experimental results have shown that the feature set we have developed, based on Zipf laws, associated with a combination of classifiers organized hierarchically according to classes taxonomy allow an efficient classification","PeriodicalId":421826,"journal":{"name":"Proceedings of the Fifth IEEE International Symposium on Signal Processing and Information Technology, 2005.","volume":"69 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-12-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127121402","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
A hybrid level set approach for efficient and reliable image segmentation 一种高效可靠的混合水平集图像分割方法
Seongjai Kim
This article is concerned with a level set segmentation algorithm which hybridizes gradient-based methods and the Mumford-Shah (gradient-free) method, for an efficient and reliable segmentation. We introduce a new strategy for the complementary functions uplusmn , which is computed such that the difference between their average and the given image are able to introduce a reliable driving force for the evolution of the level set function. An effective method of background subtraction is suggested in order to improve reliability of the new model. An incomplete (linearized) alternating direction implicit (ADI) method is applied for an efficient time-stepping procedure. For a fast convergence, we also suggest effective initialization strategies for the level set function. The resulting algorithm has proved to locate the desired edges satisfactorily in 2-4 ADI iterations
本文研究了一种水平集分割算法,该算法将基于梯度的方法与Mumford-Shah(无梯度)方法相结合,以达到高效可靠的分割效果。我们对互补函数upusmn引入了一种新的策略,该策略的计算使得它们的平均值与给定图像之间的差能够为水平集函数的演化引入可靠的驱动力。为了提高新模型的可靠性,提出了一种有效的背景减法。采用不完全(线性化)交替方向隐式(ADI)方法求解时间步进问题。为了快速收敛,我们还提出了有效的水平集函数初始化策略。结果表明,该算法在2-4次ADI迭代中可以令人满意地定位到所需的边缘
{"title":"A hybrid level set approach for efficient and reliable image segmentation","authors":"Seongjai Kim","doi":"10.1109/ISSPIT.2005.1577191","DOIUrl":"https://doi.org/10.1109/ISSPIT.2005.1577191","url":null,"abstract":"This article is concerned with a level set segmentation algorithm which hybridizes gradient-based methods and the Mumford-Shah (gradient-free) method, for an efficient and reliable segmentation. We introduce a new strategy for the complementary functions uplusmn , which is computed such that the difference between their average and the given image are able to introduce a reliable driving force for the evolution of the level set function. An effective method of background subtraction is suggested in order to improve reliability of the new model. An incomplete (linearized) alternating direction implicit (ADI) method is applied for an efficient time-stepping procedure. For a fast convergence, we also suggest effective initialization strategies for the level set function. The resulting algorithm has proved to locate the desired edges satisfactorily in 2-4 ADI iterations","PeriodicalId":421826,"journal":{"name":"Proceedings of the Fifth IEEE International Symposium on Signal Processing and Information Technology, 2005.","volume":"36 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-12-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124940708","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 14
Using Julian set patterns for higher robustness in correlation based watermarking methods
F. Yaghmaee, M. Jamzad
Some of the most important classes of watermark detection methods in image watermarking are correlation-based algorithms. In these methods usually a pseudorandom pattern is embedded in host image. Receiver can regenerate this pattern by having a key that is the seed of a random number generator. After that if the correlation between this pattern and the image that is assumed to have the watermark is higher than a predefined threshold, this means that the watermark exists and vise versa. In this paper we showed the advantage of using the Julian set patterns as a watermark, instead of commonly used pseudorandom noisy pattern. Julian set patterns can be regenerate in receiver with few parameters such as coefficients of its function and an initial point, which can be embedded in the key. Our experiments showed that these patterns not only manipulate fewer numbers of pixels but also increase the robustness of watermark against attacks
在图像水印中,一些最重要的水印检测方法是基于相关性的算法。在这些方法中,通常在宿主图像中嵌入伪随机模式。接收方可以通过拥有作为随机数生成器种子的密钥来重新生成此模式。之后,如果该模式与假定具有水印的图像之间的相关性高于预定义的阈值,则意味着水印存在,反之亦然。在本文中,我们展示了使用朱利安集合模式作为水印的优点,而不是常用的伪随机噪声模式。在接收机中,只需少量参数(函数系数和初始点)即可生成朱利安集模式,初始点可嵌入密钥中。我们的实验表明,这些模式不仅操纵更少的像素,而且提高了水印对攻击的鲁棒性
{"title":"Using Julian set patterns for higher robustness in correlation based watermarking methods","authors":"F. Yaghmaee, M. Jamzad","doi":"10.1109/ISSPIT.2005.1577123","DOIUrl":"https://doi.org/10.1109/ISSPIT.2005.1577123","url":null,"abstract":"Some of the most important classes of watermark detection methods in image watermarking are correlation-based algorithms. In these methods usually a pseudorandom pattern is embedded in host image. Receiver can regenerate this pattern by having a key that is the seed of a random number generator. After that if the correlation between this pattern and the image that is assumed to have the watermark is higher than a predefined threshold, this means that the watermark exists and vise versa. In this paper we showed the advantage of using the Julian set patterns as a watermark, instead of commonly used pseudorandom noisy pattern. Julian set patterns can be regenerate in receiver with few parameters such as coefficients of its function and an initial point, which can be embedded in the key. Our experiments showed that these patterns not only manipulate fewer numbers of pixels but also increase the robustness of watermark against attacks","PeriodicalId":421826,"journal":{"name":"Proceedings of the Fifth IEEE International Symposium on Signal Processing and Information Technology, 2005.","volume":"35 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-12-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123540434","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Mode decision optimization issues in H.264 video coding H.264视频编码中的模式决策优化问题
S. Spinsante, Franco Chiaraluce, E. Gambi, Chiara Falasconi
The high compression efficiency ensured by the JVT H.264 video coding standard is mostly based on the adoption of a variable block-size motion estimation and mode decision. However, these advanced features imply a remarkable increase in the encoder complexity. To face the complexity issues, a number of algorithms and techniques have been developed, with the aim to speed up both the processes. In this paper, one of the algorithms proposed for fast mode decision is reviewed and evaluated, with respect to the reference software implementation, and some improvements are suggested, in order to make its behavior adaptable to the video content
JVT H.264视频编码标准保证的高压缩效率主要是基于采用可变块大小的运动估计和模式决策。然而,这些高级特性意味着编码器复杂性的显著增加。为了解决这一复杂问题,人们开发了许多算法和技术,旨在加快这两个过程。本文对其中一种快速模式决策算法进行了回顾和评价,并结合参考软件的实现,提出了一些改进建议,以使其行为适应视频内容
{"title":"Mode decision optimization issues in H.264 video coding","authors":"S. Spinsante, Franco Chiaraluce, E. Gambi, Chiara Falasconi","doi":"10.1109/ISSPIT.2005.1577169","DOIUrl":"https://doi.org/10.1109/ISSPIT.2005.1577169","url":null,"abstract":"The high compression efficiency ensured by the JVT H.264 video coding standard is mostly based on the adoption of a variable block-size motion estimation and mode decision. However, these advanced features imply a remarkable increase in the encoder complexity. To face the complexity issues, a number of algorithms and techniques have been developed, with the aim to speed up both the processes. In this paper, one of the algorithms proposed for fast mode decision is reviewed and evaluated, with respect to the reference software implementation, and some improvements are suggested, in order to make its behavior adaptable to the video content","PeriodicalId":421826,"journal":{"name":"Proceedings of the Fifth IEEE International Symposium on Signal Processing and Information Technology, 2005.","volume":"54 5","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-12-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114044783","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Speech synthesis from surface electromyogram signal 基于表面肌电信号的语音合成
Y. Lam, M. Mak, P. Leong
This paper presents a methodology that uses surface electromyogram (SEMG) signals recorded from the cheek and chin to synthesize speech. Simultaneously recorded speech and SEMG signals are blocked into frames and transformed into features. Linear predictive coding (LPC) and short-time Fourier transform coefficients are chosen as speech and SEMG features respectively. A neural network is applied to convert SEMG features into speech features on a frame-by-frame basis. The converted speech features are used to reconstruct the original speech. Feature selection, conversion methodology and experimental results are discussed. The results show that phoneme-based feature extraction and frame-based feature conversion could be applied to SEMG-based continuous speech synthesis
本文提出了一种利用脸颊和下巴的表面肌电信号合成语音的方法。同时记录的语音和表面肌电信号被阻塞成帧并转换成特征。分别选择线性预测编码(LPC)和短时傅立叶变换系数作为语音特征和表面肌电信号特征。利用神经网络将表面肌电信号特征逐帧转换为语音特征。将转换后的语音特征用于重建原始语音。讨论了特征选择、转换方法和实验结果。结果表明,基于音素的特征提取和基于帧的特征转换可以应用于基于表面肌电信号的连续语音合成
{"title":"Speech synthesis from surface electromyogram signal","authors":"Y. Lam, M. Mak, P. Leong","doi":"10.1109/ISSPIT.2005.1577192","DOIUrl":"https://doi.org/10.1109/ISSPIT.2005.1577192","url":null,"abstract":"This paper presents a methodology that uses surface electromyogram (SEMG) signals recorded from the cheek and chin to synthesize speech. Simultaneously recorded speech and SEMG signals are blocked into frames and transformed into features. Linear predictive coding (LPC) and short-time Fourier transform coefficients are chosen as speech and SEMG features respectively. A neural network is applied to convert SEMG features into speech features on a frame-by-frame basis. The converted speech features are used to reconstruct the original speech. Feature selection, conversion methodology and experimental results are discussed. The results show that phoneme-based feature extraction and frame-based feature conversion could be applied to SEMG-based continuous speech synthesis","PeriodicalId":421826,"journal":{"name":"Proceedings of the Fifth IEEE International Symposium on Signal Processing and Information Technology, 2005.","volume":"287 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-12-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114179020","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Global automatic thresholding with edge information and moving average on histogram 基于直方图边缘信息和移动平均的全局自动阈值分割
Yu-Kumg Chen, Yi-Fan Chang
Optical character recognition occupies a very important field in digital image processing. It is used extensively in daily life. If the given image does not have a bimodal intensity histogram, it would cause segmenting mistake easily for the previous bi-level algorithms. In order to solve this problem, a new algorithm is proposed in this paper. The proposed algorithm uses the theory of moving average on the histogram of the fuzzy image, and then derives the better histogram. Since use only one thresholding value cannot solve this problem completely, the edge information and the window processing are introduced in this paper for advanced thresholding. Thus, a more refine bi-level image is derived and it will result in the improvement of optical character recognition. Experiments are carried out for some samples with shading to demonstrate the computational advantage of the proposed method
光学字符识别在数字图像处理中占有非常重要的地位。它在日常生活中被广泛使用。如果给定的图像没有双峰强度直方图,那么以前的双级算法很容易产生分割错误。为了解决这一问题,本文提出了一种新的算法。该算法利用移动平均理论对模糊图像的直方图进行分析,得到较好的直方图。由于仅使用一个阈值并不能完全解决这一问题,本文引入了边缘信息和窗口处理来进行高级阈值处理。从而得到更精细的双电平图像,从而提高光学字符识别的精度。对一些带有阴影的样本进行了实验,验证了该方法的计算优势
{"title":"Global automatic thresholding with edge information and moving average on histogram","authors":"Yu-Kumg Chen, Yi-Fan Chang","doi":"10.1109/ISSPIT.2005.1577189","DOIUrl":"https://doi.org/10.1109/ISSPIT.2005.1577189","url":null,"abstract":"Optical character recognition occupies a very important field in digital image processing. It is used extensively in daily life. If the given image does not have a bimodal intensity histogram, it would cause segmenting mistake easily for the previous bi-level algorithms. In order to solve this problem, a new algorithm is proposed in this paper. The proposed algorithm uses the theory of moving average on the histogram of the fuzzy image, and then derives the better histogram. Since use only one thresholding value cannot solve this problem completely, the edge information and the window processing are introduced in this paper for advanced thresholding. Thus, a more refine bi-level image is derived and it will result in the improvement of optical character recognition. Experiments are carried out for some samples with shading to demonstrate the computational advantage of the proposed method","PeriodicalId":421826,"journal":{"name":"Proceedings of the Fifth IEEE International Symposium on Signal Processing and Information Technology, 2005.","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-12-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122245687","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
A new micro-mobility scheme using core node in all-IP-based wireless systems 一种基于全ip无线系统的核心节点微移动新方案
Sungho Kim, Sunshin An
This paper proposes a new micro-mobility scheme using core node. This node manages routing information for a mobile host and not to generate unnecessary messages at each base station and wired node. This scheme can prevent control traffics from all networks and reduce packet loss and handover latency because these messages can be sent to only the core node. We have conducted distribution of traffic as messages which created at old BS and then disappeared at core node for maintenance of routing table. We have evaluated the performance of our micro-mobility scheme through a series of simulations using the network simulator 2 (ns-2)
本文提出了一种新的基于核心节点的微移动方案。该节点为移动主机管理路由信息,避免在每个基站和有线节点上产生不必要的消息。该方案可以防止来自所有网络的控制流量,减少丢包和切换延迟,因为这些消息只发送到核心节点。为了维护路由表,我们将流量作为消息进行分发,这些消息在旧的BS上产生,然后在核心节点上消失。我们通过使用网络模拟器2 (ns-2)的一系列模拟来评估我们的微移动方案的性能。
{"title":"A new micro-mobility scheme using core node in all-IP-based wireless systems","authors":"Sungho Kim, Sunshin An","doi":"10.1109/ISSPIT.2005.1577091","DOIUrl":"https://doi.org/10.1109/ISSPIT.2005.1577091","url":null,"abstract":"This paper proposes a new micro-mobility scheme using core node. This node manages routing information for a mobile host and not to generate unnecessary messages at each base station and wired node. This scheme can prevent control traffics from all networks and reduce packet loss and handover latency because these messages can be sent to only the core node. We have conducted distribution of traffic as messages which created at old BS and then disappeared at core node for maintenance of routing table. We have evaluated the performance of our micro-mobility scheme through a series of simulations using the network simulator 2 (ns-2)","PeriodicalId":421826,"journal":{"name":"Proceedings of the Fifth IEEE International Symposium on Signal Processing and Information Technology, 2005.","volume":"28 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-12-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130228263","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Time-frequency approach for analysis and synthesis of particular emotional voice 特定情绪声音的时频分析与合成方法
M. Kobayashi, S. Wada
In this paper, the analysis and synthesis methods of particular emotional voice for man-machine natural interface is developed. First, the emotional voice (neutral, anger, sadness, joy, dislike) is analyzed using time-frequency representation of speech and similarity analysis. Then, based on the result of emotional analysis, a voice with neutral emotion is transformed to synthesize the particular emotional voice using time-frequency modifications. In the simulations, five types of emotion are analyzed using 50 samples of speech signals. The satisfactory average discrimination rate is achieved in the similarity analysis. Further, the synthesized emotional voice is subjectively evaluated. It is confirmed that the emotional voice is naturally generated by the proposed time-frequency based approach
本文研究了面向人机自然界面的特定情感语音的分析与合成方法。首先,使用语音时频表示和相似度分析对情绪语音(中性、愤怒、悲伤、喜悦、厌恶)进行分析。然后,在情绪分析结果的基础上,对具有中性情绪的语音进行时频变换,合成特定的情绪语音。在模拟中,使用50个语音信号样本分析了五种类型的情绪。在相似度分析中获得了令人满意的平均识别率。进一步,对合成的情感声音进行主观评价。结果表明,基于时频的方法能够自然地产生情感语音
{"title":"Time-frequency approach for analysis and synthesis of particular emotional voice","authors":"M. Kobayashi, S. Wada","doi":"10.1109/ISSPIT.2005.1577069","DOIUrl":"https://doi.org/10.1109/ISSPIT.2005.1577069","url":null,"abstract":"In this paper, the analysis and synthesis methods of particular emotional voice for man-machine natural interface is developed. First, the emotional voice (neutral, anger, sadness, joy, dislike) is analyzed using time-frequency representation of speech and similarity analysis. Then, based on the result of emotional analysis, a voice with neutral emotion is transformed to synthesize the particular emotional voice using time-frequency modifications. In the simulations, five types of emotion are analyzed using 50 samples of speech signals. The satisfactory average discrimination rate is achieved in the similarity analysis. Further, the synthesized emotional voice is subjectively evaluated. It is confirmed that the emotional voice is naturally generated by the proposed time-frequency based approach","PeriodicalId":421826,"journal":{"name":"Proceedings of the Fifth IEEE International Symposium on Signal Processing and Information Technology, 2005.","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2005-12-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129404017","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
期刊
Proceedings of the Fifth IEEE International Symposium on Signal Processing and Information Technology, 2005.
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1