首页 > 最新文献

2015 IEEE International Conference on Consumer Electronics - Taiwan最新文献

英文 中文
Data arrangement and dimensional compression using Vivaldi for similarity search on structured peer-to-peer network 基于Vivaldi的结构化点对点网络相似性搜索数据整理与维数压缩
Pub Date : 2015-06-06 DOI: 10.1109/ICCE-TW.2015.7216959
Yoshihiro Sugaya, K. Motoyama, S. Omachi
Peer-to-peer system is a promising solution to manage a large amount of data, but similarity search on peer-to-peer network with a restricted small number of messages is a challenging problem. Existing methods that can perform similarity search work only with low-dimensional data. We propose a method to transform the very high-dimensional data into low-dimensional vectors in order to perform similarity search.
点对点系统是管理大量数据的一种很有前途的解决方案,但在消息数量有限的点对点网络上进行相似度搜索是一个具有挑战性的问题。现有的相似度搜索方法只适用于低维数据。提出了一种将非常高维的数据转换为低维的向量来进行相似度搜索的方法。
{"title":"Data arrangement and dimensional compression using Vivaldi for similarity search on structured peer-to-peer network","authors":"Yoshihiro Sugaya, K. Motoyama, S. Omachi","doi":"10.1109/ICCE-TW.2015.7216959","DOIUrl":"https://doi.org/10.1109/ICCE-TW.2015.7216959","url":null,"abstract":"Peer-to-peer system is a promising solution to manage a large amount of data, but similarity search on peer-to-peer network with a restricted small number of messages is a challenging problem. Existing methods that can perform similarity search work only with low-dimensional data. We propose a method to transform the very high-dimensional data into low-dimensional vectors in order to perform similarity search.","PeriodicalId":340402,"journal":{"name":"2015 IEEE International Conference on Consumer Electronics - Taiwan","volume":"6 35","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120927891","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A deep neural network based approach to mandarin consonant/vowel separation 基于深度神经网络的普通话辅音/元音分离方法
Pub Date : 2015-06-06 DOI: 10.1109/ICCE-TW.2015.7216923
Yen-Teh Liu, Yu Tsao, Ronald Y. Chang
In this paper, we study the problem of Mandarin consonant/vowel separation which is an integral part of many Mandarin speech applications. We propose a deep neural network (DNN) based approach and compare its performance with the support vector machine (SVM) method. Our results demonstrate an improved separation performance yielded by the proposed method, especially on consonant identification.
在本文中,我们研究了普通话的声母分离问题,这是许多普通话语音应用中不可缺少的一部分。我们提出了一种基于深度神经网络(DNN)的方法,并将其性能与支持向量机(SVM)方法进行了比较。我们的结果表明,该方法提高了分离性能,特别是在辅音识别方面。
{"title":"A deep neural network based approach to mandarin consonant/vowel separation","authors":"Yen-Teh Liu, Yu Tsao, Ronald Y. Chang","doi":"10.1109/ICCE-TW.2015.7216923","DOIUrl":"https://doi.org/10.1109/ICCE-TW.2015.7216923","url":null,"abstract":"In this paper, we study the problem of Mandarin consonant/vowel separation which is an integral part of many Mandarin speech applications. We propose a deep neural network (DNN) based approach and compare its performance with the support vector machine (SVM) method. Our results demonstrate an improved separation performance yielded by the proposed method, especially on consonant identification.","PeriodicalId":340402,"journal":{"name":"2015 IEEE International Conference on Consumer Electronics - Taiwan","volume":"22 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127057726","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Single image rain removal based on part-based model 基于零件模型的单幅图像雨水去除
Pub Date : 2015-06-06 DOI: 10.1109/ICCE-TW.2015.7216999
C. Yeh, Pin-Hsian Liu, Cheng-En Yu, Chih-Yang Lin
There are many outdoor vision applications such as surveillance and navigation. One of the challenges is rain removal, especially the rain removal from a single image. In this paper, a single rain image is divided into the high frequency part and the low frequency part by the Gaussian filter. Non-negative matrix factorization (NMF) is used to remove the rain streaks in the low frequency part. Then, Canny edge detection is applied to deal with the rain in the high frequency and the block copy method is employed to preserve the image quality. After that, we applied a rain dictionary to further divide the high frequency into rain and non-rain parts. The experimental results show that the proposed method is better than the state-of-the-art methods, especially in the high frequency part.
有许多户外视觉应用,如监视和导航。其中一个挑战是去除雨水,特别是从单个图像中去除雨水。本文采用高斯滤波方法将单幅降雨图像分为高频部分和低频部分。采用非负矩阵分解(NMF)去除低频部分的雨纹。然后,采用Canny边缘检测处理高频雨,采用分块复制方法保持图像质量;之后,我们使用雨字典将高频进一步划分为雨和非雨部分。实验结果表明,该方法优于现有的方法,特别是在高频部分。
{"title":"Single image rain removal based on part-based model","authors":"C. Yeh, Pin-Hsian Liu, Cheng-En Yu, Chih-Yang Lin","doi":"10.1109/ICCE-TW.2015.7216999","DOIUrl":"https://doi.org/10.1109/ICCE-TW.2015.7216999","url":null,"abstract":"There are many outdoor vision applications such as surveillance and navigation. One of the challenges is rain removal, especially the rain removal from a single image. In this paper, a single rain image is divided into the high frequency part and the low frequency part by the Gaussian filter. Non-negative matrix factorization (NMF) is used to remove the rain streaks in the low frequency part. Then, Canny edge detection is applied to deal with the rain in the high frequency and the block copy method is employed to preserve the image quality. After that, we applied a rain dictionary to further divide the high frequency into rain and non-rain parts. The experimental results show that the proposed method is better than the state-of-the-art methods, especially in the high frequency part.","PeriodicalId":340402,"journal":{"name":"2015 IEEE International Conference on Consumer Electronics - Taiwan","volume":"6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125493001","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Ultra-low-power voice trigger for wearable devices 用于可穿戴设备的超低功耗语音触发器
Pub Date : 2015-06-06 DOI: 10.1109/ICCE-TW.2015.7217039
Do-Hyung Kim, Seok-hwan Jo, K. Kwon, Yeonbok Lee, Seung-Won Lee, Young-Hwan Park, Sukjin Kim, Jaehyun Kim, Shihwa Lee
We introduce an ultra-low-power digital signal processor (DSP) solution for wearable applications with high performance. It employs three-issue VLIW architecture with the major low-power techniques and implemented with 95K gates in Samsung 28LPP process and runs up to 200MHz. The experimental results demonstrate that a voice trigger application can operate at 6.1MHz under 0.15mW power consumption.
我们推出了一款超低功耗数字信号处理器(DSP)解决方案,适用于高性能可穿戴应用。它采用三期VLIW架构和主要低功耗技术,并在三星28LPP工艺中实现95K门,运行频率高达200MHz。实验结果表明,在0.15mW的功耗下,语音触发应用可以工作在6.1MHz。
{"title":"Ultra-low-power voice trigger for wearable devices","authors":"Do-Hyung Kim, Seok-hwan Jo, K. Kwon, Yeonbok Lee, Seung-Won Lee, Young-Hwan Park, Sukjin Kim, Jaehyun Kim, Shihwa Lee","doi":"10.1109/ICCE-TW.2015.7217039","DOIUrl":"https://doi.org/10.1109/ICCE-TW.2015.7217039","url":null,"abstract":"We introduce an ultra-low-power digital signal processor (DSP) solution for wearable applications with high performance. It employs three-issue VLIW architecture with the major low-power techniques and implemented with 95K gates in Samsung 28LPP process and runs up to 200MHz. The experimental results demonstrate that a voice trigger application can operate at 6.1MHz under 0.15mW power consumption.","PeriodicalId":340402,"journal":{"name":"2015 IEEE International Conference on Consumer Electronics - Taiwan","volume":"31 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127544497","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Vision-based crowded pedestrian detection 基于视觉的拥挤行人检测
Pub Date : 2015-06-06 DOI: 10.1109/ICCE-TW.2015.7216929
Shih-Shinh Huang, Chun-Yuan Chen
Pedestrian detection and counting is an important topic in developing an intelligent surveillance system. In this work, we propose a vision-based system for detecting pedestrians in an image. Be robust to crowded scenes and adapt to incomplete foreground from background subtraction algorithm, expectation maximization (EM) algorithm is applied to impose the constraint of body part for achieving successful detection. A well-known dataset called CAVIAR is used to validate the effectiveness of the proposed method.
行人检测与计数是智能监控系统开发中的一个重要课题。在这项工作中,我们提出了一个基于视觉的系统来检测图像中的行人。为了对拥挤场景的鲁棒性和适应背景减除算法对前景不完全的影响,采用期望最大化算法对人体部位进行约束,实现成功的检测。一个名为CAVIAR的知名数据集被用来验证所提出方法的有效性。
{"title":"Vision-based crowded pedestrian detection","authors":"Shih-Shinh Huang, Chun-Yuan Chen","doi":"10.1109/ICCE-TW.2015.7216929","DOIUrl":"https://doi.org/10.1109/ICCE-TW.2015.7216929","url":null,"abstract":"Pedestrian detection and counting is an important topic in developing an intelligent surveillance system. In this work, we propose a vision-based system for detecting pedestrians in an image. Be robust to crowded scenes and adapt to incomplete foreground from background subtraction algorithm, expectation maximization (EM) algorithm is applied to impose the constraint of body part for achieving successful detection. A well-known dataset called CAVIAR is used to validate the effectiveness of the proposed method.","PeriodicalId":340402,"journal":{"name":"2015 IEEE International Conference on Consumer Electronics - Taiwan","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127608143","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Design of a 0.6-V 0.2-mW CMOS MEMS accelerometer 设计一种0.6 v 0.2 mw CMOS MEMS加速度计
Pub Date : 2015-06-06 DOI: 10.1109/ICCE-TW.2015.7216989
Po-Chang Wu, Bin-Da Liu, C. Yeh, S. Tseng, H. Tsai, Y. Juang
This paper presents a low-voltage low-power monolithic complementary metal-oxide-semiconductor (CMOS) micro-electromechanical-system (MEMS) accelerometer design. This design utilizes low-voltage design techniques without using low-threshold devices or internal supply voltage boosting. The accelerometer, designed in the 0.18-μm CMOS MEMS process, contains the micro-mechanical structure, readout circuits, and a 16-bit delta-sigma analog-to-digital converter (ΔΣ ADC). It occupies an area of only 0.8 × 1 mm2 and draws 0.33 mA of current from a 0.6-V supply. The simulated sensitivity is 3000 LSB/g and the nonlinearity is 0.78% within the ±6 g sensing range.
提出了一种低压低功耗单片互补金属氧化物半导体(CMOS)微机电系统(MEMS)加速度计的设计方案。本设计采用低压设计技术,不使用低阈值器件或内部电源电压升压。该加速度计采用0.18 μm CMOS MEMS工艺设计,包含微机械结构、读出电路和16位delta-sigma模数转换器(ΔΣ ADC)。它占地面积仅为0.8 × 1 mm2,从0.6 v电源中吸取0.33 mA的电流。模拟灵敏度为3000 LSB/g,在±6g的传感范围内非线性为0.78%。
{"title":"Design of a 0.6-V 0.2-mW CMOS MEMS accelerometer","authors":"Po-Chang Wu, Bin-Da Liu, C. Yeh, S. Tseng, H. Tsai, Y. Juang","doi":"10.1109/ICCE-TW.2015.7216989","DOIUrl":"https://doi.org/10.1109/ICCE-TW.2015.7216989","url":null,"abstract":"This paper presents a low-voltage low-power monolithic complementary metal-oxide-semiconductor (CMOS) micro-electromechanical-system (MEMS) accelerometer design. This design utilizes low-voltage design techniques without using low-threshold devices or internal supply voltage boosting. The accelerometer, designed in the 0.18-μm CMOS MEMS process, contains the micro-mechanical structure, readout circuits, and a 16-bit delta-sigma analog-to-digital converter (ΔΣ ADC). It occupies an area of only 0.8 × 1 mm2 and draws 0.33 mA of current from a 0.6-V supply. The simulated sensitivity is 3000 LSB/g and the nonlinearity is 0.78% within the ±6 g sensing range.","PeriodicalId":340402,"journal":{"name":"2015 IEEE International Conference on Consumer Electronics - Taiwan","volume":"421 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132555052","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
An efficient FPGA architecture for hardware realization of hexagonal based motion estimation algorithm 基于六边形的运动估计算法硬件实现的高效FPGA架构
Pub Date : 2015-06-06 DOI: 10.1109/ICCE-TW.2015.7216977
M. Muzammil, I. Ali, M. Sharif, K. A. Khalil
Motion Estimation (ME) is the most critical and complex part of any video codec system. The different algorithms and their architectures are proposed for ME process. In this paper, we have proposed an efficient architecture for Hexagon Based Search (HexBS) algorithm and implemented on XC4VSX25 Virtex4 FPGA. Simulation results show that the proposed architecture is capable of calculating the Motion Vectors (MVs) of 1280×720 High Definition (HD) videos with the best case throughput of 70 frames/sec. Moreover, the power and frequency requirements are 215mW and 127.27 MHz respectively for the proposed architecture with minimum hardware resources. Hence the proposed architecture is suitable for the real-time HD video applications.
运动估计是视频编解码系统中最关键、最复杂的部分。针对ME过程提出了不同的算法及其体系结构。本文提出了一种高效的Hexagon Based Search (HexBS)算法架构,并在XC4VSX25 Virtex4 FPGA上实现。仿真结果表明,该架构能够计算1280×720高清视频的运动矢量(mv),最佳吞吐量为70帧/秒。在硬件资源最少的情况下,该架构的功耗和频率要求分别为215mW和127.27 MHz。因此,所提出的体系结构适用于实时高清视频应用。
{"title":"An efficient FPGA architecture for hardware realization of hexagonal based motion estimation algorithm","authors":"M. Muzammil, I. Ali, M. Sharif, K. A. Khalil","doi":"10.1109/ICCE-TW.2015.7216977","DOIUrl":"https://doi.org/10.1109/ICCE-TW.2015.7216977","url":null,"abstract":"Motion Estimation (ME) is the most critical and complex part of any video codec system. The different algorithms and their architectures are proposed for ME process. In this paper, we have proposed an efficient architecture for Hexagon Based Search (HexBS) algorithm and implemented on XC4VSX25 Virtex4 FPGA. Simulation results show that the proposed architecture is capable of calculating the Motion Vectors (MVs) of 1280×720 High Definition (HD) videos with the best case throughput of 70 frames/sec. Moreover, the power and frequency requirements are 215mW and 127.27 MHz respectively for the proposed architecture with minimum hardware resources. Hence the proposed architecture is suitable for the real-time HD video applications.","PeriodicalId":340402,"journal":{"name":"2015 IEEE International Conference on Consumer Electronics - Taiwan","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132723969","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
A comparison study on SVD-based features in different transforms for image splicing detection 基于奇异值分解的不同变换特征在图像拼接检测中的比较研究
Pub Date : 2015-06-06 DOI: 10.1109/ICCE-TW.2015.7216815
Z. Moghaddasi, H. Jalab, R. M. Noor
Digital image forgery is becoming easier to perform because of the rapid developments of various manipulation tools. Between the various image forgery techniques, image splicing is considered as one the most prevalent technique. In this paper, a low dimensional singular value decomposition (SVD) based feature extraction method applied in steganalysis is proposed as an image splicing detection method. The SVD-based features are applied in different spatial and frequency domains to make a comprehensive comparison between these various transforms. Support vector machine is used to distinguish between authentic and spliced images. The results are encouraging and show that the detection accuracy of 77.60% is achieved for the DCT transform with only 25 dimensional feature vector.
由于各种操纵工具的快速发展,数字图像伪造变得越来越容易。在各种图像伪造技术中,图像拼接被认为是最常用的一种技术。本文提出了一种应用于隐写分析的基于低维奇异值分解(SVD)的特征提取方法作为图像拼接检测方法。将基于奇异值分解的特征应用于不同的空间域和频域,对各种变换进行综合比较。支持向量机用于区分真实图像和拼接图像。结果表明,仅使用25维特征向量进行DCT变换,检测准确率达到77.60%。
{"title":"A comparison study on SVD-based features in different transforms for image splicing detection","authors":"Z. Moghaddasi, H. Jalab, R. M. Noor","doi":"10.1109/ICCE-TW.2015.7216815","DOIUrl":"https://doi.org/10.1109/ICCE-TW.2015.7216815","url":null,"abstract":"Digital image forgery is becoming easier to perform because of the rapid developments of various manipulation tools. Between the various image forgery techniques, image splicing is considered as one the most prevalent technique. In this paper, a low dimensional singular value decomposition (SVD) based feature extraction method applied in steganalysis is proposed as an image splicing detection method. The SVD-based features are applied in different spatial and frequency domains to make a comprehensive comparison between these various transforms. Support vector machine is used to distinguish between authentic and spliced images. The results are encouraging and show that the detection accuracy of 77.60% is achieved for the DCT transform with only 25 dimensional feature vector.","PeriodicalId":340402,"journal":{"name":"2015 IEEE International Conference on Consumer Electronics - Taiwan","volume":"218 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134311806","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Magnitude replacement of real and imaginary modulation spectrum of acoustic spectrograms for noise-robust speech recognition 噪声鲁棒语音识别声学谱图实调制谱和虚调制谱的幅度替换
Pub Date : 2015-06-06 DOI: 10.1109/ICCE-TW.2015.7216925
Hsin-Ju Hsieh, J. Hung
In this paper, a novel method is proposed to enhance the complex-valued acoustic spectrograms of speech signals via replacing the magnitude part of the corresponding modulation spectrum in order to create noise-robust feature representation for recognition. All the evaluation experiments implemented on the Aurora-2 digit database and task show that the presented method performs better than the baseline MFCC and several well-known noise-robust techniques. These results apparently reveal that this novel method alleviates the effect of noise in speech features significantly.
本文提出了一种新的方法,通过替换相应调制谱的幅度部分来增强语音信号的复值声学谱图,从而产生噪声鲁棒的特征表示。在Aurora-2数字数据库和任务上进行的所有评估实验表明,该方法的性能优于基线MFCC和几种知名的抗噪技术。结果表明,该方法明显减轻了语音特征中噪声的影响。
{"title":"Magnitude replacement of real and imaginary modulation spectrum of acoustic spectrograms for noise-robust speech recognition","authors":"Hsin-Ju Hsieh, J. Hung","doi":"10.1109/ICCE-TW.2015.7216925","DOIUrl":"https://doi.org/10.1109/ICCE-TW.2015.7216925","url":null,"abstract":"In this paper, a novel method is proposed to enhance the complex-valued acoustic spectrograms of speech signals via replacing the magnitude part of the corresponding modulation spectrum in order to create noise-robust feature representation for recognition. All the evaluation experiments implemented on the Aurora-2 digit database and task show that the presented method performs better than the baseline MFCC and several well-known noise-robust techniques. These results apparently reveal that this novel method alleviates the effect of noise in speech features significantly.","PeriodicalId":340402,"journal":{"name":"2015 IEEE International Conference on Consumer Electronics - Taiwan","volume":"59 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133251421","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Voice conversion based on empirical conditional distribution in resource-limited scenarios 资源受限场景下基于经验条件分布的语音转换
Pub Date : 2015-06-06 DOI: 10.1109/ICCE-TW.2015.7216839
N. Xu, Yibin Tang, J. Bao, Xiao Yao, A. Jiang, Xiaofeng Liu
In this paper, a computationally efficient voice conversion system has been designed in order to improve the performance in resource-limited scenarios. First, mixtures of Gaussians (MoGs) at fixed locations of Mel frequencies have been used to represent the spectrum of STRAIGHT compactly. Second, the key conditional distributions for prediction are approximated by building histograms of aligned features empirically. Experiments have confirmed that our proposed method can obtain fairly good results compared to the traditional method without huge computational costs.
为了在资源有限的情况下提高语音转换系统的性能,本文设计了一个计算效率高的语音转换系统。首先,在Mel频率的固定位置使用高斯混合(mog)来表示STRAIGHT的紧凑频谱。其次,通过经验构建对齐特征的直方图来近似预测关键条件分布。实验证明,与传统方法相比,我们的方法可以获得相当好的结果,而且计算成本不高。
{"title":"Voice conversion based on empirical conditional distribution in resource-limited scenarios","authors":"N. Xu, Yibin Tang, J. Bao, Xiao Yao, A. Jiang, Xiaofeng Liu","doi":"10.1109/ICCE-TW.2015.7216839","DOIUrl":"https://doi.org/10.1109/ICCE-TW.2015.7216839","url":null,"abstract":"In this paper, a computationally efficient voice conversion system has been designed in order to improve the performance in resource-limited scenarios. First, mixtures of Gaussians (MoGs) at fixed locations of Mel frequencies have been used to represent the spectrum of STRAIGHT compactly. Second, the key conditional distributions for prediction are approximated by building histograms of aligned features empirically. Experiments have confirmed that our proposed method can obtain fairly good results compared to the traditional method without huge computational costs.","PeriodicalId":340402,"journal":{"name":"2015 IEEE International Conference on Consumer Electronics - Taiwan","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2015-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127812025","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
2015 IEEE International Conference on Consumer Electronics - Taiwan
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1