Fusing wavelet and short-term features for speaker identification in noisy environment

2018 International Conference on Intelligent Systems and Computer Vision (ISCV) Pub Date : 2018-04-01 DOI:10.1109/ISACV.2018.8354030

Sara Sekkate, Mohammed Khalil, A. Adib

引用次数: 4

Abstract

Effective Speaker Identification System (SIS) involves extracting features effectively. In this paper, we propose a feature extraction scheme based on wavelet analysis which is used along with short-term features. To overcome the drawbacks of Discrete Wavelet Transform (DWT), we propose to combine Stationary Wavelet Transform (SWT) with Mel-Frequency Cepstral Coefficient (MFCC) features. The combined features were used as inputs to K-nearest neighbors (Knn) classifier. The effectiveness of the proposed method is investigated for closed-set text-independent SIS in clean and noisy environments. The experimental results indicated that the proposed approach can achieve better identification rate performance with feature extraction using SWT rather than DWT.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

融合小波与短时特征的噪声环境下说话人识别

有效的说话人识别系统(SIS)涉及到特征的有效提取。本文提出了一种基于小波分析的特征提取方案，该方案与短期特征结合使用。为了克服离散小波变换(DWT)的缺点，提出将平稳小波变换(SWT)与mel -频率倒谱系数(MFCC)特征相结合。将组合的特征作为k近邻(Knn)分类器的输入。研究了该方法在清洁和噪声环境下的闭集文本无关SIS的有效性。实验结果表明，基于小波变换的特征提取比基于小波变换的特征提取具有更好的识别率。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2018 International Conference on Intelligent Systems and Computer Vision (ISCV)

自引率

0.00%

发文量

期刊最新文献

Policy based generic autonomic adapter for a context-aware social-collaborative system Dual-camera 3D head tracking for clinical infant monitoring Integrating web usage mining for an automatic learner profile detection: A learning styles-based approach Deep generative models: Survey Deep neural network dynamic traffic routing system for vehicles