Text-independent speaker identification based on feature transformation to phoneme-independent subspace

2008 11th IEEE International Conference on Communication Technology Pub Date : 2008-12-16 DOI:10.1109/ICCT.2008.4716204

Haoze Lu, H. Okamoto, M. Nishida, Y. Horiuchi, S. Kuroiwa

引用次数: 4

Abstract

In text-independent (TI) speaker identification, the variation of phonetic information strongly affects the performance of speaker identification. If this phonetic information in his/her speech data can be suppressed, a robust TI speaker identification system will be realized by using speech features having less phonetic information. In this paper, we propose a TI speaker identification method that suppresses the phonetic information by a subspace method, under the assumption that a subspace with large variance in the speech feature space is a ldquophoneme-dependent subspacerdquo and a complementary subspace of it is a ldquophoneme-independent subspacerdquo. Principal Component Analysis (PCA) is utilized to construct these subspaces. We carried out GMM-based speaker identification experiments using both a new feature vector of the proposed method and the conventional MFCC. As a result, the proposed method reduced the identification error rate by 21% compared with the conventional MFCC.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于特征转换到音素无关子空间的文本无关说话人识别

在不依赖文本的说话人识别中，语音信息的变化严重影响说话人识别的效果。如果语音数据中的语音信息可以被抑制，则可以利用语音信息较少的语音特征实现鲁棒TI说话人识别系统。本文提出了一种用子空间方法对语音信息进行抑制的TI说话人识别方法，该方法假设语音特征空间中方差较大的子空间是依赖于语音的子空间，其互补子空间是依赖于语音的子空间。利用主成分分析(PCA)构造这些子空间。我们使用该方法的新特征向量和传统的MFCC进行了基于gmm的说话人识别实验。结果表明，与传统的MFCC相比，该方法的识别错误率降低了21%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2008 11th IEEE International Conference on Communication Technology

自引率

0.00%

发文量