Isolated vowel recognition using linear predictive features and neural network classifier fusion

Proceedings of the Fifth International Conference on Information Fusion. FUSION 2002. (IEEE Cat.No.02EX5997) Pub Date : 2002-07-08 DOI:10.1109/ICIF.2002.1021003

引用次数: 7

Abstract

In this work, various linear predictive feature vectors were used to train three different automated neural networks type classifiers for the task of isolated vowel recognition. The features used included linear prediction filter coefficients, reflection coefficients, log area ratios, and the linear predictive cepstrum. The three neural network classifiers used are the multilayer perceptron, radial basis function and the probabilistic neural network. The linear predictive cepstrum of dimension 12 is the best feature especially when training is done on clean speech and testing is done on noisy speech. Three different classifier fusion strategies (linear fusion, majority voting and weighted majority voting) were found to improve the performance. Linear fusion with varying weights is the best method and is most robust to noise.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于线性预测特征和神经网络分类器融合的孤立元音识别

在这项工作中，使用各种线性预测特征向量来训练三种不同的自动神经网络类型分类器来完成孤立元音识别任务。使用的特征包括线性预测滤波器系数、反射系数、对数面积比和线性预测倒谱。使用的三种神经网络分类器是多层感知器、径向基函数和概率神经网络。12维的线性预测倒谱是最好的特征，特别是在对干净语音进行训练和对有噪声语音进行测试时。找到了三种不同的分类器融合策略(线性融合、多数投票和加权多数投票)来提高性能。变权线性融合是最好的方法，对噪声的鲁棒性最强。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Proceedings of the Fifth International Conference on Information Fusion. FUSION 2002. (IEEE Cat.No.02EX5997)

自引率

0.00%

发文量

期刊最新文献

Approximating fuzzy measures by hierarchically decomposable ones Tracking and fusion for wireless sensor networks A dynamic communication model for loosely coupled hybrid tracking systems On platform-based sensor management An improved Bayes fusion algorithm with the Parzen window method