A novel objective function for improved phoneme recognition using time delay neural networks

International 1989 Joint Conference on Neural Networks Pub Date : 1990-06-01 DOI:10.1109/IJCNN.1989.118586

J. Hampshire, A. Waibel

引用次数: 235

Abstract

The authors present single- and multispeaker recognition results for the voiced stop consonants /b, d, g/ using time-delay neural networks (TDNN), a new objective function for training these networks, and a simple arbitration scheme for improved classification accuracy. With these enhancements a median 24% reduction in the number of misclassifications made by TDNNs trained with the traditional backpropagation objective function is achieved. This redundant results in /b, d, g/ recognition rates that consistently exceed 98% for TDNNs trained with individual speakers; it yields a 98.1% recognition rate for a TDNN trained with three male speakers.<>

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

一种利用时滞神经网络改进音素识别的新目标函数

作者介绍了使用时延神经网络(TDNN)对浊音顿音/b, d, g/进行单说话和多说话识别的结果，这是一种新的训练这些网络的目标函数，以及一种简单的仲裁方案，以提高分类精度。通过这些增强，使用传统反向传播目标函数训练的tdnn的误分类次数中位数减少了24%。这种冗余导致/b、d、g/识别率对于单个说话者训练的tdnn始终超过98%;对于由三名男性说话者训练的TDNN，其识别率为98.1%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

International 1989 Joint Conference on Neural Networks

自引率

0.00%

发文量

期刊最新文献

Hybrid distributed/local connectionist architectures A new back-propagation algorithm with coupled neuron A novel objective function for improved phoneme recognition using time delay neural networks Optimization of a digital neuron design Multitarget tracking with an optical neural net using a quadratic energy function