Telephone speech data corpus and performances of speaker independent recognition system using the corpus

Proceedings of 2nd IEEE Workshop on Interactive Voice Technology for Telecommunications Applications Pub Date : 1994-09-26 DOI:10.1109/IVTTA.1994.341535

T. Isobe, K. Murakami

引用次数: 3

Abstract

The authors first describe the speech data corpus they collected from 400 male and 400 female subjects over the phone. They then compare the performances of two types of triphone model based speaker independent recognition systems, in which they used the corpus for training models and testing. One system uses a normal continuous mixture density HMM, and the other uses a CDHMM with a tree structure of 2,064 Gaussian distributions, which needs only one thirtieth of the Gaussian computation of a normal one. As a result, the system with the tree-structure CDHMM performed as well as 3% less than the system using the normal CDHMM. This shows that tree-structure CDHMM are useful for telephone speech recognition.<>

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

电话语音数据语料库及使用该语料库的说话人独立识别系统的性能

作者首先描述了他们通过电话从400名男性和400名女性受试者中收集的语音数据语料库。然后，他们比较了两种基于三联音模型的独立说话人识别系统的性能，在这两种系统中，他们使用语料库来训练模型和测试。一种系统使用正态连续混合密度HMM，另一种系统使用具有2064个高斯分布的树结构的CDHMM，其所需的高斯计算量仅为正态混合密度HMM的三十分之一。结果表明，使用树状结构CDHMM的系统比使用普通CDHMM的系统性能低3%。这表明树状结构CDHMM在电话语音识别中是有用的。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Proceedings of 2nd IEEE Workshop on Interactive Voice Technology for Telecommunications Applications

自引率

0.00%

发文量