Telephone speech data corpus and performances of speaker independent recognition system using the corpus

T. Isobe, K. Murakami
{"title":"Telephone speech data corpus and performances of speaker independent recognition system using the corpus","authors":"T. Isobe, K. Murakami","doi":"10.1109/IVTTA.1994.341535","DOIUrl":null,"url":null,"abstract":"The authors first describe the speech data corpus they collected from 400 male and 400 female subjects over the phone. They then compare the performances of two types of triphone model based speaker independent recognition systems, in which they used the corpus for training models and testing. One system uses a normal continuous mixture density HMM, and the other uses a CDHMM with a tree structure of 2,064 Gaussian distributions, which needs only one thirtieth of the Gaussian computation of a normal one. As a result, the system with the tree-structure CDHMM performed as well as 3% less than the system using the normal CDHMM. This shows that tree-structure CDHMM are useful for telephone speech recognition.<<ETX>>","PeriodicalId":435907,"journal":{"name":"Proceedings of 2nd IEEE Workshop on Interactive Voice Technology for Telecommunications Applications","volume":"12 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1994-09-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of 2nd IEEE Workshop on Interactive Voice Technology for Telecommunications Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IVTTA.1994.341535","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

Abstract

The authors first describe the speech data corpus they collected from 400 male and 400 female subjects over the phone. They then compare the performances of two types of triphone model based speaker independent recognition systems, in which they used the corpus for training models and testing. One system uses a normal continuous mixture density HMM, and the other uses a CDHMM with a tree structure of 2,064 Gaussian distributions, which needs only one thirtieth of the Gaussian computation of a normal one. As a result, the system with the tree-structure CDHMM performed as well as 3% less than the system using the normal CDHMM. This shows that tree-structure CDHMM are useful for telephone speech recognition.<>
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
电话语音数据语料库及使用该语料库的说话人独立识别系统的性能
作者首先描述了他们通过电话从400名男性和400名女性受试者中收集的语音数据语料库。然后,他们比较了两种基于三联音模型的独立说话人识别系统的性能,在这两种系统中,他们使用语料库来训练模型和测试。一种系统使用正态连续混合密度HMM,另一种系统使用具有2064个高斯分布的树结构的CDHMM,其所需的高斯计算量仅为正态混合密度HMM的三十分之一。结果表明,使用树状结构CDHMM的系统比使用普通CDHMM的系统性能低3%。这表明树状结构CDHMM在电话语音识别中是有用的。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Field trial of a speaker verification service for caller identity verification in the telephone network VoiceDialing-the first speech recognition based telephone service delivered to customer's home A system for field performance assessment of a speech recognition based telephone service Automated call routing in a telecommunications network Automation of operator services: a successful application of speech recognition technology
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1