Speaker-dependent 1000 word recognition using a large scale neural network 'CombNET-II' and dynamic spectral features

T. Kitamura, W. Hui, A. Iwata, N. Suzumura
{"title":"Speaker-dependent 1000 word recognition using a large scale neural network 'CombNET-II' and dynamic spectral features","authors":"T. Kitamura, W. Hui, A. Iwata, N. Suzumura","doi":"10.1109/IJCNN.1991.170560","DOIUrl":null,"url":null,"abstract":"The authors describe speaker-dependent large vocabulary word recognition using a large-scale neural network, CombNET-II, which consists of a four-layered neural network with a comb structure, and dynamic spectral features of speech based on a two-dimensional mel-cepstrum. CombNET-II consists of two types of neural networks. The first part is a stem network which learns by a self-growing algorithm and roughly classifies an input pattern. The second part consists of many branch networks which learn by a backpropagation algorithm and precisely classify the input pattern. A stem network is a vector quantizing network and it reduces the number of category candidates for the branch networks, so that each branch network has only a small number of connections and it is easy to tune up. Experiments on speaker-dependent large-vocabulary word recognition for 1000 Chinese spoken words is described. Experimental results show that the high recognition accuracy of 99.1% is obtained and that CombNET-II is very effective for large vocabulary spoken word recognition.<<ETX>>","PeriodicalId":211135,"journal":{"name":"[Proceedings] 1991 IEEE International Joint Conference on Neural Networks","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"1991-11-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"[Proceedings] 1991 IEEE International Joint Conference on Neural Networks","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IJCNN.1991.170560","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

The authors describe speaker-dependent large vocabulary word recognition using a large-scale neural network, CombNET-II, which consists of a four-layered neural network with a comb structure, and dynamic spectral features of speech based on a two-dimensional mel-cepstrum. CombNET-II consists of two types of neural networks. The first part is a stem network which learns by a self-growing algorithm and roughly classifies an input pattern. The second part consists of many branch networks which learn by a backpropagation algorithm and precisely classify the input pattern. A stem network is a vector quantizing network and it reduces the number of category candidates for the branch networks, so that each branch network has only a small number of connections and it is easy to tune up. Experiments on speaker-dependent large-vocabulary word recognition for 1000 Chinese spoken words is described. Experimental results show that the high recognition accuracy of 99.1% is obtained and that CombNET-II is very effective for large vocabulary spoken word recognition.<>
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
使用大规模神经网络“CombNET-II”和动态光谱特征的讲话者依赖的1000字识别
作者利用CombNET-II大规模神经网络描述了依赖于说话人的大词汇词识别,该网络由梳状结构的四层神经网络和基于二维mel-倒谱的语音动态频谱特征组成。CombNET-II由两类神经网络组成。第一部分是一个通过自生长算法学习并对输入模式进行粗略分类的干网络。第二部分由多个分支网络组成,这些分支网络通过反向传播算法学习并对输入模式进行精确分类。干网络是一种矢量量化网络,它减少了分支网络的候选类别数量,使得每个分支网络只有少量的连接,并且易于调整。描述了基于说话人的1000个汉语口语大词汇词识别实验。实验结果表明,CombNET-II的识别准确率高达99.1%,对大词汇量的口语单词识别非常有效。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Control of a robotic manipulating arm by a neural network simulation of the human cerebral and cerebellar cortical processes Neural network training using homotopy continuation methods A learning scheme of neural networks which improves accuracy and speed of convergence using redundant and diversified network structures The abilities of neural networks to abstract and to use abstractions Backpropagation based on the logarithmic error function and elimination of local minima
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1