使用增强的矢量量化算法识别与说话人无关的孤立波斯语数字

M. Jamali, Vahid Ghafarinia, M. A. Montazeri
{"title":"使用增强的矢量量化算法识别与说话人无关的孤立波斯语数字","authors":"M. Jamali, Vahid Ghafarinia, M. A. Montazeri","doi":"10.1109/SPIS.2015.7422333","DOIUrl":null,"url":null,"abstract":"Vector quantization (VQ) is a fast and simple classification algorithm that has been widely employed for the recognition of isolated spoken words. However, this algorithm and most of its improved versions fail to accurately distinguish words with similar vowels. The spoken pattern of digits/dow/ and/noh/ (2 and 9 respectively) in Persian is a good example for this type of similarity. In this paper we have proposed an enhanced vector quantization algorithm in which the deterministic role of the short consonants at the beginning of the words is taken into account. In this algorithm an unknown vector is judged based on the classification results of two set of codebooks. The first set of codebooks is constructed by the initial portions of the words while the other set is constructed by the whole words. The performance of the proposed algorithm was experimentally verified against other VQ-based algorithms. While the overall performance of the proposed algorithm was above the others, in the case of similar words it could remarkably decrease the number of misclassification. This improvement was achieved by only a small increase in the computational load.","PeriodicalId":424434,"journal":{"name":"2015 Signal Processing and Intelligent Systems Conference (SPIS)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Recognition of speaker-independent isolated Persian digits using an enhanced vector quantization algorithm\",\"authors\":\"M. Jamali, Vahid Ghafarinia, M. A. Montazeri\",\"doi\":\"10.1109/SPIS.2015.7422333\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Vector quantization (VQ) is a fast and simple classification algorithm that has been widely employed for the recognition of isolated spoken words. However, this algorithm and most of its improved versions fail to accurately distinguish words with similar vowels. The spoken pattern of digits/dow/ and/noh/ (2 and 9 respectively) in Persian is a good example for this type of similarity. In this paper we have proposed an enhanced vector quantization algorithm in which the deterministic role of the short consonants at the beginning of the words is taken into account. In this algorithm an unknown vector is judged based on the classification results of two set of codebooks. The first set of codebooks is constructed by the initial portions of the words while the other set is constructed by the whole words. The performance of the proposed algorithm was experimentally verified against other VQ-based algorithms. While the overall performance of the proposed algorithm was above the others, in the case of similar words it could remarkably decrease the number of misclassification. This improvement was achieved by only a small increase in the computational load.\",\"PeriodicalId\":424434,\"journal\":{\"name\":\"2015 Signal Processing and Intelligent Systems Conference (SPIS)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2015-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2015 Signal Processing and Intelligent Systems Conference (SPIS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SPIS.2015.7422333\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 Signal Processing and Intelligent Systems Conference (SPIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SPIS.2015.7422333","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

矢量量化(VQ)是一种快速、简单的分类算法,已被广泛应用于孤立口语单词的识别。然而,该算法及其大多数改进版本都无法准确区分元音相似的单词。波斯语中数字的发音模式/dow/和/noh/(分别为2和9)就是这种相似性的一个很好的例子。在本文中,我们提出了一种增强的矢量量化算法,其中考虑了单词开头短辅音的确定性作用。该算法根据两组码本的分类结果判断未知向量。第一组码本由单词的初始部分组成,而另一组则由整个单词组成。通过实验验证了该算法与其他基于vq的算法的性能。虽然该算法的整体性能优于其他算法,但在相似词的情况下,它可以显著减少错误分类的数量。这种改进是通过计算负载的小幅增加来实现的。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Recognition of speaker-independent isolated Persian digits using an enhanced vector quantization algorithm
Vector quantization (VQ) is a fast and simple classification algorithm that has been widely employed for the recognition of isolated spoken words. However, this algorithm and most of its improved versions fail to accurately distinguish words with similar vowels. The spoken pattern of digits/dow/ and/noh/ (2 and 9 respectively) in Persian is a good example for this type of similarity. In this paper we have proposed an enhanced vector quantization algorithm in which the deterministic role of the short consonants at the beginning of the words is taken into account. In this algorithm an unknown vector is judged based on the classification results of two set of codebooks. The first set of codebooks is constructed by the initial portions of the words while the other set is constructed by the whole words. The performance of the proposed algorithm was experimentally verified against other VQ-based algorithms. While the overall performance of the proposed algorithm was above the others, in the case of similar words it could remarkably decrease the number of misclassification. This improvement was achieved by only a small increase in the computational load.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
User-friendly visual secret sharing based on random grids An adaptive single image method for super resolution An improved DV-Hop localization algorithm in wireless sensor networks Optimization of the low-cost INS/GPS navigation system using ANFIS for high speed vehicle application A novel compressed sensing DOA estimation using difference set codes
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1