基于最优码本的阿拉伯语语音识别混合ANN/HMM模型

M. Ettaouil, M. Lazaar, Zakariae En-Naimani
{"title":"基于最优码本的阿拉伯语语音识别混合ANN/HMM模型","authors":"M. Ettaouil, M. Lazaar, Zakariae En-Naimani","doi":"10.1109/SITA.2013.6560806","DOIUrl":null,"url":null,"abstract":"Thanks to Automatic Speech Recognition (ASR), a lot of machines can nowadays emulate human being ability to understand and speak natural language. However, ASR problematic could be as interesting as it is difficult. Its difficulty is precisely due to the complexity of speech processing, which takes into consideration many aspects: acoustic, phonetic, syntactic, etc. Thus, the most commonly used technology, in the context of speech recognition, is based on statistical models. Especially, the Hidden Markov Models that are capable of simultaneously modeling frequency and temporal characteristics of the speech signal. There is also the alternative of using Neuronal Networks. But another interesting framework applied in ASR is indeed the hybrid Artificial Neural Network (ANN) and Hidden Markov Model (HMM) speech recognizer that improves the accuracy of the two models. In the present work, we propose an Arabic digits recognition system based on hybrid Artificial Neural Network and Hidden Markov Model (ANN/HMM). The main innovation in this work is to use an optimal neural network to determine the optimal class, unlike in classical Kohonen approach. The numerical results are encouraging and satisfactory.","PeriodicalId":145244,"journal":{"name":"2013 8th International Conference on Intelligent Systems: Theories and Applications (SITA)","volume":"81 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-05-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"15","resultStr":"{\"title\":\"A hybrid ANN/HMM models for arabic speech recognition using optimal codebook\",\"authors\":\"M. Ettaouil, M. Lazaar, Zakariae En-Naimani\",\"doi\":\"10.1109/SITA.2013.6560806\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Thanks to Automatic Speech Recognition (ASR), a lot of machines can nowadays emulate human being ability to understand and speak natural language. However, ASR problematic could be as interesting as it is difficult. Its difficulty is precisely due to the complexity of speech processing, which takes into consideration many aspects: acoustic, phonetic, syntactic, etc. Thus, the most commonly used technology, in the context of speech recognition, is based on statistical models. Especially, the Hidden Markov Models that are capable of simultaneously modeling frequency and temporal characteristics of the speech signal. There is also the alternative of using Neuronal Networks. But another interesting framework applied in ASR is indeed the hybrid Artificial Neural Network (ANN) and Hidden Markov Model (HMM) speech recognizer that improves the accuracy of the two models. In the present work, we propose an Arabic digits recognition system based on hybrid Artificial Neural Network and Hidden Markov Model (ANN/HMM). The main innovation in this work is to use an optimal neural network to determine the optimal class, unlike in classical Kohonen approach. The numerical results are encouraging and satisfactory.\",\"PeriodicalId\":145244,\"journal\":{\"name\":\"2013 8th International Conference on Intelligent Systems: Theories and Applications (SITA)\",\"volume\":\"81 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2013-05-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"15\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2013 8th International Conference on Intelligent Systems: Theories and Applications (SITA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SITA.2013.6560806\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 8th International Conference on Intelligent Systems: Theories and Applications (SITA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SITA.2013.6560806","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 15

摘要

由于自动语音识别(ASR),许多机器现在可以模仿人类理解和说自然语言的能力。然而,ASR问题既有趣又困难。其难点恰恰在于语音处理的复杂性,涉及到声学、语音、句法等诸多方面。因此,在语音识别的背景下,最常用的技术是基于统计模型的。特别是能够同时对语音信号的频率和时间特征进行建模的隐马尔可夫模型。还有一种选择是使用神经网络。但另一个应用于ASR的有趣框架确实是人工神经网络(ANN)和隐马尔可夫模型(HMM)的混合语音识别器,它提高了这两种模型的准确性。本文提出了一种基于人工神经网络和隐马尔可夫模型(ANN/HMM)的阿拉伯数字识别系统。这项工作的主要创新是使用最优神经网络来确定最优类别,这与经典的Kohonen方法不同。数值计算结果令人满意。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
A hybrid ANN/HMM models for arabic speech recognition using optimal codebook
Thanks to Automatic Speech Recognition (ASR), a lot of machines can nowadays emulate human being ability to understand and speak natural language. However, ASR problematic could be as interesting as it is difficult. Its difficulty is precisely due to the complexity of speech processing, which takes into consideration many aspects: acoustic, phonetic, syntactic, etc. Thus, the most commonly used technology, in the context of speech recognition, is based on statistical models. Especially, the Hidden Markov Models that are capable of simultaneously modeling frequency and temporal characteristics of the speech signal. There is also the alternative of using Neuronal Networks. But another interesting framework applied in ASR is indeed the hybrid Artificial Neural Network (ANN) and Hidden Markov Model (HMM) speech recognizer that improves the accuracy of the two models. In the present work, we propose an Arabic digits recognition system based on hybrid Artificial Neural Network and Hidden Markov Model (ANN/HMM). The main innovation in this work is to use an optimal neural network to determine the optimal class, unlike in classical Kohonen approach. The numerical results are encouraging and satisfactory.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
A rule-based simulation approach to scheduling problem in semiconductor photolithography process Hybrid network selection strategy by using M-AHP/E-TOPSIS for heterogeneous networks A semantic approach for automatic image annotation Result merging for meta-search engine Review of Web services description approaches
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1