Recognition system for nasal, lateral and trill arabic phonemes using neural networks

N. A. Abdul-Kadir, R. Sudirman, N. Mahmood
{"title":"Recognition system for nasal, lateral and trill arabic phonemes using neural networks","authors":"N. A. Abdul-Kadir, R. Sudirman, N. Mahmood","doi":"10.1109/SCORED.2012.6518644","DOIUrl":null,"url":null,"abstract":"There has been limited study and research in Arabic phoneme among Malaysians, hence making references to the work and research difficult. Although there have been significant acoustic and phonetic studies on languages such as English, French and Mandarin, to date there are no guidelines or significant findings on Malay language. In this paper, we monitored and analyzed the performance of multi-layer feed-forward with back-propagation (MLFFBP) and cascade-forward (CF) networks on our phoneme recognition system of Standard Arabic (SA). This study focused on Malaysian children as test subjects. Focused on four chosen phonemes from SA, which composed of nasal, lateral and trill behaviors, i.e. tabulated at four different articulation places. Highest training recognition rate for multi-layer and cascade-layer network are 98.8 % and 95.2 % respectively, while the highest testing recognition rate achieved for both networks is 92.9 %. 10-fold cross validation was used to evaluate system performance. The selected network is cascade layer with 40 and 10 hidden neurons in first hidden layer and second hidden layer respectively. The chosen network was used in the GUI designed for developing recognition system with user feedback.","PeriodicalId":299947,"journal":{"name":"2012 IEEE Student Conference on Research and Development (SCOReD)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 IEEE Student Conference on Research and Development (SCOReD)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SCORED.2012.6518644","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5

Abstract

There has been limited study and research in Arabic phoneme among Malaysians, hence making references to the work and research difficult. Although there have been significant acoustic and phonetic studies on languages such as English, French and Mandarin, to date there are no guidelines or significant findings on Malay language. In this paper, we monitored and analyzed the performance of multi-layer feed-forward with back-propagation (MLFFBP) and cascade-forward (CF) networks on our phoneme recognition system of Standard Arabic (SA). This study focused on Malaysian children as test subjects. Focused on four chosen phonemes from SA, which composed of nasal, lateral and trill behaviors, i.e. tabulated at four different articulation places. Highest training recognition rate for multi-layer and cascade-layer network are 98.8 % and 95.2 % respectively, while the highest testing recognition rate achieved for both networks is 92.9 %. 10-fold cross validation was used to evaluate system performance. The selected network is cascade layer with 40 and 10 hidden neurons in first hidden layer and second hidden layer respectively. The chosen network was used in the GUI designed for developing recognition system with user feedback.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于神经网络的阿拉伯语鼻音、侧音和颤音识别系统
马来西亚人对阿拉伯语音素的研究和研究有限,因此给参考工作和研究带来了困难。虽然对英语、法语和普通话等语言进行了大量的声学和语音研究,但迄今为止,在马来语方面没有指导方针或重大发现。在本文中,我们监测和分析了多层前馈与反向传播(MLFFBP)和级联前向(CF)网络在我们的标准阿拉伯语(SA)音素识别系统中的性能。这项研究的重点是马来西亚儿童作为测试对象。重点从SA中选择四个音素,这些音素由鼻音、侧音和颤音行为组成,即在四个不同的发音位置表列。多层和级联层网络的训练识别率最高分别为98.8%和95.2%,测试识别率最高均为92.9%。采用10倍交叉验证评价系统性能。所选择的网络为级联层,第一隐藏层和第二隐藏层分别有40个和10个隐藏神经元。将所选择的网络用于开发用户反馈识别系统的GUI设计中。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Welcome message Adjacent channel rejection analysis due to channel select filter in RF superheterodyne receiver of WLAN IEEE 802.11a University students stress level and brainwave balancing index: Comparison between early and end of study semester A linear model of quarter car active suspension system using composite nonlinear feedback control Initial strategy for design and fabrication of microgap electrodes
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1