Speech processing algorithm for isolated words recognition

M. Raczyński
{"title":"Speech processing algorithm for isolated words recognition","authors":"M. Raczyński","doi":"10.1109/IIPHDW.2018.8388238","DOIUrl":null,"url":null,"abstract":"Speech processing algorithms have been intensive developed since 70's and today are often implemented in daily-use devices (personal computers, mobile phones, smartphones etc.). Unfortunately, advanced algorithms have relatively high calculation costs thus need efficient (and expensive) implementation hardware. In this paper a simple speech-processing algorithm able to recognize a spoken word from previously created constant words set has been presented. This functionality is useful in many applications e.g. in voice controlled switching devices. The described algorithm has relatively low cost with sufficient efficiency and could be implemented in a simple and cheap hardware platform. The basic idea is based on the signal analysis in time domain, where the envelope of the signal is calculated and compared with previous created pattern stored in memory. The algorithm of pattern set creation is based on piecewise linear approximation. Moreover, user could create a collection of words which have to be recognized. The proposed algorithm was written in MATLAB software and tested ‘offline’ on recorded wave files and ‘online’ with music card. Next step of the research will be the implementation of the algorithm in the low-cost 32-bit ARM core microcontroller. Details of used algorithms, first tests, occurred problems and finally conclusions are presented in the paper.","PeriodicalId":405270,"journal":{"name":"2018 International Interdisciplinary PhD Workshop (IIPhDW)","volume":"33 4","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-05-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 International Interdisciplinary PhD Workshop (IIPhDW)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IIPHDW.2018.8388238","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4

Abstract

Speech processing algorithms have been intensive developed since 70's and today are often implemented in daily-use devices (personal computers, mobile phones, smartphones etc.). Unfortunately, advanced algorithms have relatively high calculation costs thus need efficient (and expensive) implementation hardware. In this paper a simple speech-processing algorithm able to recognize a spoken word from previously created constant words set has been presented. This functionality is useful in many applications e.g. in voice controlled switching devices. The described algorithm has relatively low cost with sufficient efficiency and could be implemented in a simple and cheap hardware platform. The basic idea is based on the signal analysis in time domain, where the envelope of the signal is calculated and compared with previous created pattern stored in memory. The algorithm of pattern set creation is based on piecewise linear approximation. Moreover, user could create a collection of words which have to be recognized. The proposed algorithm was written in MATLAB software and tested ‘offline’ on recorded wave files and ‘online’ with music card. Next step of the research will be the implementation of the algorithm in the low-cost 32-bit ARM core microcontroller. Details of used algorithms, first tests, occurred problems and finally conclusions are presented in the paper.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
孤立词识别的语音处理算法
自70年代以来,语音处理算法已经得到了广泛的发展,今天经常在日常使用的设备(个人电脑,移动电话,智能手机等)中实现。不幸的是,高级算法的计算成本相对较高,因此需要高效(且昂贵)的实现硬件。本文提出了一种简单的语音处理算法,能够从先前创建的固定词集中识别口语单词。该功能在许多应用中都很有用,例如在语音控制开关设备中。该算法成本相对较低,具有足够的效率,可以在简单廉价的硬件平台上实现。其基本思想是基于时域信号分析,计算信号的包络并与存储在内存中的先前创建的模式进行比较。模式集生成算法基于分段线性逼近。此外,用户可以创建一个必须被识别的单词集合。所提出的算法在MATLAB软件中编写,并在录制的波文件上“离线”和在音乐卡上“在线”进行了测试。下一步的研究将是在低成本的32位ARM核心微控制器上实现该算法。本文详细介绍了所使用的算法、首次测试、出现的问题和最后的结论。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Frequency response modeling of power transformer windings considering the attributes of ferromagnetic core Analysis of the impact of temperature load on the state of stress in a bolted flange connection Energy efficiency analysis of railway turnout heating with a simplified snow model using classical and contactless heating method Air-gap data transmission using screen brightness modulation Universal windows application for the parameters calculation of shields against ionizing radiation
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1