An automatic speaker-speech recognition system for friendly HMI based on binary halved clustering

Chih-Hsiang Peng, Chih-Hung Chou, Ta-Wen Kuan, Po-Chuan Lin, Jhing-Fa Wang, P. Yu
{"title":"An automatic speaker-speech recognition system for friendly HMI based on binary halved clustering","authors":"Chih-Hsiang Peng, Chih-Hung Chou, Ta-Wen Kuan, Po-Chuan Lin, Jhing-Fa Wang, P. Yu","doi":"10.1109/ICOT.2014.6956624","DOIUrl":null,"url":null,"abstract":"This work presents a low-cost and fast-trainable automatic speaker-speech recognition (ASSR) system, by proposed binary halved clustering (BHC) method for human-machine interface (HMI) on an embedded platform, owing to the trait of low cost in ASSR system is essential and affordable for real-world application. In addition, fast-trainable ability can provide fast responding time. The reduction of waiting time makes the proposed HMI to be friendly for users. The speech recognition uses enhanced cross-word reference templates (ECWRTs) for template training type. The novel BHC method uses binary-halved splitting to generate speaker models for low complexity requirement. The regularity of binary halved behavior is beneficial for data scheduling and resource sharing in the embedded ASSR system. Compared with the conventional works, simulation results indicate that the proposed hardware accelerator achieves 28% less cost, 90% less responding time, an ASSR accuracy of 90%. Comparison exhibits that performance of the proposed system is greater than the conventional works, thereby demonstrating the friendly and affordable factor of the proposed HMI.","PeriodicalId":343641,"journal":{"name":"2014 International Conference on Orange Technologies","volume":"BME-26 10","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-11-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 International Conference on Orange Technologies","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICOT.2014.6956624","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

This work presents a low-cost and fast-trainable automatic speaker-speech recognition (ASSR) system, by proposed binary halved clustering (BHC) method for human-machine interface (HMI) on an embedded platform, owing to the trait of low cost in ASSR system is essential and affordable for real-world application. In addition, fast-trainable ability can provide fast responding time. The reduction of waiting time makes the proposed HMI to be friendly for users. The speech recognition uses enhanced cross-word reference templates (ECWRTs) for template training type. The novel BHC method uses binary-halved splitting to generate speaker models for low complexity requirement. The regularity of binary halved behavior is beneficial for data scheduling and resource sharing in the embedded ASSR system. Compared with the conventional works, simulation results indicate that the proposed hardware accelerator achieves 28% less cost, 90% less responding time, an ASSR accuracy of 90%. Comparison exhibits that performance of the proposed system is greater than the conventional works, thereby demonstrating the friendly and affordable factor of the proposed HMI.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于二分半聚类的友好人机界面自动说话语音识别系统
本文提出了一种基于嵌入式平台人机界面(HMI)的二元半聚类(BHC)方法,提出了一种低成本、可快速训练的自动说话人语音识别(ASSR)系统,因为ASSR系统的低成本特点对于实际应用是必不可少的。此外,快速训练能力可以提供快速的响应时间。减少等待时间使所提出的人机界面对用户友好。语音识别采用增强型交叉词参考模板(ecwrt)进行模板训练。该方法采用二分分割的方法生成低复杂度的扬声器模型。二进制二分行为的规律性有利于嵌入式ASSR系统的数据调度和资源共享。仿真结果表明,与传统方法相比,所提出的硬件加速器成本降低28%,响应时间缩短90%,ASSR精度达到90%。对比表明,所提出的系统性能优于传统的工作,从而证明了所提出的人机界面的友好和负担得起的因素。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
An automatic speaker-speech recognition system for friendly HMI based on binary halved clustering A fuzzy clustering algorithm via enhanced spatially constraint for brain MR image segmentation A novel saliency detection framework for infrared thermal images A multistep liver segmentation strategy by combining level set based method with texture analysis for CT images An emotional feedback system based on a regulation process model for happiness improvement
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1