On the use of array learners towards Automatic Speech Recognition for dysarthria

2015 IEEE 10th Conference on Industrial Electronics and Applications (ICIEA) Pub Date : 2015-06-15 DOI:10.1109/ICIEA.2015.7334306

Seyed Reza Shahamiri, S. K. Ray

{"title":"On the use of array learners towards Automatic Speech Recognition for dysarthria","authors":"Seyed Reza Shahamiri, S. K. Ray","doi":"10.1109/ICIEA.2015.7334306","DOIUrl":null,"url":null,"abstract":"Providing Automatic Speech Recognition (ASR) systems for dysarthria is a challenging task since the normal and the disabled speech have different attributes; hence, using ASR systems designed and trained for normal speakers is not an effective approach. It is important to craft ASR technologies specifically for the speech disabled. Nonetheless, because of the complexity and variability of dysarthric speech, previous studies failed to achieve adequate performance. In this paper we investigated the applications of array learners towards dysarthric speech recognition. The array was implemented by several neural networks that configured to work in parallel. The proposed approach was verified by using the speech materials of seven dysarthric subjects with speech intelligibility from 2% to 86%. For comparison, the results were compared with a dysarthric ASR based on the legacy single-learner approach as the reference model. It is shown that the array learner-based dysarthric ASR improved the mean word recognition rate of 10.41% over the reference model, and decreased the error rate of 4.84%.","PeriodicalId":270660,"journal":{"name":"2015 IEEE 10th Conference on Industrial Electronics and Applications (ICIEA)","volume":"64 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-06-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 IEEE 10th Conference on Industrial Electronics and Applications (ICIEA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICIEA.2015.7334306","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 3

Abstract

Providing Automatic Speech Recognition (ASR) systems for dysarthria is a challenging task since the normal and the disabled speech have different attributes; hence, using ASR systems designed and trained for normal speakers is not an effective approach. It is important to craft ASR technologies specifically for the speech disabled. Nonetheless, because of the complexity and variability of dysarthric speech, previous studies failed to achieve adequate performance. In this paper we investigated the applications of array learners towards dysarthric speech recognition. The array was implemented by several neural networks that configured to work in parallel. The proposed approach was verified by using the speech materials of seven dysarthric subjects with speech intelligibility from 2% to 86%. For comparison, the results were compared with a dysarthric ASR based on the legacy single-learner approach as the reference model. It is shown that the array learner-based dysarthric ASR improved the mean word recognition rate of 10.41% over the reference model, and decreased the error rate of 4.84%.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

阵列学习器在构音障碍语音自动识别中的应用

为构音障碍提供自动语音识别(ASR)系统是一项具有挑战性的任务，因为正常和残疾的语言具有不同的属性;因此，使用为正常说话者设计和训练的ASR系统并不是有效的方法。专门为语言障碍设计ASR技术是很重要的。然而，由于言语困难的复杂性和可变性，以往的研究未能获得足够的表现。本文研究了阵列学习器在困难语音识别中的应用。该阵列由多个配置为并行工作的神经网络实现。通过对7名言语可理解度从2%到86%的诵读困难被试的言语材料进行验证。为了进行比较，将结果与基于传统的单学习者方法作为参考模型的困难型ASR进行比较。结果表明，与参考模型相比，基于阵列学习器的困窘ASR平均单词识别率提高了10.41%，错误率降低了4.84%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2015 IEEE 10th Conference on Industrial Electronics and Applications (ICIEA)

自引率

0.00%

发文量