Accented Indian english ASR: Some early results

2008 IEEE Spoken Language Technology Workshop Pub Date : 2008-12-01 DOI:10.1109/SLT.2008.4777881

Kaustubh Kulkarni, Sailik Sengupta, V. Ramasubramanian, Josef G. Bauer, G. Stemmer

引用次数: 4

Abstract

The problem of the effect of accent on the performance of Automatic Speech Recognition (ASR) systems is well known. In this paper, we study the effect of accent variability on the performance of the Indian English ASR task. We evaluate the test vocabularies on HMMs trained on (a) Accent specific training data (b) Accent pooled training data which combines all the accent specific training data (c) Accent pooled training data of reduced size matching the size of the accent specific training data. We demonstrate that the accent pooled training set performs the best on phonetically rich isolated word recognition task. But the accent specific HMMs perform better than the reduced accent pooled HMMs, indicating a possible approach of using a first stage accent identification to choose the correct accent trained HMMs for further recognition.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

印度口音英语ASR:一些早期结果

重音对自动语音识别(ASR)系统性能影响的问题是众所周知的。在本文中，我们研究口音变异对印度英语ASR任务表现的影响。我们在(a)特定口音训练数据(b)结合所有特定口音训练数据的口音池训练数据(c)与特定口音训练数据大小匹配的缩小大小的口音池训练数据上训练hmm上评估测试词汇。结果表明，重音池训练集在语音丰富的孤立词识别任务中表现最好。但是，针对特定口音的hmm比简化后的混合口音hmm表现得更好，这表明了一种可能的方法，即使用第一阶段的口音识别来选择正确的经过训练的口音hmm进行进一步识别。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2008 IEEE Spoken Language Technology Workshop

自引率

0.00%

发文量

期刊最新文献

“Who is this” quiz dialogue system and users' evaluation Latent dirichlet language model for speech recognition Modelling user behaviour in the HIS-POMDP dialogue manager A syntactic language model based on incremental CCG parsing Improving word segmentation for Thai speech translation