Feature masking in an embedded Mandarin speech recognition system

2004 International Symposium on Chinese Spoken Language Processing Pub Date : 2004-12-15 DOI:10.1109/CHINSL.2004.1409632

Yuezhong Tang, Xia Wang, Yang Cao, Feng Ding

引用次数: 2

Abstract

In this paper, we explored a feature component masking scheme for embedded tonal language recognition systems, in order to reduce the computational complexity with least degradation of recognition accuracy. We carried out a lot of experiments on a Mandarin isolated word recognition task with a tone-confusable vocabulary. With consideration of both clean and noisy conditions, we were able to find a masking scheme that filtered out 31 of 54 components and still outperformed the baseline with 54 components in the feature set, with dramatically less computational and memory complexity. The results showed that feature masking was a promising approach for complexity reduction in embedded tonal language recognition systems. The results also verified the effectiveness of higher order cepstral coefficients for tonal language recognition because most of them were preserved during the feature masking experiments.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

嵌入式普通话语音识别系统的特征掩蔽

本文探讨了一种用于嵌入式调性语言识别系统的特征分量掩蔽方案，以期在降低识别精度的同时降低计算复杂度。我们对声调易混淆词汇的汉语孤立词识别任务进行了大量的实验。考虑到干净和嘈杂的条件，我们能够找到一种屏蔽方案，过滤掉54个组件中的31个，并且仍然优于特征集中54个组件的基线，大大减少了计算和内存复杂性。结果表明，特征掩蔽是一种很有前途的降低嵌入式调性语言识别系统复杂性的方法。结果还验证了高阶倒谱系数在音调语言识别中的有效性，因为它们在特征掩蔽实验中大部分被保留了下来。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2004 International Symposium on Chinese Spoken Language Processing

自引率

0.00%

发文量