Synthesized stereo-based stochastic mapping with data selection for robust speech recognition

2012 8th International Symposium on Chinese Spoken Language Processing Pub Date : 2012-12-01 DOI:10.1109/ISCSLP.2012.6423542

Jun Du, Qiang Huo

引用次数: 6

Abstract

In this paper, we present a synthesized stereo-based stochastic mapping approach for robust speech recognition. We extend the traditional stereo-based stochastic mapping (SSM) in two main aspects. First, the constraint of stereo-data, which is not practical in real applications, is relaxed by using HMM-based speech synthesis. Then we make feature mapping more focused on those incorrectly recognized samples via a data selection strategy. Experimental results on Aurora3 databases show that our approach can achieve consistently significant improvements of recognition performance in the well-matched (WM) condition among four different European languages.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于数据选择的合成立体随机映射鲁棒语音识别

在本文中，我们提出了一种基于合成立体随机映射的鲁棒语音识别方法。本文主要从两个方面对传统的基于立体的随机映射(SSM)进行了扩展。首先，利用基于hmm的语音合成技术，解决了在实际应用中难以实现的立体数据约束问题;然后，我们通过数据选择策略使特征映射更加集中在那些识别错误的样本上。在Aurora3数据库上的实验结果表明，我们的方法在四种不同的欧洲语言之间的良好匹配(WM)条件下，可以取得一致的显著的识别性能提高。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2012 8th International Symposium on Chinese Spoken Language Processing

自引率

0.00%

发文量