A comparative study of fMPE and RDLT approaches to LVCSR

2012 8th International Symposium on Chinese Spoken Language Processing Pub Date : 2012-12-01 DOI:10.1109/ISCSLP.2012.6423511

Jian Xu, Zhijie Yan, Qiang Huo

引用次数: 1

Abstract

This paper presents a comparative study of two discriminatively trained feature transform approaches, namely feature-space minimum phone error (fMPE) and region-dependent linear transform (RDLT), to large vocabulary continuous speech recognition (LVCSR). Experiments are performed on an LVCSR task of conversational telephone speech transcription using about 2,000 hours training data. Starting from a maximum likelihood (ML) trained GMM-HMM based baseline system, recognition accuracy and run-time efficiency of different variants of the above two methods are evaluated, and a specific RDLT approach is identified and recommended for deployment in LVCSR applications.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

fMPE与RDLT方法在LVCSR中的比较研究

本文对两种判别训练的特征变换方法——特征空间最小电话误差(fMPE)和区域相关线性变换(RDLT)在大词汇量连续语音识别(LVCSR)中的应用进行了比较研究。实验进行了LVCSR任务会话电话语音转录使用约2000小时的训练数据。从基于最大似然(ML)训练的GMM-HMM基线系统出发，评估了上述两种方法的不同变体的识别精度和运行时效率，并确定了一种特定的RDLT方法，并推荐了在LVCSR应用中部署的方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2012 8th International Symposium on Chinese Spoken Language Processing

自引率

0.00%

发文量