An enhanced minimum classification error learning framework for balancing insertion, deletion and substitution errors

2007 IEEE Workshop on Automatic Speech Recognition & Understanding (ASRU) Pub Date : 2007-12-01 DOI:10.1109/ASRU.2007.4430178

Y. Liao, Jia-Jang Tu, Sen-Chia Chang, Chin-Hui Lee

引用次数: 3

Abstract

In continuous speech recognition substitution, insertion and deletion errors usually not only vary in numbers but also have different degrees of impact on optimizing a set of acoustic models. To balance their contributions to the overall error, an enhanced minimum classification error (E-MCE) learning framework is developed. The basic idea is to partition acoustic model optimization into three subtasks, i.e., minimum substitution errors (MSE), insertion errors (MIE) and deletion errors (MDE), and select/generate three corresponding sets of competing hypotheses, one for each individual sub-problem. MSE, MIE and MDE are then sequentially executed to gradually reduce the overall word error rates. Experimental results on continuous Mandarin digit recognition of five different data sets collected over various acoustic conditions have consistently shown the effectiveness of the proposed E-MCE learning framework.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

一种用于平衡插入、删除和替换错误的增强型最小分类错误学习框架

在连续语音识别替换中，插入和删除错误不仅数量不同，而且对一组声学模型的优化也有不同程度的影响。为了平衡它们对总体误差的贡献，我们开发了一个增强型最小分类误差(E-MCE)学习框架。其基本思想是将声学模型优化划分为三个子任务，即最小替代错误(MSE)、插入错误(MIE)和删除错误(MDE)，并选择/生成三组相应的竞争假设，每个子问题一个。然后依次执行MSE、MIE和MDE，以逐渐降低总体单词错误率。在不同声学条件下采集的5个不同数据集的连续汉语数字识别实验结果一致表明了所提出的E-MCE学习框架的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2007 IEEE Workshop on Automatic Speech Recognition & Understanding (ASRU)

自引率

0.00%

发文量