Efficient representation and fast look-up of Maximum Entropy language models

2011 IEEE Workshop on Automatic Speech Recognition & Understanding Pub Date : 2011-12-01 DOI:10.1109/ASRU.2011.6163936

Jia Cui, Stanley F. Chen, Bowen Zhou

引用次数: 1

Abstract

Word class information has long been proven useful in language modeling (LM). However, the improved performance of class-based LMs over word n-gram models generally comes at the cost of increased decoding complexity and model size. In this paper, we propose a modified version of the Maximum Entropy token-based language model of [1] that matches the performance of the best existing class-based models, but which is as fast for decoding as a word n-gram model. In addition, while it is easy to statically combine word n-gram models built on different corpora into a single word n-gram model for fast decoding, it is unknown how to statically combine class-based LMs effectively. Another contribution of this paper is to propose a novel combination method that retains the gain of class-based LMs over word n-gram models. Experimental results on several spoken language translation tasks show that our model performs significantly better than word n-gram models with comparable decoding speed and only a modest increase in model size.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

最大熵语言模型的高效表示和快速查找

长期以来，词类信息在语言建模(LM)中被证明是有用的。然而，基于类的lm优于词n-gram模型的性能通常是以增加解码复杂性和模型大小为代价的。在本文中，我们提出了[1]的基于最大熵标记的语言模型的修改版本，该模型的性能与现有的最佳基于类的模型相匹配，但其解码速度与单词n-gram模型一样快。此外，虽然在不同语料库上构建的词n-gram模型可以很容易地静态组合成单个词n-gram模型以实现快速解码，但如何有效地静态组合基于类的lm是未知的。本文的另一个贡献是提出了一种新的组合方法，该方法保留了基于类的LMs相对于词n-gram模型的增益。几个口语翻译任务的实验结果表明，我们的模型在解码速度相当的情况下表现明显优于单词n-gram模型，并且模型大小只有适度的增加。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2011 IEEE Workshop on Automatic Speech Recognition & Understanding

自引率

0.00%

发文量

期刊最新文献

Applying feature bagging for more accurate and robust automated speaking assessment Towards choosing better primes for spoken dialog systems Accent level adjustment in bilingual Thai-English text-to-speech synthesis Fast speaker diarization using a high-level scripting language Evaluating prosodic features for automated scoring of non-native read speech