Hierarchical Pitman-Yor language models for ASR in meetings

2007 IEEE Workshop on Automatic Speech Recognition & Understanding (ASRU) Pub Date : 2007-12-01 DOI:10.1109/ASRU.2007.4430096

Songfang Huang, S. Renals

引用次数: 35

Abstract

In this paper we investigate the application of a hierarchical Bayesian language model (LM) based on the Pitman-Yor process for automatic speech recognition (ASR) of multiparty meetings. The hierarchical Pitman-Yor language model (HPY-LM) provides a Bayesian interpretation of LM smoothing. An approximation to the HPYLM recovers the exact formulation of the interpolated Kneser-Ney smoothing method in n-gram models. This paper focuses on the application and scalability of HPYLM on a practical large vocabulary ASR system. Experimental results on NIST RT06s evaluation meeting data verify that HPYLM is a competitive and promising language modeling technique, which consistently performs better than interpolated Kneser-Ney and modified Kneser-Ney n-gram LMs in terms of both perplexity and word error rate.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

会议中ASR的分层Pitman-Yor语言模型

本文研究了基于Pitman-Yor过程的层次贝叶斯语言模型(LM)在多人会议自动语音识别(ASR)中的应用。分层Pitman-Yor语言模型(HPY-LM)提供了对LM平滑的贝叶斯解释。对HPYLM的近似恢复了n-gram模型中插值Kneser-Ney平滑方法的精确公式。本文主要研究了HPYLM在一个实际的大词汇量ASR系统中的应用和可扩展性。在NIST RT06s评估会议数据上的实验结果验证了HPYLM是一种有竞争力和前景的语言建模技术，在困惑度和单词错误率方面，HPYLM始终优于内插的Kneser-Ney和修改的Kneser-Ney n-gram LMs。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2007 IEEE Workshop on Automatic Speech Recognition & Understanding (ASRU)

自引率

0.00%

发文量