Detection of OOV words by combining acoustic confidence measures with linguistic features

2009 IEEE Workshop on Automatic Speech Recognition & Understanding Pub Date : 2009-12-13 DOI:10.1109/ASRU.2009.5372877

F. Stouten, D. Fohr, I. Illina

引用次数: 5

Abstract

This paper describes the design of an Out-Of-Vocabulary words (OOV) detector. Such a system is assumed to detect segments that correspond to OOV words (words that are not included in the lexicon) in the output of a LVCSR system. The OOV detector uses acoustic confidence measures that are derived from several systems: a word recognizer constrained by a lexicon, a phone recognizer constrained by a grammar and a phone recognizer without constraints. On top of that it also uses some linguistic features. The experimental results on a French broadcast news transcription task showed that for our approach precision equals recall at 35%.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

声学置信度与语言特征相结合的OOV词检测

本文介绍了一种超词汇检测器的设计。假设这样的系统可以检测LVCSR系统输出中与OOV单词(不包括在词典中的单词)对应的片段。OOV检测器使用来自几个系统的声学置信度度量:受词典约束的单词识别器、受语法约束的电话识别器和没有约束的电话识别器。除此之外，它还使用了一些语言特征。在一个法语广播新闻转录任务上的实验结果表明，我们的方法的准确率等于召回率为35%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2009 IEEE Workshop on Automatic Speech Recognition & Understanding

自引率

0.00%

发文量