Search results based N-best hypothesis rescoring with maximum entropy classification

2013 IEEE Workshop on Automatic Speech Recognition and Understanding Pub Date : 2013-12-01 DOI:10.1109/ASRU.2013.6707767

Fuchun Peng, Scott Roy, B. Shahshahani, F. Beaufays

引用次数: 20

Abstract

We propose a simple yet effective method for improving speech recognition by reranking the N-best speech recognition hypotheses using search results. We model N-best reranking as a binary classification problem and select the hypothesis with the highest classification confidence. We use query-specific features extracted from the search results to encode domain knowledge and use it with a maximum entropy classifier to rescore the N-best list. We show that rescoring even only the top 2 hypotheses, we can obtain a significant 3% absolute sentence accuracy (SACC) improvement over a strong baseline on production traffic from an entertainment domain.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于n -最优假设评分的最大熵分类搜索结果

我们提出了一种简单而有效的方法，通过使用搜索结果对n个最佳语音识别假设进行重新排序来改进语音识别。我们将N-best重排序建模为一个二元分类问题，并选择具有最高分类置信度的假设。我们使用从搜索结果中提取的特定于查询的特征对领域知识进行编码，并将其与最大熵分类器一起使用来重新评分N-best列表。我们表明，即使只重新记录前2个假设，我们也可以在娱乐领域的生产流量的强大基线上获得显著的3%的绝对句子准确性(SACC)提高。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2013 IEEE Workshop on Automatic Speech Recognition and Understanding

自引率

0.00%

发文量