文献互助智能选刊最新文献

高级搜索发布求助登录注册

Trigram duration modeling in speech recognition

2004 International Symposium on Chinese Spoken Language Processing Pub Date : 2004-12-15 DOI:10.1109/CHINSL.2004.1409627

Yun Tang, Wenju Liu, Bo Xu

引用次数: 3

Abstract

Rate of speech (ROS) is a very important factor in speech recognition. We present a new speech rate measurement method which first normalizes the duration of different acoustic units to a standard duration and then builds a trigram duration model to measure the speech rate of a sentence. We propose two methods based on the standard duration to compensate the influence introduced by speech rate variation in a data corpus and get 11% error rate reduction in Mandarin digit string recognition.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

语音识别中的三字模持续时间建模

语音速率(ROS)是语音识别中一个非常重要的因素。本文提出了一种新的语音速率测量方法，该方法首先将不同声单元的持续时间归一化为标准持续时间，然后建立三组持续时间模型来测量句子的语音速率。我们提出了两种基于标准持续时间的方法来补偿语料库中语速变化带来的影响，使汉语数字串识别的错误率降低了11%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2004 International Symposium on Chinese Spoken Language Processing

2004 International Symposium on Chinese Spoken Language Processing

自引率

0.00%

发文量

0

期刊最新文献

Discriminative transform for confidence estimation in Mandarin speech recognition A comparative study on various confidence measures in large vocabulary speech recognition Analysis of paraphrased corpus and lexical-based approach to Chinese paraphrasing Unseen handset mismatch compensation based on feature/model-space a priori knowledge interpolation for robust speaker recognition Use of direct modeling in natural language generation for Chinese and English translation

0

微信

客服QQ

Book学术公众号

扫码关注我们

反馈

Book学术官方微信

Book学术文献互助

Book学术文献互助群
群号：604180095

文献互助智能选刊最新文献互助须知联系我们：info@booksci.cn

Book学术提供免费学术资源搜索服务，方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。

Copyright © 2023 Book学术 All rights reserved.

京公网安备 11010802042870号京ICP备2023020795号-1