A low complexity cluster model interpolation based on-line adaptation technique for spoken query systems

The 9th International Symposium on Chinese Spoken Language Processing Pub Date : 2014-10-27 DOI:10.1109/ISCSLP.2014.6936573

S. Shahnawazuddin, R. Sinha

{"title":"A low complexity cluster model interpolation based on-line adaptation technique for spoken query systems","authors":"S. Shahnawazuddin, R. Sinha","doi":"10.1109/ISCSLP.2014.6936573","DOIUrl":null,"url":null,"abstract":"The work presented in this paper describes the issues of on-line adaption in context of spoken query systems. In such systems, the available adaptation data is extremely small (≤ 3 seconds). Consequently, adapting such systems becomes extremely challenging. Moreover, since these systems are meant for real-time applications, the employed adaptation technique should not add much latency to the system response. To address these issues, a simple cluster model interpolation based approach for on-line adaptation is presented in this work. The proposed approach employs an OMP based search scheme to select a set of acoustically close models from a set of pre-trained cluster models. The selected cluster models are then linearly interpolated to derive the adapted model parameters. In this work, these interpolation weights are derived from the sparse coefficients in an approximate manner. Such an approximate approach helps in avoiding the iterative ML weight estimation usually employed in existing techniques. The proposed adaptation approach though not optimal, is found to be effective for on-line adaptation. The same has been verified in this work for an LVCSR task and also for an Assamese name recognition system which is a typical example of such query systems.","PeriodicalId":285277,"journal":{"name":"The 9th International Symposium on Chinese Spoken Language Processing","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-10-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"The 9th International Symposium on Chinese Spoken Language Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISCSLP.2014.6936573","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

The work presented in this paper describes the issues of on-line adaption in context of spoken query systems. In such systems, the available adaptation data is extremely small (≤ 3 seconds). Consequently, adapting such systems becomes extremely challenging. Moreover, since these systems are meant for real-time applications, the employed adaptation technique should not add much latency to the system response. To address these issues, a simple cluster model interpolation based approach for on-line adaptation is presented in this work. The proposed approach employs an OMP based search scheme to select a set of acoustically close models from a set of pre-trained cluster models. The selected cluster models are then linearly interpolated to derive the adapted model parameters. In this work, these interpolation weights are derived from the sparse coefficients in an approximate manner. Such an approximate approach helps in avoiding the iterative ML weight estimation usually employed in existing techniques. The proposed adaptation approach though not optimal, is found to be effective for on-line adaptation. The same has been verified in this work for an LVCSR task and also for an Assamese name recognition system which is a typical example of such query systems.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于低复杂度聚类模型插值的语音查询在线自适应技术

本文提出的工作描述了语音查询系统背景下的在线适应问题。在这样的系统中，可用的适应数据非常少(≤3秒)。因此，适应这样的系统变得极具挑战性。此外，由于这些系统是用于实时应用程序的，因此所采用的自适应技术不应该给系统响应增加太多延迟。为了解决这些问题，本文提出了一种简单的基于聚类模型插值的在线自适应方法。该方法采用一种基于OMP的搜索方案，从一组预训练的聚类模型中选择一组声学接近模型。然后对所选的聚类模型进行线性插值，以得到自适应的模型参数。在这项工作中，这些插值权值以一种近似的方式从稀疏系数中导出。这种近似方法有助于避免现有技术中通常使用的迭代ML权重估计。所提出的自适应方法虽然不是最优的，但对在线自适应是有效的。在LVCSR任务和阿萨姆语名称识别系统(这是此类查询系统的典型示例)的工作中也验证了这一点。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

The 9th International Symposium on Chinese Spoken Language Processing

自引率

0.00%

发文量

期刊最新文献

The modeling of tongue tip in Standard Chinese using MRI Prosody modeling for Uyghur TTS Research on truncated speech in speaker verification The undulating scale of intonations of exclamatory sentences in Uyghur from the view of experimental phonetics Effects of preceding contexts on the categorical perception of Mandarin tones