标量量化在快速HMM计算中的应用

1995 International Conference on Acoustics, Speech, and Signal Processing Pub Date : 1995-05-09 DOI:10.1109/ICASSP.1995.479402

S. Sagayama, Satoshi Takahashi

{"title":"标量量化在快速HMM计算中的应用","authors":"S. Sagayama, Satoshi Takahashi","doi":"10.1109/ICASSP.1995.479402","DOIUrl":null,"url":null,"abstract":"This paper describes an algorithm for reducing the amount of arithmetic operations in the likelihood computation of continuous mixture HMM (CMHMM) with diagonal covariance matrices while retaining high performance. The key points are the use of the scalar quantization of the input observation vector components and table look-up. These make multiplication, squaring and division operations entirely unnecessary in the whole HMM computation (i.e., output probability calculation and trellis/Viterbi computation). It is experimentally proved in an large-vocabulary isolated word recognition task that scalar quantization into no less than 16 levels does not cause significant degradation in the speech recognition performance. Scalar quantization is also utilized in the computation truncation for unlikely distributions; the total number of distribution likelihood computations can be reduced by 66% with only a slight performance degradation. This \"multiplication-free\" HMM algorithm has high potentiality in speech recognition applications on personal computers.","PeriodicalId":300119,"journal":{"name":"1995 International Conference on Acoustics, Speech, and Signal Processing","volume":"19 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1995-05-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"22","resultStr":"{\"title\":\"On the use of scalar quantization for fast HMM computation\",\"authors\":\"S. Sagayama, Satoshi Takahashi\",\"doi\":\"10.1109/ICASSP.1995.479402\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper describes an algorithm for reducing the amount of arithmetic operations in the likelihood computation of continuous mixture HMM (CMHMM) with diagonal covariance matrices while retaining high performance. The key points are the use of the scalar quantization of the input observation vector components and table look-up. These make multiplication, squaring and division operations entirely unnecessary in the whole HMM computation (i.e., output probability calculation and trellis/Viterbi computation). It is experimentally proved in an large-vocabulary isolated word recognition task that scalar quantization into no less than 16 levels does not cause significant degradation in the speech recognition performance. Scalar quantization is also utilized in the computation truncation for unlikely distributions; the total number of distribution likelihood computations can be reduced by 66% with only a slight performance degradation. This \\\"multiplication-free\\\" HMM algorithm has high potentiality in speech recognition applications on personal computers.\",\"PeriodicalId\":300119,\"journal\":{\"name\":\"1995 International Conference on Acoustics, Speech, and Signal Processing\",\"volume\":\"19 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1995-05-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"22\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"1995 International Conference on Acoustics, Speech, and Signal Processing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICASSP.1995.479402\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"1995 International Conference on Acoustics, Speech, and Signal Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICASSP.1995.479402","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 22

摘要

本文提出了一种减少具有对角协方差矩阵的连续混合HMM (CMHMM)似然计算中的算术运算量，同时保持高性能的算法。关键是使用标量量化的输入观测向量分量和查找表。这使得在整个HMM计算(即输出概率计算和网格/维特比计算)中完全不需要乘法、平方和除法操作。在一个大词汇量孤立词识别任务中，实验证明标量量化到不小于16个级别不会导致语音识别性能的明显下降。对不可能分布的计算截断也采用了标量量化;分布似然计算的总数可以减少66%，而性能只有轻微的下降。这种“无乘法”HMM算法在个人计算机上的语音识别应用中具有很高的潜力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

On the use of scalar quantization for fast HMM computation

This paper describes an algorithm for reducing the amount of arithmetic operations in the likelihood computation of continuous mixture HMM (CMHMM) with diagonal covariance matrices while retaining high performance. The key points are the use of the scalar quantization of the input observation vector components and table look-up. These make multiplication, squaring and division operations entirely unnecessary in the whole HMM computation (i.e., output probability calculation and trellis/Viterbi computation). It is experimentally proved in an large-vocabulary isolated word recognition task that scalar quantization into no less than 16 levels does not cause significant degradation in the speech recognition performance. Scalar quantization is also utilized in the computation truncation for unlikely distributions; the total number of distribution likelihood computations can be reduced by 66% with only a slight performance degradation. This "multiplication-free" HMM algorithm has high potentiality in speech recognition applications on personal computers.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

1995 International Conference on Acoustics, Speech, and Signal Processing

自引率

0.00%

发文量

期刊最新文献

Language identification with phonological and lexical models Computationally efficient wavelet packet coding of wide-band stereo audio signals Signaling techniques using solitons Blind source detection and separation using second order non-stationarity On blind channel identification for impulsive signal environments