{"title":"A channel-weighting method for speech recognition using wavelet decompositions","authors":"Jyh-Shing Shyuu, Jhing-Fa Wang, Chung-Hsien Wu","doi":"10.1109/APCCAS.1994.514604","DOIUrl":null,"url":null,"abstract":"A decomposition of signal into a set of frequency channels of equal bandwidth on a logarithmic scale, i.e., an analysis of the signal using constant Q filters, using wavelet and multiresolution analysis is used in this paper to derive cepstrum features of different spatial frequency bands. Based on the decompositions, each channel is modeled as a Bayesian subnetwork and each subnetwork is weighted by a weighting algorithm. The distortions for speech recognition between a reference model and the input vectors are then computed by summing the weighted scores of all decomposed channels. The experimental results show that the recognition rate of this method is superior to those non-weighting methods.","PeriodicalId":231368,"journal":{"name":"Proceedings of APCCAS'94 - 1994 Asia Pacific Conference on Circuits and Systems","volume":"52 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1994-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of APCCAS'94 - 1994 Asia Pacific Conference on Circuits and Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/APCCAS.1994.514604","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
A decomposition of signal into a set of frequency channels of equal bandwidth on a logarithmic scale, i.e., an analysis of the signal using constant Q filters, using wavelet and multiresolution analysis is used in this paper to derive cepstrum features of different spatial frequency bands. Based on the decompositions, each channel is modeled as a Bayesian subnetwork and each subnetwork is weighted by a weighting algorithm. The distortions for speech recognition between a reference model and the input vectors are then computed by summing the weighted scores of all decomposed channels. The experimental results show that the recognition rate of this method is superior to those non-weighting methods.