{"title":"基于小波分解的语音识别信道加权方法","authors":"Jyh-Shing Shyuu, Jhing-Fa Wang, Chung-Hsien Wu","doi":"10.1109/APCCAS.1994.514604","DOIUrl":null,"url":null,"abstract":"A decomposition of signal into a set of frequency channels of equal bandwidth on a logarithmic scale, i.e., an analysis of the signal using constant Q filters, using wavelet and multiresolution analysis is used in this paper to derive cepstrum features of different spatial frequency bands. Based on the decompositions, each channel is modeled as a Bayesian subnetwork and each subnetwork is weighted by a weighting algorithm. The distortions for speech recognition between a reference model and the input vectors are then computed by summing the weighted scores of all decomposed channels. The experimental results show that the recognition rate of this method is superior to those non-weighting methods.","PeriodicalId":231368,"journal":{"name":"Proceedings of APCCAS'94 - 1994 Asia Pacific Conference on Circuits and Systems","volume":"52 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1994-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"A channel-weighting method for speech recognition using wavelet decompositions\",\"authors\":\"Jyh-Shing Shyuu, Jhing-Fa Wang, Chung-Hsien Wu\",\"doi\":\"10.1109/APCCAS.1994.514604\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"A decomposition of signal into a set of frequency channels of equal bandwidth on a logarithmic scale, i.e., an analysis of the signal using constant Q filters, using wavelet and multiresolution analysis is used in this paper to derive cepstrum features of different spatial frequency bands. Based on the decompositions, each channel is modeled as a Bayesian subnetwork and each subnetwork is weighted by a weighting algorithm. The distortions for speech recognition between a reference model and the input vectors are then computed by summing the weighted scores of all decomposed channels. The experimental results show that the recognition rate of this method is superior to those non-weighting methods.\",\"PeriodicalId\":231368,\"journal\":{\"name\":\"Proceedings of APCCAS'94 - 1994 Asia Pacific Conference on Circuits and Systems\",\"volume\":\"52 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1994-12-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of APCCAS'94 - 1994 Asia Pacific Conference on Circuits and Systems\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/APCCAS.1994.514604\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of APCCAS'94 - 1994 Asia Pacific Conference on Circuits and Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/APCCAS.1994.514604","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A channel-weighting method for speech recognition using wavelet decompositions
A decomposition of signal into a set of frequency channels of equal bandwidth on a logarithmic scale, i.e., an analysis of the signal using constant Q filters, using wavelet and multiresolution analysis is used in this paper to derive cepstrum features of different spatial frequency bands. Based on the decompositions, each channel is modeled as a Bayesian subnetwork and each subnetwork is weighted by a weighting algorithm. The distortions for speech recognition between a reference model and the input vectors are then computed by summing the weighted scores of all decomposed channels. The experimental results show that the recognition rate of this method is superior to those non-weighting methods.