{"title":"语音信号表示的线性预测与Gabor变换的比较","authors":"S. M. Tahir, A.Z. Sha 'ameri","doi":"10.1109/APCC.2003.1274482","DOIUrl":null,"url":null,"abstract":"Feature extraction from speech representation is one of the processes in speech recognition. Parametric modeling is a dominant approach to model speech signals. Within a localized interval, speech representation is equivalent to a noise driven output from an all-pole system that can be estimated using linear prediction. Besides the characteristics of speech, temporal variability of speech signal model is also due to the computation of linear prediction coefficients. Thus, an alternative representation is proposed based on the Gabor coefficients. In this paper, a comparison is made with the linear prediction coefficients to show the consistency of the parameters that are generated for implementation in the speech recognition system.","PeriodicalId":277507,"journal":{"name":"9th Asia-Pacific Conference on Communications (IEEE Cat. No.03EX732)","volume":"464 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2003-09-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"A comparison between speech signal representation using linear prediction and Gabor transform\",\"authors\":\"S. M. Tahir, A.Z. Sha 'ameri\",\"doi\":\"10.1109/APCC.2003.1274482\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Feature extraction from speech representation is one of the processes in speech recognition. Parametric modeling is a dominant approach to model speech signals. Within a localized interval, speech representation is equivalent to a noise driven output from an all-pole system that can be estimated using linear prediction. Besides the characteristics of speech, temporal variability of speech signal model is also due to the computation of linear prediction coefficients. Thus, an alternative representation is proposed based on the Gabor coefficients. In this paper, a comparison is made with the linear prediction coefficients to show the consistency of the parameters that are generated for implementation in the speech recognition system.\",\"PeriodicalId\":277507,\"journal\":{\"name\":\"9th Asia-Pacific Conference on Communications (IEEE Cat. No.03EX732)\",\"volume\":\"464 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2003-09-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"9th Asia-Pacific Conference on Communications (IEEE Cat. No.03EX732)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/APCC.2003.1274482\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"9th Asia-Pacific Conference on Communications (IEEE Cat. No.03EX732)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/APCC.2003.1274482","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A comparison between speech signal representation using linear prediction and Gabor transform
Feature extraction from speech representation is one of the processes in speech recognition. Parametric modeling is a dominant approach to model speech signals. Within a localized interval, speech representation is equivalent to a noise driven output from an all-pole system that can be estimated using linear prediction. Besides the characteristics of speech, temporal variability of speech signal model is also due to the computation of linear prediction coefficients. Thus, an alternative representation is proposed based on the Gabor coefficients. In this paper, a comparison is made with the linear prediction coefficients to show the consistency of the parameters that are generated for implementation in the speech recognition system.