Hsin-Ju Hsieh, Jhih-Hao Jheng, Jung-Shan Lin, J. Hung
{"title":"Linear prediction filtering on cepstral time series for noise-robust speech recognition","authors":"Hsin-Ju Hsieh, Jhih-Hao Jheng, Jung-Shan Lin, J. Hung","doi":"10.1109/ICCE-TW.2016.7521043","DOIUrl":null,"url":null,"abstract":"In this paper, we propose adopting the algorithm of linear prediction coding (LPC) to proceeds the temporal feature streams in speech recognition for noise robustness. Using LPC, an FIR filter can be obtained and applied to the time series of Mel-frequency cepstral coefficients (MFCC), and in general the fast-varying component in the modulation spectrum of MFCC can be alleviated accordingly. We have found that the smoothing of MFCC modulation spectrum helps to reduce the noise effect and enhance noise robustness of MFCC. Experiments conducted on the Aurora-2 connected digit database shows that the proposed LPC-wise method improves the recognition accuracy of MVN- and HEQ-preprocessed MFCC under a wide range of noise-corrupted situations.","PeriodicalId":6620,"journal":{"name":"2016 IEEE International Conference on Consumer Electronics-Taiwan (ICCE-TW)","volume":"27 1","pages":"1-2"},"PeriodicalIF":0.0000,"publicationDate":"2016-05-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 IEEE International Conference on Consumer Electronics-Taiwan (ICCE-TW)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCE-TW.2016.7521043","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
In this paper, we propose adopting the algorithm of linear prediction coding (LPC) to proceeds the temporal feature streams in speech recognition for noise robustness. Using LPC, an FIR filter can be obtained and applied to the time series of Mel-frequency cepstral coefficients (MFCC), and in general the fast-varying component in the modulation spectrum of MFCC can be alleviated accordingly. We have found that the smoothing of MFCC modulation spectrum helps to reduce the noise effect and enhance noise robustness of MFCC. Experiments conducted on the Aurora-2 connected digit database shows that the proposed LPC-wise method improves the recognition accuracy of MVN- and HEQ-preprocessed MFCC under a wide range of noise-corrupted situations.