Duc Hoang Ha Nguyen, Xiong Xiao, Chng Eng Siong, Haizhou Li
{"title":"语音识别中矢量泰勒级数模型对非平稳噪声的补偿分析","authors":"Duc Hoang Ha Nguyen, Xiong Xiao, Chng Eng Siong, Haizhou Li","doi":"10.1109/ISCSLP.2012.6423503","DOIUrl":null,"url":null,"abstract":"In this paper, we investigate a feature conditioning method for the VTS-based model compensation. The VTS is a technique that predicts noisy acoustic model from clean acoustic model and noise model. It is noted that most of the previous studies use a single Gaussian noise model, which is unable to model noise statistics well, especially in non-stationary noisy environments. In this paper, we propose a combination of feature processing and VTS model compensation to handle non-stationary noise more efficiently. In the feature processing stage, the non-stationary characteristics of noise is reduced, hence the processed features is more suitable for VTS model compensation using single Gaussian noise model. Experimental analysis on the AURORA2 task shows that the proposed method has the potential to improve the performance of VTS method in non-stationary environments if good noise estimation is available.","PeriodicalId":186099,"journal":{"name":"2012 8th International Symposium on Chinese Spoken Language Processing","volume":"19 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"An analysis of vector Taylor series model compensation for non-stationary noise in speech recognition\",\"authors\":\"Duc Hoang Ha Nguyen, Xiong Xiao, Chng Eng Siong, Haizhou Li\",\"doi\":\"10.1109/ISCSLP.2012.6423503\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, we investigate a feature conditioning method for the VTS-based model compensation. The VTS is a technique that predicts noisy acoustic model from clean acoustic model and noise model. It is noted that most of the previous studies use a single Gaussian noise model, which is unable to model noise statistics well, especially in non-stationary noisy environments. In this paper, we propose a combination of feature processing and VTS model compensation to handle non-stationary noise more efficiently. In the feature processing stage, the non-stationary characteristics of noise is reduced, hence the processed features is more suitable for VTS model compensation using single Gaussian noise model. Experimental analysis on the AURORA2 task shows that the proposed method has the potential to improve the performance of VTS method in non-stationary environments if good noise estimation is available.\",\"PeriodicalId\":186099,\"journal\":{\"name\":\"2012 8th International Symposium on Chinese Spoken Language Processing\",\"volume\":\"19 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2012-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2012 8th International Symposium on Chinese Spoken Language Processing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISCSLP.2012.6423503\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 8th International Symposium on Chinese Spoken Language Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISCSLP.2012.6423503","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
An analysis of vector Taylor series model compensation for non-stationary noise in speech recognition
In this paper, we investigate a feature conditioning method for the VTS-based model compensation. The VTS is a technique that predicts noisy acoustic model from clean acoustic model and noise model. It is noted that most of the previous studies use a single Gaussian noise model, which is unable to model noise statistics well, especially in non-stationary noisy environments. In this paper, we propose a combination of feature processing and VTS model compensation to handle non-stationary noise more efficiently. In the feature processing stage, the non-stationary characteristics of noise is reduced, hence the processed features is more suitable for VTS model compensation using single Gaussian noise model. Experimental analysis on the AURORA2 task shows that the proposed method has the potential to improve the performance of VTS method in non-stationary environments if good noise estimation is available.