{"title":"使用9维噪声估计算法和可控前向三月平均的单通道语音增强","authors":"D. Farrokhi, R. Togneri, A. Zaknich","doi":"10.1109/ICOSP.2008.4697058","DOIUrl":null,"url":null,"abstract":"A post processing technique is proposed to enhance speech in a single channel system. A new noise estimation algorithm is proposed in conjunction with the Controlled Forward March Averaging (CFMA) technique to enhance speech in a single channel non-stationary noisy system. We introduce a 9-Dimensional Noise Estimation (NDNE) algorithm to the Single Channel Speech Estimation (SCSE) system, that updates the estimated noise in 9 frequency sub-bands, by averaging the noisy speech power spectrum using a time and frequency dependent smoothing factor. A signal presence probability factor is calculated by computing the ratio of the noisy speech power spectrum to its local minimum, which is computed by averaging past values of the noisy speech power spectra with a look-ahead factor. The NDNE uses a non-linear thresholding map as oppose to the conventional linear thresholding. This new algorithm produced an average 7% improvement in 0 and -2.5 dB global SNR in speech corrupted with modified Babble noise. Subjective tests confirmed these results.","PeriodicalId":445699,"journal":{"name":"2008 9th International Conference on Signal Processing","volume":"43 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2008-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Single channel speech enhancement using a 9 Dimensional Noise Estimation algorithm and Controlled Forward March Averaging\",\"authors\":\"D. Farrokhi, R. Togneri, A. Zaknich\",\"doi\":\"10.1109/ICOSP.2008.4697058\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"A post processing technique is proposed to enhance speech in a single channel system. A new noise estimation algorithm is proposed in conjunction with the Controlled Forward March Averaging (CFMA) technique to enhance speech in a single channel non-stationary noisy system. We introduce a 9-Dimensional Noise Estimation (NDNE) algorithm to the Single Channel Speech Estimation (SCSE) system, that updates the estimated noise in 9 frequency sub-bands, by averaging the noisy speech power spectrum using a time and frequency dependent smoothing factor. A signal presence probability factor is calculated by computing the ratio of the noisy speech power spectrum to its local minimum, which is computed by averaging past values of the noisy speech power spectra with a look-ahead factor. The NDNE uses a non-linear thresholding map as oppose to the conventional linear thresholding. This new algorithm produced an average 7% improvement in 0 and -2.5 dB global SNR in speech corrupted with modified Babble noise. Subjective tests confirmed these results.\",\"PeriodicalId\":445699,\"journal\":{\"name\":\"2008 9th International Conference on Signal Processing\",\"volume\":\"43 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2008-12-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2008 9th International Conference on Signal Processing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICOSP.2008.4697058\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2008 9th International Conference on Signal Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICOSP.2008.4697058","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Single channel speech enhancement using a 9 Dimensional Noise Estimation algorithm and Controlled Forward March Averaging
A post processing technique is proposed to enhance speech in a single channel system. A new noise estimation algorithm is proposed in conjunction with the Controlled Forward March Averaging (CFMA) technique to enhance speech in a single channel non-stationary noisy system. We introduce a 9-Dimensional Noise Estimation (NDNE) algorithm to the Single Channel Speech Estimation (SCSE) system, that updates the estimated noise in 9 frequency sub-bands, by averaging the noisy speech power spectrum using a time and frequency dependent smoothing factor. A signal presence probability factor is calculated by computing the ratio of the noisy speech power spectrum to its local minimum, which is computed by averaging past values of the noisy speech power spectra with a look-ahead factor. The NDNE uses a non-linear thresholding map as oppose to the conventional linear thresholding. This new algorithm produced an average 7% improvement in 0 and -2.5 dB global SNR in speech corrupted with modified Babble noise. Subjective tests confirmed these results.