Yan Xiong, Qiang Chen, S. Deng, Sheng Liang, Kai Wang, Jun Zhang, Jie Wang
{"title":"Model-Based Post Filter for Microphone Array Speech Enhancement","authors":"Yan Xiong, Qiang Chen, S. Deng, Sheng Liang, Kai Wang, Jun Zhang, Jie Wang","doi":"10.1109/ICDH.2018.00023","DOIUrl":null,"url":null,"abstract":"Generalized sidelobe canceller (GSC) is wildly used in speech enhancement due to its efficient implementation. However, the conventional GSC has some drawbacks when applied to speech enhancement system. First, it is focused on improving the signal-to-noise ratio (SNR) without considering the characteristics of speech so that is not optimal for speech enhancement applications. Second, the adaptive branch in the GSC does not always estimate the noise in the fixed branch output accurately, especially when the SNR is high, the noise is spatially incoherent, or the spatial incoherent noises and spatial coherent interferences coexist. In this paper, we propose a model-based post filter for the sub-band GSC which is a typical form of the microphone array beamformer. An improved noise estimation method is developed to estimate the noise in the fixed branch output of each sub-band GSC from its adaptive branch output. Then the fixed branch output is filtered by an optimal filter which is constructed according to a GMM model trained by clean speeches and an online-estimated noise model. Experimental results show that the proposed method achieves significant improvement over the conventional sub-band GSC and outperforms several speech enhancement methods in different noisy environments.","PeriodicalId":117854,"journal":{"name":"2018 7th International Conference on Digital Home (ICDH)","volume":"248 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 7th International Conference on Digital Home (ICDH)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDH.2018.00023","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Generalized sidelobe canceller (GSC) is wildly used in speech enhancement due to its efficient implementation. However, the conventional GSC has some drawbacks when applied to speech enhancement system. First, it is focused on improving the signal-to-noise ratio (SNR) without considering the characteristics of speech so that is not optimal for speech enhancement applications. Second, the adaptive branch in the GSC does not always estimate the noise in the fixed branch output accurately, especially when the SNR is high, the noise is spatially incoherent, or the spatial incoherent noises and spatial coherent interferences coexist. In this paper, we propose a model-based post filter for the sub-band GSC which is a typical form of the microphone array beamformer. An improved noise estimation method is developed to estimate the noise in the fixed branch output of each sub-band GSC from its adaptive branch output. Then the fixed branch output is filtered by an optimal filter which is constructed according to a GMM model trained by clean speeches and an online-estimated noise model. Experimental results show that the proposed method achieves significant improvement over the conventional sub-band GSC and outperforms several speech enhancement methods in different noisy environments.