{"title":"A generalized subspace approach for enhancing speech corrupted by colored noise","authors":"Y. Hu, P. Loizou","doi":"10.1109/TSA.2003.814458","DOIUrl":null,"url":null,"abstract":"A generalized subspace approach is proposed for enhancement of speech corrupted by colored noise. A nonunitary transform, based on the simultaneous diagonalization of the clean speech and noise covariance matrices, is used to project the noisy signal onto a signal-plus-noise subspace and a noise subspace. The clean signal is estimated by nulling the signal components in the noise subspace and retaining the components in the signal subspace. The applied transform has built-in prewhitening and can therefore be used in general for colored noise. The proposed approach is shown to be a generalization of the approach proposed by Y. Ephraim and H.L. Van Trees (see ibid., vol.3, p.251-66, 1995) for white noise. Two estimators are derived based on the nonunitary transform, one based on time-domain constraints and one based on spectral domain constraints. Objective and subjective measures demonstrate improvements over other subspace-based methods when tested with TIMIT sentences corrupted with speech-shaped noise and multi-talker babble.","PeriodicalId":13155,"journal":{"name":"IEEE Trans. Speech Audio Process.","volume":"46 1","pages":"334-341"},"PeriodicalIF":0.0000,"publicationDate":"2003-07-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"406","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Trans. Speech Audio Process.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/TSA.2003.814458","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 406
Abstract
A generalized subspace approach is proposed for enhancement of speech corrupted by colored noise. A nonunitary transform, based on the simultaneous diagonalization of the clean speech and noise covariance matrices, is used to project the noisy signal onto a signal-plus-noise subspace and a noise subspace. The clean signal is estimated by nulling the signal components in the noise subspace and retaining the components in the signal subspace. The applied transform has built-in prewhitening and can therefore be used in general for colored noise. The proposed approach is shown to be a generalization of the approach proposed by Y. Ephraim and H.L. Van Trees (see ibid., vol.3, p.251-66, 1995) for white noise. Two estimators are derived based on the nonunitary transform, one based on time-domain constraints and one based on spectral domain constraints. Objective and subjective measures demonstrate improvements over other subspace-based methods when tested with TIMIT sentences corrupted with speech-shaped noise and multi-talker babble.
提出了一种广义子空间方法来增强有色噪声干扰下的语音。基于清洁语音和噪声协方差矩阵的同时对角化,采用非酉变换将噪声信号投影到信号加噪声子空间和噪声子空间上。通过消除噪声子空间中的信号分量并保留信号子空间中的分量来估计干净信号。应用的变换有内置的预白,因此可以用于有色噪声。所提出的方法被证明是Y. Ephraim和H.L. Van Trees提出的白噪声方法的推广(同上,第3卷,第251-66页,1995)。推导了基于非酉变换的两个估计量,一个基于时域约束,一个基于谱域约束。在使用被语音形状噪声和多说话者胡言乱语损坏的TIMIT句子进行测试时,客观和主观测量都比其他基于子空间的方法有所改进。