A generalized subspace approach for enhancing speech corrupted by colored noise

IEEE Trans. Speech Audio Process. Pub Date : 2003-07-28 DOI:10.1109/TSA.2003.814458

Y. Hu, P. Loizou

引用次数: 406

Abstract

A generalized subspace approach is proposed for enhancement of speech corrupted by colored noise. A nonunitary transform, based on the simultaneous diagonalization of the clean speech and noise covariance matrices, is used to project the noisy signal onto a signal-plus-noise subspace and a noise subspace. The clean signal is estimated by nulling the signal components in the noise subspace and retaining the components in the signal subspace. The applied transform has built-in prewhitening and can therefore be used in general for colored noise. The proposed approach is shown to be a generalization of the approach proposed by Y. Ephraim and H.L. Van Trees (see ibid., vol.3, p.251-66, 1995) for white noise. Two estimators are derived based on the nonunitary transform, one based on time-domain constraints and one based on spectral domain constraints. Objective and subjective measures demonstrate improvements over other subspace-based methods when tested with TIMIT sentences corrupted with speech-shaped noise and multi-talker babble.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

一种增强有色噪声语音的广义子空间方法

提出了一种广义子空间方法来增强有色噪声干扰下的语音。基于清洁语音和噪声协方差矩阵的同时对角化，采用非酉变换将噪声信号投影到信号加噪声子空间和噪声子空间上。通过消除噪声子空间中的信号分量并保留信号子空间中的分量来估计干净信号。应用的变换有内置的预白，因此可以用于有色噪声。所提出的方法被证明是Y. Ephraim和H.L. Van Trees提出的白噪声方法的推广(同上，第3卷，第251-66页，1995)。推导了基于非酉变换的两个估计量，一个基于时域约束，一个基于谱域约束。在使用被语音形状噪声和多说话者胡言乱语损坏的TIMIT句子进行测试时，客观和主观测量都比其他基于子空间的方法有所改进。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

IEEE Trans. Speech Audio Process.

自引率

0.00%

发文量