Speech enhancement using orthogonal matching pursuit algorithm

2014 International Conference on Orange Technologies Pub Date : 2014-11-20 DOI:10.1109/ICOT.2014.6956609

Hongwu Yang, Dongliang Hao, Hongying Sun, Yitong Liu

引用次数: 4

Abstract

This paper proposes a novel approach for speech enhancement based on compressed sensing (CS) theory. Each frame of noisy speech signal is sparsified firstly by using discrete cosine transform (DCT). Then we divide each frame into the noisy sub-frame and the clear sub-frame with a soft thresholding method to obtain the threholded DCT coefficients of the noisy sub-frames. After that, the partial Hadamard ensemble is used as a sensing matrix to achieve compressive measurement of the DCT coefficients of noisy sub-frame. Finally, We use the orthogonal matching pursuit in order to recover the de-noised speech signal from noisy sub-frame. Both objective and subjective experiments are employed to compare the proposed approach with the subspace method and the spectral subtraction method. Experimental results shows that proposed method outperforms other methods with the highest PESQ, ABX and MOS score for Gaussian white noise and most kinds of colour noise.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

语音增强的正交匹配追踪算法

提出了一种基于压缩感知(CS)理论的语音增强方法。首先利用离散余弦变换(DCT)对含噪语音信号进行稀疏化处理。然后用软阈值法将每帧图像分成有噪子帧和无噪子帧，得到有噪子帧的阈值DCT系数。然后，利用部分Hadamard系综作为感知矩阵，实现对噪声子帧DCT系数的压缩测量。最后，利用正交匹配追踪从噪声子帧中恢复去噪语音信号。通过客观实验和主观实验，将该方法与子空间法和谱减法进行了比较。实验结果表明，该方法对高斯白噪声和大多数彩色噪声具有最高的PESQ、ABX和MOS分数，优于其他方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2014 International Conference on Orange Technologies

自引率

0.00%

发文量