Hongwu Yang, Dongliang Hao, Hongying Sun, Yitong Liu
{"title":"Speech enhancement using orthogonal matching pursuit algorithm","authors":"Hongwu Yang, Dongliang Hao, Hongying Sun, Yitong Liu","doi":"10.1109/ICOT.2014.6956609","DOIUrl":null,"url":null,"abstract":"This paper proposes a novel approach for speech enhancement based on compressed sensing (CS) theory. Each frame of noisy speech signal is sparsified firstly by using discrete cosine transform (DCT). Then we divide each frame into the noisy sub-frame and the clear sub-frame with a soft thresholding method to obtain the threholded DCT coefficients of the noisy sub-frames. After that, the partial Hadamard ensemble is used as a sensing matrix to achieve compressive measurement of the DCT coefficients of noisy sub-frame. Finally, We use the orthogonal matching pursuit in order to recover the de-noised speech signal from noisy sub-frame. Both objective and subjective experiments are employed to compare the proposed approach with the subspace method and the spectral subtraction method. Experimental results shows that proposed method outperforms other methods with the highest PESQ, ABX and MOS score for Gaussian white noise and most kinds of colour noise.","PeriodicalId":343641,"journal":{"name":"2014 International Conference on Orange Technologies","volume":"7 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-11-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 International Conference on Orange Technologies","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICOT.2014.6956609","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
This paper proposes a novel approach for speech enhancement based on compressed sensing (CS) theory. Each frame of noisy speech signal is sparsified firstly by using discrete cosine transform (DCT). Then we divide each frame into the noisy sub-frame and the clear sub-frame with a soft thresholding method to obtain the threholded DCT coefficients of the noisy sub-frames. After that, the partial Hadamard ensemble is used as a sensing matrix to achieve compressive measurement of the DCT coefficients of noisy sub-frame. Finally, We use the orthogonal matching pursuit in order to recover the de-noised speech signal from noisy sub-frame. Both objective and subjective experiments are employed to compare the proposed approach with the subspace method and the spectral subtraction method. Experimental results shows that proposed method outperforms other methods with the highest PESQ, ABX and MOS score for Gaussian white noise and most kinds of colour noise.