{"title":"Speech Enhancement Exploiting Probabilistic Approach Using Maximum A Posterior","authors":"Xizhong Shen, Su Chenying","doi":"10.1109/ICDSP.2018.8631802","DOIUrl":null,"url":null,"abstract":"We examine spectral subtraction with both amplitude and phase spectra for improved speech enhancement performance by the method of maximum a posterior. Spectral subtraction is a very valid and direct denoising algorithm, but it has a vital problem, i.e., it may generate 'musical noise'. An adaptive harmonic model is utilized. Maximum a posterior is considered to derive the phase estimator, which is extra applied to amplitude spectral subtraction. Different from others, the extra parameters in our algorithm are considered as random variables, and the main extra parameters are amplitude and phase. The phase of the speech signal is assumed to have von Mises circular distribution, and the amplitude is to have normal distribution. The assumptions are applied to Bayesian theory, and we derived the update formulae of the parameters of the speech model, that is, phase estimator and amplitude estimator. Thus, we obtained the phase and amplitude of each harmonic. Simulation results show the further improvement of spectral subtraction.","PeriodicalId":218806,"journal":{"name":"2018 IEEE 23rd International Conference on Digital Signal Processing (DSP)","volume":"11 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE 23rd International Conference on Digital Signal Processing (DSP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDSP.2018.8631802","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
We examine spectral subtraction with both amplitude and phase spectra for improved speech enhancement performance by the method of maximum a posterior. Spectral subtraction is a very valid and direct denoising algorithm, but it has a vital problem, i.e., it may generate 'musical noise'. An adaptive harmonic model is utilized. Maximum a posterior is considered to derive the phase estimator, which is extra applied to amplitude spectral subtraction. Different from others, the extra parameters in our algorithm are considered as random variables, and the main extra parameters are amplitude and phase. The phase of the speech signal is assumed to have von Mises circular distribution, and the amplitude is to have normal distribution. The assumptions are applied to Bayesian theory, and we derived the update formulae of the parameters of the speech model, that is, phase estimator and amplitude estimator. Thus, we obtained the phase and amplitude of each harmonic. Simulation results show the further improvement of spectral subtraction.