{"title":"基于估计理论的语音增强方法","authors":"Mirishkar Sai Ganesh, M. Karthik, B. Patnaik","doi":"10.1109/ICSCN.2017.8085702","DOIUrl":null,"url":null,"abstract":"This contribution presents an efficient technique for the speech enhancement of a signal using statistical estimators which are based on squared magnitude spectra's. In any speech enhancement systems, an estimate of power spectral density is required. As conventional methods for noise elimination fails due to the non-stationary properties of the speech signal, in this context, minimum mean square error (MMSE) and maximum a posterior (MAP) estimators are derived based on Gaussian statistical model. The acquisition function which is obtained in the MAP estimator is same as the acquisition function used in the ideal binary masking. As a binary masking depends on the signal-to-noise ratio (SNR), if the SNR value exceeds 0 dB then the value assumes to be 1 otherwise 0. The results accomplished using the proposed estimator embarked with better enhancement of the speech signal than the standard minimum mean square error spectral power estimator, with low residual noise and low speech distortion.","PeriodicalId":383458,"journal":{"name":"2017 Fourth International Conference on Signal Processing, Communication and Networking (ICSCN)","volume":"238 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"An estimation theory-based approach for speech enhancement\",\"authors\":\"Mirishkar Sai Ganesh, M. Karthik, B. Patnaik\",\"doi\":\"10.1109/ICSCN.2017.8085702\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This contribution presents an efficient technique for the speech enhancement of a signal using statistical estimators which are based on squared magnitude spectra's. In any speech enhancement systems, an estimate of power spectral density is required. As conventional methods for noise elimination fails due to the non-stationary properties of the speech signal, in this context, minimum mean square error (MMSE) and maximum a posterior (MAP) estimators are derived based on Gaussian statistical model. The acquisition function which is obtained in the MAP estimator is same as the acquisition function used in the ideal binary masking. As a binary masking depends on the signal-to-noise ratio (SNR), if the SNR value exceeds 0 dB then the value assumes to be 1 otherwise 0. The results accomplished using the proposed estimator embarked with better enhancement of the speech signal than the standard minimum mean square error spectral power estimator, with low residual noise and low speech distortion.\",\"PeriodicalId\":383458,\"journal\":{\"name\":\"2017 Fourth International Conference on Signal Processing, Communication and Networking (ICSCN)\",\"volume\":\"238 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-03-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 Fourth International Conference on Signal Processing, Communication and Networking (ICSCN)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICSCN.2017.8085702\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 Fourth International Conference on Signal Processing, Communication and Networking (ICSCN)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSCN.2017.8085702","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
An estimation theory-based approach for speech enhancement
This contribution presents an efficient technique for the speech enhancement of a signal using statistical estimators which are based on squared magnitude spectra's. In any speech enhancement systems, an estimate of power spectral density is required. As conventional methods for noise elimination fails due to the non-stationary properties of the speech signal, in this context, minimum mean square error (MMSE) and maximum a posterior (MAP) estimators are derived based on Gaussian statistical model. The acquisition function which is obtained in the MAP estimator is same as the acquisition function used in the ideal binary masking. As a binary masking depends on the signal-to-noise ratio (SNR), if the SNR value exceeds 0 dB then the value assumes to be 1 otherwise 0. The results accomplished using the proposed estimator embarked with better enhancement of the speech signal than the standard minimum mean square error spectral power estimator, with low residual noise and low speech distortion.