Wang Guang-yan, Zhao Chen-Yu, Xue Xiaozhen, Zhang Jing, Zhao Xiao-qun
{"title":"Correction of distortion mask speech based on parameter estimation of AR model","authors":"Wang Guang-yan, Zhao Chen-Yu, Xue Xiaozhen, Zhang Jing, Zhao Xiao-qun","doi":"10.1109/ICALIP.2016.7846621","DOIUrl":null,"url":null,"abstract":"The generation model of speech signal has been regarded as an all-pole AR model. Distortion will happen when normal speech is disturbed or interfered. In this paper, we proposed a new signal model excited by the non-white noise signal to represent transfer function of a closed oxygen mask. Using LPC method to find the parameters of the all-pole signal model from the practical distortion signal, the prediction model is in accordance with the theoretical estimated of AR model Consequently, we can design the transfer function of the inverse filter with respect to the transfer function of the estimated model. The inverse filter is in series connection with the distortion filter in order to correct the distortion speech recorded by wearing the mask. By comparing the waveforms, normalized spectrums and spectrograms among the normal speech, the distortion speech, and the corrected speech using the proposed method, the experiment results indicate the feasibility and availability of the proposed method.","PeriodicalId":184170,"journal":{"name":"2016 International Conference on Audio, Language and Image Processing (ICALIP)","volume":"6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 International Conference on Audio, Language and Image Processing (ICALIP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICALIP.2016.7846621","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
The generation model of speech signal has been regarded as an all-pole AR model. Distortion will happen when normal speech is disturbed or interfered. In this paper, we proposed a new signal model excited by the non-white noise signal to represent transfer function of a closed oxygen mask. Using LPC method to find the parameters of the all-pole signal model from the practical distortion signal, the prediction model is in accordance with the theoretical estimated of AR model Consequently, we can design the transfer function of the inverse filter with respect to the transfer function of the estimated model. The inverse filter is in series connection with the distortion filter in order to correct the distortion speech recorded by wearing the mask. By comparing the waveforms, normalized spectrums and spectrograms among the normal speech, the distortion speech, and the corrected speech using the proposed method, the experiment results indicate the feasibility and availability of the proposed method.