Correction of distortion mask speech based on parameter estimation of AR model

2016 International Conference on Audio, Language and Image Processing (ICALIP) Pub Date : 2016-07-01 DOI:10.1109/ICALIP.2016.7846621

Wang Guang-yan, Zhao Chen-Yu, Xue Xiaozhen, Zhang Jing, Zhao Xiao-qun

引用次数: 1

Abstract

The generation model of speech signal has been regarded as an all-pole AR model. Distortion will happen when normal speech is disturbed or interfered. In this paper, we proposed a new signal model excited by the non-white noise signal to represent transfer function of a closed oxygen mask. Using LPC method to find the parameters of the all-pole signal model from the practical distortion signal, the prediction model is in accordance with the theoretical estimated of AR model Consequently, we can design the transfer function of the inverse filter with respect to the transfer function of the estimated model. The inverse filter is in series connection with the distortion filter in order to correct the distortion speech recorded by wearing the mask. By comparing the waveforms, normalized spectrums and spectrograms among the normal speech, the distortion speech, and the corrected speech using the proposed method, the experiment results indicate the feasibility and availability of the proposed method.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于AR模型参数估计的失真掩模语音校正

语音信号的生成模型被认为是一个全极AR模型。当正常的言语受到干扰或干扰时，就会发生畸变。本文提出了一种新的由非白噪声信号激励的信号模型来表示封闭式氧气面罩的传递函数。利用LPC方法从实际失真信号中求出全极信号模型的参数，使预测模型与AR模型的理论估计一致，从而可以根据估计模型的传递函数设计逆滤波器的传递函数。反滤波器与失真滤波器串联，以校正戴口罩录下的失真语音。通过对正常语音、失真语音和校正语音的波形、归一化频谱和频谱图的比较，实验结果表明了所提方法的可行性和有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2016 International Conference on Audio, Language and Image Processing (ICALIP)

自引率

0.00%

发文量