Internet speech denoising method based on IGAN algorithm

J. Comput. Methods Sci. Eng. Pub Date : 2023-06-15 DOI:10.3233/jcm-226798

Sanchuan Luo

{"title":"Internet speech denoising method based on IGAN algorithm","authors":"Sanchuan Luo","doi":"10.3233/jcm-226798","DOIUrl":null,"url":null,"abstract":"At present, to settle the question of excessive noise in the speech signal during the call of mobile devices in China, the research proposes that the Wiener filter and the generative adversarial network are combined into the IGAN algorithm. Firstly, the Wiener filter regularization algorithm is introduced to construct the preprocessing model of the speech signal; then the preprocessing model is fused with the generative adversarial network algorithm to construct the denoising model. Finally, the performance analysis and simulation experiments of the application effect of the model are carried out. The results show that in the experiment comparing IGAN with five traditional algorithms, when the SNR ratio is increased to 17.5 dB, the MOS and PESQ scores under the IGAN method can reach 4.9 and 3.5 respectively, and the DNN effect is second only to IGAN. Other algorithms perform poorly. Then compare the number of iterations and the loss value between the two. When the network voice signal begins to converge, the loss value corresponding to DNN is 1.132; while the loss value of IGAN is about 0.573, it can be found that the loss value of IGAN has dropped by half, which shows that IGAN Build the model with a smaller loss value. And IGAN tends to converge when iteratively is performed for about 200 times, and the average peak SNR can reach up to 33.85 dB, an increase of nearly 1.02 dB, and the effect is remarkable. This all shows that the IGAN algorithm has the best denoising performance for network speech signals, improves the denoising efficiency, and is conducive to obtaining a denoising signal with a higher fit with the clean signal, so that mobile devices can better serve the people.","PeriodicalId":14668,"journal":{"name":"J. Comput. Methods Sci. Eng.","volume":"21 1","pages":"1929-1940"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"J. Comput. Methods Sci. Eng.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3233/jcm-226798","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

At present, to settle the question of excessive noise in the speech signal during the call of mobile devices in China, the research proposes that the Wiener filter and the generative adversarial network are combined into the IGAN algorithm. Firstly, the Wiener filter regularization algorithm is introduced to construct the preprocessing model of the speech signal; then the preprocessing model is fused with the generative adversarial network algorithm to construct the denoising model. Finally, the performance analysis and simulation experiments of the application effect of the model are carried out. The results show that in the experiment comparing IGAN with five traditional algorithms, when the SNR ratio is increased to 17.5 dB, the MOS and PESQ scores under the IGAN method can reach 4.9 and 3.5 respectively, and the DNN effect is second only to IGAN. Other algorithms perform poorly. Then compare the number of iterations and the loss value between the two. When the network voice signal begins to converge, the loss value corresponding to DNN is 1.132; while the loss value of IGAN is about 0.573, it can be found that the loss value of IGAN has dropped by half, which shows that IGAN Build the model with a smaller loss value. And IGAN tends to converge when iteratively is performed for about 200 times, and the average peak SNR can reach up to 33.85 dB, an increase of nearly 1.02 dB, and the effect is remarkable. This all shows that the IGAN algorithm has the best denoising performance for network speech signals, improves the denoising efficiency, and is conducive to obtaining a denoising signal with a higher fit with the clean signal, so that mobile devices can better serve the people.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于IGAN算法的互联网语音去噪方法

目前，针对国内移动设备通话过程中语音信号噪声过大的问题，研究提出将维纳滤波器和生成对抗网络结合到IGAN算法中。首先，引入维纳滤波正则化算法构建语音信号预处理模型;然后将预处理模型与生成式对抗网络算法相融合，构建去噪模型。最后，对模型的应用效果进行了性能分析和仿真实验。结果表明，在IGAN与5种传统算法的对比实验中，当信噪比提高到17.5 dB时，IGAN方法下的MOS和PESQ得分分别可以达到4.9和3.5,DNN效果仅次于IGAN。其他算法表现不佳。然后比较两者之间的迭代次数和损失值。当网络语音信号开始收敛时，DNN对应的损失值为1.132;而IGAN的loss值约为0.573，可以发现IGAN的loss值下降了一半，说明IGAN构建的是loss值较小的模型。迭代200次左右，IGAN趋于收敛，平均峰值信噪比可达33.85 dB，提高近1.02 dB，效果显著。这都说明IGAN算法对网络语音信号具有最佳的去噪性能，提高了去噪效率，有利于得到与干净信号更贴合的去噪信号，使移动设备更好地为人们服务。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

J. Comput. Methods Sci. Eng.

自引率

0.00%

发文量