Internet speech denoising method based on IGAN algorithm

Sanchuan Luo
{"title":"Internet speech denoising method based on IGAN algorithm","authors":"Sanchuan Luo","doi":"10.3233/jcm-226798","DOIUrl":null,"url":null,"abstract":"At present, to settle the question of excessive noise in the speech signal during the call of mobile devices in China, the research proposes that the Wiener filter and the generative adversarial network are combined into the IGAN algorithm. Firstly, the Wiener filter regularization algorithm is introduced to construct the preprocessing model of the speech signal; then the preprocessing model is fused with the generative adversarial network algorithm to construct the denoising model. Finally, the performance analysis and simulation experiments of the application effect of the model are carried out. The results show that in the experiment comparing IGAN with five traditional algorithms, when the SNR ratio is increased to 17.5 dB, the MOS and PESQ scores under the IGAN method can reach 4.9 and 3.5 respectively, and the DNN effect is second only to IGAN. Other algorithms perform poorly. Then compare the number of iterations and the loss value between the two. When the network voice signal begins to converge, the loss value corresponding to DNN is 1.132; while the loss value of IGAN is about 0.573, it can be found that the loss value of IGAN has dropped by half, which shows that IGAN Build the model with a smaller loss value. And IGAN tends to converge when iteratively is performed for about 200 times, and the average peak SNR can reach up to 33.85 dB, an increase of nearly 1.02 dB, and the effect is remarkable. This all shows that the IGAN algorithm has the best denoising performance for network speech signals, improves the denoising efficiency, and is conducive to obtaining a denoising signal with a higher fit with the clean signal, so that mobile devices can better serve the people.","PeriodicalId":14668,"journal":{"name":"J. Comput. Methods Sci. Eng.","volume":"21 1","pages":"1929-1940"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"J. Comput. Methods Sci. Eng.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3233/jcm-226798","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

At present, to settle the question of excessive noise in the speech signal during the call of mobile devices in China, the research proposes that the Wiener filter and the generative adversarial network are combined into the IGAN algorithm. Firstly, the Wiener filter regularization algorithm is introduced to construct the preprocessing model of the speech signal; then the preprocessing model is fused with the generative adversarial network algorithm to construct the denoising model. Finally, the performance analysis and simulation experiments of the application effect of the model are carried out. The results show that in the experiment comparing IGAN with five traditional algorithms, when the SNR ratio is increased to 17.5 dB, the MOS and PESQ scores under the IGAN method can reach 4.9 and 3.5 respectively, and the DNN effect is second only to IGAN. Other algorithms perform poorly. Then compare the number of iterations and the loss value between the two. When the network voice signal begins to converge, the loss value corresponding to DNN is 1.132; while the loss value of IGAN is about 0.573, it can be found that the loss value of IGAN has dropped by half, which shows that IGAN Build the model with a smaller loss value. And IGAN tends to converge when iteratively is performed for about 200 times, and the average peak SNR can reach up to 33.85 dB, an increase of nearly 1.02 dB, and the effect is remarkable. This all shows that the IGAN algorithm has the best denoising performance for network speech signals, improves the denoising efficiency, and is conducive to obtaining a denoising signal with a higher fit with the clean signal, so that mobile devices can better serve the people.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于IGAN算法的互联网语音去噪方法
目前,针对国内移动设备通话过程中语音信号噪声过大的问题,研究提出将维纳滤波器和生成对抗网络结合到IGAN算法中。首先,引入维纳滤波正则化算法构建语音信号预处理模型;然后将预处理模型与生成式对抗网络算法相融合,构建去噪模型。最后,对模型的应用效果进行了性能分析和仿真实验。结果表明,在IGAN与5种传统算法的对比实验中,当信噪比提高到17.5 dB时,IGAN方法下的MOS和PESQ得分分别可以达到4.9和3.5,DNN效果仅次于IGAN。其他算法表现不佳。然后比较两者之间的迭代次数和损失值。当网络语音信号开始收敛时,DNN对应的损失值为1.132;而IGAN的loss值约为0.573,可以发现IGAN的loss值下降了一半,说明IGAN构建的是loss值较小的模型。迭代200次左右,IGAN趋于收敛,平均峰值信噪比可达33.85 dB,提高近1.02 dB,效果显著。这都说明IGAN算法对网络语音信号具有最佳的去噪性能,提高了去噪效率,有利于得到与干净信号更贴合的去噪信号,使移动设备更好地为人们服务。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Retracted to: Design and dynamics simulation of vehicle active occupant restraint protection system Flip-OFDM Optical MIMO Based VLC System Using ML/DL Approach Using the Structure-Behavior Coalescence Method to Formalize the Action Flow Semantics of UML 2.0 Activity Diagrams Accurate Calibration and Scalable Bandwidth Sharing of Multi-Queue SSDs Looking to Personalize Gaze Estimation Using Transformers
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1