Zhaozhao Zhang, Zhongqi Liu, Xiao Chen, Kangle Sun
{"title":"Research on underground speech enhancement technology based on generative adversarial network","authors":"Zhaozhao Zhang, Zhongqi Liu, Xiao Chen, Kangle Sun","doi":"10.1109/ICSPCC55723.2022.9984342","DOIUrl":null,"url":null,"abstract":"Because of the problems such as speech interaction and speech input difficulty caused by high noise intensity and unclear sources in underground mine, this paper proposes a speech enhancement system based on a Time-domain Generative Adversarial network, which is used in the front end of underground communication or speech recognition to improve the quality of speech information transmission and improve work efficiency. Aiming at the problems of ignoring the feature information between channels and training instability when extracting speech information in a time domain generative adversarial network, this paper introduces channel attention and Relativistic Average Generative Adversarial Network to optimize. The experimental results show that compared with other models, the proposed model can more effectively remove the downhole noise.","PeriodicalId":346917,"journal":{"name":"2022 IEEE International Conference on Signal Processing, Communications and Computing (ICSPCC)","volume":"104 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE International Conference on Signal Processing, Communications and Computing (ICSPCC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSPCC55723.2022.9984342","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Because of the problems such as speech interaction and speech input difficulty caused by high noise intensity and unclear sources in underground mine, this paper proposes a speech enhancement system based on a Time-domain Generative Adversarial network, which is used in the front end of underground communication or speech recognition to improve the quality of speech information transmission and improve work efficiency. Aiming at the problems of ignoring the feature information between channels and training instability when extracting speech information in a time domain generative adversarial network, this paper introduces channel attention and Relativistic Average Generative Adversarial Network to optimize. The experimental results show that compared with other models, the proposed model can more effectively remove the downhole noise.