{"title":"基于sftrls的语音增强方法,利用CNN确定噪声类型和最佳遗忘因子","authors":"De-You Tang, Guoqiang Chen","doi":"10.1109/PRML52754.2021.9520741","DOIUrl":null,"url":null,"abstract":"This paper presents a speech enhancement method combining the convolutional neural network (CNN) and SFTRLS, SFTRLS-CNN, which consists of two tiers of CNN to customize parameters for the SFTRLS algorithm. The first CNN identifies noise type, and the second CNN matches the best forgetting factor. The experimental results show that the noise recognition rate of SFTRLS-CNN goes up to 99.97% and displays better performance than the k-nearest neighbor (KNN) and the support vector machine (SVM). The accuracy ratio of matching the best forgetting factor for the SFTRLS is up to 99.40%. The improvement of the perceptual evaluation of speech quality (PESQ) is 23%, and the decrease of log-spectral distortion (LSD) is 4% on average. SFTRLS-CNN also improves the SNR of all speeches significantly.","PeriodicalId":429603,"journal":{"name":"2021 IEEE 2nd International Conference on Pattern Recognition and Machine Learning (PRML)","volume":"20 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"SFTRLS-Based Speech Enhancement Method Using CNN to Determine the Noise Type and the Optimal Forgetting Factor\",\"authors\":\"De-You Tang, Guoqiang Chen\",\"doi\":\"10.1109/PRML52754.2021.9520741\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper presents a speech enhancement method combining the convolutional neural network (CNN) and SFTRLS, SFTRLS-CNN, which consists of two tiers of CNN to customize parameters for the SFTRLS algorithm. The first CNN identifies noise type, and the second CNN matches the best forgetting factor. The experimental results show that the noise recognition rate of SFTRLS-CNN goes up to 99.97% and displays better performance than the k-nearest neighbor (KNN) and the support vector machine (SVM). The accuracy ratio of matching the best forgetting factor for the SFTRLS is up to 99.40%. The improvement of the perceptual evaluation of speech quality (PESQ) is 23%, and the decrease of log-spectral distortion (LSD) is 4% on average. SFTRLS-CNN also improves the SNR of all speeches significantly.\",\"PeriodicalId\":429603,\"journal\":{\"name\":\"2021 IEEE 2nd International Conference on Pattern Recognition and Machine Learning (PRML)\",\"volume\":\"20 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-07-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 IEEE 2nd International Conference on Pattern Recognition and Machine Learning (PRML)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/PRML52754.2021.9520741\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE 2nd International Conference on Pattern Recognition and Machine Learning (PRML)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/PRML52754.2021.9520741","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
SFTRLS-Based Speech Enhancement Method Using CNN to Determine the Noise Type and the Optimal Forgetting Factor
This paper presents a speech enhancement method combining the convolutional neural network (CNN) and SFTRLS, SFTRLS-CNN, which consists of two tiers of CNN to customize parameters for the SFTRLS algorithm. The first CNN identifies noise type, and the second CNN matches the best forgetting factor. The experimental results show that the noise recognition rate of SFTRLS-CNN goes up to 99.97% and displays better performance than the k-nearest neighbor (KNN) and the support vector machine (SVM). The accuracy ratio of matching the best forgetting factor for the SFTRLS is up to 99.40%. The improvement of the perceptual evaluation of speech quality (PESQ) is 23%, and the decrease of log-spectral distortion (LSD) is 4% on average. SFTRLS-CNN also improves the SNR of all speeches significantly.