{"title":"SFTRLS-Based Speech Enhancement Method Using CNN to Determine the Noise Type and the Optimal Forgetting Factor","authors":"De-You Tang, Guoqiang Chen","doi":"10.1109/PRML52754.2021.9520741","DOIUrl":null,"url":null,"abstract":"This paper presents a speech enhancement method combining the convolutional neural network (CNN) and SFTRLS, SFTRLS-CNN, which consists of two tiers of CNN to customize parameters for the SFTRLS algorithm. The first CNN identifies noise type, and the second CNN matches the best forgetting factor. The experimental results show that the noise recognition rate of SFTRLS-CNN goes up to 99.97% and displays better performance than the k-nearest neighbor (KNN) and the support vector machine (SVM). The accuracy ratio of matching the best forgetting factor for the SFTRLS is up to 99.40%. The improvement of the perceptual evaluation of speech quality (PESQ) is 23%, and the decrease of log-spectral distortion (LSD) is 4% on average. SFTRLS-CNN also improves the SNR of all speeches significantly.","PeriodicalId":429603,"journal":{"name":"2021 IEEE 2nd International Conference on Pattern Recognition and Machine Learning (PRML)","volume":"20 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE 2nd International Conference on Pattern Recognition and Machine Learning (PRML)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/PRML52754.2021.9520741","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
This paper presents a speech enhancement method combining the convolutional neural network (CNN) and SFTRLS, SFTRLS-CNN, which consists of two tiers of CNN to customize parameters for the SFTRLS algorithm. The first CNN identifies noise type, and the second CNN matches the best forgetting factor. The experimental results show that the noise recognition rate of SFTRLS-CNN goes up to 99.97% and displays better performance than the k-nearest neighbor (KNN) and the support vector machine (SVM). The accuracy ratio of matching the best forgetting factor for the SFTRLS is up to 99.40%. The improvement of the perceptual evaluation of speech quality (PESQ) is 23%, and the decrease of log-spectral distortion (LSD) is 4% on average. SFTRLS-CNN also improves the SNR of all speeches significantly.