S. A. Jerjees, Hala Jassim Mohammed, Hayder S. Radeaf, Basheera M. Mahmmod, S. Abdulhussain
{"title":"Deep Learning-Based Speech Enhancement Algorithm Using Charlier Transform","authors":"S. A. Jerjees, Hala Jassim Mohammed, Hayder S. Radeaf, Basheera M. Mahmmod, S. Abdulhussain","doi":"10.1109/DeSE58274.2023.10099854","DOIUrl":null,"url":null,"abstract":"Machine learning, a part of artificial intelligence, is recently used in speech enhancement algorithms (SE). The primary focus of SE is finding the original speech signal from the distorted one. Specifically, deep learning is used in SE because it handles nonlinear mapping problems for complicated features. In this paper, Charlier polynomials-based discrete transform, simply discrete Charlier transform (DCHT), has been used to get the spectra of the noisy signal using a fully connected neural network. Deep learning effectively acquires the context information of speech signal and gets enhanced speech with good quality and intelligibility properties. The proposed algorithm is tested experimentally through self-comparison to obtain the best speech enhancement models corresponding to the DCHT parameter. The experiment is performed with different values of the DCHT parameter. In addition, the well-known TIMIT database is used for evaluation purposes. Different speech measures are used in the experiment. The realized results show the ability of the trained model based on DCHT to enhance the speech signal and provide good results on specific conditions.","PeriodicalId":346847,"journal":{"name":"2023 15th International Conference on Developments in eSystems Engineering (DeSE)","volume":"19 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 15th International Conference on Developments in eSystems Engineering (DeSE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DeSE58274.2023.10099854","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Machine learning, a part of artificial intelligence, is recently used in speech enhancement algorithms (SE). The primary focus of SE is finding the original speech signal from the distorted one. Specifically, deep learning is used in SE because it handles nonlinear mapping problems for complicated features. In this paper, Charlier polynomials-based discrete transform, simply discrete Charlier transform (DCHT), has been used to get the spectra of the noisy signal using a fully connected neural network. Deep learning effectively acquires the context information of speech signal and gets enhanced speech with good quality and intelligibility properties. The proposed algorithm is tested experimentally through self-comparison to obtain the best speech enhancement models corresponding to the DCHT parameter. The experiment is performed with different values of the DCHT parameter. In addition, the well-known TIMIT database is used for evaluation purposes. Different speech measures are used in the experiment. The realized results show the ability of the trained model based on DCHT to enhance the speech signal and provide good results on specific conditions.