Lizeng Gong, Shanshan Xie, Yan Zhang, Yanjiao Xiong, Xiaoyan Wang, Jun Yu Li
{"title":"A Robust Feature Extraction Method for Sound Signals Based on Gabor and MFCC","authors":"Lizeng Gong, Shanshan Xie, Yan Zhang, Yanjiao Xiong, Xiaoyan Wang, Jun Yu Li","doi":"10.1109/ICCIS56375.2022.9998146","DOIUrl":null,"url":null,"abstract":"Along with the development of data scale and the high complexity of sound signals, feature extraction and classification methods of sound signals have become a major research hotspot. However, the current sound signal feature extraction methods are difficult to accurately and stably provide a high-precision classification effect for the sound signal due to the complex frequency distribution and the influence of noise. Therefore, a robust feature extraction method for sound signals based on multi-scale and multi-directional Gabor filters and Mel frequency cepstral coefficient (MtWGM) was proposed. This method performs preprocessing on the signal by mixing hard threshold and soft threshold wavelet denoising. The Gabor filter is used to the combined energy spectrum of the framed signal, to achieve the effect of relatively more balanced intra-class features and more prominent inter-class features, and finally improve the noise reduction performance and classification accuracy of sound signals. The experiments are conducted out on three different sound signal datasets. Three classifiers are trained to test the effectiveness of extracted features. The experimental results show that the proposed multiple wavelet Gabor_MFCC (MtWGM) method has obtained better classification accuracy and robustness than that of MFCC.","PeriodicalId":398546,"journal":{"name":"2022 6th International Conference on Communication and Information Systems (ICCIS)","volume":"5 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 6th International Conference on Communication and Information Systems (ICCIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCIS56375.2022.9998146","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Along with the development of data scale and the high complexity of sound signals, feature extraction and classification methods of sound signals have become a major research hotspot. However, the current sound signal feature extraction methods are difficult to accurately and stably provide a high-precision classification effect for the sound signal due to the complex frequency distribution and the influence of noise. Therefore, a robust feature extraction method for sound signals based on multi-scale and multi-directional Gabor filters and Mel frequency cepstral coefficient (MtWGM) was proposed. This method performs preprocessing on the signal by mixing hard threshold and soft threshold wavelet denoising. The Gabor filter is used to the combined energy spectrum of the framed signal, to achieve the effect of relatively more balanced intra-class features and more prominent inter-class features, and finally improve the noise reduction performance and classification accuracy of sound signals. The experiments are conducted out on three different sound signal datasets. Three classifiers are trained to test the effectiveness of extracted features. The experimental results show that the proposed multiple wavelet Gabor_MFCC (MtWGM) method has obtained better classification accuracy and robustness than that of MFCC.