{"title":"On the Impact of Gabor Phase for Spectro-Temporal Feature Extraction in Building an ASR System","authors":"Anirban Dutta, G. Prabhakar, C. V. R. Rao","doi":"10.1109/IEMCON51383.2020.9284872","DOIUrl":null,"url":null,"abstract":"Spectro-temporal features have recently showed its usefulness for various speech recognition tasks. Two Dimensional (2-D) Gabor filters are characterized by local patches of spectro-temporal fields which allows them to extract the spectro-temporal information from speech samples. The state of art spectro-temporal features uses the real part of the Gabor filters without any phase offset value. In this work, we analyzed to see whether the Gabor phase have any relevance in the context of spectro-temporal feature extraction in building the acoustic module of a hybrid Automatic Speech Recognition (ASR) system. Different phase offset values are investigated to see if it carries equivalent or complementary information. The experiments are carried out using TIMIT dataset corrupted with different noises at various SNR values. It is found that a Gabor offset phase of 0 degree and 90 degree is equally important in the Gabor filter design for building a robust ASR system.","PeriodicalId":6871,"journal":{"name":"2020 11th IEEE Annual Information Technology, Electronics and Mobile Communication Conference (IEMCON)","volume":"25 1","pages":"0341-0345"},"PeriodicalIF":0.0000,"publicationDate":"2020-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 11th IEEE Annual Information Technology, Electronics and Mobile Communication Conference (IEMCON)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IEMCON51383.2020.9284872","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Spectro-temporal features have recently showed its usefulness for various speech recognition tasks. Two Dimensional (2-D) Gabor filters are characterized by local patches of spectro-temporal fields which allows them to extract the spectro-temporal information from speech samples. The state of art spectro-temporal features uses the real part of the Gabor filters without any phase offset value. In this work, we analyzed to see whether the Gabor phase have any relevance in the context of spectro-temporal feature extraction in building the acoustic module of a hybrid Automatic Speech Recognition (ASR) system. Different phase offset values are investigated to see if it carries equivalent or complementary information. The experiments are carried out using TIMIT dataset corrupted with different noises at various SNR values. It is found that a Gabor offset phase of 0 degree and 90 degree is equally important in the Gabor filter design for building a robust ASR system.