{"title":"Mel滤波器组基于能量的斜率特征及其在说话人识别中的应用","authors":"S. Madikeri, H. Murthy","doi":"10.1109/NCC.2011.5734713","DOIUrl":null,"url":null,"abstract":"This paper investigates the use of Mel Filterbank Slope (MFS) feature for speaker recognition tasks. The Mel filterbank slope feature emphasises formants in comparison with that of the conventional Mel Filterbank Cepstral Coefficients (MFCC). The effectiveness of this feature is evaluated on the NIST 2003 speaker recognition database. Results show significant gain in performance on speaker identification accuracies by 8.9% and speaker verification EER by 1.6% with no additional computational costs involved. A combination of the MFS feature along with the delta MFCC feature shows further 2.7% and 1.2% improvements in the respective tasks. Late fusion on speaker verification systems are shown to give an overall improvement of 3%.","PeriodicalId":158295,"journal":{"name":"2011 National Conference on Communications (NCC)","volume":"48 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"26","resultStr":"{\"title\":\"Mel Filter Bank energy-based Slope feature and its application to speaker recognition\",\"authors\":\"S. Madikeri, H. Murthy\",\"doi\":\"10.1109/NCC.2011.5734713\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper investigates the use of Mel Filterbank Slope (MFS) feature for speaker recognition tasks. The Mel filterbank slope feature emphasises formants in comparison with that of the conventional Mel Filterbank Cepstral Coefficients (MFCC). The effectiveness of this feature is evaluated on the NIST 2003 speaker recognition database. Results show significant gain in performance on speaker identification accuracies by 8.9% and speaker verification EER by 1.6% with no additional computational costs involved. A combination of the MFS feature along with the delta MFCC feature shows further 2.7% and 1.2% improvements in the respective tasks. Late fusion on speaker verification systems are shown to give an overall improvement of 3%.\",\"PeriodicalId\":158295,\"journal\":{\"name\":\"2011 National Conference on Communications (NCC)\",\"volume\":\"48 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2011-03-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"26\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2011 National Conference on Communications (NCC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/NCC.2011.5734713\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2011 National Conference on Communications (NCC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NCC.2011.5734713","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Mel Filter Bank energy-based Slope feature and its application to speaker recognition
This paper investigates the use of Mel Filterbank Slope (MFS) feature for speaker recognition tasks. The Mel filterbank slope feature emphasises formants in comparison with that of the conventional Mel Filterbank Cepstral Coefficients (MFCC). The effectiveness of this feature is evaluated on the NIST 2003 speaker recognition database. Results show significant gain in performance on speaker identification accuracies by 8.9% and speaker verification EER by 1.6% with no additional computational costs involved. A combination of the MFS feature along with the delta MFCC feature shows further 2.7% and 1.2% improvements in the respective tasks. Late fusion on speaker verification systems are shown to give an overall improvement of 3%.