{"title":"基于AHS和HMM评分融合的增强说话人识别","authors":"T. Islam, S. Mangayyagari, R. Sankar","doi":"10.1109/SECON.2007.342843","DOIUrl":null,"url":null,"abstract":"Speaker recognition history dates back to some four decades, and yet it has not been reliable enough to be considered as a standalone security system. This paper focuses on the enhancement of speaker recognition through fusion of likelihood scores generated by arithmetic harmonic sphericity (AHS) and hidden Markov model (HMM) techniques. Due to the contrastive nature of AHS and HMM, we have observed a significant performance improvement of 22% and 6% true acceptance rate at 5% false acceptance rate, when this fusion technique was evaluated on two different datasets - YOHO and USF multimodal biometric dataset, respectively. Performance enhancement has been achieved on both the datasets, however performance on YOHO was comparatively higher than that on USF dataset, owing to the fact that USF dataset is a noisy outdoor dataset whereas YOHO is an indoor dataset.","PeriodicalId":423683,"journal":{"name":"Proceedings 2007 IEEE SoutheastCon","volume":"215 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2007-03-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Enhanced speaker recognition based on score level fusion of AHS and HMM\",\"authors\":\"T. Islam, S. Mangayyagari, R. Sankar\",\"doi\":\"10.1109/SECON.2007.342843\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Speaker recognition history dates back to some four decades, and yet it has not been reliable enough to be considered as a standalone security system. This paper focuses on the enhancement of speaker recognition through fusion of likelihood scores generated by arithmetic harmonic sphericity (AHS) and hidden Markov model (HMM) techniques. Due to the contrastive nature of AHS and HMM, we have observed a significant performance improvement of 22% and 6% true acceptance rate at 5% false acceptance rate, when this fusion technique was evaluated on two different datasets - YOHO and USF multimodal biometric dataset, respectively. Performance enhancement has been achieved on both the datasets, however performance on YOHO was comparatively higher than that on USF dataset, owing to the fact that USF dataset is a noisy outdoor dataset whereas YOHO is an indoor dataset.\",\"PeriodicalId\":423683,\"journal\":{\"name\":\"Proceedings 2007 IEEE SoutheastCon\",\"volume\":\"215 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2007-03-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings 2007 IEEE SoutheastCon\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SECON.2007.342843\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings 2007 IEEE SoutheastCon","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SECON.2007.342843","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Enhanced speaker recognition based on score level fusion of AHS and HMM
Speaker recognition history dates back to some four decades, and yet it has not been reliable enough to be considered as a standalone security system. This paper focuses on the enhancement of speaker recognition through fusion of likelihood scores generated by arithmetic harmonic sphericity (AHS) and hidden Markov model (HMM) techniques. Due to the contrastive nature of AHS and HMM, we have observed a significant performance improvement of 22% and 6% true acceptance rate at 5% false acceptance rate, when this fusion technique was evaluated on two different datasets - YOHO and USF multimodal biometric dataset, respectively. Performance enhancement has been achieved on both the datasets, however performance on YOHO was comparatively higher than that on USF dataset, owing to the fact that USF dataset is a noisy outdoor dataset whereas YOHO is an indoor dataset.