基于AHS和HMM评分融合的增强说话人识别

Proceedings 2007 IEEE SoutheastCon Pub Date : 2007-03-22 DOI:10.1109/SECON.2007.342843

T. Islam, S. Mangayyagari, R. Sankar

{"title":"基于AHS和HMM评分融合的增强说话人识别","authors":"T. Islam, S. Mangayyagari, R. Sankar","doi":"10.1109/SECON.2007.342843","DOIUrl":null,"url":null,"abstract":"Speaker recognition history dates back to some four decades, and yet it has not been reliable enough to be considered as a standalone security system. This paper focuses on the enhancement of speaker recognition through fusion of likelihood scores generated by arithmetic harmonic sphericity (AHS) and hidden Markov model (HMM) techniques. Due to the contrastive nature of AHS and HMM, we have observed a significant performance improvement of 22% and 6% true acceptance rate at 5% false acceptance rate, when this fusion technique was evaluated on two different datasets - YOHO and USF multimodal biometric dataset, respectively. Performance enhancement has been achieved on both the datasets, however performance on YOHO was comparatively higher than that on USF dataset, owing to the fact that USF dataset is a noisy outdoor dataset whereas YOHO is an indoor dataset.","PeriodicalId":423683,"journal":{"name":"Proceedings 2007 IEEE SoutheastCon","volume":"215 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2007-03-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Enhanced speaker recognition based on score level fusion of AHS and HMM\",\"authors\":\"T. Islam, S. Mangayyagari, R. Sankar\",\"doi\":\"10.1109/SECON.2007.342843\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Speaker recognition history dates back to some four decades, and yet it has not been reliable enough to be considered as a standalone security system. This paper focuses on the enhancement of speaker recognition through fusion of likelihood scores generated by arithmetic harmonic sphericity (AHS) and hidden Markov model (HMM) techniques. Due to the contrastive nature of AHS and HMM, we have observed a significant performance improvement of 22% and 6% true acceptance rate at 5% false acceptance rate, when this fusion technique was evaluated on two different datasets - YOHO and USF multimodal biometric dataset, respectively. Performance enhancement has been achieved on both the datasets, however performance on YOHO was comparatively higher than that on USF dataset, owing to the fact that USF dataset is a noisy outdoor dataset whereas YOHO is an indoor dataset.\",\"PeriodicalId\":423683,\"journal\":{\"name\":\"Proceedings 2007 IEEE SoutheastCon\",\"volume\":\"215 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2007-03-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings 2007 IEEE SoutheastCon\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/SECON.2007.342843\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings 2007 IEEE SoutheastCon","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SECON.2007.342843","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 3

摘要

说话人识别的历史可以追溯到大约40年前，但它还不够可靠，不能被视为一个独立的安全系统。本文研究了将算术调和球度(AHS)和隐马尔可夫模型(HMM)生成的似然分数融合在一起，增强说话人识别。由于AHS和HMM的对比性质，我们观察到，当这种融合技术分别在YOHO和USF多模态生物特征数据集上进行评估时，在5%的错误接受率下，其性能显著提高了22%和6%的真实接受率。两种数据集的性能都得到了提高，但由于USF数据集是一个有噪声的室外数据集，而YOHO数据集是一个室内数据集，因此YOHO数据集的性能相对高于USF数据集。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Enhanced speaker recognition based on score level fusion of AHS and HMM

Speaker recognition history dates back to some four decades, and yet it has not been reliable enough to be considered as a standalone security system. This paper focuses on the enhancement of speaker recognition through fusion of likelihood scores generated by arithmetic harmonic sphericity (AHS) and hidden Markov model (HMM) techniques. Due to the contrastive nature of AHS and HMM, we have observed a significant performance improvement of 22% and 6% true acceptance rate at 5% false acceptance rate, when this fusion technique was evaluated on two different datasets - YOHO and USF multimodal biometric dataset, respectively. Performance enhancement has been achieved on both the datasets, however performance on YOHO was comparatively higher than that on USF dataset, owing to the fact that USF dataset is a noisy outdoor dataset whereas YOHO is an indoor dataset.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Proceedings 2007 IEEE SoutheastCon

自引率

0.00%

发文量