鲁棒说话人识别应用的异常值去除和融合技术

I. Ali, G. Saha
{"title":"鲁棒说话人识别应用的异常值去除和融合技术","authors":"I. Ali, G. Saha","doi":"10.1109/NCC.2012.6176856","DOIUrl":null,"url":null,"abstract":"Outliers in real time speaker recognition can be viewed as a disturbing element and one of the reason of the degradation of the recognition accuracy. In speaker space, outliers may consider as non-intrinsic speaker's information in clean environment or noise information in noisy environment. So detection of outliers purify the speaker space with most speaker specific feature vectors in both clean and noisy environment. There are several methodology to detect outliers but in this paper we use a distance based method to mitigate the effects of outliers and incorporate fusion techniques to improve the recognition accuracy of speaker recognition system. Distances are taken from Minkowski family up to third order and also Mahalanobis distance which is a probabilistic distance. In fusion methodology we use GMM as a single classifier with complementary feature sets, MFCC and IMFCC. In this paper, we fuse the score of MFCC and IMFCC with a equal weight method. This method not only improves the recognition accuracy but simultaneously improve the detection rate of outliers with respect to the base line feature set.","PeriodicalId":178278,"journal":{"name":"2012 National Conference on Communications (NCC)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Outlier removal and fusion techniques for robust speaker recognition applications\",\"authors\":\"I. Ali, G. Saha\",\"doi\":\"10.1109/NCC.2012.6176856\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Outliers in real time speaker recognition can be viewed as a disturbing element and one of the reason of the degradation of the recognition accuracy. In speaker space, outliers may consider as non-intrinsic speaker's information in clean environment or noise information in noisy environment. So detection of outliers purify the speaker space with most speaker specific feature vectors in both clean and noisy environment. There are several methodology to detect outliers but in this paper we use a distance based method to mitigate the effects of outliers and incorporate fusion techniques to improve the recognition accuracy of speaker recognition system. Distances are taken from Minkowski family up to third order and also Mahalanobis distance which is a probabilistic distance. In fusion methodology we use GMM as a single classifier with complementary feature sets, MFCC and IMFCC. In this paper, we fuse the score of MFCC and IMFCC with a equal weight method. This method not only improves the recognition accuracy but simultaneously improve the detection rate of outliers with respect to the base line feature set.\",\"PeriodicalId\":178278,\"journal\":{\"name\":\"2012 National Conference on Communications (NCC)\",\"volume\":\"2 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2012-04-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2012 National Conference on Communications (NCC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/NCC.2012.6176856\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 National Conference on Communications (NCC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NCC.2012.6176856","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

在实时说话人识别中,异常值是一个干扰因素,是导致识别精度下降的原因之一。在说话人空间中,在清洁环境中,异常值可以看作是非本然说话人的信息,在嘈杂环境中,异常值可以看作是噪声信息。因此,无论在清洁环境还是噪声环境下,异常点检测都能利用大多数特定于说话人的特征向量来净化说话人空间。有几种方法可以检测异常值,但本文采用基于距离的方法来减轻异常值的影响,并结合融合技术来提高说话人识别系统的识别精度。距离取自闵可夫斯基族到三阶的距离,也取自马氏距离,这是一种概率距离。在融合方法中,我们使用GMM作为具有互补特征集,MFCC和IMFCC的单个分类器。本文采用等权方法对MFCC和IMFCC的分数进行融合。该方法不仅提高了识别精度,而且相对于基线特征集提高了异常点的检出率。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Outlier removal and fusion techniques for robust speaker recognition applications
Outliers in real time speaker recognition can be viewed as a disturbing element and one of the reason of the degradation of the recognition accuracy. In speaker space, outliers may consider as non-intrinsic speaker's information in clean environment or noise information in noisy environment. So detection of outliers purify the speaker space with most speaker specific feature vectors in both clean and noisy environment. There are several methodology to detect outliers but in this paper we use a distance based method to mitigate the effects of outliers and incorporate fusion techniques to improve the recognition accuracy of speaker recognition system. Distances are taken from Minkowski family up to third order and also Mahalanobis distance which is a probabilistic distance. In fusion methodology we use GMM as a single classifier with complementary feature sets, MFCC and IMFCC. In this paper, we fuse the score of MFCC and IMFCC with a equal weight method. This method not only improves the recognition accuracy but simultaneously improve the detection rate of outliers with respect to the base line feature set.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Quantized modulation diversity for 64-QAM IITKGP-MLILSC speech database for language identification Strip lined - Truncated ground plane for flat response of miniaturized UWB patch antenna On the underwater wireless network clustering Faster BIC segmentation using local speaker modeling
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1