Environment based threshold for Speaker Identification

Soumen Kanrar
{"title":"Environment based threshold for Speaker Identification","authors":"Soumen Kanrar","doi":"10.1109/EESCO.2015.7253615","DOIUrl":null,"url":null,"abstract":"Speaker Identification process is to identify a particular vocal cord from a set of existing speakers. In the speaker identification processes, the unknown speaker voice sample targets each of the existing speakers in the system and gives a predication. The predication is more than one existing known speaker voice and is very close to the unknown speaker voice. It is a one to many mapping. The mapping function gives a set of predicated values associated with the order pair of speakers. In the order pair, the first coordinate is the unknown speaker and the second coordinates is the existing known speaker from the speaker recognition system. The set of predicated values helps to identify the unknown speaker. The identification process makes a comparison of the unknown speaker model with the models of the existing voice in the system. In this paper, the model is a Gaussian mixture model built by the extraction of the acoustic feature vectors. This paper presents the impact of the decision threshold based on false accepts and false reject for an unknown number of speaker conversion in the speaker identification result. In the simulation, the considered known speaker voices are collected through different channels. In the testing, the GMM voice models of the known speakers are distributed among the numbers of clusters in the test data set.","PeriodicalId":305584,"journal":{"name":"2015 International Conference on Electrical, Electronics, Signals, Communication and Optimization (EESCO)","volume":"13 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 International Conference on Electrical, Electronics, Signals, Communication and Optimization (EESCO)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/EESCO.2015.7253615","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Speaker Identification process is to identify a particular vocal cord from a set of existing speakers. In the speaker identification processes, the unknown speaker voice sample targets each of the existing speakers in the system and gives a predication. The predication is more than one existing known speaker voice and is very close to the unknown speaker voice. It is a one to many mapping. The mapping function gives a set of predicated values associated with the order pair of speakers. In the order pair, the first coordinate is the unknown speaker and the second coordinates is the existing known speaker from the speaker recognition system. The set of predicated values helps to identify the unknown speaker. The identification process makes a comparison of the unknown speaker model with the models of the existing voice in the system. In this paper, the model is a Gaussian mixture model built by the extraction of the acoustic feature vectors. This paper presents the impact of the decision threshold based on false accepts and false reject for an unknown number of speaker conversion in the speaker identification result. In the simulation, the considered known speaker voices are collected through different channels. In the testing, the GMM voice models of the known speakers are distributed among the numbers of clusters in the test data set.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于环境的说话人识别阈值
说话人识别过程是从一组现有的说话人中识别一个特定的声带。在说话人识别过程中,未知说话人语音样本针对系统中现有的每个说话人进行预测。预测是一个以上的现有已知的说话人的声音,是非常接近未知的说话人的声音。这是一对多映射。映射函数给出一组与扬声器的顺序对相关联的谓词值。在阶对中,第一个坐标为未知的说话人,第二个坐标为说话人识别系统中已有的已知说话人。谓词值的集合有助于识别未知的说话者。识别过程将未知的说话人模型与系统中已有的语音模型进行比较。本文的模型是通过提取声学特征向量建立的高斯混合模型。本文研究了基于假接受和假拒绝的决策阈值对说话人识别结果中未知数量说话人转换的影响。在仿真中,通过不同的通道收集已知的说话人声音。在测试中,将已知说话人的GMM语音模型分布在测试数据集中的多个聚类中。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
IEEE copyright and consent form IEEE copyright and consent form Neuro-Genetic and CCM based biometrical authentication through side-view of human-face A feasibility analysis of decentralized solar power using RETScreen in Odisha Scanning tool for the detection of images embedded with malicious programs
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1