{"title":"Environment based threshold for Speaker Identification","authors":"Soumen Kanrar","doi":"10.1109/EESCO.2015.7253615","DOIUrl":null,"url":null,"abstract":"Speaker Identification process is to identify a particular vocal cord from a set of existing speakers. In the speaker identification processes, the unknown speaker voice sample targets each of the existing speakers in the system and gives a predication. The predication is more than one existing known speaker voice and is very close to the unknown speaker voice. It is a one to many mapping. The mapping function gives a set of predicated values associated with the order pair of speakers. In the order pair, the first coordinate is the unknown speaker and the second coordinates is the existing known speaker from the speaker recognition system. The set of predicated values helps to identify the unknown speaker. The identification process makes a comparison of the unknown speaker model with the models of the existing voice in the system. In this paper, the model is a Gaussian mixture model built by the extraction of the acoustic feature vectors. This paper presents the impact of the decision threshold based on false accepts and false reject for an unknown number of speaker conversion in the speaker identification result. In the simulation, the considered known speaker voices are collected through different channels. In the testing, the GMM voice models of the known speakers are distributed among the numbers of clusters in the test data set.","PeriodicalId":305584,"journal":{"name":"2015 International Conference on Electrical, Electronics, Signals, Communication and Optimization (EESCO)","volume":"13 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 International Conference on Electrical, Electronics, Signals, Communication and Optimization (EESCO)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/EESCO.2015.7253615","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Speaker Identification process is to identify a particular vocal cord from a set of existing speakers. In the speaker identification processes, the unknown speaker voice sample targets each of the existing speakers in the system and gives a predication. The predication is more than one existing known speaker voice and is very close to the unknown speaker voice. It is a one to many mapping. The mapping function gives a set of predicated values associated with the order pair of speakers. In the order pair, the first coordinate is the unknown speaker and the second coordinates is the existing known speaker from the speaker recognition system. The set of predicated values helps to identify the unknown speaker. The identification process makes a comparison of the unknown speaker model with the models of the existing voice in the system. In this paper, the model is a Gaussian mixture model built by the extraction of the acoustic feature vectors. This paper presents the impact of the decision threshold based on false accepts and false reject for an unknown number of speaker conversion in the speaker identification result. In the simulation, the considered known speaker voices are collected through different channels. In the testing, the GMM voice models of the known speakers are distributed among the numbers of clusters in the test data set.