ar向量使用CMS鲁棒文本独立说话人验证

2002 14th International Conference on Digital Signal Processing Proceedings. DSP 2002 (Cat. No.02TH8628) Pub Date : 2002-11-07 DOI:10.1109/ICDSP.2002.1028276

C. B. D. Lima, Dirceu G. da Silva, A. Alcaim, J. A. Apolinário

{"title":"ar向量使用CMS鲁棒文本独立说话人验证","authors":"C. B. D. Lima, Dirceu G. da Silva, A. Alcaim, J. A. Apolinário","doi":"10.1109/ICDSP.2002.1028276","DOIUrl":null,"url":null,"abstract":"This paper presents the performance of the AR-vector with cepstral mean subtraction (CMS) used to compensate the distortions caused by distinct telephone channels. The speaker recognition performance obtained with the use of CMS is compared with a system without compensation. With 60 s of speech signal used for training and 30 s used for testing, the error rate without channel normalization is around 2.82% against the 1.65% achieved with CMS. For 10 s testing time, the error rate dropped from 5.40% to 3.80% when using CMS. For the lowest testing time (3 s), the error rate of the AR-vector is close to 19% regardless of whether or not the normalization technique is used. Although there is a clear improvement in performance when using CMS, it is not of major significance. This leads to the conclusion that the AR-vector classification system is somewhat robust to channel distortion, especially as the testing time decreases.","PeriodicalId":351073,"journal":{"name":"2002 14th International Conference on Digital Signal Processing Proceedings. DSP 2002 (Cat. No.02TH8628)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2002-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"AR-vector using CMS for robust text independent speaker verification\",\"authors\":\"C. B. D. Lima, Dirceu G. da Silva, A. Alcaim, J. A. Apolinário\",\"doi\":\"10.1109/ICDSP.2002.1028276\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper presents the performance of the AR-vector with cepstral mean subtraction (CMS) used to compensate the distortions caused by distinct telephone channels. The speaker recognition performance obtained with the use of CMS is compared with a system without compensation. With 60 s of speech signal used for training and 30 s used for testing, the error rate without channel normalization is around 2.82% against the 1.65% achieved with CMS. For 10 s testing time, the error rate dropped from 5.40% to 3.80% when using CMS. For the lowest testing time (3 s), the error rate of the AR-vector is close to 19% regardless of whether or not the normalization technique is used. Although there is a clear improvement in performance when using CMS, it is not of major significance. This leads to the conclusion that the AR-vector classification system is somewhat robust to channel distortion, especially as the testing time decreases.\",\"PeriodicalId\":351073,\"journal\":{\"name\":\"2002 14th International Conference on Digital Signal Processing Proceedings. DSP 2002 (Cat. No.02TH8628)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2002-11-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2002 14th International Conference on Digital Signal Processing Proceedings. DSP 2002 (Cat. No.02TH8628)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICDSP.2002.1028276\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2002 14th International Conference on Digital Signal Processing Proceedings. DSP 2002 (Cat. No.02TH8628)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDSP.2002.1028276","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

摘要

本文介绍了用倒谱平均减法(CMS)补偿不同电话信道造成的失真的ar向量的性能。将使用CMS的说话人识别性能与无补偿的系统进行了比较。60秒的语音信号用于训练，30秒用于测试，没有信道归一化的错误率约为2.82%，而CMS的错误率为1.65%。在10 s的测试时间内，使用CMS时的错误率从5.40%下降到3.80%。在最低测试时间(3 s)下，无论是否使用归一化技术，ar向量的错误率都接近19%。虽然在使用CMS时性能有明显的提高，但并不具有重大意义。由此得出结论，ar向量分类系统对信道失真具有一定的鲁棒性，特别是随着测试时间的减少。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

AR-vector using CMS for robust text independent speaker verification

This paper presents the performance of the AR-vector with cepstral mean subtraction (CMS) used to compensate the distortions caused by distinct telephone channels. The speaker recognition performance obtained with the use of CMS is compared with a system without compensation. With 60 s of speech signal used for training and 30 s used for testing, the error rate without channel normalization is around 2.82% against the 1.65% achieved with CMS. For 10 s testing time, the error rate dropped from 5.40% to 3.80% when using CMS. For the lowest testing time (3 s), the error rate of the AR-vector is close to 19% regardless of whether or not the normalization technique is used. Although there is a clear improvement in performance when using CMS, it is not of major significance. This leads to the conclusion that the AR-vector classification system is somewhat robust to channel distortion, especially as the testing time decreases.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2002 14th International Conference on Digital Signal Processing Proceedings. DSP 2002 (Cat. No.02TH8628)

自引率

0.00%

发文量