利用多UBMs提高自动估计扬声器年龄的精度

A. Bastanfard, Dariush Amirkhani, M. Hasani
{"title":"利用多UBMs提高自动估计扬声器年龄的精度","authors":"A. Bastanfard, Dariush Amirkhani, M. Hasani","doi":"10.1109/KBEI.2019.8735005","DOIUrl":null,"url":null,"abstract":"In the recent years, many studies have been conducted and many programs and methods have been presented in the field of age estimation. However, the accuracy of the proposed methods remains a challenge in this field. One of the proposed methods for age estimation, which has a higher accuracy than other available methods, is the use of i-vector for automatic age estimation of the sound signal. This method uses one UBM for Gaussian Mixture model. In this paper, by increasing the number of UBMs, the Gaussian Mixture model is optimized and the accuracy of age estimation is improved. Using multiple UBMs with different Gaussian components, for each speaker multiple i-vectors corresponding to each UBM are extracted. Given that, the age of the person is estimated several times and the average of all the estimations is taken as the age of the individual. Also, the results of many experiments in the age estimation, show that PLP features can increase the accuracy of the individuals estimated age. So the second suggestion is to use this features in age estimation. Finally, to enhance the distinction of voice features, the mapping of features was introduced to a new environment; the mapping relationship to this new environment was obtained by training a deep belief network. The proposed algorithm was tested on the NIST 2004, NIST 2005 and NIST 2008 databases. In contrast to single UBM method, the results for the proposed method show a Pearson correlation of 0.8 and a mean absolute error of 5.14, suggesting a %6.67 and %17 improvements in Pearson correlation and mean absolute error, respectively.","PeriodicalId":339990,"journal":{"name":"2019 5th Conference on Knowledge Based Engineering and Innovation (KBEI)","volume":"37 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"Increasing the Accuracy of Automatic Speaker Age Estimation by Using Multiple UBMs\",\"authors\":\"A. Bastanfard, Dariush Amirkhani, M. Hasani\",\"doi\":\"10.1109/KBEI.2019.8735005\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In the recent years, many studies have been conducted and many programs and methods have been presented in the field of age estimation. However, the accuracy of the proposed methods remains a challenge in this field. One of the proposed methods for age estimation, which has a higher accuracy than other available methods, is the use of i-vector for automatic age estimation of the sound signal. This method uses one UBM for Gaussian Mixture model. In this paper, by increasing the number of UBMs, the Gaussian Mixture model is optimized and the accuracy of age estimation is improved. Using multiple UBMs with different Gaussian components, for each speaker multiple i-vectors corresponding to each UBM are extracted. Given that, the age of the person is estimated several times and the average of all the estimations is taken as the age of the individual. Also, the results of many experiments in the age estimation, show that PLP features can increase the accuracy of the individuals estimated age. So the second suggestion is to use this features in age estimation. Finally, to enhance the distinction of voice features, the mapping of features was introduced to a new environment; the mapping relationship to this new environment was obtained by training a deep belief network. The proposed algorithm was tested on the NIST 2004, NIST 2005 and NIST 2008 databases. In contrast to single UBM method, the results for the proposed method show a Pearson correlation of 0.8 and a mean absolute error of 5.14, suggesting a %6.67 and %17 improvements in Pearson correlation and mean absolute error, respectively.\",\"PeriodicalId\":339990,\"journal\":{\"name\":\"2019 5th Conference on Knowledge Based Engineering and Innovation (KBEI)\",\"volume\":\"37 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-02-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 5th Conference on Knowledge Based Engineering and Innovation (KBEI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/KBEI.2019.8735005\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 5th Conference on Knowledge Based Engineering and Innovation (KBEI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/KBEI.2019.8735005","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6

摘要

近年来,在年龄估计领域进行了许多研究,提出了许多方案和方法。然而,所提出的方法的准确性仍然是该领域的一个挑战。本文提出的年龄估计方法之一是使用i向量对声音信号进行自动年龄估计,该方法比其他可用方法具有更高的精度。该方法对高斯混合模型使用一个UBM。本文通过增加UBMs的数量,对高斯混合模型进行了优化,提高了年龄估计的精度。使用多个具有不同高斯分量的UBM,对每个说话人提取每个UBM对应的多个i向量。鉴于此,对人的年龄进行多次估计,并将所有估计的平均值作为个体的年龄。此外,许多年龄估计的实验结果表明,PLP特征可以提高个体估计年龄的准确性。第二个建议是在年龄估计中使用这个特征。最后,为了增强语音特征的区分能力,将特征映射引入到新的环境中;通过训练一个深度信念网络,得到与新环境的映射关系。该算法在NIST 2004、NIST 2005和NIST 2008数据库上进行了测试。与单一UBM方法相比,该方法的Pearson相关系数为0.8,平均绝对误差为5.14,Pearson相关系数和平均绝对误差分别提高了%6.67和%17。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Increasing the Accuracy of Automatic Speaker Age Estimation by Using Multiple UBMs
In the recent years, many studies have been conducted and many programs and methods have been presented in the field of age estimation. However, the accuracy of the proposed methods remains a challenge in this field. One of the proposed methods for age estimation, which has a higher accuracy than other available methods, is the use of i-vector for automatic age estimation of the sound signal. This method uses one UBM for Gaussian Mixture model. In this paper, by increasing the number of UBMs, the Gaussian Mixture model is optimized and the accuracy of age estimation is improved. Using multiple UBMs with different Gaussian components, for each speaker multiple i-vectors corresponding to each UBM are extracted. Given that, the age of the person is estimated several times and the average of all the estimations is taken as the age of the individual. Also, the results of many experiments in the age estimation, show that PLP features can increase the accuracy of the individuals estimated age. So the second suggestion is to use this features in age estimation. Finally, to enhance the distinction of voice features, the mapping of features was introduced to a new environment; the mapping relationship to this new environment was obtained by training a deep belief network. The proposed algorithm was tested on the NIST 2004, NIST 2005 and NIST 2008 databases. In contrast to single UBM method, the results for the proposed method show a Pearson correlation of 0.8 and a mean absolute error of 5.14, suggesting a %6.67 and %17 improvements in Pearson correlation and mean absolute error, respectively.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Profitability Prediction for ATM Transactions Using Artificial Neural Networks: A Data-Driven Analysis Fabrication of UV detector by Schottky Pd/ZnO/Si Contacts Hybrid of genetic algorithm and krill herd for software clustering problem Development of a Hybrid Bayesian Network Model for Hydraulic Simulation of Agricultural Water Distribution and Delivery Using SIFT Descriptors for Face Recognition Based on Neural Network and Kepenekci Approach
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1