Optimized Variable Size Windowing Based Speaker Verification

Sujiya Sreedharan, C. Eswaran
{"title":"Optimized Variable Size Windowing Based Speaker Verification","authors":"Sujiya Sreedharan, C. Eswaran","doi":"10.1145/3277453.3277461","DOIUrl":null,"url":null,"abstract":"In recent years the variances of speech features of speaker verification system were measured by computing covariance matrix parameterized through its eigenvalues and vectors by keeping fixed sliding window size. The computed eigenvectors were weighted with its corresponding magnitude and normalized. Then, the features were extracted and fused using different fusion techniques for recognizing the speaker. However, this approach was not suitable for all types of datasets and some significant feature information may be lost during extraction based on fixed window size. Hence in this article, the variable size sliding window is applied for Speaker Verification system. Initially, the speech signal is considered as input and the FMPM features are extracted using FDLP, MHEC and PNCC including MFCC based on the variable size of a sliding window. Here, the sliding window size is optimized by Modified Grey Wolf Optimization (MGWO) algorithm which is also used for selecting the classifier parameters and most optimal features adaptively. The most optimal features are selected from the extracted FMPM and classified by using GMM classification. Thus, the proposed approach allows continuous adaptation of SV using variable window size and classifier parameters. Finally, the considerable improvements in Speaker Verification are observed through experimental results.","PeriodicalId":186835,"journal":{"name":"Proceedings of the 2018 International Conference on Electronics and Electrical Engineering Technology","volume":"110 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-09-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2018 International Conference on Electronics and Electrical Engineering Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3277453.3277461","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

In recent years the variances of speech features of speaker verification system were measured by computing covariance matrix parameterized through its eigenvalues and vectors by keeping fixed sliding window size. The computed eigenvectors were weighted with its corresponding magnitude and normalized. Then, the features were extracted and fused using different fusion techniques for recognizing the speaker. However, this approach was not suitable for all types of datasets and some significant feature information may be lost during extraction based on fixed window size. Hence in this article, the variable size sliding window is applied for Speaker Verification system. Initially, the speech signal is considered as input and the FMPM features are extracted using FDLP, MHEC and PNCC including MFCC based on the variable size of a sliding window. Here, the sliding window size is optimized by Modified Grey Wolf Optimization (MGWO) algorithm which is also used for selecting the classifier parameters and most optimal features adaptively. The most optimal features are selected from the extracted FMPM and classified by using GMM classification. Thus, the proposed approach allows continuous adaptation of SV using variable window size and classifier parameters. Finally, the considerable improvements in Speaker Verification are observed through experimental results.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
优化的基于可变大小窗口的说话人验证
近年来,说话人验证系统的语音特征方差测量是通过计算协方差矩阵来实现的,协方差矩阵通过特征值和向量参数化,并保持固定的滑动窗口大小。将计算得到的特征向量与其相应的幅度进行加权并归一化。然后,利用不同的融合技术提取特征并进行融合,实现说话人识别。然而,这种方法并不适用于所有类型的数据集,并且在基于固定窗口大小的提取过程中可能会丢失一些重要的特征信息。因此,本文将变大小滑动窗口应用于说话人验证系统。首先,将语音信号作为输入,使用FDLP、MHEC和PNCC(包括基于可变大小的滑动窗口MFCC)提取FMPM特征。其中,滑动窗口大小采用改进灰狼优化算法(MGWO)进行优化,该算法还用于自适应选择分类器参数和最优特征。从提取的FMPM中选择最优的特征,使用GMM分类进行分类。因此,所提出的方法允许使用可变窗口大小和分类器参数连续适应SV。实验结果表明,该方法在说话人验证方面有较大的改进。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Power Quality Comprehensive Evaluation of DC Distribution Network Based on Maximizing Deviation and Fuzzy Matter-Element Model Resource Allocation for System Throughput Maximization Based on Mobile Edge Computing Design of Sparse Cosine-Modulated Filter Banks Using BP Neural Network Inrush Current Suppression of High Voltage Shunt Capacitors Based on Precharge Statistical Shape Model Generation Using K-means Clustering
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1