An Efficient Speaker Identification Approach for Biometric Access Control System

Khushboo Jha, Arun Jain, S. Srivastava
{"title":"An Efficient Speaker Identification Approach for Biometric Access Control System","authors":"Khushboo Jha, Arun Jain, S. Srivastava","doi":"10.1109/RAIT57693.2023.10127101","DOIUrl":null,"url":null,"abstract":"This work proposes an efficient cepstral-frequency domain based acoustic feature as a speaker identification solution for reliable biometric access control system. The Convolutional Neural Network (CNN) trained for this purpose uses the amalgamation of cepstral-frequency domain based acoustic features such as Power Normalized Cepstral Coefficients (PNCC) and Formant as PNCC-F. The PNCC-F with CNN classifier demonstrates an increase in identification efficacy. The speaker identification accuracy in clean, as well as noisy environment, has been used to evaluate the effectiveness of PNCC alone and in tandem with the formant feature. This work has been executed in a Python 3.8.8 environment using the standard database with 43 speakers called VidTIMIT. The efficiency of the PNCC-F feature was further evaluated in a real-time noisy environment by mixing babble, factory, and machine gun noises from NOISEX-92 database to speech samples with 0 to 20 dB of distortion. The proposed PNCC-F feature surpassed the conventional PNCC feature in a clean environment by 2.34%, and outperformed at all SNR levels for all different noises.","PeriodicalId":281845,"journal":{"name":"2023 5th International Conference on Recent Advances in Information Technology (RAIT)","volume":"23 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-03-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 5th International Conference on Recent Advances in Information Technology (RAIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/RAIT57693.2023.10127101","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

This work proposes an efficient cepstral-frequency domain based acoustic feature as a speaker identification solution for reliable biometric access control system. The Convolutional Neural Network (CNN) trained for this purpose uses the amalgamation of cepstral-frequency domain based acoustic features such as Power Normalized Cepstral Coefficients (PNCC) and Formant as PNCC-F. The PNCC-F with CNN classifier demonstrates an increase in identification efficacy. The speaker identification accuracy in clean, as well as noisy environment, has been used to evaluate the effectiveness of PNCC alone and in tandem with the formant feature. This work has been executed in a Python 3.8.8 environment using the standard database with 43 speakers called VidTIMIT. The efficiency of the PNCC-F feature was further evaluated in a real-time noisy environment by mixing babble, factory, and machine gun noises from NOISEX-92 database to speech samples with 0 to 20 dB of distortion. The proposed PNCC-F feature surpassed the conventional PNCC feature in a clean environment by 2.34%, and outperformed at all SNR levels for all different noises.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
生物识别门禁系统中一种有效的说话人识别方法
本文提出了一种有效的基于倒谱频域的声学特征作为可靠的生物识别门禁系统的说话人识别解决方案。为此目的训练的卷积神经网络(CNN)使用基于倒频谱频域的声学特征的合并,如功率归一化倒频谱系数(PNCC)和形成峰作为PNCC- f。采用CNN分类器的pnc - f在识别效率上有所提高。在清洁和噪声环境下的说话人识别精度,被用来评估PNCC单独和串联形成峰特征的有效性。这项工作是在Python 3.8.8环境中执行的,使用了43个名为VidTIMIT的扬声器的标准数据库。通过将NOISEX-92数据库中的牙牙声、工厂噪声和机枪噪声混合到失真为0 ~ 20 dB的语音样本中,进一步评估了pnc - f特征在实时噪声环境中的效率。提出的PNCC- f特征在清洁环境下比传统的PNCC特征高出2.34%,并且在所有不同噪声的所有信噪比水平下都优于传统的PNCC特征。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Realistic Benchmark Datasets for Team Formation Problem in Social Networks Microarray Data Analysis for Diagnosis of Cancer Diseases by Machine Learning algorithm UAVs-assisted Multi-Hop D2D Communication using Hybrid PTS for disaster management Secrecy Outage Analysis of Energy Harvesting Enabled Two User Cooperative NOMA Outage Analysis of a D2D Network for MIMO-NOMA-based Downlink Transmission
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1