DR-SASV: A deep and reliable spoof aware speech verification system

IF 1 Q4 COMPUTER SCIENCE, INFORMATION SYSTEMS International Journal on Information Technologies and Security Pub Date : 2023-12-01 DOI:10.59035/ffmb8272
Amay Gada, Neel Kothari, R. Karani, Chetashri Badane, Dhruv Gada, Tanish Patwa
{"title":"DR-SASV: A deep and reliable spoof aware speech verification system","authors":"Amay Gada, Neel Kothari, R. Karani, Chetashri Badane, Dhruv Gada, Tanish Patwa","doi":"10.59035/ffmb8272","DOIUrl":null,"url":null,"abstract":"A spoof-aware speaker verification system is an integrated system that is capable of jointly identifying impostor speakers as well as spoofing attacks from target speakers. This type of system largely helps in protecting sensitive data, mitigating fraud, and reducing theft. Research has recently enhanced the effectiveness of countermeasure systems and automatic speaker verification systems separately to produce low Equal Error Rates (EER) for each system. However, work exploring a combination of both is still scarce. This paper proposes an end-to-end solution to address spoof-aware automatic speaker verification (ASV) by introducing a Deep Reliable Spoof-Aware-Speaker-Verification (DR-SASV) system. The proposed system allows the target audio to pass through a “spoof aware” speaker verification model sequentially after applying a convolutional neural network (CNN)-based spoof detection model. The suggested system produces encouraging results after being trained on the ASVSpoof 2019 LA dataset. The spoof detection model gives a validation accuracy of 96%, while the transformer-based speech verification model authenticates users with an error rate of 13.74%. The system surpasses other state-of-the-art models and produces an EER score of 10.32%.","PeriodicalId":42317,"journal":{"name":"International Journal on Information Technologies and Security","volume":"83 9","pages":""},"PeriodicalIF":1.0000,"publicationDate":"2023-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal on Information Technologies and Security","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.59035/ffmb8272","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0

Abstract

A spoof-aware speaker verification system is an integrated system that is capable of jointly identifying impostor speakers as well as spoofing attacks from target speakers. This type of system largely helps in protecting sensitive data, mitigating fraud, and reducing theft. Research has recently enhanced the effectiveness of countermeasure systems and automatic speaker verification systems separately to produce low Equal Error Rates (EER) for each system. However, work exploring a combination of both is still scarce. This paper proposes an end-to-end solution to address spoof-aware automatic speaker verification (ASV) by introducing a Deep Reliable Spoof-Aware-Speaker-Verification (DR-SASV) system. The proposed system allows the target audio to pass through a “spoof aware” speaker verification model sequentially after applying a convolutional neural network (CNN)-based spoof detection model. The suggested system produces encouraging results after being trained on the ASVSpoof 2019 LA dataset. The spoof detection model gives a validation accuracy of 96%, while the transformer-based speech verification model authenticates users with an error rate of 13.74%. The system surpasses other state-of-the-art models and produces an EER score of 10.32%.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
DR-SASV:深度可靠的防欺骗语音验证系统
欺骗感知说话人验证系统是既能识别冒名顶替说话人,又能抵御目标说话人的欺骗攻击的集成系统。这种类型的系统在很大程度上有助于保护敏感数据、减轻欺诈和减少盗窃。最近的研究分别提高了对抗系统和自动说话人验证系统的有效性,以产生较低的等错误率(EER)。然而,探索两者结合的工作仍然很少。本文提出了一种端到端的解决方案,通过引入深度可靠欺骗感知-说话人验证(DR-SASV)系统来解决欺骗感知自动说话人验证(ASV)问题。该系统采用基于卷积神经网络(CNN)的欺骗检测模型,使目标音频依次通过“欺骗感知”说话人验证模型。在ASVSpoof 2019洛杉矶数据集上进行训练后,建议的系统产生了令人鼓舞的结果。欺骗检测模型的验证准确率为96%,而基于变压器的语音验证模型对用户的验证错误率为13.74%。该系统超越了其他最先进的模型,并产生了10.32%的EER得分。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
66.70%
发文量
0
期刊最新文献
System for automated design of resonant inverters Hybrid algorithms with alternative embedded local search schemes for the p-median problem Ethno-demographic dimensions of national power GIS-based model for the assessment of the smart sustainable development of municipalities of Northeast Bulgaria A new light ensemble deep-learning framework to detect fire
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1