说话人验证对现实语音欺骗的脆弱性研究

2015 IEEE 7th International Conference on Biometrics Theory, Applications and Systems (BTAS) Pub Date : 2015-12-17 DOI:10.1109/BTAS.2015.7358783

Serife Seda Kucur Ergunay, E. Khoury, Alexandros Lazaridis, S. Marcel

{"title":"说话人验证对现实语音欺骗的脆弱性研究","authors":"Serife Seda Kucur Ergunay, E. Khoury, Alexandros Lazaridis, S. Marcel","doi":"10.1109/BTAS.2015.7358783","DOIUrl":null,"url":null,"abstract":"Automatic speaker verification (ASV) systems are subject to various kinds of malicious attacks. Replay, voice conversion and speech synthesis attacks drastically degrade the performance of a standard ASV system by increasing its false acceptance rates. This issue raised a high level of interest in the speech research community where the possible voice spoofing attacks and their related countermeasures have been investigated. However, much less effort has been devoted in creating realistic and diverse spoofing attack databases that foster researchers to correctly evaluate their countermeasures against attacks. The existing studies are not complete in terms of types of attacks, and often difficult to reproduce because of unavailability of public databases. In this paper we introduce the voice spoofing data-set of AVspoof, a public audio-visual spoofing database. AVspoof includes ten realistic spoofing threats generated using replay, speech synthesis and voice conversion. In addition, we provide a set of experimental results that show the effect of such attacks on current state-of-the-art ASV systems.","PeriodicalId":404972,"journal":{"name":"2015 IEEE 7th International Conference on Biometrics Theory, Applications and Systems (BTAS)","volume":"107 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"115","resultStr":"{\"title\":\"On the vulnerability of speaker verification to realistic voice spoofing\",\"authors\":\"Serife Seda Kucur Ergunay, E. Khoury, Alexandros Lazaridis, S. Marcel\",\"doi\":\"10.1109/BTAS.2015.7358783\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Automatic speaker verification (ASV) systems are subject to various kinds of malicious attacks. Replay, voice conversion and speech synthesis attacks drastically degrade the performance of a standard ASV system by increasing its false acceptance rates. This issue raised a high level of interest in the speech research community where the possible voice spoofing attacks and their related countermeasures have been investigated. However, much less effort has been devoted in creating realistic and diverse spoofing attack databases that foster researchers to correctly evaluate their countermeasures against attacks. The existing studies are not complete in terms of types of attacks, and often difficult to reproduce because of unavailability of public databases. In this paper we introduce the voice spoofing data-set of AVspoof, a public audio-visual spoofing database. AVspoof includes ten realistic spoofing threats generated using replay, speech synthesis and voice conversion. In addition, we provide a set of experimental results that show the effect of such attacks on current state-of-the-art ASV systems.\",\"PeriodicalId\":404972,\"journal\":{\"name\":\"2015 IEEE 7th International Conference on Biometrics Theory, Applications and Systems (BTAS)\",\"volume\":\"107 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2015-12-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"115\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2015 IEEE 7th International Conference on Biometrics Theory, Applications and Systems (BTAS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/BTAS.2015.7358783\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 IEEE 7th International Conference on Biometrics Theory, Applications and Systems (BTAS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BTAS.2015.7358783","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 115

摘要

自动说话人验证(ASV)系统经常受到各种恶意攻击。重放、语音转换和语音合成攻击通过增加其错误接受率，大大降低了标准ASV系统的性能。这个问题引起了语音研究界的高度关注，他们正在研究可能的语音欺骗攻击及其相关对策。然而，在创建现实的和多样化的欺骗攻击数据库方面投入的努力要少得多，这些数据库可以促进研究人员正确评估针对攻击的对策。就攻击类型而言，现有的研究并不完整，而且由于缺乏公共数据库，往往难以复制。本文介绍了公共视听欺骗数据库AVspoof的语音欺骗数据集。AVspoof包括使用重放、语音合成和语音转换生成的十个现实欺骗威胁。此外，我们提供了一组实验结果，显示了这种攻击对当前最先进的ASV系统的影响。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

On the vulnerability of speaker verification to realistic voice spoofing

Automatic speaker verification (ASV) systems are subject to various kinds of malicious attacks. Replay, voice conversion and speech synthesis attacks drastically degrade the performance of a standard ASV system by increasing its false acceptance rates. This issue raised a high level of interest in the speech research community where the possible voice spoofing attacks and their related countermeasures have been investigated. However, much less effort has been devoted in creating realistic and diverse spoofing attack databases that foster researchers to correctly evaluate their countermeasures against attacks. The existing studies are not complete in terms of types of attacks, and often difficult to reproduce because of unavailability of public databases. In this paper we introduce the voice spoofing data-set of AVspoof, a public audio-visual spoofing database. AVspoof includes ten realistic spoofing threats generated using replay, speech synthesis and voice conversion. In addition, we provide a set of experimental results that show the effect of such attacks on current state-of-the-art ASV systems.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2015 IEEE 7th International Conference on Biometrics Theory, Applications and Systems (BTAS)

自引率

0.00%

发文量

期刊最新文献

Towards fitting a 3D dense facial model to a 2D image: A landmark-free approach Combining 3D and 2D for less constrained periocular recognition Pace independent mobile gait biometrics Iris imaging in visible spectrum using white LED On smartphone camera based fingerphoto authentication