基于线性倒频系数ResNeWt18的说话人自动验证重播攻击检测

2021 16th International Joint Symposium on Artificial Intelligence and Natural Language Processing (iSAI-NLP) Pub Date : 2021-12-21 DOI:10.1109/iSAI-NLP54397.2021.9678164

Anuwat Chaiwongyen, Kanokkarn Pinkeaw, W. Kongprawechnon, Jessada Karnjana, M. Unoki

{"title":"基于线性倒频系数ResNeWt18的说话人自动验证重播攻击检测","authors":"Anuwat Chaiwongyen, Kanokkarn Pinkeaw, W. Kongprawechnon, Jessada Karnjana, M. Unoki","doi":"10.1109/iSAI-NLP54397.2021.9678164","DOIUrl":null,"url":null,"abstract":"This paper proposes, effective method for replay attack detection used in an automatic speaker verification system. The replay attack is of interest because it is the most straightforward and effective attack and is challenging to detect. It is a playback of the recording of the voice of a target speaker. From the literature, no speech features work well with all classifiers, and there is no investigation of using ResNet-based model, called ResNeWt, with linear frequency cepstral coefficient (LFCC). Therefore, a replay attack detection model based on 18-layer ResNeWt that takes LFCCs as the input, was constructed in this paper. The proposes method was tested on a dataset provided by ASVspoof 2019 competition. In terms of the equal error rate (EER), the proposed method is the best in all existing methods, with an EER of 0.29%. The comparison in terms of replay attack detection was also made in detail. The performance of the proposed method in terms of the balanced accuracy, precision, recall, and F1-score was considerably better than existing methods.","PeriodicalId":339826,"journal":{"name":"2021 16th International Joint Symposium on Artificial Intelligence and Natural Language Processing (iSAI-NLP)","volume":"3 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Replay Attack Detection in Automatic Speaker Verification Based on ResNeWt18 with Linear Frequency Cepstral Coefficients\",\"authors\":\"Anuwat Chaiwongyen, Kanokkarn Pinkeaw, W. Kongprawechnon, Jessada Karnjana, M. Unoki\",\"doi\":\"10.1109/iSAI-NLP54397.2021.9678164\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper proposes, effective method for replay attack detection used in an automatic speaker verification system. The replay attack is of interest because it is the most straightforward and effective attack and is challenging to detect. It is a playback of the recording of the voice of a target speaker. From the literature, no speech features work well with all classifiers, and there is no investigation of using ResNet-based model, called ResNeWt, with linear frequency cepstral coefficient (LFCC). Therefore, a replay attack detection model based on 18-layer ResNeWt that takes LFCCs as the input, was constructed in this paper. The proposes method was tested on a dataset provided by ASVspoof 2019 competition. In terms of the equal error rate (EER), the proposed method is the best in all existing methods, with an EER of 0.29%. The comparison in terms of replay attack detection was also made in detail. The performance of the proposed method in terms of the balanced accuracy, precision, recall, and F1-score was considerably better than existing methods.\",\"PeriodicalId\":339826,\"journal\":{\"name\":\"2021 16th International Joint Symposium on Artificial Intelligence and Natural Language Processing (iSAI-NLP)\",\"volume\":\"3 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-12-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 16th International Joint Symposium on Artificial Intelligence and Natural Language Processing (iSAI-NLP)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/iSAI-NLP54397.2021.9678164\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 16th International Joint Symposium on Artificial Intelligence and Natural Language Processing (iSAI-NLP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/iSAI-NLP54397.2021.9678164","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

本文提出了一种有效的重放攻击检测方法，用于语音自动验证系统。重放攻击之所以令人感兴趣，是因为它是最直接、最有效的攻击，而且很难检测到。它是对目标说话者的声音录音的回放。从文献来看，没有语音特征可以很好地与所有分类器一起工作，并且没有研究使用基于resnet的模型，称为ResNeWt，具有线性频率倒谱系数(LFCC)。因此，本文构建了一个以lfc为输入的基于18层ResNeWt的重放攻击检测模型。该方法在ASVspoof 2019竞赛提供的数据集上进行了测试。在等错误率(EER)方面，该方法是所有现有方法中最好的，EER为0.29%。在重放攻击检测方面也进行了详细的比较。该方法在正确率、精密度、查全率和f1得分的平衡方面均明显优于现有方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Replay Attack Detection in Automatic Speaker Verification Based on ResNeWt18 with Linear Frequency Cepstral Coefficients

This paper proposes, effective method for replay attack detection used in an automatic speaker verification system. The replay attack is of interest because it is the most straightforward and effective attack and is challenging to detect. It is a playback of the recording of the voice of a target speaker. From the literature, no speech features work well with all classifiers, and there is no investigation of using ResNet-based model, called ResNeWt, with linear frequency cepstral coefficient (LFCC). Therefore, a replay attack detection model based on 18-layer ResNeWt that takes LFCCs as the input, was constructed in this paper. The proposes method was tested on a dataset provided by ASVspoof 2019 competition. In terms of the equal error rate (EER), the proposed method is the best in all existing methods, with an EER of 0.29%. The comparison in terms of replay attack detection was also made in detail. The performance of the proposed method in terms of the balanced accuracy, precision, recall, and F1-score was considerably better than existing methods.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2021 16th International Joint Symposium on Artificial Intelligence and Natural Language Processing (iSAI-NLP)

自引率

0.00%

发文量