Replay attack detection based on deformable convolutional neural network and temporal-frequency attention model

IF 2 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Journal of Intelligent Systems Pub Date : 2023-01-01 DOI:10.1515/jisys-2022-0265

Dang-en Xie, Hai Hu, Qiang Xu

引用次数: 0

Abstract

Abstract As an important identity authentication method, speaker verification (SV) has been widely used in many domains, e.g., mobile financials. At the same time, the existing SV systems are insecure under replay spoofing attacks. Toward a more secure and stable SV system, this article proposes a replay attack detection system based on deformable convolutional neural networks (DCNNs) and a time–frequency double-channel attention model. In DCNN, the positions of elements in the convolutional kernel are not fixed. Instead, they are modified by some trainable variable to help the model extract more useful local information from input spectrograms. Meanwhile, a time–frequency domino double-channel attention model is adopted to extract more effective distinctive features to collect valuable information for distinguishing genuine and replay speeches. Experimental results on ASVspoof 2019 dataset show that the proposed model can detect replay attacks accurately.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于可变形卷积神经网络和时频注意模型的重放攻击检测

作为一种重要的身份认证方法，说话人验证(SV)在移动金融等领域得到了广泛应用。同时，现有的SV系统在重放欺骗攻击下是不安全的。为了使SV系统更加安全稳定，本文提出了一种基于可变形卷积神经网络(DCNNs)和时频双通道注意力模型的重放攻击检测系统。在DCNN中，卷积核中元素的位置是不固定的。相反，它们被一些可训练的变量修改，以帮助模型从输入谱图中提取更多有用的局部信息。同时，采用时频骨牌双通道注意模型提取更有效的显著特征，为区分真实演讲和重播演讲收集有价值的信息。在ASVspoof 2019数据集上的实验结果表明，该模型能够准确检测重放攻击。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Journal of Intelligent Systems COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE-

CiteScore

5.90

自引率

3.30%

发文量

审稿时长

51 weeks

期刊介绍： The Journal of Intelligent Systems aims to provide research and review papers, as well as Brief Communications at an interdisciplinary level, with the field of intelligent systems providing the focal point. This field includes areas like artificial intelligence, models and computational theories of human cognition, perception and motivation; brain models, artificial neural nets and neural computing. It covers contributions from the social, human and computer sciences to the analysis and application of information technology.