{"title":"Defence algorithm against adversarial example based on local perturbation DAT-LP","authors":"Jun Tang, Yuchen Huang, Zhi Mou, Shiyu Wang, Yuanyuan Zhang, Bing Guo","doi":"10.1080/10589759.2023.2249581","DOIUrl":null,"url":null,"abstract":"ABSTRACTWith further research into neural networks, their scope of application is becoming increasingly extensive. Among these, more neural network models are used in text classification tasks and have achieved excellent results. However, the crucial issue of derived adversarial examples has dramatically affected the stability and robustness of the neural network model. This issue confines the further expansion of the neural network application, especially in some security-sensitive tasks. Concerning the text classification task, our proposed DAT-LP (Defence with Adversarial Training Based on Local Perturbation) algorithm is designed to address the adversarial example issue, which uses local perturbation to enhance model performance based on adversarial training. Furthermore, SW-CStart (Cold-start Algorithm Based on Sliding Window) algorithm is designed to realise adversarial training in the model’s initialisation stage. The DAT-LP algorithm is evaluated by comparing with three baselines, including baseline models (BiLSTM, TextCNN), Dropout(regularisation method), and ADT (Adversarial Training method), respectively. As it turns out, DAT-LP’s performance is superior and demonstrates the best generalisation ability.KEYWORDS: DAT-LPdefence algorithmmachine learning securityrobustnessadversarial attack Disclosure statementNo potential conflict of interest was reported by the author(s).Additional informationFundingThis work was supported in part by the National Natural Science Foundation of China under Grant No. 62072319; the Sichuan Science and Technology Program under Grant No. 2023YFQ0022, 2022YFG0041, 2022YFG0155 and 2022YFG0157; the Luzhou Science and Technology Innovation R&D Program under Grant No. 2022CDLZ-6.","PeriodicalId":49746,"journal":{"name":"Nondestructive Testing and Evaluation","volume":"213 1","pages":"0"},"PeriodicalIF":3.0000,"publicationDate":"2023-09-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Nondestructive Testing and Evaluation","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1080/10589759.2023.2249581","RegionNum":3,"RegionCategory":"材料科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MATERIALS SCIENCE, CHARACTERIZATION & TESTING","Score":null,"Total":0}
引用次数: 0
Abstract
ABSTRACTWith further research into neural networks, their scope of application is becoming increasingly extensive. Among these, more neural network models are used in text classification tasks and have achieved excellent results. However, the crucial issue of derived adversarial examples has dramatically affected the stability and robustness of the neural network model. This issue confines the further expansion of the neural network application, especially in some security-sensitive tasks. Concerning the text classification task, our proposed DAT-LP (Defence with Adversarial Training Based on Local Perturbation) algorithm is designed to address the adversarial example issue, which uses local perturbation to enhance model performance based on adversarial training. Furthermore, SW-CStart (Cold-start Algorithm Based on Sliding Window) algorithm is designed to realise adversarial training in the model’s initialisation stage. The DAT-LP algorithm is evaluated by comparing with three baselines, including baseline models (BiLSTM, TextCNN), Dropout(regularisation method), and ADT (Adversarial Training method), respectively. As it turns out, DAT-LP’s performance is superior and demonstrates the best generalisation ability.KEYWORDS: DAT-LPdefence algorithmmachine learning securityrobustnessadversarial attack Disclosure statementNo potential conflict of interest was reported by the author(s).Additional informationFundingThis work was supported in part by the National Natural Science Foundation of China under Grant No. 62072319; the Sichuan Science and Technology Program under Grant No. 2023YFQ0022, 2022YFG0041, 2022YFG0155 and 2022YFG0157; the Luzhou Science and Technology Innovation R&D Program under Grant No. 2022CDLZ-6.
摘要随着神经网络研究的深入,其应用范围越来越广泛。其中,更多的神经网络模型被用于文本分类任务,并取得了优异的效果。然而,衍生的对抗示例的关键问题极大地影响了神经网络模型的稳定性和鲁棒性。这个问题限制了神经网络应用的进一步扩展,特别是在一些安全敏感的任务中。在文本分类任务方面,我们提出的DAT-LP(基于局部扰动的对抗训练防御)算法旨在解决对抗示例问题,该算法利用局部扰动来增强基于对抗训练的模型性能。设计了SW-CStart (Cold-start Algorithm Based on Sliding Window)算法,在模型初始化阶段实现对抗性训练。通过对比基线模型(BiLSTM、TextCNN)、Dropout(正则化方法)和ADT(对抗训练方法)三个基线来评估DAT-LP算法。结果表明,DAT-LP的性能更优,具有最好的泛化能力。关键词:dat - lpd防御算法机器学习安全性鲁棒性对抗性攻击披露声明作者未报告潜在的利益冲突。本研究得到国家自然科学基金项目资助(No. 62072319);四川省科技计划项目(2023YFQ0022、2022YFG0041、2022YFG0155、2022YFG0157);泸州市科技创新发展计划(2022CDLZ-6)
期刊介绍:
Nondestructive Testing and Evaluation publishes the results of research and development in the underlying theory, novel techniques and applications of nondestructive testing and evaluation in the form of letters, original papers and review articles.
Articles concerning both the investigation of physical processes and the development of mechanical processes and techniques are welcomed. Studies of conventional techniques, including radiography, ultrasound, eddy currents, magnetic properties and magnetic particle inspection, thermal imaging and dye penetrant, will be considered in addition to more advanced approaches using, for example, lasers, squid magnetometers, interferometers, synchrotron and neutron beams and Compton scattering.
Work on the development of conventional and novel transducers is particularly welcomed. In addition, articles are invited on general aspects of nondestructive testing and evaluation in education, training, validation and links with engineering.