Ran Yan;Ruiying Du;Kun He;Jing Chen;Qiao Li;Cong Wu
{"title":"Universal and Efficient Adversarial Training Framework With Membership Inference Resistance","authors":"Ran Yan;Ruiying Du;Kun He;Jing Chen;Qiao Li;Cong Wu","doi":"10.1109/JIOT.2025.3551762","DOIUrl":null,"url":null,"abstract":"Adversarial training is an effective approach to enhance the robustness of machine learning models via adding adversarial examples into the training phase. However, existing adversarial training methods increase the advantage of membership inference attacks, which aim to determine from the model whether an example is in the training dataset. In this article, we propose an adversarial training framework that guarantees both robustness and membership privacy by introducing a tailor-made example called reverse-symmetry example. Moreover, our framework reduces the number of required adversarial examples compared with existing adversarial training methods. We implement our framework using four adversarial training methods on the FMNIST and CIFAR10 datasets and compare its performance with deep learning differential privacy. Our experimental findings demonstrate that our framework mitigates model overfitting and outperforms the original adversarial training with respect to the overall performance of accuracy, robustness, privacy, and runtime.","PeriodicalId":54347,"journal":{"name":"IEEE Internet of Things Journal","volume":"12 12","pages":"18665-18677"},"PeriodicalIF":8.9000,"publicationDate":"2025-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Internet of Things Journal","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10929713/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
Adversarial training is an effective approach to enhance the robustness of machine learning models via adding adversarial examples into the training phase. However, existing adversarial training methods increase the advantage of membership inference attacks, which aim to determine from the model whether an example is in the training dataset. In this article, we propose an adversarial training framework that guarantees both robustness and membership privacy by introducing a tailor-made example called reverse-symmetry example. Moreover, our framework reduces the number of required adversarial examples compared with existing adversarial training methods. We implement our framework using four adversarial training methods on the FMNIST and CIFAR10 datasets and compare its performance with deep learning differential privacy. Our experimental findings demonstrate that our framework mitigates model overfitting and outperforms the original adversarial training with respect to the overall performance of accuracy, robustness, privacy, and runtime.
期刊介绍:
The EEE Internet of Things (IoT) Journal publishes articles and review articles covering various aspects of IoT, including IoT system architecture, IoT enabling technologies, IoT communication and networking protocols such as network coding, and IoT services and applications. Topics encompass IoT's impacts on sensor technologies, big data management, and future internet design for applications like smart cities and smart homes. Fields of interest include IoT architecture such as things-centric, data-centric, service-oriented IoT architecture; IoT enabling technologies and systematic integration such as sensor technologies, big sensor data management, and future Internet design for IoT; IoT services, applications, and test-beds such as IoT service middleware, IoT application programming interface (API), IoT application design, and IoT trials/experiments; IoT standardization activities and technology development in different standard development organizations (SDO) such as IEEE, IETF, ITU, 3GPP, ETSI, etc.