{"title":"SVFLDetector: a decentralized client detection method for Byzantine problem in vertical federated learning","authors":"Jiuyun Xu, Yinyue Jiang, Hanfei Fan, Qiqi Wang","doi":"10.1007/s00607-024-01262-5","DOIUrl":null,"url":null,"abstract":"<p>In recent years, with the deepening of cross-industry cooperation, vertical federated learning with multiple overlapping samples and fewer overlapping features has attracted extensive attention. Vertical federated learning increases the challenge of detecting Byzantine clients due to feature heterogeneity, in contrast to horizontal federated learning. Existing methods for detecting Byzantine clients can be divided into statistical-based and detection-based types. The detection-based type breaks the limit on the number of Byzantine clients. To our knowledge, current research in vertical federated learning relies on the assumption of a reliable third-party coordinator and is based on statistical type. In this work, we propose a framework based on a detection type called SVFLDetector to detect Byzantine clients in vertical federated learning. The key ideas of SVFLDetector are: (1) we combine decentralized vertical federated learning with split learning, utilizing their respective advantages and eliminating the impact of a third-party server; (2) according to the heterogeneity of features in vertical federated learning, we use a client detection method which is achieved by grouping through feature encoding and performing cross validation within groups to identify Byzantine clients; (3) we propose a penalty function to reduce the impact of Byzantine clients on model aggregation. Numerical experiments show that our method has strong robustness against various Byzantine attacks.</p>","PeriodicalId":10718,"journal":{"name":"Computing","volume":"2 1","pages":""},"PeriodicalIF":3.3000,"publicationDate":"2024-02-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computing","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1007/s00607-024-01262-5","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, THEORY & METHODS","Score":null,"Total":0}
引用次数: 0
Abstract
In recent years, with the deepening of cross-industry cooperation, vertical federated learning with multiple overlapping samples and fewer overlapping features has attracted extensive attention. Vertical federated learning increases the challenge of detecting Byzantine clients due to feature heterogeneity, in contrast to horizontal federated learning. Existing methods for detecting Byzantine clients can be divided into statistical-based and detection-based types. The detection-based type breaks the limit on the number of Byzantine clients. To our knowledge, current research in vertical federated learning relies on the assumption of a reliable third-party coordinator and is based on statistical type. In this work, we propose a framework based on a detection type called SVFLDetector to detect Byzantine clients in vertical federated learning. The key ideas of SVFLDetector are: (1) we combine decentralized vertical federated learning with split learning, utilizing their respective advantages and eliminating the impact of a third-party server; (2) according to the heterogeneity of features in vertical federated learning, we use a client detection method which is achieved by grouping through feature encoding and performing cross validation within groups to identify Byzantine clients; (3) we propose a penalty function to reduce the impact of Byzantine clients on model aggregation. Numerical experiments show that our method has strong robustness against various Byzantine attacks.
期刊介绍:
Computing publishes original papers, short communications and surveys on all fields of computing. The contributions should be written in English and may be of theoretical or applied nature, the essential criteria are computational relevance and systematic foundation of results.