An edge-assisted federated contrastive learning method with local intrinsic dimensionality in noisy label environment

Software: Practice and Experience Pub Date : 2023-11-30 DOI:10.1002/spe.3295

Siyuan Wu, Guoming Zhang, Fei Dai, Bowen Liu, Wanchun Dou

{"title":"An edge-assisted federated contrastive learning method with local intrinsic dimensionality in noisy label environment","authors":"Siyuan Wu, Guoming Zhang, Fei Dai, Bowen Liu, Wanchun Dou","doi":"10.1002/spe.3295","DOIUrl":null,"url":null,"abstract":"The advent of federated learning (FL) has presented a viable solution for distributed training in edge environment, while simultaneously ensuring the preservation of privacy. In real-world scenarios, edge devices may be subject to label noise caused by environmental differences, automated weakly supervised annotation, malicious tampering, or even human error. However, the potential of the noisy samples have not been fully leveraged by prior studies on FL aimed at addressing label noise. Rather, they have primarily focused on conventional filtering or correction techniques to alleviate the impact of noisy labels. To tackle this challenge, a method, named <b>DETECTION</b>, is proposed in this article. It aims at effectively detecting noisy clients and mitigating the adverse impact of label noise while preserving data privacy. Specially, a confidence scoring mechanism based on local intrinsic dimensionality (LID) is investigated for distinguishing noisy clients from clean clients. Then, a loss function based on prototype contrastive learning is designed to optimize the local model. To address the varying levels of noise across clients, a LID weighted aggregation strategy (LA) is introduced. Experimental results on three datasets demonstrate the effectiveness of DETECTION in addressing the issue of label noise in FL while maintaining data privacy.","PeriodicalId":21899,"journal":{"name":"Software: Practice and Experience","volume":"34 5","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-11-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Software: Practice and Experience","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1002/spe.3295","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

The advent of federated learning (FL) has presented a viable solution for distributed training in edge environment, while simultaneously ensuring the preservation of privacy. In real-world scenarios, edge devices may be subject to label noise caused by environmental differences, automated weakly supervised annotation, malicious tampering, or even human error. However, the potential of the noisy samples have not been fully leveraged by prior studies on FL aimed at addressing label noise. Rather, they have primarily focused on conventional filtering or correction techniques to alleviate the impact of noisy labels. To tackle this challenge, a method, named DETECTION, is proposed in this article. It aims at effectively detecting noisy clients and mitigating the adverse impact of label noise while preserving data privacy. Specially, a confidence scoring mechanism based on local intrinsic dimensionality (LID) is investigated for distinguishing noisy clients from clean clients. Then, a loss function based on prototype contrastive learning is designed to optimize the local model. To address the varying levels of noise across clients, a LID weighted aggregation strategy (LA) is introduced. Experimental results on three datasets demonstrate the effectiveness of DETECTION in addressing the issue of label noise in FL while maintaining data privacy.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

噪声标签环境下具有局部固有维数的边缘辅助联邦对比学习方法

联邦学习(FL)的出现为边缘环境下的分布式训练提供了一种可行的解决方案，同时保证了隐私的保护。在现实场景中，边缘设备可能会受到由环境差异、自动弱监督注释、恶意篡改甚至人为错误引起的标签噪声的影响。然而，之前针对标签噪声的FL研究并未充分利用噪声样本的潜力。相反，他们主要关注于传统的滤波或校正技术，以减轻噪声标签的影响。为了解决这个问题，本文提出了一种名为DETECTION的方法。它旨在有效地检测噪声客户端，减轻标签噪声的不利影响，同时保护数据隐私。特别地，研究了一种基于局部固有维数(LID)的置信度评分机制，用于区分噪声客户端和干净客户端。然后，设计了基于原型对比学习的损失函数对局部模型进行优化。为了解决客户机之间不同程度的噪声，引入了LID加权聚合策略(LA)。在三个数据集上的实验结果证明了DETECTION在解决FL中标签噪声问题的同时保持数据隐私的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Software: Practice and Experience

自引率

0.00%

发文量