自适应:基于异构边缘设备的非iid数据的自适应联邦学习,以减轻偏差并提高训练效率

IF 14.7 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Information Fusion Pub Date : 2025-01-10 DOI:10.1016/j.inffus.2025.102932
Neha Singh, Mainak Adhikari
{"title":"自适应:基于异构边缘设备的非iid数据的自适应联邦学习,以减轻偏差并提高训练效率","authors":"Neha Singh, Mainak Adhikari","doi":"10.1016/j.inffus.2025.102932","DOIUrl":null,"url":null,"abstract":"Federated learning (FL) offers a decentralized and collaborative training solution on resource-constraint Edge Devices (EDs) to improve a global model without sharing raw data. Standard Synchronous FL (SFL) approaches provide significant advantages in terms of data privacy and reduced communication overhead, however, face several challenges including Non-independent and identically distributed (Non-IID) data, the presence of unlabeled data, biased aggregation due to device heterogeneity and effective EDs selection to handle the straggler. To tackle these challenges, we propose a new Self-adaptive Federated Learning (SelfFed) strategy using a masked loss function to handle unlabeled data. This allows EDs to concentrate on labeled data, enhancing training efficiency. Additionally, we integrate a novel quality-dependent aggregation solution to mitigate bias during model updates through aggregation. This solution accurately reflects performance across Non-IID data distributions by incentivizing local EDs using a new Stackelberg game model. The model provides rewards based on their contributions to the global model, thereby keeping the EDs motivated to participate and perform well. Finally, we incorporate a deep reinforcement learning technique into the proposed SelfFed strategy for dynamic ED selection to handle straggler EDs. This technique adapts to changes in device performance and resources over iterations, fostering collaboration and sustained engagement. The performance of the SelfFed strategy is evaluated using a real-time SFL scenario (irrigation control in paddy fields) and three benchmark datasets using a serverless private cloud environment. Comparative results against state-of-the-art approaches reveal that the SelfFed significantly reduces CPU usage by 5%–6% and enhances training efficiency by 4%–8% while achieving 4%–6% higher accuracy. Further, in the real-time scenario, the SelfFed improves CPU usage by 3%–5% and enhances training efficiency by 8%–10% with 5%–7% higher accuracy.","PeriodicalId":50367,"journal":{"name":"Information Fusion","volume":"28 1","pages":""},"PeriodicalIF":14.7000,"publicationDate":"2025-01-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"SelfFed: Self-adaptive Federated Learning with Non-IID data on Heterogeneous Edge Devices for Bias Mitigation and Enhance Training Efficiency\",\"authors\":\"Neha Singh, Mainak Adhikari\",\"doi\":\"10.1016/j.inffus.2025.102932\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Federated learning (FL) offers a decentralized and collaborative training solution on resource-constraint Edge Devices (EDs) to improve a global model without sharing raw data. Standard Synchronous FL (SFL) approaches provide significant advantages in terms of data privacy and reduced communication overhead, however, face several challenges including Non-independent and identically distributed (Non-IID) data, the presence of unlabeled data, biased aggregation due to device heterogeneity and effective EDs selection to handle the straggler. To tackle these challenges, we propose a new Self-adaptive Federated Learning (SelfFed) strategy using a masked loss function to handle unlabeled data. This allows EDs to concentrate on labeled data, enhancing training efficiency. Additionally, we integrate a novel quality-dependent aggregation solution to mitigate bias during model updates through aggregation. This solution accurately reflects performance across Non-IID data distributions by incentivizing local EDs using a new Stackelberg game model. The model provides rewards based on their contributions to the global model, thereby keeping the EDs motivated to participate and perform well. Finally, we incorporate a deep reinforcement learning technique into the proposed SelfFed strategy for dynamic ED selection to handle straggler EDs. This technique adapts to changes in device performance and resources over iterations, fostering collaboration and sustained engagement. The performance of the SelfFed strategy is evaluated using a real-time SFL scenario (irrigation control in paddy fields) and three benchmark datasets using a serverless private cloud environment. Comparative results against state-of-the-art approaches reveal that the SelfFed significantly reduces CPU usage by 5%–6% and enhances training efficiency by 4%–8% while achieving 4%–6% higher accuracy. Further, in the real-time scenario, the SelfFed improves CPU usage by 3%–5% and enhances training efficiency by 8%–10% with 5%–7% higher accuracy.\",\"PeriodicalId\":50367,\"journal\":{\"name\":\"Information Fusion\",\"volume\":\"28 1\",\"pages\":\"\"},\"PeriodicalIF\":14.7000,\"publicationDate\":\"2025-01-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Information Fusion\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://doi.org/10.1016/j.inffus.2025.102932\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Fusion","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1016/j.inffus.2025.102932","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

摘要

联邦学习(FL)在资源受限的边缘设备(ed)上提供了一种分散和协作的训练解决方案,在不共享原始数据的情况下改进全局模型。标准同步FL (SFL)方法在数据隐私和减少通信开销方面提供了显着优势,然而,面临着一些挑战,包括非独立和同分布(Non-IID)数据、未标记数据的存在、由于设备异构而导致的偏差聚合以及有效的ed选择来处理离散者。为了应对这些挑战,我们提出了一种新的自适应联邦学习(Self-adaptive Federated Learning, self - ffed)策略,使用屏蔽损失函数来处理未标记的数据。这使得EDs能够专注于标记数据,提高培训效率。此外,我们集成了一种新的依赖于质量的聚合解决方案,通过聚合来减轻模型更新过程中的偏差。该解决方案通过使用新的Stackelberg游戏模型激励本地ed,准确地反映了非iid数据分布的性能。该模型根据他们对全球模型的贡献提供奖励,从而保持主编参与和表现良好的动力。最后,我们将深度强化学习技术整合到所提出的用于动态ED选择的SelfFed策略中,以处理掉队ED。这种技术适应设备性能和资源在迭代过程中的变化,促进协作和持续参与。使用实时SFL场景(水田灌溉控制)和使用无服务器私有云环境的三个基准数据集来评估SelfFed策略的性能。与最先进方法的对比结果表明,SelfFed显著降低了5%-6%的CPU使用率,提高了4%-8%的训练效率,同时实现了4%-6%的准确率提高。此外,在实时场景中,SelfFed将CPU使用率提高了3%-5%,将训练效率提高了8%-10%,准确率提高了5%-7%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
SelfFed: Self-adaptive Federated Learning with Non-IID data on Heterogeneous Edge Devices for Bias Mitigation and Enhance Training Efficiency
Federated learning (FL) offers a decentralized and collaborative training solution on resource-constraint Edge Devices (EDs) to improve a global model without sharing raw data. Standard Synchronous FL (SFL) approaches provide significant advantages in terms of data privacy and reduced communication overhead, however, face several challenges including Non-independent and identically distributed (Non-IID) data, the presence of unlabeled data, biased aggregation due to device heterogeneity and effective EDs selection to handle the straggler. To tackle these challenges, we propose a new Self-adaptive Federated Learning (SelfFed) strategy using a masked loss function to handle unlabeled data. This allows EDs to concentrate on labeled data, enhancing training efficiency. Additionally, we integrate a novel quality-dependent aggregation solution to mitigate bias during model updates through aggregation. This solution accurately reflects performance across Non-IID data distributions by incentivizing local EDs using a new Stackelberg game model. The model provides rewards based on their contributions to the global model, thereby keeping the EDs motivated to participate and perform well. Finally, we incorporate a deep reinforcement learning technique into the proposed SelfFed strategy for dynamic ED selection to handle straggler EDs. This technique adapts to changes in device performance and resources over iterations, fostering collaboration and sustained engagement. The performance of the SelfFed strategy is evaluated using a real-time SFL scenario (irrigation control in paddy fields) and three benchmark datasets using a serverless private cloud environment. Comparative results against state-of-the-art approaches reveal that the SelfFed significantly reduces CPU usage by 5%–6% and enhances training efficiency by 4%–8% while achieving 4%–6% higher accuracy. Further, in the real-time scenario, the SelfFed improves CPU usage by 3%–5% and enhances training efficiency by 8%–10% with 5%–7% higher accuracy.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Information Fusion
Information Fusion 工程技术-计算机:理论方法
CiteScore
33.20
自引率
4.30%
发文量
161
审稿时长
7.9 months
期刊介绍: Information Fusion serves as a central platform for showcasing advancements in multi-sensor, multi-source, multi-process information fusion, fostering collaboration among diverse disciplines driving its progress. It is the leading outlet for sharing research and development in this field, focusing on architectures, algorithms, and applications. Papers dealing with fundamental theoretical analyses as well as those demonstrating their application to real-world problems will be welcome.
期刊最新文献
Optimizing the environmental design and management of public green spaces: Analyzing urban infrastructure and long-term user experience with a focus on streetlight density in the city of Las Vegas, NV DF-BSFNet: A bilateral synergistic fusion network with novel dynamic flow convolution for robust road extraction KDFuse: A high-level vision task-driven infrared and visible image fusion method based on cross-domain knowledge distillation SelfFed: Self-adaptive Federated Learning with Non-IID data on Heterogeneous Edge Devices for Bias Mitigation and Enhance Training Efficiency DEMO: A Dynamics-Enhanced Learning Model for multi-horizon trajectory prediction in autonomous vehicles
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1