{"title":"A novel staged training strategy leveraging knowledge distillation and model fusion for heterogeneous federated learning","authors":"Debao Wang, Shaopeng Guan, Ruikang Sun","doi":"10.1016/j.jnca.2025.104104","DOIUrl":null,"url":null,"abstract":"<div><div>Client-side data heterogeneity poses a significant challenge in Federated Learning (FL), limiting the effectiveness of global models. To address this, we propose a staged training approach combining Knowledge Distillation and model fusion. First, a regularized KD technique trains a robust teacher model on the server, transferring knowledge to student models to enhance convergence and reduce overfitting. Then, an adaptive parameter assignment mechanism intelligently combines the local and global models, enabling clients to integrate global knowledge with local features for improved accuracy. Experimental results on multiple image classification datasets demonstrate that our approach outperforms existing algorithms in both convergence speed and accuracy, particularly in highly heterogeneous scenarios. It effectively balances the global model’s generalization and local personalization, providing a robust solution for FL.</div></div>","PeriodicalId":54784,"journal":{"name":"Journal of Network and Computer Applications","volume":"236 ","pages":"Article 104104"},"PeriodicalIF":7.7000,"publicationDate":"2025-01-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Network and Computer Applications","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1084804525000013","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE","Score":null,"Total":0}
引用次数: 0
Abstract
Client-side data heterogeneity poses a significant challenge in Federated Learning (FL), limiting the effectiveness of global models. To address this, we propose a staged training approach combining Knowledge Distillation and model fusion. First, a regularized KD technique trains a robust teacher model on the server, transferring knowledge to student models to enhance convergence and reduce overfitting. Then, an adaptive parameter assignment mechanism intelligently combines the local and global models, enabling clients to integrate global knowledge with local features for improved accuracy. Experimental results on multiple image classification datasets demonstrate that our approach outperforms existing algorithms in both convergence speed and accuracy, particularly in highly heterogeneous scenarios. It effectively balances the global model’s generalization and local personalization, providing a robust solution for FL.
期刊介绍:
The Journal of Network and Computer Applications welcomes research contributions, surveys, and notes in all areas relating to computer networks and applications thereof. Sample topics include new design techniques, interesting or novel applications, components or standards; computer networks with tools such as WWW; emerging standards for internet protocols; Wireless networks; Mobile Computing; emerging computing models such as cloud computing, grid computing; applications of networked systems for remote collaboration and telemedicine, etc. The journal is abstracted and indexed in Scopus, Engineering Index, Web of Science, Science Citation Index Expanded and INSPEC.