利用聚类联合半监督学习在 HWN 中实现高效数据标记和最佳设备调度

IF 8.3 2区 计算机科学 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC IEEE Transactions on Communications Pub Date : 2024-12-18 DOI:10.1109/TCOMM.2024.3519538
Moqbel Hamood;Abdullatif Albaseer;Mohamed Abdallah;Ala Al-Fuqaha
{"title":"利用聚类联合半监督学习在 HWN 中实现高效数据标记和最佳设备调度","authors":"Moqbel Hamood;Abdullatif Albaseer;Mohamed Abdallah;Ala Al-Fuqaha","doi":"10.1109/TCOMM.2024.3519538","DOIUrl":null,"url":null,"abstract":"Clustered Federated Multi-task Learning (CFL) has emerged as a promising technique to address statistical challenges, particularly with non-independent and identically distributed (non-IID) data across users. However, existing CFL studies entirely rely on the impractical assumption that devices possess access to accurate ground-truth labels. This assumption becomes specifically problematic in hierarchical wireless networks (HWNs), with vast unlabeled data and dual-level model aggregation, not only leading to slowing down convergence speeds and extending processing times but also resulting in increased resource consumption. To this end, we propose Clustered Federated Semi-Supervised Learning (CFSL), a novel framework tailored for more realistic scenarios in HWNs. We leverage specialized models resulting from device clustering and present two prediction model schemes, the best-performing specialized model and the weighted-averaging ensemble model, to correctly label unlabeled, unseen data. For the best-performing specialized model scheme, a specialized model excelling in label prediction for a specific device is assigned to correctly label the unlabeled data, even when the data originates from other environments, while the weighted-averaging ensemble model combines all specialized models into a unified model, capturing more details from broader data distributions across edge networks. The CFSL also introduces two novel prediction time schemes, split-based and stopping-based, for accurately timing the labeling process, alongside two strategic device selection schemes, greedy and round-robin, upon reaching each cluster’s stopping point. Extensive testing validates CFSL’s superiority over existing models in labeling and testing accuracies and resource efficiency, achieving up to 51% energy savings.","PeriodicalId":13041,"journal":{"name":"IEEE Transactions on Communications","volume":"73 7","pages":"4941-4957"},"PeriodicalIF":8.3000,"publicationDate":"2024-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Efficient Data Labeling and Optimal Device Scheduling in HWNs Using Clustered Federated Semi-Supervised Learning\",\"authors\":\"Moqbel Hamood;Abdullatif Albaseer;Mohamed Abdallah;Ala Al-Fuqaha\",\"doi\":\"10.1109/TCOMM.2024.3519538\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Clustered Federated Multi-task Learning (CFL) has emerged as a promising technique to address statistical challenges, particularly with non-independent and identically distributed (non-IID) data across users. However, existing CFL studies entirely rely on the impractical assumption that devices possess access to accurate ground-truth labels. This assumption becomes specifically problematic in hierarchical wireless networks (HWNs), with vast unlabeled data and dual-level model aggregation, not only leading to slowing down convergence speeds and extending processing times but also resulting in increased resource consumption. To this end, we propose Clustered Federated Semi-Supervised Learning (CFSL), a novel framework tailored for more realistic scenarios in HWNs. We leverage specialized models resulting from device clustering and present two prediction model schemes, the best-performing specialized model and the weighted-averaging ensemble model, to correctly label unlabeled, unseen data. For the best-performing specialized model scheme, a specialized model excelling in label prediction for a specific device is assigned to correctly label the unlabeled data, even when the data originates from other environments, while the weighted-averaging ensemble model combines all specialized models into a unified model, capturing more details from broader data distributions across edge networks. The CFSL also introduces two novel prediction time schemes, split-based and stopping-based, for accurately timing the labeling process, alongside two strategic device selection schemes, greedy and round-robin, upon reaching each cluster’s stopping point. Extensive testing validates CFSL’s superiority over existing models in labeling and testing accuracies and resource efficiency, achieving up to 51% energy savings.\",\"PeriodicalId\":13041,\"journal\":{\"name\":\"IEEE Transactions on Communications\",\"volume\":\"73 7\",\"pages\":\"4941-4957\"},\"PeriodicalIF\":8.3000,\"publicationDate\":\"2024-12-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Communications\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10806860/\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Communications","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10806860/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0

摘要

聚类联邦多任务学习(CFL)已经成为解决统计挑战的一种很有前途的技术,特别是对于跨用户的非独立和同分布(non-IID)数据。然而,现有的CFL研究完全依赖于不切实际的假设,即设备能够获得准确的接地真值标签。这种假设在分层无线网络(hwn)中尤其成问题,因为它具有大量未标记的数据和双层模型聚合,不仅会导致收敛速度减慢和处理时间延长,还会导致资源消耗增加。为此,我们提出了聚类联邦半监督学习(CFSL),这是一个为HWNs中更现实的场景量身定制的新框架。我们利用由设备聚类产生的专门模型,并提出了两种预测模型方案,即性能最佳的专门模型和加权平均集成模型,以正确标记未标记的未见过的数据。对于性能最佳的专门模型方案,分配一个擅长特定设备标签预测的专门模型来正确标记未标记的数据,即使数据来自其他环境,而加权平均集成模型将所有专门模型结合到一个统一的模型中,从跨边缘网络的更广泛的数据分布中捕获更多细节。CFSL还引入了两种新的预测时间方案,基于分裂和基于停止,用于准确地计时标记过程,以及两种策略设备选择方案,贪婪和循环,在到达每个集群的停止点时。广泛的测试验证了CFSL在标签和测试准确性和资源效率方面优于现有模型,实现高达51%的节能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Efficient Data Labeling and Optimal Device Scheduling in HWNs Using Clustered Federated Semi-Supervised Learning
Clustered Federated Multi-task Learning (CFL) has emerged as a promising technique to address statistical challenges, particularly with non-independent and identically distributed (non-IID) data across users. However, existing CFL studies entirely rely on the impractical assumption that devices possess access to accurate ground-truth labels. This assumption becomes specifically problematic in hierarchical wireless networks (HWNs), with vast unlabeled data and dual-level model aggregation, not only leading to slowing down convergence speeds and extending processing times but also resulting in increased resource consumption. To this end, we propose Clustered Federated Semi-Supervised Learning (CFSL), a novel framework tailored for more realistic scenarios in HWNs. We leverage specialized models resulting from device clustering and present two prediction model schemes, the best-performing specialized model and the weighted-averaging ensemble model, to correctly label unlabeled, unseen data. For the best-performing specialized model scheme, a specialized model excelling in label prediction for a specific device is assigned to correctly label the unlabeled data, even when the data originates from other environments, while the weighted-averaging ensemble model combines all specialized models into a unified model, capturing more details from broader data distributions across edge networks. The CFSL also introduces two novel prediction time schemes, split-based and stopping-based, for accurately timing the labeling process, alongside two strategic device selection schemes, greedy and round-robin, upon reaching each cluster’s stopping point. Extensive testing validates CFSL’s superiority over existing models in labeling and testing accuracies and resource efficiency, achieving up to 51% energy savings.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
IEEE Transactions on Communications
IEEE Transactions on Communications 工程技术-电信学
CiteScore
16.10
自引率
8.40%
发文量
528
审稿时长
4.1 months
期刊介绍: The IEEE Transactions on Communications is dedicated to publishing high-quality manuscripts that showcase advancements in the state-of-the-art of telecommunications. Our scope encompasses all aspects of telecommunications, including telephone, telegraphy, facsimile, and television, facilitated by electromagnetic propagation methods such as radio, wire, aerial, underground, coaxial, and submarine cables, as well as waveguides, communication satellites, and lasers. We cover telecommunications in various settings, including marine, aeronautical, space, and fixed station services, addressing topics such as repeaters, radio relaying, signal storage, regeneration, error detection and correction, multiplexing, carrier techniques, communication switching systems, data communications, and communication theory. Join us in advancing the field of telecommunications through groundbreaking research and innovation.
期刊最新文献
Adaptive UAV Positioning to Enhance SNR in Air-to-Water Optical Wireless Channels CRB-Constrained Rate Optimization for Movable Antenna-Enabled IRS-Aided ISAC Systems Enhancing Near-field BAN-based Vital-Sign Monitoring via Integrated Sensing, Communication, and Powering Network-Level Performance Analysis for Hybrid sub-6 GHz and mmWave Integrated Sensing and Communications OIRS-assisted VLC Channel Optimization Against UAV Blockage Based on Two-Stage Machine Learning Framework
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1