联合学习中的数据取样：原理、特征和分类

Q1 Social Sciences IEEE Communications Standards Magazine Pub Date : 2023-12-01 DOI:10.1109/MCOMSTD.0004.2200076

Alekha Kumar Mishra, Deepak Puthal

{"title":"联合学习中的数据取样：原理、特征和分类","authors":"Alekha Kumar Mishra, Deepak Puthal","doi":"10.1109/MCOMSTD.0004.2200076","DOIUrl":null,"url":null,"abstract":"Federated learning collects data from various devices, analyzes it locally, aggregates it, and then finds meaningful insights from it. Data sampling works the same way by dividing the larger data set into smaller parts and applying computation to those data sets, which reduces the time taken to do the work. Data sampling in federated learning aims to find the ideal mixture of selecting data sets for training purposes to improve training accuracy while staying within the maximum capability of the device and network. In this article, we present an overview and analysis of recent data sampling techniques for federated learning. The list includes sampling approaches suitable for federated learning environments such as clustering, dynamic sampling, adaptive sampling, probabilistic sampling, and many more. The feature analysis is comprised of a description of the procedure, the criteria, and other relevant parameters for sampling. The efficiency of the sampling technique is analyzed via comparison of claimed accuracy and convergence rate with respect to the used dataset.","PeriodicalId":36719,"journal":{"name":"IEEE Communications Standards Magazine","volume":"799 ","pages":"28-33"},"PeriodicalIF":0.0000,"publicationDate":"2023-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Data Sampling In Federated Learning: Principles, Features And Taxonomy\",\"authors\":\"Alekha Kumar Mishra, Deepak Puthal\",\"doi\":\"10.1109/MCOMSTD.0004.2200076\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Federated learning collects data from various devices, analyzes it locally, aggregates it, and then finds meaningful insights from it. Data sampling works the same way by dividing the larger data set into smaller parts and applying computation to those data sets, which reduces the time taken to do the work. Data sampling in federated learning aims to find the ideal mixture of selecting data sets for training purposes to improve training accuracy while staying within the maximum capability of the device and network. In this article, we present an overview and analysis of recent data sampling techniques for federated learning. The list includes sampling approaches suitable for federated learning environments such as clustering, dynamic sampling, adaptive sampling, probabilistic sampling, and many more. The feature analysis is comprised of a description of the procedure, the criteria, and other relevant parameters for sampling. The efficiency of the sampling technique is analyzed via comparison of claimed accuracy and convergence rate with respect to the used dataset.\",\"PeriodicalId\":36719,\"journal\":{\"name\":\"IEEE Communications Standards Magazine\",\"volume\":\"799 \",\"pages\":\"28-33\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Communications Standards Magazine\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/MCOMSTD.0004.2200076\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"Social Sciences\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Communications Standards Magazine","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MCOMSTD.0004.2200076","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Social Sciences","Score":null,"Total":0}

引用次数: 0

摘要

联合学习从各种设备中收集数据，在本地对其进行分析、汇总，然后从中发现有意义的见解。数据采样的工作原理与此相同，它将较大的数据集分成较小的部分，并对这些数据集进行计算，从而减少了工作所需的时间。联合学习中的数据采样旨在找到理想的混合数据集选择方法，以提高训练的准确性，同时保持在设备和网络的最大能力范围内。在本文中，我们概述并分析了最近用于联合学习的数据采样技术。其中包括适合联合学习环境的采样方法，如聚类、动态采样、自适应采样、概率采样等。特征分析包括对抽样程序、标准和其他相关参数的描述。通过比较所声称的准确率和收敛率，分析了所使用数据集的采样技术的效率。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Data Sampling In Federated Learning: Principles, Features And Taxonomy

Federated learning collects data from various devices, analyzes it locally, aggregates it, and then finds meaningful insights from it. Data sampling works the same way by dividing the larger data set into smaller parts and applying computation to those data sets, which reduces the time taken to do the work. Data sampling in federated learning aims to find the ideal mixture of selecting data sets for training purposes to improve training accuracy while staying within the maximum capability of the device and network. In this article, we present an overview and analysis of recent data sampling techniques for federated learning. The list includes sampling approaches suitable for federated learning environments such as clustering, dynamic sampling, adaptive sampling, probabilistic sampling, and many more. The feature analysis is comprised of a description of the procedure, the criteria, and other relevant parameters for sampling. The efficiency of the sampling technique is analyzed via comparison of claimed accuracy and convergence rate with respect to the used dataset.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊