POLARIS：利用客户端选择加速异步联合学习

IF 5.3 2区计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS IEEE Transactions on Cloud Computing Pub Date : 2024-02-28 DOI:10.1109/TCC.2024.3370688

Yufei Kang;Baochun Li

{"title":"POLARIS：利用客户端选择加速异步联合学习","authors":"Yufei Kang;Baochun Li","doi":"10.1109/TCC.2024.3370688","DOIUrl":null,"url":null,"abstract":"Federated learning has garnered significant research attention as a privacy-preserving learning paradigm. Asynchronous federated learning has been proposed to improve scalability by accommodating slower clients, commonly referred to as stragglers. However, asynchronous federated learning suffers from slow convergence with respect to wall-clock time, due to the existence of data heterogeneity and staleness. Existing strategies struggled to tackle both difficulties for a wide range of deep learning models. To address the problem, we propose \n<italic>Polaris\n, a theoretically sound design and a new take to client selection for asynchronous federated learning. With \n<italic>Polaris\n, we first theoretically investigated the design space of client sampling strategies from a geometric optimization perspective, taking both data heterogeneity and staleness into account. Our design is not only theoretically proven, but also thoroughly tested in our reproducible experimental open-source testbed. Our experimental results demonstrates overwhelming evidence that \n<italic>Polaris\n outperformed existing state-of-the-art client selection strategies by a substantial margin over a wide variety of tasks and datasets, as we train image classification models using \n<monospace>CIFAR-10</monospace>\n, \n<monospace>CIFAR-100</monospace>\n, \n<monospace>CINIC-10</monospace>\n, \n<monospace>Federated EMNIST</monospace>\n, and a language modeling model using the \n<monospace>Tiny Shakespeare</monospace>\n dataset. Further, our extensive array of ablation studies have also shown that \n<italic>Polaris\n is both scalable and robust as the size of datasets scale up and data heterogeneity vary.","PeriodicalId":13202,"journal":{"name":"IEEE Transactions on Cloud Computing","volume":"12 2","pages":"446-458"},"PeriodicalIF":5.3000,"publicationDate":"2024-02-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Polaris: Accelerating Asynchronous Federated Learning With Client Selection\",\"authors\":\"Yufei Kang;Baochun Li\",\"doi\":\"10.1109/TCC.2024.3370688\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Federated learning has garnered significant research attention as a privacy-preserving learning paradigm. Asynchronous federated learning has been proposed to improve scalability by accommodating slower clients, commonly referred to as stragglers. However, asynchronous federated learning suffers from slow convergence with respect to wall-clock time, due to the existence of data heterogeneity and staleness. Existing strategies struggled to tackle both difficulties for a wide range of deep learning models. To address the problem, we propose \\n<italic>Polaris\\n, a theoretically sound design and a new take to client selection for asynchronous federated learning. With \\n<italic>Polaris\\n, we first theoretically investigated the design space of client sampling strategies from a geometric optimization perspective, taking both data heterogeneity and staleness into account. Our design is not only theoretically proven, but also thoroughly tested in our reproducible experimental open-source testbed. Our experimental results demonstrates overwhelming evidence that \\n<italic>Polaris\\n outperformed existing state-of-the-art client selection strategies by a substantial margin over a wide variety of tasks and datasets, as we train image classification models using \\n<monospace>CIFAR-10</monospace>\\n, \\n<monospace>CIFAR-100</monospace>\\n, \\n<monospace>CINIC-10</monospace>\\n, \\n<monospace>Federated EMNIST</monospace>\\n, and a language modeling model using the \\n<monospace>Tiny Shakespeare</monospace>\\n dataset. Further, our extensive array of ablation studies have also shown that \\n<italic>Polaris\\n is both scalable and robust as the size of datasets scale up and data heterogeneity vary.\",\"PeriodicalId\":13202,\"journal\":{\"name\":\"IEEE Transactions on Cloud Computing\",\"volume\":\"12 2\",\"pages\":\"446-458\"},\"PeriodicalIF\":5.3000,\"publicationDate\":\"2024-02-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Cloud Computing\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10452821/\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Cloud Computing","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10452821/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}

引用次数: 0

摘要

作为一种保护隐私的学习模式，联合学习在研究中备受关注。异步联合学习已被提出，通过容纳速度较慢的客户端（通常称为 "散兵游勇"）来提高可扩展性。然而，由于数据异构性和陈旧性的存在，异步联合学习存在收敛速度慢的问题。现有的策略难以解决各种深度学习模型的这两个难题。为了解决这个问题，我们提出了北极星（Polaris），这是一种理论上合理的设计，也是异步联合学习客户端选择的一种新方法。通过 Polaris，我们首先从几何优化的角度从理论上研究了客户端采样策略的设计空间，同时考虑到了数据的异质性和陈旧性。我们的设计不仅在理论上得到了证明，而且还在可重复的开源实验平台上进行了全面测试。我们使用 CIFAR-10、CIFAR-100、CINIC-10、Federated EMNIST 训练图像分类模型，并使用 Tiny Shakespeare 数据集训练语言建模。此外，我们大量的消融研究也表明，随着数据集规模的扩大和数据异质性的变化，Polaris 具有可扩展性和鲁棒性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Polaris: Accelerating Asynchronous Federated Learning With Client Selection

Federated learning has garnered significant research attention as a privacy-preserving learning paradigm. Asynchronous federated learning has been proposed to improve scalability by accommodating slower clients, commonly referred to as stragglers. However, asynchronous federated learning suffers from slow convergence with respect to wall-clock time, due to the existence of data heterogeneity and staleness. Existing strategies struggled to tackle both difficulties for a wide range of deep learning models. To address the problem, we propose Polaris , a theoretically sound design and a new take to client selection for asynchronous federated learning. With Polaris , we first theoretically investigated the design space of client sampling strategies from a geometric optimization perspective, taking both data heterogeneity and staleness into account. Our design is not only theoretically proven, but also thoroughly tested in our reproducible experimental open-source testbed. Our experimental results demonstrates overwhelming evidence that Polaris outperformed existing state-of-the-art client selection strategies by a substantial margin over a wide variety of tasks and datasets, as we train image classification models using CIFAR-10 , CIFAR-100 , CINIC-10 , Federated EMNIST , and a language modeling model using the Tiny Shakespeare dataset. Further, our extensive array of ablation studies have also shown that Polaris is both scalable and robust as the size of datasets scale up and data heterogeneity vary.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

IEEE Transactions on Cloud Computing Computer Science-Software

CiteScore

9.40

自引率

6.20%

发文量

167

期刊介绍： The IEEE Transactions on Cloud Computing (TCC) is dedicated to the multidisciplinary field of cloud computing. It is committed to the publication of articles that present innovative research ideas, application results, and case studies in cloud computing, focusing on key technical issues related to theory, algorithms, systems, applications, and performance.