A systematic review of federated learning incentive mechanisms and associated security challenges

IF 12.7 1区计算机科学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Computer Science Review Pub Date : 2023-09-13 DOI:10.1016/j.cosrev.2023.100593

Asad Ali , Inaam Ilahi , Adnan Qayyum , Ihab Mohammed , Ala Al-Fuqaha , Junaid Qadir

{"title":"A systematic review of federated learning incentive mechanisms and associated security challenges","authors":"Asad Ali , Inaam Ilahi , Adnan Qayyum , Ihab Mohammed , Ala Al-Fuqaha , Junaid Qadir","doi":"10.1016/j.cosrev.2023.100593","DOIUrl":null,"url":null,"abstract":"<div><p>In response to various privacy risks, researchers and practitioners have been exploring different paradigms that can leverage the increased computational capabilities of consumer devices to train machine learning<span> (ML) models in a distributed fashion without requiring the uploading of the training data from individual devices to central facilities. For this purpose, federated learning (FL) was proposed as a technique that can learn a global machine model at a central master node by the aggregation of models trained locally using private data. However, organizations may be reluctant to train models locally and to share these local ML models due to the required computational resources for model training at their end and due to privacy risks that may result from adversaries inverting these models to infer information about the private training data. Incentive mechanisms have been proposed to motivate end users to participate in collaborative training of ML models (using their local data) in return for certain rewards. However, the design of an optimal incentive mechanism for FL is challenging due to its distributed nature and the fact that the central server has no access to clients’ hyperparameters information and the amount/quality data used for training, which makes the task of determining the reward based on the contribution of individual clients in FL environment difficult. Even though several incentive mechanisms have been proposed for FL, a thorough up-to-date systematic review is missing and this paper fills this gap. To the best of our knowledge, this paper is the first systematic review that comprehensively enlists the design principles required for implementing these incentive mechanisms and then categorizes various incentive mechanisms according to their design principles. In addition, we also provide a comprehensive overview of security challenges associated with incentive-driven FL. Finally, we highlight the limitations and pitfalls of these incentive schemes and elaborate upon open-research issues that require further research attention.</span></p></div>","PeriodicalId":48633,"journal":{"name":"Computer Science Review","volume":"50 ","pages":"Article 100593"},"PeriodicalIF":12.7000,"publicationDate":"2023-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer Science Review","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1574013723000606","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}

引用次数: 1

Abstract

In response to various privacy risks, researchers and practitioners have been exploring different paradigms that can leverage the increased computational capabilities of consumer devices to train machine learning (ML) models in a distributed fashion without requiring the uploading of the training data from individual devices to central facilities. For this purpose, federated learning (FL) was proposed as a technique that can learn a global machine model at a central master node by the aggregation of models trained locally using private data. However, organizations may be reluctant to train models locally and to share these local ML models due to the required computational resources for model training at their end and due to privacy risks that may result from adversaries inverting these models to infer information about the private training data. Incentive mechanisms have been proposed to motivate end users to participate in collaborative training of ML models (using their local data) in return for certain rewards. However, the design of an optimal incentive mechanism for FL is challenging due to its distributed nature and the fact that the central server has no access to clients’ hyperparameters information and the amount/quality data used for training, which makes the task of determining the reward based on the contribution of individual clients in FL environment difficult. Even though several incentive mechanisms have been proposed for FL, a thorough up-to-date systematic review is missing and this paper fills this gap. To the best of our knowledge, this paper is the first systematic review that comprehensively enlists the design principles required for implementing these incentive mechanisms and then categorizes various incentive mechanisms according to their design principles. In addition, we also provide a comprehensive overview of security challenges associated with incentive-driven FL. Finally, we highlight the limitations and pitfalls of these incentive schemes and elaborate upon open-research issues that require further research attention.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

联邦学习激励机制和相关安全挑战的系统综述

为了应对各种隐私风险，研究人员和从业者一直在探索不同的范式，这些范式可以利用消费者设备增加的计算能力，以分布式方式训练机器学习（ML）模型，而不需要将训练数据从单个设备上传到中央设施。为此，提出了联邦学习（FL），作为一种可以通过聚合使用私有数据在本地训练的模型来在中央主节点学习全局机器模型的技术。然而，组织可能不愿意在本地训练模型并共享这些本地ML模型，这是因为它们末端的模型训练所需的计算资源，以及由于对手反转这些模型以推断有关私人训练数据的信息可能导致的隐私风险。已经提出了激励机制来激励最终用户参与ML模型的协作训练（使用他们的本地数据），以换取某些奖励。然而，由于FL的分布式性质以及中央服务器无法访问客户端的超参数信息和用于训练的数量/质量数据，因此FL的最佳激励机制的设计具有挑战性，这使得基于单个客户端在FL环境中的贡献来确定奖励的任务变得困难。尽管已经为FL提出了几种激励机制，但缺乏最新的系统综述，本文填补了这一空白。据我们所知，本文是第一篇系统综述，全面列出了实施这些激励机制所需的设计原则，然后根据其设计原则对各种激励机制进行了分类。此外，我们还全面概述了与激励驱动的FL相关的安全挑战。最后，我们强调了这些激励方案的局限性和陷阱，并详细阐述了需要进一步研究关注的开放研究问题。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Computer Science Review Computer Science-General Computer Science

CiteScore

32.70

自引率

0.00%

发文量

审稿时长

51 days

期刊介绍： Computer Science Review, a publication dedicated to research surveys and expository overviews of open problems in computer science, targets a broad audience within the field seeking comprehensive insights into the latest developments. The journal welcomes articles from various fields as long as their content impacts the advancement of computer science. In particular, articles that review the application of well-known Computer Science methods to other areas are in scope only if these articles advance the fundamental understanding of those methods.