Federated Low-Rank Adaptation for Large Models Fine-Tuning Over Wireless Networks

IF 8.9 1区计算机科学 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC IEEE Transactions on Wireless Communications Pub Date : 2024-11-22 DOI:10.1109/TWC.2024.3497998

Haofeng Sun;Hui Tian;Wanli Ni;Jingheng Zheng;Dusit Niyato;Ping Zhang

{"title":"Federated Low-Rank Adaptation for Large Models Fine-Tuning Over Wireless Networks","authors":"Haofeng Sun;Hui Tian;Wanli Ni;Jingheng Zheng;Dusit Niyato;Ping Zhang","doi":"10.1109/TWC.2024.3497998","DOIUrl":null,"url":null,"abstract":"The emergence of large language models (LLMs) with multi-task generalization capabilities is expected to improve the performance of artificial intelligence (AI)-as-a-service provision in 6G networks. By fine-tuning LLMs, AI services can become more precise and tailored to the demands of different downstream tasks. However, centralized fine-tuning paradigms pose a potential risk to user privacy, and existing distributed fine-tuning methods incur significant wireless transmission burdens due to the large-scale parameter transmission of LLMs. To tackle these challenges, by leveraging the low rank feature in LLM fine-tuning, we propose a wireless over-the-air federated learning (AirFL) based low-rank adaptation (LoRA) framework that integrates LoRA and over-the-air computation (AirComp) to achieve efficient fine-tuning and aggregation. Based on multiple-input multiple-output (MIMO) and orthogonal frequency division multiplexing (OFDM), we design a multi-stream AirComp scheme to fulfill the aggregation requirement of AirFL-LoRA. Furthermore, by deriving an optimality gap, we gain theoretical insights into the joint impact of rank selection and gradient aggregation distortion on the fine-tuning performance of AirFL-LoRA. Next, we formulate a non-convex problem to minimize the optimality gap, which is solved by the proposed backtracking-based alternating algorithm and the manifold optimization algorithm iteratively. Through fine-tuning LLMs for different downstream tasks, experimental results reveal that the AirFL-LoRA framework outperforms the state-of-the-art baselines on both training loss and perplexity, closely approximating the performance of FL with ideal aggregation.","PeriodicalId":13431,"journal":{"name":"IEEE Transactions on Wireless Communications","volume":"24 1","pages":"659-675"},"PeriodicalIF":8.9000,"publicationDate":"2024-11-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Wireless Communications","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10763424/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}

引用次数: 0

Abstract

The emergence of large language models (LLMs) with multi-task generalization capabilities is expected to improve the performance of artificial intelligence (AI)-as-a-service provision in 6G networks. By fine-tuning LLMs, AI services can become more precise and tailored to the demands of different downstream tasks. However, centralized fine-tuning paradigms pose a potential risk to user privacy, and existing distributed fine-tuning methods incur significant wireless transmission burdens due to the large-scale parameter transmission of LLMs. To tackle these challenges, by leveraging the low rank feature in LLM fine-tuning, we propose a wireless over-the-air federated learning (AirFL) based low-rank adaptation (LoRA) framework that integrates LoRA and over-the-air computation (AirComp) to achieve efficient fine-tuning and aggregation. Based on multiple-input multiple-output (MIMO) and orthogonal frequency division multiplexing (OFDM), we design a multi-stream AirComp scheme to fulfill the aggregation requirement of AirFL-LoRA. Furthermore, by deriving an optimality gap, we gain theoretical insights into the joint impact of rank selection and gradient aggregation distortion on the fine-tuning performance of AirFL-LoRA. Next, we formulate a non-convex problem to minimize the optimality gap, which is solved by the proposed backtracking-based alternating algorithm and the manifold optimization algorithm iteratively. Through fine-tuning LLMs for different downstream tasks, experimental results reveal that the AirFL-LoRA framework outperforms the state-of-the-art baselines on both training loss and perplexity, closely approximating the performance of FL with ideal aggregation.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

通过无线网络对大型模型进行微调的联合低域自适应

具有多任务泛化能力的大型语言模型（llm）的出现有望提高6G网络中人工智能（AI）即服务（as-a-service）提供的性能。通过对llm进行微调，人工智能服务可以变得更加精确，并根据不同下游任务的需求量身定制。然而，集中式微调范式对用户隐私存在潜在风险，现有的分布式微调方法由于llm的大规模参数传输而带来了巨大的无线传输负担。为了应对这些挑战，通过利用LLM微调中的低秩特征，我们提出了一种基于无线空中联邦学习（AirFL）的低秩自适应（LoRA）框架，该框架集成了LoRA和空中计算（AirComp），以实现高效的微调和聚合。基于多输入多输出（MIMO）和正交频分复用（OFDM），设计了一种多流AirComp方案，以满足AirFL-LoRA的汇聚要求。此外，通过推导最优性差距，我们从理论上深入了解了等级选择和梯度聚合失真对AirFL-LoRA微调性能的共同影响。其次，我们提出了一个最小化最优性间隙的非凸问题，该问题由基于回溯的交替算法和流形优化算法迭代求解。通过对不同下游任务的llm进行微调，实验结果表明，AirFL-LoRA框架在训练损失和困惑度方面都优于最先进的基线，非常接近具有理想聚合的FL的性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

IEEE Transactions on Wireless Communications 工程技术-电信学

CiteScore

18.60

自引率

10.60%

发文量

708

审稿时长

5.6 months

期刊介绍： The IEEE Transactions on Wireless Communications is a prestigious publication that showcases cutting-edge advancements in wireless communications. It welcomes both theoretical and practical contributions in various areas. The scope of the Transactions encompasses a wide range of topics, including modulation and coding, detection and estimation, propagation and channel characterization, and diversity techniques. The journal also emphasizes the physical and link layer communication aspects of network architectures and protocols. The journal is open to papers on specific topics or non-traditional topics related to specific application areas. This includes simulation tools and methodologies, orthogonal frequency division multiplexing, MIMO systems, and wireless over optical technologies. Overall, the IEEE Transactions on Wireless Communications serves as a platform for high-quality manuscripts that push the boundaries of wireless communications and contribute to advancements in the field.

期刊最新文献

Table of Contents IEEE Transactions on Wireless Communications Society Information IEEE Transactions on Wireless Communications Publication Information A Novel Gridless Uplink/Downlink Channel Estimation Method for Millimeter Wave MIMO-OFDM Systems Resonant Beam Enabled Multi-Target Localization