DiReDi: Distillation and Reverse Distillation for AIoT Applications

arXiv - CS - Distributed, Parallel, and Cluster Computing Pub Date : 2024-09-12 DOI:arxiv-2409.08308

Chen Sun, Qing Tong, Wenshuang Yang, Wenqi Zhang

{"title":"DiReDi: Distillation and Reverse Distillation for AIoT Applications","authors":"Chen Sun, Qing Tong, Wenshuang Yang, Wenqi Zhang","doi":"arxiv-2409.08308","DOIUrl":null,"url":null,"abstract":"Typically, the significant efficiency can be achieved by deploying different\nedge AI models in various real world scenarios while a few large models manage\nthose edge AI models remotely from cloud servers. However, customizing edge AI\nmodels for each user's specific application or extending current models to new\napplication scenarios remains a challenge. Inappropriate local training or fine\ntuning of edge AI models by users can lead to model malfunction, potentially\nresulting in legal issues for the manufacturer. To address aforementioned\nissues, this paper proposes an innovative framework called \"DiReD\", which\ninvolves knowledge DIstillation & REverse DIstillation. In the initial step, an\nedge AI model is trained with presumed data and a KD process using the cloud AI\nmodel in the upper management cloud server. This edge AI model is then\ndispatched to edge AI devices solely for inference in the user's application\nscenario. When the user needs to update the edge AI model to better fit the\nactual scenario, the reverse distillation (RD) process is employed to extract\nthe knowledge: the difference between user preferences and the manufacturer's\npresumptions from the edge AI model using the user's exclusive data. Only the\nextracted knowledge is reported back to the upper management cloud server to\nupdate the cloud AI model, thus protecting user privacy by not using any\nexclusive data. The updated cloud AI can then update the edge AI model with the\nextended knowledge. Simulation results demonstrate that the proposed \"DiReDi\"\nframework allows the manufacturer to update the user model by learning new\nknowledge from the user's actual scenario with private data. The initial\nredundant knowledge is reduced since the retraining emphasizes user private\ndata.","PeriodicalId":501422,"journal":{"name":"arXiv - CS - Distributed, Parallel, and Cluster Computing","volume":"64 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Distributed, Parallel, and Cluster Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.08308","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Typically, the significant efficiency can be achieved by deploying different edge AI models in various real world scenarios while a few large models manage those edge AI models remotely from cloud servers. However, customizing edge AI models for each user's specific application or extending current models to new application scenarios remains a challenge. Inappropriate local training or fine tuning of edge AI models by users can lead to model malfunction, potentially resulting in legal issues for the manufacturer. To address aforementioned issues, this paper proposes an innovative framework called "DiReD", which involves knowledge DIstillation & REverse DIstillation. In the initial step, an edge AI model is trained with presumed data and a KD process using the cloud AI model in the upper management cloud server. This edge AI model is then dispatched to edge AI devices solely for inference in the user's application scenario. When the user needs to update the edge AI model to better fit the actual scenario, the reverse distillation (RD) process is employed to extract the knowledge: the difference between user preferences and the manufacturer's presumptions from the edge AI model using the user's exclusive data. Only the extracted knowledge is reported back to the upper management cloud server to update the cloud AI model, thus protecting user privacy by not using any exclusive data. The updated cloud AI can then update the edge AI model with the extended knowledge. Simulation results demonstrate that the proposed "DiReDi" framework allows the manufacturer to update the user model by learning new knowledge from the user's actual scenario with private data. The initial redundant knowledge is reduced since the retraining emphasizes user private data.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

DiReDi：面向 AIoT 应用的蒸馏和反向蒸馏

通常情况下，通过在各种现实场景中部署不同的边缘人工智能模型，同时由少数大型模型从云服务器远程管理这些边缘人工智能模型，可以实现显著的效率。然而，为每个用户的特定应用定制边缘人工智能模型或将当前模型扩展到新的应用场景仍然是一项挑战。用户对边缘人工智能模型进行不恰当的本地训练或微调可能会导致模型失灵，从而给制造商带来潜在的法律问题。为解决上述问题，本文提出了一个名为 "DiReD "的创新框架。第一步，利用上层管理云服务器中的云人工智能模型，通过假定数据和 KD 流程训练边缘人工智能模型。然后，该边缘人工智能模型被分配到边缘人工智能设备上，仅用于用户应用场景中的推理。当用户需要更新边缘人工智能模型以更好地适应实际场景时，就会采用反向蒸馏（RD）流程，利用用户的独家数据从边缘人工智能模型中提取知识：用户偏好与制造商假设之间的差异。只有提取的知识才会反馈给上层管理云服务器，用于更新云人工智能模型，从而通过不使用任何独家数据来保护用户隐私。更新后的云人工智能可以利用扩展知识更新边缘人工智能模型。仿真结果表明，所提出的 "DiReDi "框架允许制造商通过从用户的实际场景中学习新知识来更新用户模型。由于再训练强调用户私有数据，因此减少了初始冗余知识。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

arXiv - CS - Distributed, Parallel, and Cluster Computing

自引率

0.00%

发文量

期刊最新文献

Massively parallel CMA-ES with increasing population Communication Lower Bounds and Optimal Algorithms for Symmetric Matrix Computations Energy Efficiency Support for Software Defined Networks: a Serverless Computing Approach CountChain: A Decentralized Oracle Network for Counting Systems Delay Analysis of EIP-4844