{"title":"空中边缘的大型模型:边缘-云模型演化与通信范式","authors":"Shuhang Zhang;Qingyu Liu;Ke Chen;Boya Di;Hongliang Zhang;Wenhan Yang;Dusit Niyato;Zhu Han;H. Vincent Poor","doi":"10.1109/JSAC.2024.3460078","DOIUrl":null,"url":null,"abstract":"The future sixth-generation (6G) of wireless networks is expected to surpass its predecessors by offering ubiquitous coverage through integrated air-ground deployments in both communication and computing domains. In such networks, aerial platforms, such as unmanned aerial vehicles (UAVs), conduct artificial intelligence (AI) computations based on multi-modal data to support diverse applications including surveillance and environment construction. However, these multi-domain inference and content generation tasks require large AI models, demanding powerful computing capabilities and finely tuned inference models trained on rich datasets, thus posing significant challenges for UAVs. To tackle this problem, we propose an integrated air-ground edge-cloud model framework, in which UAVs serve as edge nodes for data collection and small model computation. Through wireless channels, UAVs collaborate with ground cloud servers providing large model computation and model updating for edge UAVs. With limited wireless communication bandwidth, the proposed framework faces the challenge of information exchange scheduling between the edge UAVs and the cloud server. To tackle this, we present joint task allocation, transmission resource allocation, transmission data quantization design, and edge model update design to enhance the inference accuracy of the integrated air-ground edge-cloud model evolution framework by mean average precision (mAP) maximization. A closed-form lower bound on the mAP of the proposed framework is derived based on the mAP of the edge model and mAP of the cloud model, and the solution to the mAP maximization problem is optimized accordingly. Simulations, based on results from vision-based classification experiments, consistently demonstrate that the mAP of the proposed integrated air-ground edge-cloud model evolution framework outperforms both a centralized cloud model framework and a distributed edge model framework across various communication bandwidths and data sizes.","PeriodicalId":73294,"journal":{"name":"IEEE journal on selected areas in communications : a publication of the IEEE Communications Society","volume":"43 1","pages":"21-35"},"PeriodicalIF":0.0000,"publicationDate":"2024-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Large Models for Aerial Edges: An Edge-Cloud Model Evolution and Communication Paradigm\",\"authors\":\"Shuhang Zhang;Qingyu Liu;Ke Chen;Boya Di;Hongliang Zhang;Wenhan Yang;Dusit Niyato;Zhu Han;H. Vincent Poor\",\"doi\":\"10.1109/JSAC.2024.3460078\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The future sixth-generation (6G) of wireless networks is expected to surpass its predecessors by offering ubiquitous coverage through integrated air-ground deployments in both communication and computing domains. In such networks, aerial platforms, such as unmanned aerial vehicles (UAVs), conduct artificial intelligence (AI) computations based on multi-modal data to support diverse applications including surveillance and environment construction. However, these multi-domain inference and content generation tasks require large AI models, demanding powerful computing capabilities and finely tuned inference models trained on rich datasets, thus posing significant challenges for UAVs. To tackle this problem, we propose an integrated air-ground edge-cloud model framework, in which UAVs serve as edge nodes for data collection and small model computation. Through wireless channels, UAVs collaborate with ground cloud servers providing large model computation and model updating for edge UAVs. With limited wireless communication bandwidth, the proposed framework faces the challenge of information exchange scheduling between the edge UAVs and the cloud server. To tackle this, we present joint task allocation, transmission resource allocation, transmission data quantization design, and edge model update design to enhance the inference accuracy of the integrated air-ground edge-cloud model evolution framework by mean average precision (mAP) maximization. A closed-form lower bound on the mAP of the proposed framework is derived based on the mAP of the edge model and mAP of the cloud model, and the solution to the mAP maximization problem is optimized accordingly. Simulations, based on results from vision-based classification experiments, consistently demonstrate that the mAP of the proposed integrated air-ground edge-cloud model evolution framework outperforms both a centralized cloud model framework and a distributed edge model framework across various communication bandwidths and data sizes.\",\"PeriodicalId\":73294,\"journal\":{\"name\":\"IEEE journal on selected areas in communications : a publication of the IEEE Communications Society\",\"volume\":\"43 1\",\"pages\":\"21-35\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-09-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE journal on selected areas in communications : a publication of the IEEE Communications Society\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10681129/\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE journal on selected areas in communications : a publication of the IEEE Communications Society","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10681129/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Large Models for Aerial Edges: An Edge-Cloud Model Evolution and Communication Paradigm
The future sixth-generation (6G) of wireless networks is expected to surpass its predecessors by offering ubiquitous coverage through integrated air-ground deployments in both communication and computing domains. In such networks, aerial platforms, such as unmanned aerial vehicles (UAVs), conduct artificial intelligence (AI) computations based on multi-modal data to support diverse applications including surveillance and environment construction. However, these multi-domain inference and content generation tasks require large AI models, demanding powerful computing capabilities and finely tuned inference models trained on rich datasets, thus posing significant challenges for UAVs. To tackle this problem, we propose an integrated air-ground edge-cloud model framework, in which UAVs serve as edge nodes for data collection and small model computation. Through wireless channels, UAVs collaborate with ground cloud servers providing large model computation and model updating for edge UAVs. With limited wireless communication bandwidth, the proposed framework faces the challenge of information exchange scheduling between the edge UAVs and the cloud server. To tackle this, we present joint task allocation, transmission resource allocation, transmission data quantization design, and edge model update design to enhance the inference accuracy of the integrated air-ground edge-cloud model evolution framework by mean average precision (mAP) maximization. A closed-form lower bound on the mAP of the proposed framework is derived based on the mAP of the edge model and mAP of the cloud model, and the solution to the mAP maximization problem is optimized accordingly. Simulations, based on results from vision-based classification experiments, consistently demonstrate that the mAP of the proposed integrated air-ground edge-cloud model evolution framework outperforms both a centralized cloud model framework and a distributed edge model framework across various communication bandwidths and data sizes.