以集合为基础增强边缘部署的 CNN 在少镜头场景中的可靠性

Zhen Gao;Shuang Liu;Junbo Zhao;Xiaofei Wang;Yu Wang;Zhu Han
{"title":"以集合为基础增强边缘部署的 CNN 在少镜头场景中的可靠性","authors":"Zhen Gao;Shuang Liu;Junbo Zhao;Xiaofei Wang;Yu Wang;Zhu Han","doi":"10.1109/TMLCN.2024.3435168","DOIUrl":null,"url":null,"abstract":"Convolutional Neural Networks (CNNs) have been applied in wide areas of computer vision, and edge intelligence is expected to provide instant AI service with the support of broadband mobile networks. However, the deployment of CNNs on network edge faces severe challenges. First, edge or embedded devices are usually not reliable, and hardware failures can corrupt the CNN system, which is unacceptable for critical applications, such as autonomous driving and object detection on space platforms. Second, edge or embedded devices are usually resource-limited, and therefore traditional redundancy-based protection methods are not applicable due to huge overhead. Although network pruning is effective to reduce the complexity of CNNs, we cannot have sufficient data for performance recovery in many scenarios due to privacy and security concerns. To enhance the reliability of CNNs on resource-limited devices with the few-shot constraint, we propose to construct an ensemble system with weak base CNNs pruned from the original strong CNN. To improve the ensemble performance with diverse base CNNs, we first propose a novel filter importance evaluation method by combining the amplitude and gradient information of the filter. Since the gradient part is related to the input data, different subsets of data are used for layer sensitivity analysis for different base CNNs, so that the different pruning configurations can be obtained for each base CNN. On this basis, a modified ReLU function is proposed to determine the final pruning rate of each layer in each base CNN. Extensive experiments prove that the proposed solution can effectively improve the reliability of CNNs with much less resource requirement for each edge server.","PeriodicalId":100641,"journal":{"name":"IEEE Transactions on Machine Learning in Communications and Networking","volume":"2 ","pages":"1062-1075"},"PeriodicalIF":0.0000,"publicationDate":"2024-07-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10614218","citationCount":"0","resultStr":"{\"title\":\"Ensemble-Based Reliability Enhancement for Edge-Deployed CNNs in Few-Shot Scenarios\",\"authors\":\"Zhen Gao;Shuang Liu;Junbo Zhao;Xiaofei Wang;Yu Wang;Zhu Han\",\"doi\":\"10.1109/TMLCN.2024.3435168\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Convolutional Neural Networks (CNNs) have been applied in wide areas of computer vision, and edge intelligence is expected to provide instant AI service with the support of broadband mobile networks. However, the deployment of CNNs on network edge faces severe challenges. First, edge or embedded devices are usually not reliable, and hardware failures can corrupt the CNN system, which is unacceptable for critical applications, such as autonomous driving and object detection on space platforms. Second, edge or embedded devices are usually resource-limited, and therefore traditional redundancy-based protection methods are not applicable due to huge overhead. Although network pruning is effective to reduce the complexity of CNNs, we cannot have sufficient data for performance recovery in many scenarios due to privacy and security concerns. To enhance the reliability of CNNs on resource-limited devices with the few-shot constraint, we propose to construct an ensemble system with weak base CNNs pruned from the original strong CNN. To improve the ensemble performance with diverse base CNNs, we first propose a novel filter importance evaluation method by combining the amplitude and gradient information of the filter. Since the gradient part is related to the input data, different subsets of data are used for layer sensitivity analysis for different base CNNs, so that the different pruning configurations can be obtained for each base CNN. On this basis, a modified ReLU function is proposed to determine the final pruning rate of each layer in each base CNN. Extensive experiments prove that the proposed solution can effectively improve the reliability of CNNs with much less resource requirement for each edge server.\",\"PeriodicalId\":100641,\"journal\":{\"name\":\"IEEE Transactions on Machine Learning in Communications and Networking\",\"volume\":\"2 \",\"pages\":\"1062-1075\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-07-29\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10614218\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Machine Learning in Communications and Networking\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10614218/\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Machine Learning in Communications and Networking","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10614218/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

卷积神经网络(CNN)已被广泛应用于计算机视觉领域,而边缘智能有望在宽带移动网络的支持下提供即时的人工智能服务。然而,在网络边缘部署 CNN 面临着严峻的挑战。首先,边缘或嵌入式设备通常不可靠,硬件故障可能会破坏 CNN 系统,这对于自动驾驶和空间平台上的物体检测等关键应用来说是不可接受的。其次,边缘或嵌入式设备通常资源有限,因此传统的基于冗余的保护方法因开销巨大而不适用。虽然网络剪枝能有效降低 CNN 的复杂性,但出于隐私和安全考虑,我们在很多场景下无法获得足够的数据来恢复性能。为了在资源受限的设备上提高少镜头限制下的 CNN 的可靠性,我们建议用从原始强 CNN 中剪枝出来的弱基础 CNN 构建一个集合系统。为了提高不同基础 CNN 的集合性能,我们首先提出了一种新颖的滤波器重要性评估方法,该方法结合了滤波器的振幅和梯度信息。由于梯度部分与输入数据相关,因此对不同的基础 CNN 使用不同的数据子集进行层敏感性分析,从而为每个基础 CNN 获得不同的剪枝配置。在此基础上,提出了一种修改后的 ReLU 函数,用于确定每个基础 CNN 中各层的最终剪枝率。大量实验证明,所提出的解决方案可以有效提高 CNN 的可靠性,同时大大减少对每个边缘服务器的资源需求。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Ensemble-Based Reliability Enhancement for Edge-Deployed CNNs in Few-Shot Scenarios
Convolutional Neural Networks (CNNs) have been applied in wide areas of computer vision, and edge intelligence is expected to provide instant AI service with the support of broadband mobile networks. However, the deployment of CNNs on network edge faces severe challenges. First, edge or embedded devices are usually not reliable, and hardware failures can corrupt the CNN system, which is unacceptable for critical applications, such as autonomous driving and object detection on space platforms. Second, edge or embedded devices are usually resource-limited, and therefore traditional redundancy-based protection methods are not applicable due to huge overhead. Although network pruning is effective to reduce the complexity of CNNs, we cannot have sufficient data for performance recovery in many scenarios due to privacy and security concerns. To enhance the reliability of CNNs on resource-limited devices with the few-shot constraint, we propose to construct an ensemble system with weak base CNNs pruned from the original strong CNN. To improve the ensemble performance with diverse base CNNs, we first propose a novel filter importance evaluation method by combining the amplitude and gradient information of the filter. Since the gradient part is related to the input data, different subsets of data are used for layer sensitivity analysis for different base CNNs, so that the different pruning configurations can be obtained for each base CNN. On this basis, a modified ReLU function is proposed to determine the final pruning rate of each layer in each base CNN. Extensive experiments prove that the proposed solution can effectively improve the reliability of CNNs with much less resource requirement for each edge server.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Asynchronous Real-Time Federated Learning for Anomaly Detection in Microservice Cloud Applications Conditional Denoising Diffusion Probabilistic Models for Data Reconstruction Enhancement in Wireless Communications Multi-Agent Reinforcement Learning With Action Masking for UAV-Enabled Mobile Communications Online Learning for Intelligent Thermal Management of Interference-Coupled and Passively Cooled Base Stations Robust and Lightweight Modeling of IoT Network Behaviors From Raw Traffic Packets
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1