Xudong Sun , Yu Wang , Zhanglin Liu , Shaoxuan Gao , Wenbo He , Chao Tong
{"title":"SlimDL: Deploying ultra-light deep learning model on sweeping robots","authors":"Xudong Sun , Yu Wang , Zhanglin Liu , Shaoxuan Gao , Wenbo He , Chao Tong","doi":"10.1016/j.engappai.2025.110415","DOIUrl":null,"url":null,"abstract":"<div><div>Advanced object detection methods have yielded impressive progress in recent years. However, the computational constraints of edge mobile devices present significant deployment challenges for state-of-the-art algorithms. We propose a deep learning deployment framework with two stages: model adaptation and compression. Our method enhance “You Only Look Once version 5” (YOLOv5) with lightweight modules, which improves detection performance while reducing computational load. Additionally, we present a pruning algorithm, employing adaptive batch normalization and iterative pruning. Our evaluation on “Microsoft Common Objects in Context” (MSCOCO) dataset and custom SweepRobot datasets demonstrates that our method consistently outperforms state-of-the-art approaches. On the SweepRobot dataset, our method doubled YOLOv5’s detection speed on the sweeping robot from 15.69 frames per second (FPS) to 30.77 FPS, maintaining 97.3% performance at 20% of the computational cost. Even on Graphics Processing Unit equipped devices, our method achieved 1.8% and 2.8% higher Average Precision compared to direct scaling and pruning with the original pruning algorithm.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"149 ","pages":"Article 110415"},"PeriodicalIF":8.0000,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering Applications of Artificial Intelligence","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0952197625004154","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/3/11 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
Advanced object detection methods have yielded impressive progress in recent years. However, the computational constraints of edge mobile devices present significant deployment challenges for state-of-the-art algorithms. We propose a deep learning deployment framework with two stages: model adaptation and compression. Our method enhance “You Only Look Once version 5” (YOLOv5) with lightweight modules, which improves detection performance while reducing computational load. Additionally, we present a pruning algorithm, employing adaptive batch normalization and iterative pruning. Our evaluation on “Microsoft Common Objects in Context” (MSCOCO) dataset and custom SweepRobot datasets demonstrates that our method consistently outperforms state-of-the-art approaches. On the SweepRobot dataset, our method doubled YOLOv5’s detection speed on the sweeping robot from 15.69 frames per second (FPS) to 30.77 FPS, maintaining 97.3% performance at 20% of the computational cost. Even on Graphics Processing Unit equipped devices, our method achieved 1.8% and 2.8% higher Average Precision compared to direct scaling and pruning with the original pruning algorithm.
近年来,先进的目标检测方法取得了令人瞩目的进展。然而,边缘移动设备的计算限制为最先进的算法提出了重大的部署挑战。我们提出了一个深度学习部署框架,分为两个阶段:模型适应和压缩。我们的方法通过轻量级模块增强了“You Only Look Once version 5”(YOLOv5),在降低计算负荷的同时提高了检测性能。此外,我们提出了一种采用自适应批归一化和迭代剪枝的剪枝算法。我们对“Microsoft公共对象上下文”(MSCOCO)数据集和自定义SweepRobot数据集的评估表明,我们的方法始终优于最先进的方法。在SweepRobot数据集上,我们的方法将YOLOv5对扫地机器人的检测速度从15.69帧/秒(FPS)提高到30.77帧/秒,在20%的计算成本下保持了97.3%的性能。即使在配备图形处理单元的设备上,与使用原始修剪算法的直接缩放和修剪相比,我们的方法的平均精度也提高了1.8%和2.8%。
期刊介绍:
Artificial Intelligence (AI) is pivotal in driving the fourth industrial revolution, witnessing remarkable advancements across various machine learning methodologies. AI techniques have become indispensable tools for practicing engineers, enabling them to tackle previously insurmountable challenges. Engineering Applications of Artificial Intelligence serves as a global platform for the swift dissemination of research elucidating the practical application of AI methods across all engineering disciplines. Submitted papers are expected to present novel aspects of AI utilized in real-world engineering applications, validated using publicly available datasets to ensure the replicability of research outcomes. Join us in exploring the transformative potential of AI in engineering.