{"title":"用于快速视觉感知立体仓库存储位置状态的轻量级卷积神经网络","authors":"Liangrui Zhang, Xi Zhang, Mingzhou Liu","doi":"10.1007/s10845-024-02397-0","DOIUrl":null,"url":null,"abstract":"<p>Accurate storage location status data is an important input for location assignment in the inbound stage. Traditional Internet of Things (IoT) identification technologies require high costs and are easily affected by warehouse environments. A lightweight convolutional neural network is proposed for perceiving storage status to achieve high stability and low cost of location availability monitoring. Based on the existing You Only Look Once (YOLOv5) algorithm, the Hough transform is used in the pre-processing to implement tilt correction on the image to improve the stability of object localization. Then the feature extraction unit CBlock is designed based on a new depthwise separable convolution in which the convolutional block attention module is embedded, focusing on both channel and spatial information. The backbone network is constructed by stacking these CBlock blocks to compress the computational cost. The improved neck network adds cross-layer information fusion to reduce the information loss caused by sampling and ensure perceptual accuracy. Moreover, the penalty metric is redefined by SIoU, which considers the vector angle of the bounding box regression and improves the convergence speed and accuracy. The experiments show that the proposed model achieves successful results for storage location status perception in stereo warehouse.</p>","PeriodicalId":16193,"journal":{"name":"Journal of Intelligent Manufacturing","volume":"22 1","pages":""},"PeriodicalIF":5.9000,"publicationDate":"2024-05-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Lightweight convolutional neural network for fast visual perception of storage location status in stereo warehouse\",\"authors\":\"Liangrui Zhang, Xi Zhang, Mingzhou Liu\",\"doi\":\"10.1007/s10845-024-02397-0\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Accurate storage location status data is an important input for location assignment in the inbound stage. Traditional Internet of Things (IoT) identification technologies require high costs and are easily affected by warehouse environments. A lightweight convolutional neural network is proposed for perceiving storage status to achieve high stability and low cost of location availability monitoring. Based on the existing You Only Look Once (YOLOv5) algorithm, the Hough transform is used in the pre-processing to implement tilt correction on the image to improve the stability of object localization. Then the feature extraction unit CBlock is designed based on a new depthwise separable convolution in which the convolutional block attention module is embedded, focusing on both channel and spatial information. The backbone network is constructed by stacking these CBlock blocks to compress the computational cost. The improved neck network adds cross-layer information fusion to reduce the information loss caused by sampling and ensure perceptual accuracy. Moreover, the penalty metric is redefined by SIoU, which considers the vector angle of the bounding box regression and improves the convergence speed and accuracy. The experiments show that the proposed model achieves successful results for storage location status perception in stereo warehouse.</p>\",\"PeriodicalId\":16193,\"journal\":{\"name\":\"Journal of Intelligent Manufacturing\",\"volume\":\"22 1\",\"pages\":\"\"},\"PeriodicalIF\":5.9000,\"publicationDate\":\"2024-05-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Intelligent Manufacturing\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://doi.org/10.1007/s10845-024-02397-0\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Intelligent Manufacturing","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1007/s10845-024-02397-0","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
Lightweight convolutional neural network for fast visual perception of storage location status in stereo warehouse
Accurate storage location status data is an important input for location assignment in the inbound stage. Traditional Internet of Things (IoT) identification technologies require high costs and are easily affected by warehouse environments. A lightweight convolutional neural network is proposed for perceiving storage status to achieve high stability and low cost of location availability monitoring. Based on the existing You Only Look Once (YOLOv5) algorithm, the Hough transform is used in the pre-processing to implement tilt correction on the image to improve the stability of object localization. Then the feature extraction unit CBlock is designed based on a new depthwise separable convolution in which the convolutional block attention module is embedded, focusing on both channel and spatial information. The backbone network is constructed by stacking these CBlock blocks to compress the computational cost. The improved neck network adds cross-layer information fusion to reduce the information loss caused by sampling and ensure perceptual accuracy. Moreover, the penalty metric is redefined by SIoU, which considers the vector angle of the bounding box regression and improves the convergence speed and accuracy. The experiments show that the proposed model achieves successful results for storage location status perception in stereo warehouse.
期刊介绍:
The Journal of Nonlinear Engineering aims to be a platform for sharing original research results in theoretical, experimental, practical, and applied nonlinear phenomena within engineering. It serves as a forum to exchange ideas and applications of nonlinear problems across various engineering disciplines. Articles are considered for publication if they explore nonlinearities in engineering systems, offering realistic mathematical modeling, utilizing nonlinearity for new designs, stabilizing systems, understanding system behavior through nonlinearity, optimizing systems based on nonlinear interactions, and developing algorithms to harness and leverage nonlinear elements.