An improved lightweight network based on deep learning for grape recognition in unstructured environments

IF 7.7 Q1 AGRICULTURE, MULTIDISCIPLINARY Information Processing in Agriculture Pub Date : 2023-02-20 DOI:10.1016/j.inpa.2023.02.003

Bingpiao Liu, Yunzhi Zhang, Jinhai Wang, Lufeng Luo, Qinghua Lu, Huiling Wei, Wenbo Zhu

{"title":"An improved lightweight network based on deep learning for grape recognition in unstructured environments","authors":"Bingpiao Liu, Yunzhi Zhang, Jinhai Wang, Lufeng Luo, Qinghua Lu, Huiling Wei, Wenbo Zhu","doi":"10.1016/j.inpa.2023.02.003","DOIUrl":null,"url":null,"abstract":"<div><p>In unstructured environments, dense grape fruit growth and the presence of occlusion cause difficult recognition problems, which will seriously affect the performance of grape picking robots. To address these problems, this study improves the YOLOX-Tiny model and proposes a new grape detection model, YOLOX-RA, which can quickly and accurately identify densely growing and occluded grape bunches. The proposed YOLOX-RA model uses a 3 × 3 convolutional layer with a step size of 2 to replace the focal layer to reduce the computational burden. The CBS layer in the ResBlock_Body module of the second, third, and fourth layers of the backbone layer is removed, and the CSPLayer module is replaced by the ResBlock-M module to speed up the detection. An auxiliary network (AlNet) with the remaining network blocks was added after the ResBlock-M module to improve the detection accuracy. Two depth-separable convolutions (DSC) are used in the neck module layer to replace the normal convolution to reduce the computational cost. We evaluated the detection performance of SSD, YOLOv4 SSD, YOLOv4-Tiny, YOLO-Grape, YOLOv5-X, YOLOX-Tiny, and YOLOX-RA on a grape test set. The results show that the YOLOX-RA model has the best detection performance, achieving 88.75 % mAP, a recognition speed of 84.88 FPS, and model size of 17.53 MB. It can accurately detect densely grown and shaded grape bunches, which can effectively improve the performance of the grape picking robot.</p></div>","PeriodicalId":53443,"journal":{"name":"Information Processing in Agriculture","volume":"11 2","pages":"Pages 202-216"},"PeriodicalIF":7.7000,"publicationDate":"2023-02-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2214317323000136/pdfft?md5=09e056f7c87eb9309cb1cba9644ac80b&pid=1-s2.0-S2214317323000136-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Processing in Agriculture","FirstCategoryId":"1091","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2214317323000136","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRICULTURE, MULTIDISCIPLINARY","Score":null,"Total":0}

引用次数: 0

Abstract

In unstructured environments, dense grape fruit growth and the presence of occlusion cause difficult recognition problems, which will seriously affect the performance of grape picking robots. To address these problems, this study improves the YOLOX-Tiny model and proposes a new grape detection model, YOLOX-RA, which can quickly and accurately identify densely growing and occluded grape bunches. The proposed YOLOX-RA model uses a 3 × 3 convolutional layer with a step size of 2 to replace the focal layer to reduce the computational burden. The CBS layer in the ResBlock_Body module of the second, third, and fourth layers of the backbone layer is removed, and the CSPLayer module is replaced by the ResBlock-M module to speed up the detection. An auxiliary network (AlNet) with the remaining network blocks was added after the ResBlock-M module to improve the detection accuracy. Two depth-separable convolutions (DSC) are used in the neck module layer to replace the normal convolution to reduce the computational cost. We evaluated the detection performance of SSD, YOLOv4 SSD, YOLOv4-Tiny, YOLO-Grape, YOLOv5-X, YOLOX-Tiny, and YOLOX-RA on a grape test set. The results show that the YOLOX-RA model has the best detection performance, achieving 88.75 % mAP, a recognition speed of 84.88 FPS, and model size of 17.53 MB. It can accurately detect densely grown and shaded grape bunches, which can effectively improve the performance of the grape picking robot.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

一种改进的基于深度学习的轻量级网络用于非结构化环境中的葡萄识别

在非结构化环境中，密集生长的葡萄果实和遮挡物的存在会造成难以识别的问题，严重影响葡萄采摘机器人的性能。针对这些问题，本研究改进了 YOLOX-Tiny 模型，并提出了一种新的葡萄检测模型 YOLOX-RA，该模型可以快速准确地识别生长密集和遮挡的葡萄串。所提出的 YOLOX-RA 模型使用步长为 2 的 3 × 3 卷积层取代焦点层，以减轻计算负担。骨干层第二、三、四层 ResBlock_Body 模块中的 CBS 层被移除，CSPLayer 模块被 ResBlock-M 模块取代，以加快检测速度。在 ResBlock-M 模块之后添加了一个包含其余网络模块的辅助网络（AlNet），以提高检测精度。在颈部模块层使用了两个深度分离卷积（DSC）来替代普通卷积，以降低计算成本。我们在葡萄测试集上评估了 SSD、YOLOv4 SSD、YOLOv4-Tiny、YOLO-Grape、YOLOv5-X、YOLOX-Tiny 和 YOLOX-RA 的检测性能。结果表明，YOLOX-RA 模型的检测性能最好，mAP 达到 88.75%，识别速度为 84.88 FPS，模型大小为 17.53 MB。它能准确检测到生长密集和遮光的葡萄串，从而有效提高葡萄采摘机器人的性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Information Processing in Agriculture Agricultural and Biological Sciences-Animal Science and Zoology

CiteScore

21.10

自引率

0.00%

发文量

期刊介绍： Information Processing in Agriculture (IPA) was established in 2013 and it encourages the development towards a science and technology of information processing in agriculture, through the following aims: • Promote the use of knowledge and methods from the information processing technologies in the agriculture; • Illustrate the experiences and publications of the institutes, universities and government, and also the profitable technologies on agriculture; • Provide opportunities and platform for exchanging knowledge, strategies and experiences among the researchers in information processing worldwide; • Promote and encourage interactions among agriculture Scientists, Meteorologists, Biologists (Pathologists/Entomologists) with IT Professionals and other stakeholders to develop and implement methods, techniques, tools, and issues related to information processing technology in agriculture; • Create and promote expert groups for development of agro-meteorological databases, crop and livestock modelling and applications for development of crop performance based decision support system. Topics of interest include, but are not limited to: • Smart Sensor and Wireless Sensor Network • Remote Sensing • Simulation, Optimization, Modeling and Automatic Control • Decision Support Systems, Intelligent Systems and Artificial Intelligence • Computer Vision and Image Processing • Inspection and Traceability for Food Quality • Precision Agriculture and Intelligent Instrument • The Internet of Things and Cloud Computing • Big Data and Data Mining