Yanjun Zhu , Shunshun Sui , Wensheng Du , Xiang Li , Ping Liu
{"title":"Picking point localization method of table grape picking robot based on you only look once version 8 nano","authors":"Yanjun Zhu , Shunshun Sui , Wensheng Du , Xiang Li , Ping Liu","doi":"10.1016/j.engappai.2025.110266","DOIUrl":null,"url":null,"abstract":"<div><div>Automatic localization of the picking point for table grape is the key to achieving intelligent harvesting. Aiming at the problem, the You Only Look Once version 8 nano-Deformable Convolutional Networks-Wise Intersection over Union-Fourth Detection Layer (YOLO v8n-DWF) network was designed for the table grape picking robot to realize the detection of table grapes and the localization of picking points. The Deformable Convolutional Networks (DCN) model was used to enhance the robustness of table grape detection and improve the detection precision of grape stems. To improve the detection precision of table grape and stem and reduce the impact of low-quality data on model generalization ability, Wise-Intersection over Union version 3 (<em>WIou v3</em>) was applied. In addition, aiming at the problem of difficultly identifying due to small target and low pixel of table grape stem, a fourth detection layer for small target detection was added to improve the recognition and detection ability of the network model for table grape stem. Further, a more accurate geometric localization method of picking points was proposed to achieve fast picking of table grapes. Finally, the results showed that the detection precision, recall, mean Average Precision<sub>50</sub> (<em>mAP</em><sub><em>50</em></sub>) and mean Average Precision<sub>50-95</sub> (<em>mAP</em><sub><em>50-95</em></sub>) of the YOLO v8n-DWF model was 97.9%, 95.3%, 97.6% and 85.4%, respectively. In addition, the success rate of the geometric method based on the results of identification for YOLO v8n-DWF was 88.24%, and the average picking success rate of table grapes in field experiments was 87.40%. It can fully meet the requirements of intelligent picking of table grapes.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"146 ","pages":"Article 110266"},"PeriodicalIF":8.0000,"publicationDate":"2025-04-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering Applications of Artificial Intelligence","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0952197625002660","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/2/15 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
Automatic localization of the picking point for table grape is the key to achieving intelligent harvesting. Aiming at the problem, the You Only Look Once version 8 nano-Deformable Convolutional Networks-Wise Intersection over Union-Fourth Detection Layer (YOLO v8n-DWF) network was designed for the table grape picking robot to realize the detection of table grapes and the localization of picking points. The Deformable Convolutional Networks (DCN) model was used to enhance the robustness of table grape detection and improve the detection precision of grape stems. To improve the detection precision of table grape and stem and reduce the impact of low-quality data on model generalization ability, Wise-Intersection over Union version 3 (WIou v3) was applied. In addition, aiming at the problem of difficultly identifying due to small target and low pixel of table grape stem, a fourth detection layer for small target detection was added to improve the recognition and detection ability of the network model for table grape stem. Further, a more accurate geometric localization method of picking points was proposed to achieve fast picking of table grapes. Finally, the results showed that the detection precision, recall, mean Average Precision50 (mAP50) and mean Average Precision50-95 (mAP50-95) of the YOLO v8n-DWF model was 97.9%, 95.3%, 97.6% and 85.4%, respectively. In addition, the success rate of the geometric method based on the results of identification for YOLO v8n-DWF was 88.24%, and the average picking success rate of table grapes in field experiments was 87.40%. It can fully meet the requirements of intelligent picking of table grapes.
鲜食葡萄采摘点的自动定位是实现智能采收的关键。针对这一问题,为鲜食葡萄采摘机器人设计了第8版纳米可变形卷积网络-联合-第四检测层智慧交集(YOLO v8n-DWF)网络,实现了鲜食葡萄的检测和采摘点的定位。采用可变形卷积网络(Deformable Convolutional Networks, DCN)模型增强了鲜食葡萄检测的鲁棒性,提高了葡萄茎的检测精度。为了提高鲜食葡萄和葡萄茎的检测精度,减少低质量数据对模型泛化能力的影响,采用Wise-Intersection over Union version 3 (WIou v3)。此外,针对鲜食葡萄茎目标小、像素低难以识别的问题,增加了小目标检测的第四检测层,提高了网络模型对鲜食葡萄茎的识别和检测能力。在此基础上,提出了一种更精确的采摘点几何定位方法,实现了鲜食葡萄的快速采摘。结果表明,YOLO v8n-DWF模型的检测精度为97.9%,召回率为95.3%,平均平均Precision50 (mAP50)为97.6%,平均平均Precision50-95 (mAP50-95)为85.4%。此外,基于YOLO v8n-DWF鉴定结果的几何方法的成功率为88.24%,田间试验的平均采摘成功率为87.40%。完全可以满足鲜食葡萄智能采摘的要求。
期刊介绍:
Artificial Intelligence (AI) is pivotal in driving the fourth industrial revolution, witnessing remarkable advancements across various machine learning methodologies. AI techniques have become indispensable tools for practicing engineers, enabling them to tackle previously insurmountable challenges. Engineering Applications of Artificial Intelligence serves as a global platform for the swift dissemination of research elucidating the practical application of AI methods across all engineering disciplines. Submitted papers are expected to present novel aspects of AI utilized in real-world engineering applications, validated using publicly available datasets to ensure the replicability of research outcomes. Join us in exploring the transformative potential of AI in engineering.