Instance segmentation of partially occluded Medjool-date fruit bunches for robotic thinning

IF 6.6 2区农林科学 Q1 AGRICULTURE, MULTIDISCIPLINARY Precision Agriculture Pub Date : 2023-10-30 DOI:10.1007/s11119-023-10086-w

May Regev, Avital Bechar, Yuval Cohen, Avraham Sadowsky, Sigal Berman

{"title":"Instance segmentation of partially occluded Medjool-date fruit bunches for robotic thinning","authors":"May Regev, Avital Bechar, Yuval Cohen, Avraham Sadowsky, Sigal Berman","doi":"10.1007/s11119-023-10086-w","DOIUrl":null,"url":null,"abstract":"<p>Medjool date thinning automation is essential for reducing Medjool production labor and improving fruit quality. Thinning automation requires motion planning based on feature extraction from a segmented fruit bunch and its components. Previous research with focused bunch images attained high success in bunch component segmentation but less success in establishing correct association between the two components (a rachis and spikelets) that form one bunch. The current study presents an algorithm for improved component segmentation and association in the presence of occlusions based on integrating deep neural networks, traditional methods building on bunch geometry, and active vision. Following segmentation with Mask-R-CNN, segmented component images are converted to binary images with a Savitzky–Golay filter and an adapted Otsu threshold. Bunch orientation is calculated based on lines found in the binary image with the Hough transform. The orientation is used for associating a rachis with spikelets. If a suitable rachis is not found, bunch orientation is used for selecting a better viewpoint. The method was tested with two databases of bunches in an orchard, one with focused and one with non-focused images. In all images, the spikelets were correctly identified [intersection over union (IoU) 0.5: F1 0.9]. The average orientation errors were 18.15° (SD 12.77°) and 16.44° (SD 11.07°), respectively, for the focused and non-focused databases. For correct rachis selection, precision was very high when incorporating orientation, and when additionally incorporating active vision recall (and therefore F1) was high (IoU 0.5: orientation: precision 0.94, recall 0.44, F1 0.60; addition of active vision: precision 0.96, recall 0.61, F1 0.74). The developed method leads to highly accurate identification of fruit bunches and their spikelets and rachis, making it suitable for integration with a thinning automation system.</p>","PeriodicalId":20423,"journal":{"name":"Precision Agriculture","volume":"3 1","pages":""},"PeriodicalIF":6.6000,"publicationDate":"2023-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Precision Agriculture","FirstCategoryId":"97","ListUrlMain":"https://doi.org/10.1007/s11119-023-10086-w","RegionNum":2,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRICULTURE, MULTIDISCIPLINARY","Score":null,"Total":0}

引用次数: 0

Abstract

Medjool date thinning automation is essential for reducing Medjool production labor and improving fruit quality. Thinning automation requires motion planning based on feature extraction from a segmented fruit bunch and its components. Previous research with focused bunch images attained high success in bunch component segmentation but less success in establishing correct association between the two components (a rachis and spikelets) that form one bunch. The current study presents an algorithm for improved component segmentation and association in the presence of occlusions based on integrating deep neural networks, traditional methods building on bunch geometry, and active vision. Following segmentation with Mask-R-CNN, segmented component images are converted to binary images with a Savitzky–Golay filter and an adapted Otsu threshold. Bunch orientation is calculated based on lines found in the binary image with the Hough transform. The orientation is used for associating a rachis with spikelets. If a suitable rachis is not found, bunch orientation is used for selecting a better viewpoint. The method was tested with two databases of bunches in an orchard, one with focused and one with non-focused images. In all images, the spikelets were correctly identified [intersection over union (IoU) 0.5: F1 0.9]. The average orientation errors were 18.15° (SD 12.77°) and 16.44° (SD 11.07°), respectively, for the focused and non-focused databases. For correct rachis selection, precision was very high when incorporating orientation, and when additionally incorporating active vision recall (and therefore F1) was high (IoU 0.5: orientation: precision 0.94, recall 0.44, F1 0.60; addition of active vision: precision 0.96, recall 0.61, F1 0.74). The developed method leads to highly accurate identification of fruit bunches and their spikelets and rachis, making it suitable for integration with a thinning automation system.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

部分遮挡的medjol -date果束的实例分割

枸杞枣间伐自动化是减少枸杞生产劳动，提高果品品质的重要手段。细化自动化需要基于特征提取的运动规划，从一个分割果束及其组件。以往使用聚焦束图像的研究在束成分分割方面取得了较高的成功，但在建立组成束的两个成分(轴和小穗)之间的正确关联方面取得了较低的成功。本研究提出了一种基于深度神经网络、基于束几何的传统方法和主动视觉相结合的改进的遮挡下成分分割和关联算法。使用Mask-R-CNN进行分割后，使用Savitzky-Golay滤波器和自适应Otsu阈值将分割后的分量图像转换为二值图像。束的方向是基于在二值图像中用霍夫变换找到的线来计算的。朝向用于将轴与小穗联系起来。如果没有找到合适的轴，则使用束定向来选择更好的视点。该方法在一个果园的两个数据库中进行了测试，一个是聚焦图像，一个是非聚焦图像。在所有图像中，小穗被正确识别[IoU交叉比0.5:F1 0.9]。聚焦和非聚焦数据库的平均定位误差分别为18.15°(SD 12.77°)和16.44°(SD 11.07°)。对于正确的轴选择，当考虑方向时，精度非常高，当另外考虑主动视觉召回(因此F1)时，精度很高(IoU 0.5:方向:精度0.94，召回率0.44,F1 0.60;增加主动视觉:精度0.96，召回率0.61,F1 0.74)。该方法对果束及其小穗和轴进行了高度精确的鉴定，适合与间伐自动化系统集成。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Precision Agriculture 农林科学-农业综合

CiteScore

12.30

自引率

8.10%

发文量

103

审稿时长

>24 weeks

期刊介绍： Precision Agriculture promotes the most innovative results coming from the research in the field of precision agriculture. It provides an effective forum for disseminating original and fundamental research and experience in the rapidly advancing area of precision farming. There are many topics in the field of precision agriculture; therefore, the topics that are addressed include, but are not limited to: Natural Resources Variability: Soil and landscape variability, digital elevation models, soil mapping, geostatistics, geographic information systems, microclimate, weather forecasting, remote sensing, management units, scale, etc. Managing Variability: Sampling techniques, site-specific nutrient and crop protection chemical recommendation, crop quality, tillage, seed density, seed variety, yield mapping, remote sensing, record keeping systems, data interpretation and use, crops (corn, wheat, sugar beets, potatoes, peanut, cotton, vegetables, etc.), management scale, etc. Engineering Technology: Computers, positioning systems, DGPS, machinery, tillage, planting, nutrient and crop protection implements, manure, irrigation, fertigation, yield monitor and mapping, soil physical and chemical characteristic sensors, weed/pest mapping, etc. Profitability: MEY, net returns, BMPs, optimum recommendations, crop quality, technology cost, sustainability, social impacts, marketing, cooperatives, farm scale, crop type, etc. Environment: Nutrient, crop protection chemicals, sediments, leaching, runoff, practices, field, watershed, on/off farm, artificial drainage, ground water, surface water, etc. Technology Transfer: Skill needs, education, training, outreach, methods, surveys, agri-business, producers, distance education, Internet, simulations models, decision support systems, expert systems, on-farm experimentation, partnerships, quality of rural life, etc.