Artificial Intelligence in Agriculture最新文献_第3页

Comprehensive review on 3D point cloud segmentation in plants 植物三维点云分割技术综述

IF 8.2 Q1 AGRICULTURE, MULTIDISCIPLINARY

Artificial Intelligence in Agriculture

Pub Date : 2025-01-11 DOI: 10.1016/j.aiia.2025.01.006

Hongli Song , Weiliang Wen , Sheng Wu , Xinyu Guo

Segmentation of three-dimensional (3D) point clouds is fundamental in comprehending unstructured structural and morphological data. It plays a critical role in research related to plant phenomics, 3D plant modeling, and functional-structural plant modeling. Although technologies for plant point cloud segmentation (PPCS) have advanced rapidly, there has been a lack of a systematic overview of the development process. This paper presents an overview of the progress made in 3D point cloud segmentation research in plants. It starts by discussing the methods used to acquire point clouds in plants, and analyzes the impact of point cloud resolution and quality on the segmentation task. It then introduces multi-scale point cloud segmentation in plants. The paper summarizes and analyzes traditional methods for PPCS, including the global and local features. This paper discusses the progress of machine learning-based segmentation on plant point clouds through supervised, unsupervised, and integrated approaches. It also summarizes the datasets that for PPCS using deep learning-oriented methods and explains the advantages and disadvantages of deep learning-based methods for projection-based, voxel-based, and point-based approaches respectively. Finally, the development of PPCS is discussed and prospected. Deep learning methods are predicted to become dominant in the field of PPCS, and 3D point cloud segmentation would develop towards more automated with higher resolution and precision.

三维点云的分割是理解非结构化结构和形态数据的基础。它在植物表型组学、植物三维建模和植物功能结构建模等研究中发挥着至关重要的作用。尽管植物点云分割（PPCS）技术发展迅速，但一直缺乏对其发展过程的系统概述。本文综述了植物三维点云分割的研究进展。首先讨论了植物中点云的获取方法，分析了点云分辨率和质量对分割任务的影响。然后引入植物的多尺度点云分割。总结和分析了传统的PPCS方法，包括全局特征和局部特征。本文讨论了基于机器学习的植物点云分割的进展，包括有监督、无监督和集成方法。总结了基于深度学习的PPCS方法的数据集，并分别解释了基于投影、基于体素和基于点的深度学习方法的优缺点。最后，对PPCS的发展进行了讨论和展望。预计深度学习方法将在PPCS领域占据主导地位，3D点云分割将朝着更高分辨率和精度的自动化方向发展。

{"title":"Comprehensive review on 3D point cloud segmentation in plants","authors":"Hongli Song , Weiliang Wen , Sheng Wu , Xinyu Guo","doi":"10.1016/j.aiia.2025.01.006","DOIUrl":"10.1016/j.aiia.2025.01.006","url":null,"abstract":"<div><div>Segmentation of three-dimensional (3D) point clouds is fundamental in comprehending unstructured structural and morphological data. It plays a critical role in research related to plant phenomics, 3D plant modeling, and functional-structural plant modeling. Although technologies for plant point cloud segmentation (PPCS) have advanced rapidly, there has been a lack of a systematic overview of the development process. This paper presents an overview of the progress made in 3D point cloud segmentation research in plants. It starts by discussing the methods used to acquire point clouds in plants, and analyzes the impact of point cloud resolution and quality on the segmentation task. It then introduces multi-scale point cloud segmentation in plants. The paper summarizes and analyzes traditional methods for PPCS, including the global and local features. This paper discusses the progress of machine learning-based segmentation on plant point clouds through supervised, unsupervised, and integrated approaches. It also summarizes the datasets that for PPCS using deep learning-oriented methods and explains the advantages and disadvantages of deep learning-based methods for projection-based, voxel-based, and point-based approaches respectively. Finally, the development of PPCS is discussed and prospected. Deep learning methods are predicted to become dominant in the field of PPCS, and 3D point cloud segmentation would develop towards more automated with higher resolution and precision.</div></div>","PeriodicalId":52814,"journal":{"name":"Artificial Intelligence in Agriculture","volume":"15 2","pages":"Pages 296-315"},"PeriodicalIF":8.2,"publicationDate":"2025-01-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143704964","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

High-throughput phenotyping techniques for forage: Status, bottleneck, and challenges

IF 8.2 Q1 AGRICULTURE, MULTIDISCIPLINARY

Artificial Intelligence in Agriculture

Pub Date : 2025-01-10 DOI: 10.1016/j.aiia.2025.01.003

Tao Cheng , Dongyan Zhang , Gan Zhang , Tianyi Wang , Weibo Ren , Feng Yuan , Yaling Liu , Zhaoming Wang , Chunjiang Zhao

High-throughput phenotyping (HTP) technology is now a significant bottleneck in the efficient selection and breeding of superior forage genetic resources. To better understand the status of forage phenotyping research and identify key directions for development, this review summarizes advances in HTP technology for forage phenotypic analysis over the past ten years. This paper reviews the unique aspects and research priorities in forage phenotypic monitoring, highlights key remote sensing platforms, examines the applications of advanced sensing technology for quantifying phenotypic traits, explores artificial intelligence (AI) algorithms in phenotypic data integration and analysis, and assesses recent progress in phenotypic genomics. The practical applications of HTP technology in forage remain constrained by several challenges. These include establishing uniform data collection standards, designing effective algorithms to handle complex genetic and environmental interactions, deepening the cross-exploration of phenomics-genomics, solving the problem of pathological inversion of forage phenotypic growth monitoring models, and developing low-cost forage phenotypic equipment. Resolving these challenges will unlock the full potential of HTP, enabling precise identification of superior forage traits, accelerating the breeding of superior varieties, and ultimately improving forage yield.

高通量表型（HTP）技术是目前牧草优良遗传资源高效选择和育种的重要瓶颈。为了更好地了解牧草表型研究的现状和确定重点发展方向，本文对近十年来HTP技术用于牧草表型分析的进展进行了综述。本文综述了牧草表型监测的独特方面和研究重点，重点介绍了关键的遥感平台，研究了先进的传感技术在表型性状量化方面的应用，探讨了人工智能（AI）算法在表型数据集成和分析方面的应用，并评估了表型基因组学的最新进展。HTP技术在饲料中的实际应用仍然受到一些挑战的制约。其中包括建立统一的数据收集标准，设计有效的算法来处理复杂的遗传和环境相互作用，深化表型学与基因组学的交叉探索，解决饲料表型生长监测模型的病理反转问题，以及开发低成本的饲料表型设备。解决这些挑战将释放HTP的全部潜力，能够精确识别优质饲料性状，加速优良品种的育种，最终提高饲料产量。

{"title":"High-throughput phenotyping techniques for forage: Status, bottleneck, and challenges","authors":"Tao Cheng , Dongyan Zhang , Gan Zhang , Tianyi Wang , Weibo Ren , Feng Yuan , Yaling Liu , Zhaoming Wang , Chunjiang Zhao","doi":"10.1016/j.aiia.2025.01.003","DOIUrl":"10.1016/j.aiia.2025.01.003","url":null,"abstract":"<div><div>High-throughput phenotyping (HTP) technology is now a significant bottleneck in the efficient selection and breeding of superior forage genetic resources. To better understand the status of forage phenotyping research and identify key directions for development, this review summarizes advances in HTP technology for forage phenotypic analysis over the past ten years. This paper reviews the unique aspects and research priorities in forage phenotypic monitoring, highlights key remote sensing platforms, examines the applications of advanced sensing technology for quantifying phenotypic traits, explores artificial intelligence (AI) algorithms in phenotypic data integration and analysis, and assesses recent progress in phenotypic genomics. The practical applications of HTP technology in forage remain constrained by several challenges. These include establishing uniform data collection standards, designing effective algorithms to handle complex genetic and environmental interactions, deepening the cross-exploration of phenomics-genomics, solving the problem of pathological inversion of forage phenotypic growth monitoring models, and developing low-cost forage phenotypic equipment. Resolving these challenges will unlock the full potential of HTP, enabling precise identification of superior forage traits, accelerating the breeding of superior varieties, and ultimately improving forage yield.</div></div>","PeriodicalId":52814,"journal":{"name":"Artificial Intelligence in Agriculture","volume":"15 1","pages":"Pages 98-115"},"PeriodicalIF":8.2,"publicationDate":"2025-01-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143097558","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Crop-conditional semantic segmentation for efficient agricultural disease assessment

IF 8.2 Q1 AGRICULTURE, MULTIDISCIPLINARY

Artificial Intelligence in Agriculture

Pub Date : 2025-01-10 DOI: 10.1016/j.aiia.2025.01.002

Artzai Picon , Itziar Eguskiza , Pablo Galan , Laura Gomez-Zamanillo , Javier Romero , Christian Klukas , Arantza Bereciartua-Perez , Mike Scharner , Ramon Navarra-Mestre

In this study, we introduced an innovative crop-conditional semantic segmentation architecture that seamlessly incorporates contextual metadata (crop information). This is achieved by merging the contextual information at a late layer stage, allowing the method to be integrated with any semantic segmentation architecture, including novel ones. To evaluate the effectiveness of this approach, we curated a challenging dataset of over 100,000 images captured in real-field conditions using mobile phones. This dataset includes various disease stages across 21 diseases and seven crops (wheat, barley, corn, rice, rape-seed, vinegrape, and cucumber), with the added complexity of multiple diseases coexisting in a single image. We demonstrate that incorporating contextual multi-crop information significantly enhances the performance of semantic segmentation models for plant disease detection. By leveraging crop-specific metadata, our approach achieves higher accuracy and better generalization across diverse crops (F1 = 0.68, r = 0.75) compared to traditional methods (F1 = 0.24, r = 0.68). Additionally, the adoption of a semi-supervised approach based on pseudo-labeling of single diseased plants, offers significant advantages for plant disease segmentation and quantification (F1 = 0.73, r = 0.95). This method enhances the model's performance by leveraging both labeled and unlabeled data, reducing the dependency on extensive manual annotations, which are often time-consuming and costly.

The deployment of this algorithm holds the potential to revolutionize the digitization of crop protection product testing, ensuring heightened repeatability while minimizing human subjectivity. By addressing the challenges of semantic segmentation and disease quantification, we contribute to more effective and precise phenotyping, ultimately supporting better crop management and protection strategies.

在这项研究中，我们引入了一种创新的作物条件语义分割架构，该架构无缝地结合了上下文元数据（作物信息）。这是通过在后期阶段合并上下文信息来实现的，允许该方法与任何语义分割体系结构集成，包括新的。为了评估这种方法的有效性，我们策划了一个具有挑战性的数据集，其中包括使用手机在实际条件下拍摄的100,000多张图像。该数据集包括21种疾病和7种作物（小麦、大麦、玉米、水稻、油菜籽、葡萄和黄瓜）的不同疾病阶段，并且在单个图像中同时存在多种疾病的复杂性。我们证明，结合上下文多作物信息显着提高了植物病害检测的语义分割模型的性能。通过利用特定作物的元数据，与传统方法（F1 = 0.24, r = 0.68）相比，我们的方法在不同作物之间实现了更高的精度和更好的泛化（F1 = 0.68, r = 0.75）。此外，采用基于单株病株伪标记的半监督方法，对植物病害的分割和定量具有显著优势（F1 = 0.73, r = 0.95）。该方法通过利用标记和未标记的数据来增强模型的性能，减少了对大量手工注释的依赖，而手工注释通常既耗时又昂贵。该算法的部署有可能彻底改变作物保护产品测试的数字化，确保提高可重复性，同时最大限度地减少人类的主观性。通过解决语义分割和疾病量化的挑战，我们有助于更有效和精确的表型分析，最终支持更好的作物管理和保护策略。

{"title":"Crop-conditional semantic segmentation for efficient agricultural disease assessment","authors":"Artzai Picon , Itziar Eguskiza , Pablo Galan , Laura Gomez-Zamanillo , Javier Romero , Christian Klukas , Arantza Bereciartua-Perez , Mike Scharner , Ramon Navarra-Mestre","doi":"10.1016/j.aiia.2025.01.002","DOIUrl":"10.1016/j.aiia.2025.01.002","url":null,"abstract":"<div><div>In this study, we introduced an innovative crop-conditional semantic segmentation architecture that seamlessly incorporates contextual metadata (crop information). This is achieved by merging the contextual information at a late layer stage, allowing the method to be integrated with any semantic segmentation architecture, including novel ones. To evaluate the effectiveness of this approach, we curated a challenging dataset of over 100,000 images captured in real-field conditions using mobile phones. This dataset includes various disease stages across 21 diseases and seven crops (wheat, barley, corn, rice, rape-seed, vinegrape, and cucumber), with the added complexity of multiple diseases coexisting in a single image. We demonstrate that incorporating contextual multi-crop information significantly enhances the performance of semantic segmentation models for plant disease detection. By leveraging crop-specific metadata, our approach achieves higher accuracy and better generalization across diverse crops (F1 = 0.68, <em>r</em> = 0.75) compared to traditional methods (F1 = 0.24, <em>r</em> = 0.68). Additionally, the adoption of a semi-supervised approach based on pseudo-labeling of single diseased plants, offers significant advantages for plant disease segmentation and quantification (F1 = 0.73, <em>r</em> = 0.95). This method enhances the model's performance by leveraging both labeled and unlabeled data, reducing the dependency on extensive manual annotations, which are often time-consuming and costly.</div><div>The deployment of this algorithm holds the potential to revolutionize the digitization of crop protection product testing, ensuring heightened repeatability while minimizing human subjectivity. By addressing the challenges of semantic segmentation and disease quantification, we contribute to more effective and precise phenotyping, ultimately supporting better crop management and protection strategies.</div></div>","PeriodicalId":52814,"journal":{"name":"Artificial Intelligence in Agriculture","volume":"15 1","pages":"Pages 79-87"},"PeriodicalIF":8.2,"publicationDate":"2025-01-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143097561","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Knowledge-guided temperature correction method for soluble solids content detection of watermelon based on Vis/NIR spectroscopy

IF 8.2 Q1 AGRICULTURE, MULTIDISCIPLINARY

Artificial Intelligence in Agriculture

Pub Date : 2025-01-09 DOI: 10.1016/j.aiia.2025.01.004

Zhizhong Sun , Jie Yang , Yang Yao , Dong Hu , Yibin Ying , Junxian Guo , Lijuan Xie

Visible/near-infrared (Vis/NIR) spectroscopy technology has been extensively utilized for the determination of soluble solids content (SSC) in fruits. Nonetheless, the spectral distortion resulting from temperature variations in the sample leads to a decrease in detection accuracy. To mitigate the influence of temperature fluctuations on the accuracy of SSC detection in fruits, using watermelon as an example, this study presents a knowledge-guided temperature correction method utilizing one-dimensional convolutional neural networks (1D-CNN). This method consists of two stages: the first stage involves utilizing 1D-CNN models and gradient-weighted class activation mapping (Grad-CAM) method to acquire gradient-weighted features correlating with temperature. The second stage involves mapping these features and integrating them with the original Vis/NIR spectrum, and then train and test the partial least squares (PLS) model. This knowledge-guided method can identify wavelength bands with high temperature correlation in the Vis/NIR spectra, offering valuable guidance for spectral data processing. The performance of the PLS model constructed using the 15 °C spectrum guided by this method is superior to that of the global model, and can reduce the root mean square error of the prediction set (RMSEP) to 0.324°Brix, which is 32.5 % lower than the RMSEP of the global model (0.480°Brix). The method proposed in this study has superior temperature correction effects than slope and bias correction, piecewise direct standardization, and external parameter orthogonalization correction methods. The results indicate that the knowledge-guided temperature correction method based on deep learning can significantly enhance the detection accuracy of SSC in watermelon, providing valuable reference for the development of PLS calibration methods.

可见/近红外（Vis/NIR）光谱技术已广泛应用于水果中可溶性固形物含量的测定。尽管如此，样品中温度变化引起的光谱失真导致检测精度降低。为了减轻温度波动对水果中SSC检测精度的影响，本研究以西瓜为例，提出了一种基于一维卷积神经网络（1D-CNN）的知识引导温度校正方法。该方法分为两个阶段：第一阶段利用1D-CNN模型和梯度加权类激活映射（gradient-weighted class activation mapping, Grad-CAM）方法获取与温度相关的梯度加权特征。第二阶段涉及映射这些特征并将它们与原始的Vis/NIR光谱进行整合，然后训练和测试偏最小二乘（PLS）模型。这种知识引导方法可以识别出可见光/近红外光谱中具有高温相关的波段，为光谱数据处理提供了有价值的指导。利用该方法指导的15°C光谱构建的PLS模型的性能优于全局模型，预测集的均方根误差（RMSEP）降至0.324°Brix，比全局模型的RMSEP（0.480°Brix）低32.5%。该方法的温度校正效果优于斜率和偏置校正、分段直接标准化和外部参数正交化校正方法。结果表明，基于深度学习的知识引导温度校正方法可以显著提高西瓜中SSC的检测精度，为PLS校正方法的开发提供了有价值的参考。

{"title":"Knowledge-guided temperature correction method for soluble solids content detection of watermelon based on Vis/NIR spectroscopy","authors":"Zhizhong Sun , Jie Yang , Yang Yao , Dong Hu , Yibin Ying , Junxian Guo , Lijuan Xie","doi":"10.1016/j.aiia.2025.01.004","DOIUrl":"10.1016/j.aiia.2025.01.004","url":null,"abstract":"<div><div>Visible/near-infrared (Vis/NIR) spectroscopy technology has been extensively utilized for the determination of soluble solids content (SSC) in fruits. Nonetheless, the spectral distortion resulting from temperature variations in the sample leads to a decrease in detection accuracy. To mitigate the influence of temperature fluctuations on the accuracy of SSC detection in fruits, using watermelon as an example, this study presents a knowledge-guided temperature correction method utilizing one-dimensional convolutional neural networks (1D-CNN). This method consists of two stages: the first stage involves utilizing 1D-CNN models and gradient-weighted class activation mapping (Grad-CAM) method to acquire gradient-weighted features correlating with temperature. The second stage involves mapping these features and integrating them with the original Vis/NIR spectrum, and then train and test the partial least squares (PLS) model. This knowledge-guided method can identify wavelength bands with high temperature correlation in the Vis/NIR spectra, offering valuable guidance for spectral data processing. The performance of the PLS model constructed using the 15 °C spectrum guided by this method is superior to that of the global model, and can reduce the root mean square error of the prediction set (RMSEP) to 0.324°Brix, which is 32.5 % lower than the RMSEP of the global model (0.480°Brix). The method proposed in this study has superior temperature correction effects than slope and bias correction, piecewise direct standardization, and external parameter orthogonalization correction methods. The results indicate that the knowledge-guided temperature correction method based on deep learning can significantly enhance the detection accuracy of SSC in watermelon, providing valuable reference for the development of PLS calibration methods.</div></div>","PeriodicalId":52814,"journal":{"name":"Artificial Intelligence in Agriculture","volume":"15 1","pages":"Pages 88-97"},"PeriodicalIF":8.2,"publicationDate":"2025-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143097560","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Enhancing citrus surface defects detection: A priori feature guided semantic segmentation model

IF 8.2 Q1 AGRICULTURE, MULTIDISCIPLINARY

Artificial Intelligence in Agriculture

Pub Date : 2025-01-09 DOI: 10.1016/j.aiia.2025.01.005

Xufeng Xu , Tao Xu , Zichao Wei , Zetong Li , Yafei Wang , Xiuqin Rao

The accurate detection of citrus surface defects is of great importance for elevating the product quality and augmenting its market value. However, due to defect diversity and complexity, existing methods focused on parameter and data enhancement have limitations in detection and segmentation. Therefore, this study proposed a citrus surface defect segmentation model guided by prior features, named PrioriFormer. The model extracted texture features, boundary features, and superpixel features that were crucial for defect detection and segmentation, as priori features. A Priori Feature Fusion Module (PFFM) was designed to integrate the priori features, thereby establishing a priori feature branch. Then the priori feature branch was integrated into the baseline model SegFormer, with the objective of enhancing key feature learning capacity of the model. Finally, the effectiveness of the priori features in enhancing the performance of the model was demonstrated through the implementation of specific experiments. The result showed that PrioriFormer achieved an mPA (mean Pixel Accuracy), mIoU (mean Intersection over Union), and Dice Coefficient of 91.0 %, 85.8 %, and 91.0 %, respectively. Compared to other semantic segmentation models, the proposed model has achieved the best performance. The model parameters of PrioriFormer have only increase by 2.7 % in comparison to the baseline model, while the mIoU has improved by 3.3 %, indicating that the improvement of segmentation performance had less dependence on model parameters. Even when trained on few data, PrioriFormer maintained the high segmentation performance, with the reduction of mIoU not exceeding 4.2 %. This demonstrated the strong feature learning ability of the model in scenarios with limited data. Furthermore, validation on external datasets confirmed PrioriFormer's superior performance and adaptability to different tasks. The study found that the proposed PrioriFomer guided by priori features can effectively enhance the accuracy of the citrus surface defect segmentation model, providing technical reference for citrus sorting and quality assessment.

柑橘表面缺陷的准确检测对提高柑橘产品质量和增加柑橘市场价值具有重要意义。然而，由于缺陷的多样性和复杂性，现有的方法侧重于参数和数据增强，在检测和分割方面存在局限性。因此，本研究提出了一种基于先验特征的柑橘表面缺陷分割模型，命名为priority former。该模型提取了对缺陷检测和分割至关重要的纹理特征、边界特征和超像素特征作为先验特征。设计先验特征融合模块（PFFM）对先验特征进行融合，建立先验特征分支。然后将先验特征分支集成到基线模型SegFormer中，以增强模型的关键特征学习能力。最后，通过具体实验的实施，验证了先验特征在增强模型性能方面的有效性。结果表明，该算法的mPA（平均像素精度）、mIoU（平均交叉比并）和Dice系数分别达到91.0%、85.8%和91.0%。与其他语义分割模型相比，该模型取得了最好的性能。与基线模型相比，preferformer的模型参数仅提高了2.7%，而mIoU的模型参数提高了3.3%，说明分割性能的提高对模型参数的依赖性较小。即使在少量数据上进行训练，priority former也保持了较高的分割性能，mIoU的降低不超过4.2%。这证明了该模型在有限数据场景下具有较强的特征学习能力。此外，在外部数据集上的验证证实了priority former优越的性能和对不同任务的适应性。研究发现，本文提出的PrioriFomer以先验特征为指导，可以有效提高柑橘表面缺陷分割模型的准确性，为柑橘分选和质量评价提供技术参考。

{"title":"Enhancing citrus surface defects detection: A priori feature guided semantic segmentation model","authors":"Xufeng Xu , Tao Xu , Zichao Wei , Zetong Li , Yafei Wang , Xiuqin Rao","doi":"10.1016/j.aiia.2025.01.005","DOIUrl":"10.1016/j.aiia.2025.01.005","url":null,"abstract":"<div><div>The accurate detection of citrus surface defects is of great importance for elevating the product quality and augmenting its market value. However, due to defect diversity and complexity, existing methods focused on parameter and data enhancement have limitations in detection and segmentation. Therefore, this study proposed a citrus surface defect segmentation model guided by prior features, named PrioriFormer. The model extracted texture features, boundary features, and superpixel features that were crucial for defect detection and segmentation, as priori features. A Priori Feature Fusion Module (PFFM) was designed to integrate the priori features, thereby establishing a priori feature branch. Then the priori feature branch was integrated into the baseline model SegFormer, with the objective of enhancing key feature learning capacity of the model. Finally, the effectiveness of the priori features in enhancing the performance of the model was demonstrated through the implementation of specific experiments. The result showed that PrioriFormer achieved an mPA (mean Pixel Accuracy), mIoU (mean Intersection over Union), and Dice Coefficient of 91.0 %, 85.8 %, and 91.0 %, respectively. Compared to other semantic segmentation models, the proposed model has achieved the best performance. The model parameters of PrioriFormer have only increase by 2.7 % in comparison to the baseline model, while the mIoU has improved by 3.3 %, indicating that the improvement of segmentation performance had less dependence on model parameters. Even when trained on few data, PrioriFormer maintained the high segmentation performance, with the reduction of mIoU not exceeding 4.2 %. This demonstrated the strong feature learning ability of the model in scenarios with limited data. Furthermore, validation on external datasets confirmed PrioriFormer's superior performance and adaptability to different tasks. The study found that the proposed PrioriFomer guided by priori features can effectively enhance the accuracy of the citrus surface defect segmentation model, providing technical reference for citrus sorting and quality assessment.</div></div>","PeriodicalId":52814,"journal":{"name":"Artificial Intelligence in Agriculture","volume":"15 1","pages":"Pages 67-78"},"PeriodicalIF":8.2,"publicationDate":"2025-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143097562","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

PAB-Mamba-YOLO: VSSM assists in YOLO for aggressive behavior detection among weaned piglets

IF 8.2 Q1 AGRICULTURE, MULTIDISCIPLINARY

Artificial Intelligence in Agriculture

Pub Date : 2025-01-06 DOI: 10.1016/j.aiia.2025.01.001

Xue Xia , Ning Zhang , Zhibin Guan , Xin Chai , Shixin Ma , Xiujuan Chai , Tan Sun

Aggressive behavior among piglets is considered a harmful social contact. Monitoring weaned piglets with intense aggressive behaviors is paramount for pig breeding management. This study introduced a novel hybrid model, PAB-Mamba-YOLO, integrating the principles of Mamba and YOLO for efficient visual detection of weaned piglets' aggressive behaviors, including climbing body, nose hitting, biting tail and biting ear. Within the proposed model, a novel CSPVSS module, which integrated the Cross Stage Partial (CSP) structure with the Visual State Space Model (VSSM), has been developed. This module was adeptly integrated into the Neck part of the network, where it harnessed convolutional capabilities for local feature extraction and leveraged the visual state space to reveal long-distance dependencies. The model exhibited sound performance in detecting aggressive behaviors, with an average precision (AP) of 0.976 for climbing body, 0.994 for nose hitting, 0.977 for biting tail and 0.994 for biting ear. The mean average precision (mAP) of 0.985 reflected the model's overall effectiveness in detecting all classes of aggressive behaviors. The model achieved a detection speed FPS of 69 f/s, with model complexity measured by 7.2 G floating-point operations (GFLOPs) and parameters (Params) of 2.63 million. Comparative experiments with existing prevailing models confirmed the superiority of the proposed model. This work is expected to contribute a glimmer of fresh ideas and inspiration to the research field of precision breeding and behavioral analysis of animals.

小猪之间的攻击性行为被认为是一种有害的社会接触。监测具有强烈攻击行为的断奶仔猪对猪的育种管理至关重要。本研究结合曼巴和YOLO的原理，建立了一种新的杂交模型，即PAB-Mamba-YOLO，用于对断奶仔猪爬身、撞鼻、咬尾、咬耳等攻击行为进行高效的视觉检测。在提出的模型中，开发了一种新的CSPVSS模块，该模块将跨阶段部分（CSP）结构与视觉状态空间模型（VSSM）相结合。该模块被巧妙地集成到网络的颈部部分，在那里它利用卷积功能进行局部特征提取，并利用视觉状态空间来显示远程依赖关系。该模型检测攻击行为的平均精度（AP）分别为0.976、0.994、0.977和0.994。平均平均精度（mAP）为0.985，反映了该模型在检测各类攻击行为方面的总体有效性。该模型的检测速度FPS为69 f/s，通过7.2 G浮点运算（GFLOPs）和参数（Params）测量模型复杂度为263万。与现有主流模型的对比实验证实了所提模型的优越性。该工作有望为动物精密育种和行为分析研究领域提供一丝新的思路和灵感。

{"title":"PAB-Mamba-YOLO: VSSM assists in YOLO for aggressive behavior detection among weaned piglets","authors":"Xue Xia , Ning Zhang , Zhibin Guan , Xin Chai , Shixin Ma , Xiujuan Chai , Tan Sun","doi":"10.1016/j.aiia.2025.01.001","DOIUrl":"10.1016/j.aiia.2025.01.001","url":null,"abstract":"<div><div>Aggressive behavior among piglets is considered a harmful social contact. Monitoring weaned piglets with intense aggressive behaviors is paramount for pig breeding management. This study introduced a novel hybrid model, PAB-Mamba-YOLO, integrating the principles of Mamba and YOLO for efficient visual detection of weaned piglets' aggressive behaviors, including climbing body, nose hitting, biting tail and biting ear. Within the proposed model, a novel CSPVSS module, which integrated the Cross Stage Partial (CSP) structure with the Visual State Space Model (VSSM), has been developed. This module was adeptly integrated into the Neck part of the network, where it harnessed convolutional capabilities for local feature extraction and leveraged the visual state space to reveal long-distance dependencies. The model exhibited sound performance in detecting aggressive behaviors, with an average precision (AP) of 0.976 for climbing body, 0.994 for nose hitting, 0.977 for biting tail and 0.994 for biting ear. The mean average precision (mAP) of 0.985 reflected the model's overall effectiveness in detecting all classes of aggressive behaviors. The model achieved a detection speed FPS of 69 f/s, with model complexity measured by 7.2 G floating-point operations (GFLOPs) and parameters (Params) of 2.63 million. Comparative experiments with existing prevailing models confirmed the superiority of the proposed model. This work is expected to contribute a glimmer of fresh ideas and inspiration to the research field of precision breeding and behavioral analysis of animals.</div></div>","PeriodicalId":52814,"journal":{"name":"Artificial Intelligence in Agriculture","volume":"15 1","pages":"Pages 52-66"},"PeriodicalIF":8.2,"publicationDate":"2025-01-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143097559","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A bibliometric analysis of research on remote sensing-based monitoring of soil organic matter conducted between 2003 and 2023 2003 - 2023年土壤有机质遥感监测研究的文献计量学分析

IF 8.2 Q1 AGRICULTURE, MULTIDISCIPLINARY

Artificial Intelligence in Agriculture

Pub Date : 2025-01-03 DOI: 10.1016/j.aiia.2024.12.004

Xionghai Chen , Fei Yuan , Syed Tahir Ata-Ul-Karim , Xiaojun Liu , Yongchao Tian , Yan Zhu , Weixing Cao , Qiang Cao

Soil organic matter (SOM) is a key metric for assessing soil quality and crop yield potential. It plays a vital role in maintaining the ecological balance environment and promoting sustainable farming practices. This review examines the evolving trends in remote sensing (RS)-based SOM monitoring by analyzing 739 scholarly publications from the Web of Science database from 2003 to 2023 using a bibliometric approach. The study reveals that research on RS-based SOM monitoring has entered a rapid growth phase since 2018, with China and the United States as the main contributors and an extensive international cooperation network. In model construction, high frequency covariates such as soil pH, precipitation, temperature, and topography significantly improved the prediction accuracy. Data preprocessing methods such as Standard Normal Variables (SNV), Principal Component Analysis (PCA), and Multiple Scattering Correction (MSC) enhanced data consistency. Traditional statistical models are gradually being replaced by nonlinear machine learning and deep learning methods (CNN, XGBoost, andStacking), which are particularly good at handling complex high-dimensional data. Regional spectral libraries (OzSoil and AfSIS) excel in local accuracy, while global spectral libraries (ISRIC and LUCAS) are more suitable for cross-region modeling, and the migration learning technique effectively improves the model generalization ability in low data regions. Integrated models (CNN-LSTM and GAN) have significant advantages in capturing the spatial and temporal dynamics of SOMs, and uncertainty quantification methods (Bayesian inference, Monte Carlo simulation) enhance the reliability of the models in multi-source data and data-scarce scenarios. Future research should focus on further optimization of multi-source data fusion and uncertainty quantification to promote the development and applicability of RS-based SOM monitoring techniques for precision soil management and sustainable agriculture.

土壤有机质（SOM）是评价土壤质量和作物产量潜力的重要指标。它在维持生态平衡环境和促进可持续耕作方式方面发挥着至关重要的作用。本文采用文献计量学方法，分析了2003 - 2023年Web of Science数据库中739篇学术论文，探讨了基于遥感（RS）的SOM监测的发展趋势。研究表明，自2018年以来，基于rs的SOM监测研究进入快速增长阶段，以中美两国为主要贡献者，形成了广泛的国际合作网络。在模型构建中，土壤pH、降水、温度、地形等高频协变量显著提高了预测精度。采用标准正态变量（Standard Normal Variables， SNV）、主成分分析（Principal Component Analysis， PCA）和多重散射校正（Multiple Scattering Correction， MSC）等数据预处理方法增强了数据的一致性。传统的统计模型正逐渐被非线性机器学习和深度学习方法（CNN、XGBoost和stacking）所取代，这些方法特别擅长处理复杂的高维数据。区域谱库（OzSoil和AfSIS）具有较好的局部精度，而全局谱库（ISRIC和LUCAS）更适合跨区域建模，迁移学习技术有效提高了低数据区的模型泛化能力。集成模型（CNN-LSTM和GAN）在捕获SOMs时空动态方面具有显著优势，不确定性量化方法（贝叶斯推理、蒙特卡罗模拟）增强了模型在多源数据和数据稀缺场景下的可靠性。未来的研究应进一步优化多源数据融合和不确定性量化，以促进基于rs的SOM监测技术在土壤精准管理和可持续农业中的发展和应用。

{"title":"A bibliometric analysis of research on remote sensing-based monitoring of soil organic matter conducted between 2003 and 2023","authors":"Xionghai Chen , Fei Yuan , Syed Tahir Ata-Ul-Karim , Xiaojun Liu , Yongchao Tian , Yan Zhu , Weixing Cao , Qiang Cao","doi":"10.1016/j.aiia.2024.12.004","DOIUrl":"10.1016/j.aiia.2024.12.004","url":null,"abstract":"<div><div>Soil organic matter (SOM) is a key metric for assessing soil quality and crop yield potential. It plays a vital role in maintaining the ecological balance environment and promoting sustainable farming practices. This review examines the evolving trends in remote sensing (<em>RS</em>)-based SOM monitoring by analyzing 739 scholarly publications from the Web of Science database from 2003 to 2023 using a bibliometric approach. The study reveals that research on RS-based SOM monitoring has entered a rapid growth phase since 2018, with China and the United States as the main contributors and an extensive international cooperation network. In model construction, high frequency covariates such as soil pH, precipitation, temperature, and topography significantly improved the prediction accuracy. Data preprocessing methods such as Standard Normal Variables (SNV), Principal Component Analysis (PCA), and Multiple Scattering Correction (MSC) enhanced data consistency. Traditional statistical models are gradually being replaced by nonlinear machine learning and deep learning methods (CNN, XGBoost, andStacking), which are particularly good at handling complex high-dimensional data. Regional spectral libraries (OzSoil and AfSIS) excel in local accuracy, while global spectral libraries (ISRIC and LUCAS) are more suitable for cross-region modeling, and the migration learning technique effectively improves the model generalization ability in low data regions. Integrated models (CNN-LSTM and GAN) have significant advantages in capturing the spatial and temporal dynamics of SOMs, and uncertainty quantification methods (Bayesian inference, Monte Carlo simulation) enhance the reliability of the models in multi-source data and data-scarce scenarios. Future research should focus on further optimization of multi-source data fusion and uncertainty quantification to promote the development and applicability of RS-based SOM monitoring techniques for precision soil management and sustainable agriculture.</div></div>","PeriodicalId":52814,"journal":{"name":"Artificial Intelligence in Agriculture","volume":"15 1","pages":"Pages 26-38"},"PeriodicalIF":8.2,"publicationDate":"2025-01-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143149330","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Normalized difference vegetation index prediction using reservoir computing and pretrained language models

IF 8.2 Q1 AGRICULTURE, MULTIDISCIPLINARY

Artificial Intelligence in Agriculture

Pub Date : 2025-01-02 DOI: 10.1016/j.aiia.2024.12.005

John Olamofe , Ram Ray , Xishuang Dong , Lijun Qian

In this study, we examined plant health prediction through the Normalized Difference Vegetation Index (NDVI) calculated from satellite image derived reflectance values in the near-infrared and red spectra. The problem is formulated as a temporal data prediction problem. Using MODIS/Terra Vegetation Indices 16-Day L3 Global 250 m SIN Grid V061 dataset, we designed and implemented Reservoir Computing (RC) models and transformer-based models including pretrained language model, and compared the prediction performance of these models to traditional machine learning and deep learning methods such as Nonlinear Regression, Decision Tree, Convolutional Neural Network (CNN), Long Short-Term Memory (LSTM) network, and DLinear. It is observed that the DLinear/LSTM model showed exceptional predictive accuracy, while the pretrained RC model significantly enhanced traditional RC model forecasts. Additionally, Frozen Pretrained Transformer (FPT), a pretrained language model, showed superior performance in predicting specific NDVI values (most often peak or lowest NDVI), suggesting its effectiveness in precise temporal predictions. Furthermore, transformer-based models, specifically PatchTST and FPT, demonstrated substantial mean squared error reductions, particularly in limited data scenarios (1 %, 5 %, 15 % and 50 % sample sizes), indicating their robustness in precise NDVI temporal predictions when data is limited. The findings in this study demonstrated the effectiveness of emerging machine learning techniques such as reservoir computing and pretrained language model for remote sensing and their contributions in precision agriculture.

在这项研究中，我们通过卫星图像近红外和红色光谱的反射率值计算的归一化植被指数（NDVI）来检验植物健康预测。该问题被表述为一个时间数据预测问题。利用MODIS/Terra植被指数16天L3全球250 m SIN网格V061数据集，设计并实现了水库计算（RC）模型和基于变压器的预训练语言模型，并将这些模型的预测性能与传统的机器学习和深度学习方法（如非线性回归、决策树、卷积神经网络（CNN）、长短期记忆（LSTM）网络和DLinear）进行了比较。结果表明，DLinear/LSTM模型具有较好的预测精度，而预训练后的RC模型显著提高了传统RC模型的预测精度。此外，Frozen Pretrained Transformer （FPT）是一种预训练语言模型，在预测特定的NDVI值（通常是峰值或最低NDVI）方面表现优异，表明其在精确时间预测方面的有效性。此外，基于变压器的模型，特别是PatchTST和FPT，显示出显著的均方误差降低，特别是在有限的数据场景下（1%、5%、15%和50%的样本量），表明它们在数据有限时精确的NDVI时间预测中的鲁棒性。本研究的发现证明了水库计算和遥感预训练语言模型等新兴机器学习技术的有效性及其在精准农业中的贡献。

{"title":"Normalized difference vegetation index prediction using reservoir computing and pretrained language models","authors":"John Olamofe , Ram Ray , Xishuang Dong , Lijun Qian","doi":"10.1016/j.aiia.2024.12.005","DOIUrl":"10.1016/j.aiia.2024.12.005","url":null,"abstract":"<div><div>In this study, we examined plant health prediction through the Normalized Difference Vegetation Index (NDVI) calculated from satellite image derived reflectance values in the near-infrared and red spectra. The problem is formulated as a temporal data prediction problem. Using MODIS/Terra Vegetation Indices 16-Day L3 Global 250 m SIN Grid V061 dataset, we designed and implemented Reservoir Computing (RC) models and transformer-based models including pretrained language model, and compared the prediction performance of these models to traditional machine learning and deep learning methods such as Nonlinear Regression, Decision Tree, Convolutional Neural Network (CNN), Long Short-Term Memory (LSTM) network, and DLinear. It is observed that the DLinear/LSTM model showed exceptional predictive accuracy, while the pretrained RC model significantly enhanced traditional RC model forecasts. Additionally, Frozen Pretrained Transformer (FPT), a pretrained language model, showed superior performance in predicting specific NDVI values (most often peak or lowest NDVI), suggesting its effectiveness in precise temporal predictions. Furthermore, transformer-based models, specifically PatchTST and FPT, demonstrated substantial mean squared error reductions, particularly in limited data scenarios (1 %, 5 %, 15 % and 50 % sample sizes), indicating their robustness in precise NDVI temporal predictions when data is limited. The findings in this study demonstrated the effectiveness of emerging machine learning techniques such as reservoir computing and pretrained language model for remote sensing and their contributions in precision agriculture.</div></div>","PeriodicalId":52814,"journal":{"name":"Artificial Intelligence in Agriculture","volume":"15 1","pages":"Pages 116-129"},"PeriodicalIF":8.2,"publicationDate":"2025-01-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143097557","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Development of automatic wheat seeding quantity control system based on Doppler radar speed measurement

IF 8.2 Q1 AGRICULTURE, MULTIDISCIPLINARY

Artificial Intelligence in Agriculture

Pub Date : 2024-12-24 DOI: 10.1016/j.aiia.2024.12.001

Weiwei Wang , Wenbing Shi , Ce Liu , Yuwei Wang , Lu Liu , Liqing Chen

With advancements in agricultural technology, the full mechanization of rice straw wheat planting has been achieved. However, issues such as missed seeding, uneven row spacing, and poor uniformity of row replenishment often arise due to wheel slippage in wheeled wheat seeders. These problems manual replanting after emergence, reducing efficiency and increasing labor costs. To address these challenges, a speed-adaptive wheat seeding control system based on speed radar was developed. This system comprises a pneumatic wheat seeding device, an automatic speed-following control system, a human-machine interface, and a stepper motor. Leveraging an embedded controller, the system dynamically adjusts motor speed based on real-time forward speed to ensure precise seeding. Using fuzzy PID control, the system dynamically adjusts motor speed, achieving row spacing consistency below 3.9 % and seeding stability within 1.3 %, even at varying speeds. This system addresses critical challenges in precision agriculture, enhancing planting efficiency and reducing labor costs. This innovation enhances planting efficiency, reduces labor costs, and ensures adaptability to varying tractor speeds, meeting the precision requirements of wheat planting.

随着农业技术的进步，水稻、秸秆、小麦种植已实现全机械化。轮式小麦播种机由于轮滑，经常出现漏播、行间距不均匀、补行均匀性差等问题。这些问题出现后人工补种，降低了效率，增加了人工成本。为了解决这些问题，研制了一种基于速度雷达的小麦播种速度自适应控制系统。该系统包括气动播种装置、自动调速控制系统、人机界面和步进电机。利用嵌入式控制器，系统根据实时前进速度动态调整电机速度，以确保精确播种。采用模糊PID控制，系统动态调节电机转速，即使在不同转速下，也能实现行距一致性在3.9%以下，播种稳定性在1.3%以内。该系统解决了精准农业的关键挑战，提高了种植效率，降低了劳动力成本。这一创新提高了种植效率，降低了人工成本，保证了对不同拖拉机速度的适应性，满足了小麦种植的精度要求。

{"title":"Development of automatic wheat seeding quantity control system based on Doppler radar speed measurement","authors":"Weiwei Wang , Wenbing Shi , Ce Liu , Yuwei Wang , Lu Liu , Liqing Chen","doi":"10.1016/j.aiia.2024.12.001","DOIUrl":"10.1016/j.aiia.2024.12.001","url":null,"abstract":"<div><div>With advancements in agricultural technology, the full mechanization of rice straw wheat planting has been achieved. However, issues such as missed seeding, uneven row spacing, and poor uniformity of row replenishment often arise due to wheel slippage in wheeled wheat seeders. These problems manual replanting after emergence, reducing efficiency and increasing labor costs. To address these challenges, a speed-adaptive wheat seeding control system based on speed radar was developed. This system comprises a pneumatic wheat seeding device, an automatic speed-following control system, a human-machine interface, and a stepper motor. Leveraging an embedded controller, the system dynamically adjusts motor speed based on real-time forward speed to ensure precise seeding. Using fuzzy PID control, the system dynamically adjusts motor speed, achieving row spacing consistency below 3.9 % and seeding stability within 1.3 %, even at varying speeds. This system addresses critical challenges in precision agriculture, enhancing planting efficiency and reducing labor costs. This innovation enhances planting efficiency, reduces labor costs, and ensures adaptability to varying tractor speeds, meeting the precision requirements of wheat planting.</div></div>","PeriodicalId":52814,"journal":{"name":"Artificial Intelligence in Agriculture","volume":"15 1","pages":"Pages 12-25"},"PeriodicalIF":8.2,"publicationDate":"2024-12-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143097979","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Robotization of banana de-handing under multi-constraint scenarios: Challenges and future directions

IF 8.2 Q1 AGRICULTURE, MULTIDISCIPLINARY

Artificial Intelligence in Agriculture

Pub Date : 2024-12-22 DOI: 10.1016/j.aiia.2024.12.002

Jie Guo , Zhou Yang , Manoj Karkee , Jieli Duan , Yong He

Banana de-handing is an important part of banana post-harvesting operation. The traditional artificial de-handing model has problems such as labor intensity, inaccurate cutting, uneven cutting surface, unstable catching, and damage of banana fruit, etc. The mapping relationship between the characteristic parameters of the movement posture of the cutter and the influencing factors of the contact stress of banana crown cutting in unstructured environments, and the changing rules of the bumping contact stress of complex multi-shaped banana fruit with the physical property parameters of the cushioning material are the theoretical and technical difficulties that urgently need to be solved in the realization of banana mechanical de-handing. The future research (research on the coupling mechanism of visual cognition-mechanism cutting and low-destructive catching method of full-field continuous de-handing of bananas under multi-constraint scenarios) should: (1) create a database of banana crown, obtain the optimal banana crown recognition model, develop a recognition and locating system of the cutting line of banana crown and obtain its spatial location information; (2) establish the discrete element mechanical model of banana crown and the interaction model between banana crown and the cutter, clarify the stress change and the force wave transmission characteristics of the cutting process, construct the multi-objective optimization equation of the cutting performance, obtain the best combination of cutting parameters, and ascertain the mechanisms of synergistic locating and continuous cutting of banana crown; (3) establish the contact mechanical model of banana fruit drop-bump, parse the bumping characteristics between banana fruit and cushioning material, construct mathematical equations to quantitatively assess damage results, and determine the detract catching method of banana fruit that matches the de-handing mode in multi-constraint scenarios. This study showed that the real-time identification and spatial positioning of fruit, the mechanical properties of crown and the optimization of cutting performance, the damage mechanism of fruit and its loss-reducing harvesting method are the three key breakthroughs in realizing the robotization of de-handing. The current bottleneck problems and future research directions of intelligent banana de-handing were pointed out in this paper, which can provide a theoretical basis for the optimal design of banana de-handing devices and provide technical support for promoting the practical application of intelligent de-handing equipment.

香蕉脱手是香蕉采收后的重要环节。传统的人工去手模式存在劳动强度大、切割不准、切割面不均匀、抓握不稳、香蕉果破损等问题。非结构化环境下切刀运动姿态特征参数与香蕉冠切割接触应力影响因素之间的映射关系，以及复杂多形香蕉果实碰撞接触应力随缓冲材料物理性能参数的变化规律，是实现香蕉机械脱手中急需解决的理论和技术难题。未来的研究（多约束场景下香蕉全场连续脱手视觉认知-机制切割与低破坏性捕获方法耦合机制研究）应：(1)建立香蕉冠数据库，获得最优香蕉冠识别模型，开发香蕉冠切割线识别定位系统，获取其空间定位信息；(2)建立香蕉冠的离散元力学模型及香蕉冠与刀具的相互作用模型，阐明香蕉冠切割过程中的应力变化和力波传递特性，构建香蕉冠切割性能的多目标优化方程，获得最佳切割参数组合，确定香蕉冠协同定位和连续切割的机理；(3)建立香蕉果实跌落碰撞接触力学模型，解析香蕉果实与缓冲材料碰撞特性，构建数学方程定量评估损伤结果，确定多约束场景下匹配脱手模式的香蕉果实抓损方法。研究表明，水果的实时识别与空间定位、果冠力学特性与切分性能优化、水果损伤机理及其减损采收方法是实现脱手机械化的三个关键突破。指出了香蕉智能去手目前存在的瓶颈问题和未来的研究方向，为香蕉智能去手设备的优化设计提供理论依据，为推动智能去手设备的实际应用提供技术支持。

{"title":"Robotization of banana de-handing under multi-constraint scenarios: Challenges and future directions","authors":"Jie Guo , Zhou Yang , Manoj Karkee , Jieli Duan , Yong He","doi":"10.1016/j.aiia.2024.12.002","DOIUrl":"10.1016/j.aiia.2024.12.002","url":null,"abstract":"<div><div>Banana de-handing is an important part of banana post-harvesting operation. The traditional artificial de-handing model has problems such as labor intensity, inaccurate cutting, uneven cutting surface, unstable catching, and damage of banana fruit, etc. The mapping relationship between the characteristic parameters of the movement posture of the cutter and the influencing factors of the contact stress of banana crown cutting in unstructured environments, and the changing rules of the bumping contact stress of complex multi-shaped banana fruit with the physical property parameters of the cushioning material are the theoretical and technical difficulties that urgently need to be solved in the realization of banana mechanical de-handing. The future research (research on the coupling mechanism of visual cognition-mechanism cutting and low-destructive catching method of full-field continuous de-handing of bananas under multi-constraint scenarios) should: (1) create a database of banana crown, obtain the optimal banana crown recognition model, develop a recognition and locating system of the cutting line of banana crown and obtain its spatial location information; (2) establish the discrete element mechanical model of banana crown and the interaction model between banana crown and the cutter, clarify the stress change and the force wave transmission characteristics of the cutting process, construct the multi-objective optimization equation of the cutting performance, obtain the best combination of cutting parameters, and ascertain the mechanisms of synergistic locating and continuous cutting of banana crown; (3) establish the contact mechanical model of banana fruit drop-bump, parse the bumping characteristics between banana fruit and cushioning material, construct mathematical equations to quantitatively assess damage results, and determine the detract catching method of banana fruit that matches the de-handing mode in multi-constraint scenarios. This study showed that the real-time identification and spatial positioning of fruit, the mechanical properties of crown and the optimization of cutting performance, the damage mechanism of fruit and its loss-reducing harvesting method are the three key breakthroughs in realizing the robotization of de-handing. The current bottleneck problems and future research directions of intelligent banana de-handing were pointed out in this paper, which can provide a theoretical basis for the optimal design of banana de-handing devices and provide technical support for promoting the practical application of intelligent de-handing equipment.</div></div>","PeriodicalId":52814,"journal":{"name":"Artificial Intelligence in Agriculture","volume":"15 1","pages":"Pages 1-11"},"PeriodicalIF":8.2,"publicationDate":"2024-12-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143097983","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0