Pub Date : 2025-01-11DOI: 10.1016/j.aiia.2025.01.006
Hongli Song , Weiliang Wen , Sheng Wu , Xinyu Guo
Segmentation of three-dimensional (3D) point clouds is fundamental in comprehending unstructured structural and morphological data. It plays a critical role in research related to plant phenomics, 3D plant modeling, and functional-structural plant modeling. Although technologies for plant point cloud segmentation (PPCS) have advanced rapidly, there has been a lack of a systematic overview of the development process. This paper presents an overview of the progress made in 3D point cloud segmentation research in plants. It starts by discussing the methods used to acquire point clouds in plants, and analyzes the impact of point cloud resolution and quality on the segmentation task. It then introduces multi-scale point cloud segmentation in plants. The paper summarizes and analyzes traditional methods for PPCS, including the global and local features. This paper discusses the progress of machine learning-based segmentation on plant point clouds through supervised, unsupervised, and integrated approaches. It also summarizes the datasets that for PPCS using deep learning-oriented methods and explains the advantages and disadvantages of deep learning-based methods for projection-based, voxel-based, and point-based approaches respectively. Finally, the development of PPCS is discussed and prospected. Deep learning methods are predicted to become dominant in the field of PPCS, and 3D point cloud segmentation would develop towards more automated with higher resolution and precision.
{"title":"Comprehensive review on 3D point cloud segmentation in plants","authors":"Hongli Song , Weiliang Wen , Sheng Wu , Xinyu Guo","doi":"10.1016/j.aiia.2025.01.006","DOIUrl":"10.1016/j.aiia.2025.01.006","url":null,"abstract":"<div><div>Segmentation of three-dimensional (3D) point clouds is fundamental in comprehending unstructured structural and morphological data. It plays a critical role in research related to plant phenomics, 3D plant modeling, and functional-structural plant modeling. Although technologies for plant point cloud segmentation (PPCS) have advanced rapidly, there has been a lack of a systematic overview of the development process. This paper presents an overview of the progress made in 3D point cloud segmentation research in plants. It starts by discussing the methods used to acquire point clouds in plants, and analyzes the impact of point cloud resolution and quality on the segmentation task. It then introduces multi-scale point cloud segmentation in plants. The paper summarizes and analyzes traditional methods for PPCS, including the global and local features. This paper discusses the progress of machine learning-based segmentation on plant point clouds through supervised, unsupervised, and integrated approaches. It also summarizes the datasets that for PPCS using deep learning-oriented methods and explains the advantages and disadvantages of deep learning-based methods for projection-based, voxel-based, and point-based approaches respectively. Finally, the development of PPCS is discussed and prospected. Deep learning methods are predicted to become dominant in the field of PPCS, and 3D point cloud segmentation would develop towards more automated with higher resolution and precision.</div></div>","PeriodicalId":52814,"journal":{"name":"Artificial Intelligence in Agriculture","volume":"15 2","pages":"Pages 296-315"},"PeriodicalIF":8.2,"publicationDate":"2025-01-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143704964","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-01-10DOI: 10.1016/j.aiia.2025.01.003
Tao Cheng , Dongyan Zhang , Gan Zhang , Tianyi Wang , Weibo Ren , Feng Yuan , Yaling Liu , Zhaoming Wang , Chunjiang Zhao
High-throughput phenotyping (HTP) technology is now a significant bottleneck in the efficient selection and breeding of superior forage genetic resources. To better understand the status of forage phenotyping research and identify key directions for development, this review summarizes advances in HTP technology for forage phenotypic analysis over the past ten years. This paper reviews the unique aspects and research priorities in forage phenotypic monitoring, highlights key remote sensing platforms, examines the applications of advanced sensing technology for quantifying phenotypic traits, explores artificial intelligence (AI) algorithms in phenotypic data integration and analysis, and assesses recent progress in phenotypic genomics. The practical applications of HTP technology in forage remain constrained by several challenges. These include establishing uniform data collection standards, designing effective algorithms to handle complex genetic and environmental interactions, deepening the cross-exploration of phenomics-genomics, solving the problem of pathological inversion of forage phenotypic growth monitoring models, and developing low-cost forage phenotypic equipment. Resolving these challenges will unlock the full potential of HTP, enabling precise identification of superior forage traits, accelerating the breeding of superior varieties, and ultimately improving forage yield.
{"title":"High-throughput phenotyping techniques for forage: Status, bottleneck, and challenges","authors":"Tao Cheng , Dongyan Zhang , Gan Zhang , Tianyi Wang , Weibo Ren , Feng Yuan , Yaling Liu , Zhaoming Wang , Chunjiang Zhao","doi":"10.1016/j.aiia.2025.01.003","DOIUrl":"10.1016/j.aiia.2025.01.003","url":null,"abstract":"<div><div>High-throughput phenotyping (HTP) technology is now a significant bottleneck in the efficient selection and breeding of superior forage genetic resources. To better understand the status of forage phenotyping research and identify key directions for development, this review summarizes advances in HTP technology for forage phenotypic analysis over the past ten years. This paper reviews the unique aspects and research priorities in forage phenotypic monitoring, highlights key remote sensing platforms, examines the applications of advanced sensing technology for quantifying phenotypic traits, explores artificial intelligence (AI) algorithms in phenotypic data integration and analysis, and assesses recent progress in phenotypic genomics. The practical applications of HTP technology in forage remain constrained by several challenges. These include establishing uniform data collection standards, designing effective algorithms to handle complex genetic and environmental interactions, deepening the cross-exploration of phenomics-genomics, solving the problem of pathological inversion of forage phenotypic growth monitoring models, and developing low-cost forage phenotypic equipment. Resolving these challenges will unlock the full potential of HTP, enabling precise identification of superior forage traits, accelerating the breeding of superior varieties, and ultimately improving forage yield.</div></div>","PeriodicalId":52814,"journal":{"name":"Artificial Intelligence in Agriculture","volume":"15 1","pages":"Pages 98-115"},"PeriodicalIF":8.2,"publicationDate":"2025-01-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143097558","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-01-10DOI: 10.1016/j.aiia.2025.01.002
Artzai Picon , Itziar Eguskiza , Pablo Galan , Laura Gomez-Zamanillo , Javier Romero , Christian Klukas , Arantza Bereciartua-Perez , Mike Scharner , Ramon Navarra-Mestre
In this study, we introduced an innovative crop-conditional semantic segmentation architecture that seamlessly incorporates contextual metadata (crop information). This is achieved by merging the contextual information at a late layer stage, allowing the method to be integrated with any semantic segmentation architecture, including novel ones. To evaluate the effectiveness of this approach, we curated a challenging dataset of over 100,000 images captured in real-field conditions using mobile phones. This dataset includes various disease stages across 21 diseases and seven crops (wheat, barley, corn, rice, rape-seed, vinegrape, and cucumber), with the added complexity of multiple diseases coexisting in a single image. We demonstrate that incorporating contextual multi-crop information significantly enhances the performance of semantic segmentation models for plant disease detection. By leveraging crop-specific metadata, our approach achieves higher accuracy and better generalization across diverse crops (F1 = 0.68, r = 0.75) compared to traditional methods (F1 = 0.24, r = 0.68). Additionally, the adoption of a semi-supervised approach based on pseudo-labeling of single diseased plants, offers significant advantages for plant disease segmentation and quantification (F1 = 0.73, r = 0.95). This method enhances the model's performance by leveraging both labeled and unlabeled data, reducing the dependency on extensive manual annotations, which are often time-consuming and costly.
The deployment of this algorithm holds the potential to revolutionize the digitization of crop protection product testing, ensuring heightened repeatability while minimizing human subjectivity. By addressing the challenges of semantic segmentation and disease quantification, we contribute to more effective and precise phenotyping, ultimately supporting better crop management and protection strategies.
在这项研究中,我们引入了一种创新的作物条件语义分割架构,该架构无缝地结合了上下文元数据(作物信息)。这是通过在后期阶段合并上下文信息来实现的,允许该方法与任何语义分割体系结构集成,包括新的。为了评估这种方法的有效性,我们策划了一个具有挑战性的数据集,其中包括使用手机在实际条件下拍摄的100,000多张图像。该数据集包括21种疾病和7种作物(小麦、大麦、玉米、水稻、油菜籽、葡萄和黄瓜)的不同疾病阶段,并且在单个图像中同时存在多种疾病的复杂性。我们证明,结合上下文多作物信息显着提高了植物病害检测的语义分割模型的性能。通过利用特定作物的元数据,与传统方法(F1 = 0.24, r = 0.68)相比,我们的方法在不同作物之间实现了更高的精度和更好的泛化(F1 = 0.68, r = 0.75)。此外,采用基于单株病株伪标记的半监督方法,对植物病害的分割和定量具有显著优势(F1 = 0.73, r = 0.95)。该方法通过利用标记和未标记的数据来增强模型的性能,减少了对大量手工注释的依赖,而手工注释通常既耗时又昂贵。该算法的部署有可能彻底改变作物保护产品测试的数字化,确保提高可重复性,同时最大限度地减少人类的主观性。通过解决语义分割和疾病量化的挑战,我们有助于更有效和精确的表型分析,最终支持更好的作物管理和保护策略。
{"title":"Crop-conditional semantic segmentation for efficient agricultural disease assessment","authors":"Artzai Picon , Itziar Eguskiza , Pablo Galan , Laura Gomez-Zamanillo , Javier Romero , Christian Klukas , Arantza Bereciartua-Perez , Mike Scharner , Ramon Navarra-Mestre","doi":"10.1016/j.aiia.2025.01.002","DOIUrl":"10.1016/j.aiia.2025.01.002","url":null,"abstract":"<div><div>In this study, we introduced an innovative crop-conditional semantic segmentation architecture that seamlessly incorporates contextual metadata (crop information). This is achieved by merging the contextual information at a late layer stage, allowing the method to be integrated with any semantic segmentation architecture, including novel ones. To evaluate the effectiveness of this approach, we curated a challenging dataset of over 100,000 images captured in real-field conditions using mobile phones. This dataset includes various disease stages across 21 diseases and seven crops (wheat, barley, corn, rice, rape-seed, vinegrape, and cucumber), with the added complexity of multiple diseases coexisting in a single image. We demonstrate that incorporating contextual multi-crop information significantly enhances the performance of semantic segmentation models for plant disease detection. By leveraging crop-specific metadata, our approach achieves higher accuracy and better generalization across diverse crops (F1 = 0.68, <em>r</em> = 0.75) compared to traditional methods (F1 = 0.24, <em>r</em> = 0.68). Additionally, the adoption of a semi-supervised approach based on pseudo-labeling of single diseased plants, offers significant advantages for plant disease segmentation and quantification (F1 = 0.73, <em>r</em> = 0.95). This method enhances the model's performance by leveraging both labeled and unlabeled data, reducing the dependency on extensive manual annotations, which are often time-consuming and costly.</div><div>The deployment of this algorithm holds the potential to revolutionize the digitization of crop protection product testing, ensuring heightened repeatability while minimizing human subjectivity. By addressing the challenges of semantic segmentation and disease quantification, we contribute to more effective and precise phenotyping, ultimately supporting better crop management and protection strategies.</div></div>","PeriodicalId":52814,"journal":{"name":"Artificial Intelligence in Agriculture","volume":"15 1","pages":"Pages 79-87"},"PeriodicalIF":8.2,"publicationDate":"2025-01-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143097561","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-01-09DOI: 10.1016/j.aiia.2025.01.004
Zhizhong Sun , Jie Yang , Yang Yao , Dong Hu , Yibin Ying , Junxian Guo , Lijuan Xie
Visible/near-infrared (Vis/NIR) spectroscopy technology has been extensively utilized for the determination of soluble solids content (SSC) in fruits. Nonetheless, the spectral distortion resulting from temperature variations in the sample leads to a decrease in detection accuracy. To mitigate the influence of temperature fluctuations on the accuracy of SSC detection in fruits, using watermelon as an example, this study presents a knowledge-guided temperature correction method utilizing one-dimensional convolutional neural networks (1D-CNN). This method consists of two stages: the first stage involves utilizing 1D-CNN models and gradient-weighted class activation mapping (Grad-CAM) method to acquire gradient-weighted features correlating with temperature. The second stage involves mapping these features and integrating them with the original Vis/NIR spectrum, and then train and test the partial least squares (PLS) model. This knowledge-guided method can identify wavelength bands with high temperature correlation in the Vis/NIR spectra, offering valuable guidance for spectral data processing. The performance of the PLS model constructed using the 15 °C spectrum guided by this method is superior to that of the global model, and can reduce the root mean square error of the prediction set (RMSEP) to 0.324°Brix, which is 32.5 % lower than the RMSEP of the global model (0.480°Brix). The method proposed in this study has superior temperature correction effects than slope and bias correction, piecewise direct standardization, and external parameter orthogonalization correction methods. The results indicate that the knowledge-guided temperature correction method based on deep learning can significantly enhance the detection accuracy of SSC in watermelon, providing valuable reference for the development of PLS calibration methods.
可见/近红外(Vis/NIR)光谱技术已广泛应用于水果中可溶性固形物含量的测定。尽管如此,样品中温度变化引起的光谱失真导致检测精度降低。为了减轻温度波动对水果中SSC检测精度的影响,本研究以西瓜为例,提出了一种基于一维卷积神经网络(1D-CNN)的知识引导温度校正方法。该方法分为两个阶段:第一阶段利用1D-CNN模型和梯度加权类激活映射(gradient-weighted class activation mapping, Grad-CAM)方法获取与温度相关的梯度加权特征。第二阶段涉及映射这些特征并将它们与原始的Vis/NIR光谱进行整合,然后训练和测试偏最小二乘(PLS)模型。这种知识引导方法可以识别出可见光/近红外光谱中具有高温相关的波段,为光谱数据处理提供了有价值的指导。利用该方法指导的15°C光谱构建的PLS模型的性能优于全局模型,预测集的均方根误差(RMSEP)降至0.324°Brix,比全局模型的RMSEP(0.480°Brix)低32.5%。该方法的温度校正效果优于斜率和偏置校正、分段直接标准化和外部参数正交化校正方法。结果表明,基于深度学习的知识引导温度校正方法可以显著提高西瓜中SSC的检测精度,为PLS校正方法的开发提供了有价值的参考。
{"title":"Knowledge-guided temperature correction method for soluble solids content detection of watermelon based on Vis/NIR spectroscopy","authors":"Zhizhong Sun , Jie Yang , Yang Yao , Dong Hu , Yibin Ying , Junxian Guo , Lijuan Xie","doi":"10.1016/j.aiia.2025.01.004","DOIUrl":"10.1016/j.aiia.2025.01.004","url":null,"abstract":"<div><div>Visible/near-infrared (Vis/NIR) spectroscopy technology has been extensively utilized for the determination of soluble solids content (SSC) in fruits. Nonetheless, the spectral distortion resulting from temperature variations in the sample leads to a decrease in detection accuracy. To mitigate the influence of temperature fluctuations on the accuracy of SSC detection in fruits, using watermelon as an example, this study presents a knowledge-guided temperature correction method utilizing one-dimensional convolutional neural networks (1D-CNN). This method consists of two stages: the first stage involves utilizing 1D-CNN models and gradient-weighted class activation mapping (Grad-CAM) method to acquire gradient-weighted features correlating with temperature. The second stage involves mapping these features and integrating them with the original Vis/NIR spectrum, and then train and test the partial least squares (PLS) model. This knowledge-guided method can identify wavelength bands with high temperature correlation in the Vis/NIR spectra, offering valuable guidance for spectral data processing. The performance of the PLS model constructed using the 15 °C spectrum guided by this method is superior to that of the global model, and can reduce the root mean square error of the prediction set (RMSEP) to 0.324°Brix, which is 32.5 % lower than the RMSEP of the global model (0.480°Brix). The method proposed in this study has superior temperature correction effects than slope and bias correction, piecewise direct standardization, and external parameter orthogonalization correction methods. The results indicate that the knowledge-guided temperature correction method based on deep learning can significantly enhance the detection accuracy of SSC in watermelon, providing valuable reference for the development of PLS calibration methods.</div></div>","PeriodicalId":52814,"journal":{"name":"Artificial Intelligence in Agriculture","volume":"15 1","pages":"Pages 88-97"},"PeriodicalIF":8.2,"publicationDate":"2025-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143097560","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-01-09DOI: 10.1016/j.aiia.2025.01.005
Xufeng Xu , Tao Xu , Zichao Wei , Zetong Li , Yafei Wang , Xiuqin Rao
The accurate detection of citrus surface defects is of great importance for elevating the product quality and augmenting its market value. However, due to defect diversity and complexity, existing methods focused on parameter and data enhancement have limitations in detection and segmentation. Therefore, this study proposed a citrus surface defect segmentation model guided by prior features, named PrioriFormer. The model extracted texture features, boundary features, and superpixel features that were crucial for defect detection and segmentation, as priori features. A Priori Feature Fusion Module (PFFM) was designed to integrate the priori features, thereby establishing a priori feature branch. Then the priori feature branch was integrated into the baseline model SegFormer, with the objective of enhancing key feature learning capacity of the model. Finally, the effectiveness of the priori features in enhancing the performance of the model was demonstrated through the implementation of specific experiments. The result showed that PrioriFormer achieved an mPA (mean Pixel Accuracy), mIoU (mean Intersection over Union), and Dice Coefficient of 91.0 %, 85.8 %, and 91.0 %, respectively. Compared to other semantic segmentation models, the proposed model has achieved the best performance. The model parameters of PrioriFormer have only increase by 2.7 % in comparison to the baseline model, while the mIoU has improved by 3.3 %, indicating that the improvement of segmentation performance had less dependence on model parameters. Even when trained on few data, PrioriFormer maintained the high segmentation performance, with the reduction of mIoU not exceeding 4.2 %. This demonstrated the strong feature learning ability of the model in scenarios with limited data. Furthermore, validation on external datasets confirmed PrioriFormer's superior performance and adaptability to different tasks. The study found that the proposed PrioriFomer guided by priori features can effectively enhance the accuracy of the citrus surface defect segmentation model, providing technical reference for citrus sorting and quality assessment.
{"title":"Enhancing citrus surface defects detection: A priori feature guided semantic segmentation model","authors":"Xufeng Xu , Tao Xu , Zichao Wei , Zetong Li , Yafei Wang , Xiuqin Rao","doi":"10.1016/j.aiia.2025.01.005","DOIUrl":"10.1016/j.aiia.2025.01.005","url":null,"abstract":"<div><div>The accurate detection of citrus surface defects is of great importance for elevating the product quality and augmenting its market value. However, due to defect diversity and complexity, existing methods focused on parameter and data enhancement have limitations in detection and segmentation. Therefore, this study proposed a citrus surface defect segmentation model guided by prior features, named PrioriFormer. The model extracted texture features, boundary features, and superpixel features that were crucial for defect detection and segmentation, as priori features. A Priori Feature Fusion Module (PFFM) was designed to integrate the priori features, thereby establishing a priori feature branch. Then the priori feature branch was integrated into the baseline model SegFormer, with the objective of enhancing key feature learning capacity of the model. Finally, the effectiveness of the priori features in enhancing the performance of the model was demonstrated through the implementation of specific experiments. The result showed that PrioriFormer achieved an mPA (mean Pixel Accuracy), mIoU (mean Intersection over Union), and Dice Coefficient of 91.0 %, 85.8 %, and 91.0 %, respectively. Compared to other semantic segmentation models, the proposed model has achieved the best performance. The model parameters of PrioriFormer have only increase by 2.7 % in comparison to the baseline model, while the mIoU has improved by 3.3 %, indicating that the improvement of segmentation performance had less dependence on model parameters. Even when trained on few data, PrioriFormer maintained the high segmentation performance, with the reduction of mIoU not exceeding 4.2 %. This demonstrated the strong feature learning ability of the model in scenarios with limited data. Furthermore, validation on external datasets confirmed PrioriFormer's superior performance and adaptability to different tasks. The study found that the proposed PrioriFomer guided by priori features can effectively enhance the accuracy of the citrus surface defect segmentation model, providing technical reference for citrus sorting and quality assessment.</div></div>","PeriodicalId":52814,"journal":{"name":"Artificial Intelligence in Agriculture","volume":"15 1","pages":"Pages 67-78"},"PeriodicalIF":8.2,"publicationDate":"2025-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143097562","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-01-06DOI: 10.1016/j.aiia.2025.01.001
Xue Xia , Ning Zhang , Zhibin Guan , Xin Chai , Shixin Ma , Xiujuan Chai , Tan Sun
Aggressive behavior among piglets is considered a harmful social contact. Monitoring weaned piglets with intense aggressive behaviors is paramount for pig breeding management. This study introduced a novel hybrid model, PAB-Mamba-YOLO, integrating the principles of Mamba and YOLO for efficient visual detection of weaned piglets' aggressive behaviors, including climbing body, nose hitting, biting tail and biting ear. Within the proposed model, a novel CSPVSS module, which integrated the Cross Stage Partial (CSP) structure with the Visual State Space Model (VSSM), has been developed. This module was adeptly integrated into the Neck part of the network, where it harnessed convolutional capabilities for local feature extraction and leveraged the visual state space to reveal long-distance dependencies. The model exhibited sound performance in detecting aggressive behaviors, with an average precision (AP) of 0.976 for climbing body, 0.994 for nose hitting, 0.977 for biting tail and 0.994 for biting ear. The mean average precision (mAP) of 0.985 reflected the model's overall effectiveness in detecting all classes of aggressive behaviors. The model achieved a detection speed FPS of 69 f/s, with model complexity measured by 7.2 G floating-point operations (GFLOPs) and parameters (Params) of 2.63 million. Comparative experiments with existing prevailing models confirmed the superiority of the proposed model. This work is expected to contribute a glimmer of fresh ideas and inspiration to the research field of precision breeding and behavioral analysis of animals.
{"title":"PAB-Mamba-YOLO: VSSM assists in YOLO for aggressive behavior detection among weaned piglets","authors":"Xue Xia , Ning Zhang , Zhibin Guan , Xin Chai , Shixin Ma , Xiujuan Chai , Tan Sun","doi":"10.1016/j.aiia.2025.01.001","DOIUrl":"10.1016/j.aiia.2025.01.001","url":null,"abstract":"<div><div>Aggressive behavior among piglets is considered a harmful social contact. Monitoring weaned piglets with intense aggressive behaviors is paramount for pig breeding management. This study introduced a novel hybrid model, PAB-Mamba-YOLO, integrating the principles of Mamba and YOLO for efficient visual detection of weaned piglets' aggressive behaviors, including climbing body, nose hitting, biting tail and biting ear. Within the proposed model, a novel CSPVSS module, which integrated the Cross Stage Partial (CSP) structure with the Visual State Space Model (VSSM), has been developed. This module was adeptly integrated into the Neck part of the network, where it harnessed convolutional capabilities for local feature extraction and leveraged the visual state space to reveal long-distance dependencies. The model exhibited sound performance in detecting aggressive behaviors, with an average precision (AP) of 0.976 for climbing body, 0.994 for nose hitting, 0.977 for biting tail and 0.994 for biting ear. The mean average precision (mAP) of 0.985 reflected the model's overall effectiveness in detecting all classes of aggressive behaviors. The model achieved a detection speed FPS of 69 f/s, with model complexity measured by 7.2 G floating-point operations (GFLOPs) and parameters (Params) of 2.63 million. Comparative experiments with existing prevailing models confirmed the superiority of the proposed model. This work is expected to contribute a glimmer of fresh ideas and inspiration to the research field of precision breeding and behavioral analysis of animals.</div></div>","PeriodicalId":52814,"journal":{"name":"Artificial Intelligence in Agriculture","volume":"15 1","pages":"Pages 52-66"},"PeriodicalIF":8.2,"publicationDate":"2025-01-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143097559","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-01-03DOI: 10.1016/j.aiia.2024.12.004
Xionghai Chen , Fei Yuan , Syed Tahir Ata-Ul-Karim , Xiaojun Liu , Yongchao Tian , Yan Zhu , Weixing Cao , Qiang Cao
Soil organic matter (SOM) is a key metric for assessing soil quality and crop yield potential. It plays a vital role in maintaining the ecological balance environment and promoting sustainable farming practices. This review examines the evolving trends in remote sensing (RS)-based SOM monitoring by analyzing 739 scholarly publications from the Web of Science database from 2003 to 2023 using a bibliometric approach. The study reveals that research on RS-based SOM monitoring has entered a rapid growth phase since 2018, with China and the United States as the main contributors and an extensive international cooperation network. In model construction, high frequency covariates such as soil pH, precipitation, temperature, and topography significantly improved the prediction accuracy. Data preprocessing methods such as Standard Normal Variables (SNV), Principal Component Analysis (PCA), and Multiple Scattering Correction (MSC) enhanced data consistency. Traditional statistical models are gradually being replaced by nonlinear machine learning and deep learning methods (CNN, XGBoost, andStacking), which are particularly good at handling complex high-dimensional data. Regional spectral libraries (OzSoil and AfSIS) excel in local accuracy, while global spectral libraries (ISRIC and LUCAS) are more suitable for cross-region modeling, and the migration learning technique effectively improves the model generalization ability in low data regions. Integrated models (CNN-LSTM and GAN) have significant advantages in capturing the spatial and temporal dynamics of SOMs, and uncertainty quantification methods (Bayesian inference, Monte Carlo simulation) enhance the reliability of the models in multi-source data and data-scarce scenarios. Future research should focus on further optimization of multi-source data fusion and uncertainty quantification to promote the development and applicability of RS-based SOM monitoring techniques for precision soil management and sustainable agriculture.
土壤有机质(SOM)是评价土壤质量和作物产量潜力的重要指标。它在维持生态平衡环境和促进可持续耕作方式方面发挥着至关重要的作用。本文采用文献计量学方法,分析了2003 - 2023年Web of Science数据库中739篇学术论文,探讨了基于遥感(RS)的SOM监测的发展趋势。研究表明,自2018年以来,基于rs的SOM监测研究进入快速增长阶段,以中美两国为主要贡献者,形成了广泛的国际合作网络。在模型构建中,土壤pH、降水、温度、地形等高频协变量显著提高了预测精度。采用标准正态变量(Standard Normal Variables, SNV)、主成分分析(Principal Component Analysis, PCA)和多重散射校正(Multiple Scattering Correction, MSC)等数据预处理方法增强了数据的一致性。传统的统计模型正逐渐被非线性机器学习和深度学习方法(CNN、XGBoost和stacking)所取代,这些方法特别擅长处理复杂的高维数据。区域谱库(OzSoil和AfSIS)具有较好的局部精度,而全局谱库(ISRIC和LUCAS)更适合跨区域建模,迁移学习技术有效提高了低数据区的模型泛化能力。集成模型(CNN-LSTM和GAN)在捕获SOMs时空动态方面具有显著优势,不确定性量化方法(贝叶斯推理、蒙特卡罗模拟)增强了模型在多源数据和数据稀缺场景下的可靠性。未来的研究应进一步优化多源数据融合和不确定性量化,以促进基于rs的SOM监测技术在土壤精准管理和可持续农业中的发展和应用。
{"title":"A bibliometric analysis of research on remote sensing-based monitoring of soil organic matter conducted between 2003 and 2023","authors":"Xionghai Chen , Fei Yuan , Syed Tahir Ata-Ul-Karim , Xiaojun Liu , Yongchao Tian , Yan Zhu , Weixing Cao , Qiang Cao","doi":"10.1016/j.aiia.2024.12.004","DOIUrl":"10.1016/j.aiia.2024.12.004","url":null,"abstract":"<div><div>Soil organic matter (SOM) is a key metric for assessing soil quality and crop yield potential. It plays a vital role in maintaining the ecological balance environment and promoting sustainable farming practices. This review examines the evolving trends in remote sensing (<em>RS</em>)-based SOM monitoring by analyzing 739 scholarly publications from the Web of Science database from 2003 to 2023 using a bibliometric approach. The study reveals that research on RS-based SOM monitoring has entered a rapid growth phase since 2018, with China and the United States as the main contributors and an extensive international cooperation network. In model construction, high frequency covariates such as soil pH, precipitation, temperature, and topography significantly improved the prediction accuracy. Data preprocessing methods such as Standard Normal Variables (SNV), Principal Component Analysis (PCA), and Multiple Scattering Correction (MSC) enhanced data consistency. Traditional statistical models are gradually being replaced by nonlinear machine learning and deep learning methods (CNN, XGBoost, andStacking), which are particularly good at handling complex high-dimensional data. Regional spectral libraries (OzSoil and AfSIS) excel in local accuracy, while global spectral libraries (ISRIC and LUCAS) are more suitable for cross-region modeling, and the migration learning technique effectively improves the model generalization ability in low data regions. Integrated models (CNN-LSTM and GAN) have significant advantages in capturing the spatial and temporal dynamics of SOMs, and uncertainty quantification methods (Bayesian inference, Monte Carlo simulation) enhance the reliability of the models in multi-source data and data-scarce scenarios. Future research should focus on further optimization of multi-source data fusion and uncertainty quantification to promote the development and applicability of RS-based SOM monitoring techniques for precision soil management and sustainable agriculture.</div></div>","PeriodicalId":52814,"journal":{"name":"Artificial Intelligence in Agriculture","volume":"15 1","pages":"Pages 26-38"},"PeriodicalIF":8.2,"publicationDate":"2025-01-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143149330","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-01-02DOI: 10.1016/j.aiia.2024.12.005
John Olamofe , Ram Ray , Xishuang Dong , Lijun Qian
In this study, we examined plant health prediction through the Normalized Difference Vegetation Index (NDVI) calculated from satellite image derived reflectance values in the near-infrared and red spectra. The problem is formulated as a temporal data prediction problem. Using MODIS/Terra Vegetation Indices 16-Day L3 Global 250 m SIN Grid V061 dataset, we designed and implemented Reservoir Computing (RC) models and transformer-based models including pretrained language model, and compared the prediction performance of these models to traditional machine learning and deep learning methods such as Nonlinear Regression, Decision Tree, Convolutional Neural Network (CNN), Long Short-Term Memory (LSTM) network, and DLinear. It is observed that the DLinear/LSTM model showed exceptional predictive accuracy, while the pretrained RC model significantly enhanced traditional RC model forecasts. Additionally, Frozen Pretrained Transformer (FPT), a pretrained language model, showed superior performance in predicting specific NDVI values (most often peak or lowest NDVI), suggesting its effectiveness in precise temporal predictions. Furthermore, transformer-based models, specifically PatchTST and FPT, demonstrated substantial mean squared error reductions, particularly in limited data scenarios (1 %, 5 %, 15 % and 50 % sample sizes), indicating their robustness in precise NDVI temporal predictions when data is limited. The findings in this study demonstrated the effectiveness of emerging machine learning techniques such as reservoir computing and pretrained language model for remote sensing and their contributions in precision agriculture.
在这项研究中,我们通过卫星图像近红外和红色光谱的反射率值计算的归一化植被指数(NDVI)来检验植物健康预测。该问题被表述为一个时间数据预测问题。利用MODIS/Terra植被指数16天L3全球250 m SIN网格V061数据集,设计并实现了水库计算(RC)模型和基于变压器的预训练语言模型,并将这些模型的预测性能与传统的机器学习和深度学习方法(如非线性回归、决策树、卷积神经网络(CNN)、长短期记忆(LSTM)网络和DLinear)进行了比较。结果表明,DLinear/LSTM模型具有较好的预测精度,而预训练后的RC模型显著提高了传统RC模型的预测精度。此外,Frozen Pretrained Transformer (FPT)是一种预训练语言模型,在预测特定的NDVI值(通常是峰值或最低NDVI)方面表现优异,表明其在精确时间预测方面的有效性。此外,基于变压器的模型,特别是PatchTST和FPT,显示出显著的均方误差降低,特别是在有限的数据场景下(1%、5%、15%和50%的样本量),表明它们在数据有限时精确的NDVI时间预测中的鲁棒性。本研究的发现证明了水库计算和遥感预训练语言模型等新兴机器学习技术的有效性及其在精准农业中的贡献。
{"title":"Normalized difference vegetation index prediction using reservoir computing and pretrained language models","authors":"John Olamofe , Ram Ray , Xishuang Dong , Lijun Qian","doi":"10.1016/j.aiia.2024.12.005","DOIUrl":"10.1016/j.aiia.2024.12.005","url":null,"abstract":"<div><div>In this study, we examined plant health prediction through the Normalized Difference Vegetation Index (NDVI) calculated from satellite image derived reflectance values in the near-infrared and red spectra. The problem is formulated as a temporal data prediction problem. Using MODIS/Terra Vegetation Indices 16-Day L3 Global 250 m SIN Grid V061 dataset, we designed and implemented Reservoir Computing (RC) models and transformer-based models including pretrained language model, and compared the prediction performance of these models to traditional machine learning and deep learning methods such as Nonlinear Regression, Decision Tree, Convolutional Neural Network (CNN), Long Short-Term Memory (LSTM) network, and DLinear. It is observed that the DLinear/LSTM model showed exceptional predictive accuracy, while the pretrained RC model significantly enhanced traditional RC model forecasts. Additionally, Frozen Pretrained Transformer (FPT), a pretrained language model, showed superior performance in predicting specific NDVI values (most often peak or lowest NDVI), suggesting its effectiveness in precise temporal predictions. Furthermore, transformer-based models, specifically PatchTST and FPT, demonstrated substantial mean squared error reductions, particularly in limited data scenarios (1 %, 5 %, 15 % and 50 % sample sizes), indicating their robustness in precise NDVI temporal predictions when data is limited. The findings in this study demonstrated the effectiveness of emerging machine learning techniques such as reservoir computing and pretrained language model for remote sensing and their contributions in precision agriculture.</div></div>","PeriodicalId":52814,"journal":{"name":"Artificial Intelligence in Agriculture","volume":"15 1","pages":"Pages 116-129"},"PeriodicalIF":8.2,"publicationDate":"2025-01-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143097557","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-12-24DOI: 10.1016/j.aiia.2024.12.001
Weiwei Wang , Wenbing Shi , Ce Liu , Yuwei Wang , Lu Liu , Liqing Chen
With advancements in agricultural technology, the full mechanization of rice straw wheat planting has been achieved. However, issues such as missed seeding, uneven row spacing, and poor uniformity of row replenishment often arise due to wheel slippage in wheeled wheat seeders. These problems manual replanting after emergence, reducing efficiency and increasing labor costs. To address these challenges, a speed-adaptive wheat seeding control system based on speed radar was developed. This system comprises a pneumatic wheat seeding device, an automatic speed-following control system, a human-machine interface, and a stepper motor. Leveraging an embedded controller, the system dynamically adjusts motor speed based on real-time forward speed to ensure precise seeding. Using fuzzy PID control, the system dynamically adjusts motor speed, achieving row spacing consistency below 3.9 % and seeding stability within 1.3 %, even at varying speeds. This system addresses critical challenges in precision agriculture, enhancing planting efficiency and reducing labor costs. This innovation enhances planting efficiency, reduces labor costs, and ensures adaptability to varying tractor speeds, meeting the precision requirements of wheat planting.
{"title":"Development of automatic wheat seeding quantity control system based on Doppler radar speed measurement","authors":"Weiwei Wang , Wenbing Shi , Ce Liu , Yuwei Wang , Lu Liu , Liqing Chen","doi":"10.1016/j.aiia.2024.12.001","DOIUrl":"10.1016/j.aiia.2024.12.001","url":null,"abstract":"<div><div>With advancements in agricultural technology, the full mechanization of rice straw wheat planting has been achieved. However, issues such as missed seeding, uneven row spacing, and poor uniformity of row replenishment often arise due to wheel slippage in wheeled wheat seeders. These problems manual replanting after emergence, reducing efficiency and increasing labor costs. To address these challenges, a speed-adaptive wheat seeding control system based on speed radar was developed. This system comprises a pneumatic wheat seeding device, an automatic speed-following control system, a human-machine interface, and a stepper motor. Leveraging an embedded controller, the system dynamically adjusts motor speed based on real-time forward speed to ensure precise seeding. Using fuzzy PID control, the system dynamically adjusts motor speed, achieving row spacing consistency below 3.9 % and seeding stability within 1.3 %, even at varying speeds. This system addresses critical challenges in precision agriculture, enhancing planting efficiency and reducing labor costs. This innovation enhances planting efficiency, reduces labor costs, and ensures adaptability to varying tractor speeds, meeting the precision requirements of wheat planting.</div></div>","PeriodicalId":52814,"journal":{"name":"Artificial Intelligence in Agriculture","volume":"15 1","pages":"Pages 12-25"},"PeriodicalIF":8.2,"publicationDate":"2024-12-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143097979","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-12-22DOI: 10.1016/j.aiia.2024.12.002
Jie Guo , Zhou Yang , Manoj Karkee , Jieli Duan , Yong He
Banana de-handing is an important part of banana post-harvesting operation. The traditional artificial de-handing model has problems such as labor intensity, inaccurate cutting, uneven cutting surface, unstable catching, and damage of banana fruit, etc. The mapping relationship between the characteristic parameters of the movement posture of the cutter and the influencing factors of the contact stress of banana crown cutting in unstructured environments, and the changing rules of the bumping contact stress of complex multi-shaped banana fruit with the physical property parameters of the cushioning material are the theoretical and technical difficulties that urgently need to be solved in the realization of banana mechanical de-handing. The future research (research on the coupling mechanism of visual cognition-mechanism cutting and low-destructive catching method of full-field continuous de-handing of bananas under multi-constraint scenarios) should: (1) create a database of banana crown, obtain the optimal banana crown recognition model, develop a recognition and locating system of the cutting line of banana crown and obtain its spatial location information; (2) establish the discrete element mechanical model of banana crown and the interaction model between banana crown and the cutter, clarify the stress change and the force wave transmission characteristics of the cutting process, construct the multi-objective optimization equation of the cutting performance, obtain the best combination of cutting parameters, and ascertain the mechanisms of synergistic locating and continuous cutting of banana crown; (3) establish the contact mechanical model of banana fruit drop-bump, parse the bumping characteristics between banana fruit and cushioning material, construct mathematical equations to quantitatively assess damage results, and determine the detract catching method of banana fruit that matches the de-handing mode in multi-constraint scenarios. This study showed that the real-time identification and spatial positioning of fruit, the mechanical properties of crown and the optimization of cutting performance, the damage mechanism of fruit and its loss-reducing harvesting method are the three key breakthroughs in realizing the robotization of de-handing. The current bottleneck problems and future research directions of intelligent banana de-handing were pointed out in this paper, which can provide a theoretical basis for the optimal design of banana de-handing devices and provide technical support for promoting the practical application of intelligent de-handing equipment.
{"title":"Robotization of banana de-handing under multi-constraint scenarios: Challenges and future directions","authors":"Jie Guo , Zhou Yang , Manoj Karkee , Jieli Duan , Yong He","doi":"10.1016/j.aiia.2024.12.002","DOIUrl":"10.1016/j.aiia.2024.12.002","url":null,"abstract":"<div><div>Banana de-handing is an important part of banana post-harvesting operation. The traditional artificial de-handing model has problems such as labor intensity, inaccurate cutting, uneven cutting surface, unstable catching, and damage of banana fruit, etc. The mapping relationship between the characteristic parameters of the movement posture of the cutter and the influencing factors of the contact stress of banana crown cutting in unstructured environments, and the changing rules of the bumping contact stress of complex multi-shaped banana fruit with the physical property parameters of the cushioning material are the theoretical and technical difficulties that urgently need to be solved in the realization of banana mechanical de-handing. The future research (research on the coupling mechanism of visual cognition-mechanism cutting and low-destructive catching method of full-field continuous de-handing of bananas under multi-constraint scenarios) should: (1) create a database of banana crown, obtain the optimal banana crown recognition model, develop a recognition and locating system of the cutting line of banana crown and obtain its spatial location information; (2) establish the discrete element mechanical model of banana crown and the interaction model between banana crown and the cutter, clarify the stress change and the force wave transmission characteristics of the cutting process, construct the multi-objective optimization equation of the cutting performance, obtain the best combination of cutting parameters, and ascertain the mechanisms of synergistic locating and continuous cutting of banana crown; (3) establish the contact mechanical model of banana fruit drop-bump, parse the bumping characteristics between banana fruit and cushioning material, construct mathematical equations to quantitatively assess damage results, and determine the detract catching method of banana fruit that matches the de-handing mode in multi-constraint scenarios. This study showed that the real-time identification and spatial positioning of fruit, the mechanical properties of crown and the optimization of cutting performance, the damage mechanism of fruit and its loss-reducing harvesting method are the three key breakthroughs in realizing the robotization of de-handing. The current bottleneck problems and future research directions of intelligent banana de-handing were pointed out in this paper, which can provide a theoretical basis for the optimal design of banana de-handing devices and provide technical support for promoting the practical application of intelligent de-handing equipment.</div></div>","PeriodicalId":52814,"journal":{"name":"Artificial Intelligence in Agriculture","volume":"15 1","pages":"Pages 1-11"},"PeriodicalIF":8.2,"publicationDate":"2024-12-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143097983","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}