T. Singhpoo, K. Saengprachatanarug, S. Wongpichet, J. Posom, Kanda Runapongsa Saikaew
{"title":"Cassava stalk detection for a cassava harvesting robot based on YOLO v4 and Mask R-CNN","authors":"T. Singhpoo, K. Saengprachatanarug, S. Wongpichet, J. Posom, Kanda Runapongsa Saikaew","doi":"10.4081/jae.2023.1301","DOIUrl":null,"url":null,"abstract":"The quality of fresh cassava roots can be increased through the use of precision equipment. As a first step towards developing an automatic cassava root cutting system, this study demonstrates the use of a computer vision system with deep learning for cassava stalk detection. An RGB image of a cassava tree mounted on a cassava-pulling machine was captured, and the YOLO v4 model and two Mask R-CNN models with ResNet 101 and ResNet 50 base architectures were employed to train the weights to predict the position of the cassava stalk. One hundred test images of stalks of various shapes and sizes were used to determine the grasping point and inclination, and the results from manual annotation were compared with the predicted results. Regarding localisation, Mask R-CNN with ResNet 101 gave a significantly higher performance than the other models, with an F1 score and a mean IoU of 0.81 and 0.70, respectively. YOLO v4 showed the highest correlation for the x- and y-coordinates for the prediction of the grasping point, with values for R2 of 0.89 and 0.53, respectively. For inclination prediction, Mask R-CNN with ResNet 101 and Mask R-CNN with ResNet 50 gave the same level of correlation, with values for R2 of 0.50 and 0.61, respectively. These results were acceptable for use as design criteria for developing a cassava rootcutting robot.","PeriodicalId":48507,"journal":{"name":"Journal of Agricultural Engineering","volume":"36 1","pages":""},"PeriodicalIF":2.4000,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Agricultural Engineering","FirstCategoryId":"97","ListUrlMain":"https://doi.org/10.4081/jae.2023.1301","RegionNum":4,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"AGRICULTURAL ENGINEERING","Score":null,"Total":0}
引用次数: 0
Abstract
The quality of fresh cassava roots can be increased through the use of precision equipment. As a first step towards developing an automatic cassava root cutting system, this study demonstrates the use of a computer vision system with deep learning for cassava stalk detection. An RGB image of a cassava tree mounted on a cassava-pulling machine was captured, and the YOLO v4 model and two Mask R-CNN models with ResNet 101 and ResNet 50 base architectures were employed to train the weights to predict the position of the cassava stalk. One hundred test images of stalks of various shapes and sizes were used to determine the grasping point and inclination, and the results from manual annotation were compared with the predicted results. Regarding localisation, Mask R-CNN with ResNet 101 gave a significantly higher performance than the other models, with an F1 score and a mean IoU of 0.81 and 0.70, respectively. YOLO v4 showed the highest correlation for the x- and y-coordinates for the prediction of the grasping point, with values for R2 of 0.89 and 0.53, respectively. For inclination prediction, Mask R-CNN with ResNet 101 and Mask R-CNN with ResNet 50 gave the same level of correlation, with values for R2 of 0.50 and 0.61, respectively. These results were acceptable for use as design criteria for developing a cassava rootcutting robot.
期刊介绍:
The Journal of Agricultural Engineering (JAE) is the official journal of the Italian Society of Agricultural Engineering supported by University of Bologna, Italy. The subject matter covers a complete and interdisciplinary range of research in engineering for agriculture and biosystems.