Prabhakar Maheswari , Purushothaman Raja , Manoj Karkee , Mugundhan Raja , Rahmath Ulla Baig , Kiet Tran Trung , Vinh Truong Hoang
{"title":"用于苹果园水果检测和定位的改进型 DeepLabv3+ 架构的性能分析","authors":"Prabhakar Maheswari , Purushothaman Raja , Manoj Karkee , Mugundhan Raja , Rahmath Ulla Baig , Kiet Tran Trung , Vinh Truong Hoang","doi":"10.1016/j.atech.2024.100729","DOIUrl":null,"url":null,"abstract":"<div><div>Deep learning plays an important role in automating various operations in fruit crop production including irrigation, nutrition management, yield estimation and harvesting. Yield estimation is essential in fruit crop production as it helps farmers optimize cultivation, harvesting, logistics and marketing operations. Furthermore, fruit detection and localization is a very important step in the development of an automated fruit harvesting system. Hence, an intelligent system was proposed in this study for apple fruit detection and localization using modified DeepLabv3+, semantic segmentation based architecture. The finetuned customizations (such as modifying the activation function, optimization technique and loss function) were performed in the original architecture of DeepLabv3+ and its performance was analyzed. The modified model was trained with the training dataset of 2600 apple tree images. Images were split into 80 % of training and 20 % of validation. The modified architecture was also compared with the other variants of DeepLabv3+ architectures. After training, the model was tested with the unobserved test dataset of 101 images. The test results demonstrated the Mean Accuracy (<span><math><msub><mi>M</mi><mrow><mi>A</mi><mi>c</mi><mi>c</mi></mrow></msub></math></span>) of 98.58 % and the Mean Intersection over Union (<span><math><msub><mi>M</mi><mrow><mi>I</mi><mi>o</mi><mi>U</mi></mrow></msub></math></span>) of 96.66 % without compromising the inference time (i.e., 15 ms). The proposed model revealed the improved results than the original model which attained a <span><math><msub><mi>M</mi><mrow><mi>A</mi><mi>c</mi><mi>c</mi></mrow></msub></math></span> of 92.12 % and <span><math><msub><mi>M</mi><mrow><mi>I</mi><mi>o</mi><mi>U</mi></mrow></msub></math></span> of 88.94 % for the same dataset with the inference time of 40 ms. To ascertain further, the modified model was compared with other single stage detectors, including Fully Convolutional Network (FCN) and U-Net. FCN attained a <span><math><msub><mi>M</mi><mrow><mi>A</mi><mi>c</mi><mi>c</mi></mrow></msub></math></span>and<span><math><msub><mi>M</mi><mrow><mi>I</mi><mi>o</mi><mi>U</mi></mrow></msub></math></span> of 77.5 % and 77.27 %, respectively whereas U-Net resulted a <span><math><msub><mi>M</mi><mrow><mi>A</mi><mi>c</mi><mi>c</mi></mrow></msub></math></span> and <span><math><msub><mi>M</mi><mrow><mi>I</mi><mi>o</mi><mi>U</mi></mrow></msub></math></span> of 83.95 % and 81.09 %, respectively. Results demonstrated that the modified DeepLabv3+ with ResNet18 is capable of detecting the apple fruits by mitigating the effects of class imbalance which is the major drawback in single stage detectors. Further, better detection and localization of apple fruits can lead to the precise picking by the robotic system.</div></div>","PeriodicalId":74813,"journal":{"name":"Smart agricultural technology","volume":"10 ","pages":"Article 100729"},"PeriodicalIF":6.3000,"publicationDate":"2024-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Performance analysis of modified DeepLabv3+ architecture for fruit detection and localization in apple orchards\",\"authors\":\"Prabhakar Maheswari , Purushothaman Raja , Manoj Karkee , Mugundhan Raja , Rahmath Ulla Baig , Kiet Tran Trung , Vinh Truong Hoang\",\"doi\":\"10.1016/j.atech.2024.100729\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Deep learning plays an important role in automating various operations in fruit crop production including irrigation, nutrition management, yield estimation and harvesting. Yield estimation is essential in fruit crop production as it helps farmers optimize cultivation, harvesting, logistics and marketing operations. Furthermore, fruit detection and localization is a very important step in the development of an automated fruit harvesting system. Hence, an intelligent system was proposed in this study for apple fruit detection and localization using modified DeepLabv3+, semantic segmentation based architecture. The finetuned customizations (such as modifying the activation function, optimization technique and loss function) were performed in the original architecture of DeepLabv3+ and its performance was analyzed. The modified model was trained with the training dataset of 2600 apple tree images. Images were split into 80 % of training and 20 % of validation. The modified architecture was also compared with the other variants of DeepLabv3+ architectures. After training, the model was tested with the unobserved test dataset of 101 images. The test results demonstrated the Mean Accuracy (<span><math><msub><mi>M</mi><mrow><mi>A</mi><mi>c</mi><mi>c</mi></mrow></msub></math></span>) of 98.58 % and the Mean Intersection over Union (<span><math><msub><mi>M</mi><mrow><mi>I</mi><mi>o</mi><mi>U</mi></mrow></msub></math></span>) of 96.66 % without compromising the inference time (i.e., 15 ms). The proposed model revealed the improved results than the original model which attained a <span><math><msub><mi>M</mi><mrow><mi>A</mi><mi>c</mi><mi>c</mi></mrow></msub></math></span> of 92.12 % and <span><math><msub><mi>M</mi><mrow><mi>I</mi><mi>o</mi><mi>U</mi></mrow></msub></math></span> of 88.94 % for the same dataset with the inference time of 40 ms. To ascertain further, the modified model was compared with other single stage detectors, including Fully Convolutional Network (FCN) and U-Net. FCN attained a <span><math><msub><mi>M</mi><mrow><mi>A</mi><mi>c</mi><mi>c</mi></mrow></msub></math></span>and<span><math><msub><mi>M</mi><mrow><mi>I</mi><mi>o</mi><mi>U</mi></mrow></msub></math></span> of 77.5 % and 77.27 %, respectively whereas U-Net resulted a <span><math><msub><mi>M</mi><mrow><mi>A</mi><mi>c</mi><mi>c</mi></mrow></msub></math></span> and <span><math><msub><mi>M</mi><mrow><mi>I</mi><mi>o</mi><mi>U</mi></mrow></msub></math></span> of 83.95 % and 81.09 %, respectively. Results demonstrated that the modified DeepLabv3+ with ResNet18 is capable of detecting the apple fruits by mitigating the effects of class imbalance which is the major drawback in single stage detectors. Further, better detection and localization of apple fruits can lead to the precise picking by the robotic system.</div></div>\",\"PeriodicalId\":74813,\"journal\":{\"name\":\"Smart agricultural technology\",\"volume\":\"10 \",\"pages\":\"Article 100729\"},\"PeriodicalIF\":6.3000,\"publicationDate\":\"2024-12-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Smart agricultural technology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2772375524003332\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AGRICULTURAL ENGINEERING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Smart agricultural technology","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2772375524003332","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRICULTURAL ENGINEERING","Score":null,"Total":0}
Performance analysis of modified DeepLabv3+ architecture for fruit detection and localization in apple orchards
Deep learning plays an important role in automating various operations in fruit crop production including irrigation, nutrition management, yield estimation and harvesting. Yield estimation is essential in fruit crop production as it helps farmers optimize cultivation, harvesting, logistics and marketing operations. Furthermore, fruit detection and localization is a very important step in the development of an automated fruit harvesting system. Hence, an intelligent system was proposed in this study for apple fruit detection and localization using modified DeepLabv3+, semantic segmentation based architecture. The finetuned customizations (such as modifying the activation function, optimization technique and loss function) were performed in the original architecture of DeepLabv3+ and its performance was analyzed. The modified model was trained with the training dataset of 2600 apple tree images. Images were split into 80 % of training and 20 % of validation. The modified architecture was also compared with the other variants of DeepLabv3+ architectures. After training, the model was tested with the unobserved test dataset of 101 images. The test results demonstrated the Mean Accuracy () of 98.58 % and the Mean Intersection over Union () of 96.66 % without compromising the inference time (i.e., 15 ms). The proposed model revealed the improved results than the original model which attained a of 92.12 % and of 88.94 % for the same dataset with the inference time of 40 ms. To ascertain further, the modified model was compared with other single stage detectors, including Fully Convolutional Network (FCN) and U-Net. FCN attained a and of 77.5 % and 77.27 %, respectively whereas U-Net resulted a and of 83.95 % and 81.09 %, respectively. Results demonstrated that the modified DeepLabv3+ with ResNet18 is capable of detecting the apple fruits by mitigating the effects of class imbalance which is the major drawback in single stage detectors. Further, better detection and localization of apple fruits can lead to the precise picking by the robotic system.