{"title":"Application of Efficient Channel Attention and Small-Scale Layer to YOLOv5s for Wheat Ears Detection","authors":"Feijie Dai, Yongan Xue, Linsheng Huang, Wenjiang Huang, Jinling Zhao","doi":"10.1007/s12524-024-01913-2","DOIUrl":null,"url":null,"abstract":"<p>Wheat is a crucial global grain crop that plays a vital role in ensuring food security worldwide. The automatic and accurate counting of wheat ears is essential for assessing wheat yield. However, the detection accuracy is greatly affected by the complex background and small target size. To address these challenges and improve the performance, we propose an enhanced YOLOv5s method. In the backbone, we introduce the efficient channel attention (ECA) to enhance the feature extraction capability of the original C3 module. Additionally, we incorporate a small-scale detection layer in the neck and prediction stages. This modification expands the original three-scale feature detection (20 × 20, 40 × 40, and 80 × 80) to a four-scale feature detection (20 × 20, 40 × 40, 80 × 80, and 160 × 160), thereby enhancing the recognition accuracy of small targets. Experimental results demonstrate that our method achieves an Accuracy (Acc) of 93.97%, which represents a 2.94% improvement over the YOLOv5s. Additionally, our method has a mean absolute error (MAE) of 0.57, a reduction of 0.6 from the YOLOv5s. The Acc of the improved YOLOv5s approaches that of YOLOv7; however, the giga floating-point operations per second (GFLOPs) and inference speed of the enhanced YOLOv5s are significantly lower than those of YOLOv7. Across various phases of the wheat test dataset, the enhanced model demonstrated superior performance. As a result, the enhanced YOLOv5s enhances its suitability for challenging field conditions and offers a dependable technical framework for ear detection and wheat yield estimation.</p>","PeriodicalId":17510,"journal":{"name":"Journal of the Indian Society of Remote Sensing","volume":"31 1","pages":""},"PeriodicalIF":2.2000,"publicationDate":"2024-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of the Indian Society of Remote Sensing","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1007/s12524-024-01913-2","RegionNum":4,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Wheat is a crucial global grain crop that plays a vital role in ensuring food security worldwide. The automatic and accurate counting of wheat ears is essential for assessing wheat yield. However, the detection accuracy is greatly affected by the complex background and small target size. To address these challenges and improve the performance, we propose an enhanced YOLOv5s method. In the backbone, we introduce the efficient channel attention (ECA) to enhance the feature extraction capability of the original C3 module. Additionally, we incorporate a small-scale detection layer in the neck and prediction stages. This modification expands the original three-scale feature detection (20 × 20, 40 × 40, and 80 × 80) to a four-scale feature detection (20 × 20, 40 × 40, 80 × 80, and 160 × 160), thereby enhancing the recognition accuracy of small targets. Experimental results demonstrate that our method achieves an Accuracy (Acc) of 93.97%, which represents a 2.94% improvement over the YOLOv5s. Additionally, our method has a mean absolute error (MAE) of 0.57, a reduction of 0.6 from the YOLOv5s. The Acc of the improved YOLOv5s approaches that of YOLOv7; however, the giga floating-point operations per second (GFLOPs) and inference speed of the enhanced YOLOv5s are significantly lower than those of YOLOv7. Across various phases of the wheat test dataset, the enhanced model demonstrated superior performance. As a result, the enhanced YOLOv5s enhances its suitability for challenging field conditions and offers a dependable technical framework for ear detection and wheat yield estimation.
期刊介绍:
The aims and scope of the Journal of the Indian Society of Remote Sensing are to help towards advancement, dissemination and application of the knowledge of Remote Sensing technology, which is deemed to include photo interpretation, photogrammetry, aerial photography, image processing, and other related technologies in the field of survey, planning and management of natural resources and other areas of application where the technology is considered to be appropriate, to promote interaction among all persons, bodies, institutions (private and/or state-owned) and industries interested in achieving advancement, dissemination and application of the technology, to encourage and undertake research in remote sensing and related technologies and to undertake and execute all acts which shall promote all or any of the aims and objectives of the Indian Society of Remote Sensing.