Lin Jiao;Qihuang Liu;Haiyun Liu;Peng Chen;Rujing Wang;Kang Liu;Shifeng Dong
{"title":"WheatNet: Attentional Path Aggregation Feature Pyramid Network for Precise Detection and Counting of Dense and Arbitrary-Oriented Wheat Spikes","authors":"Lin Jiao;Qihuang Liu;Haiyun Liu;Peng Chen;Rujing Wang;Kang Liu;Shifeng Dong","doi":"10.1109/TAFE.2024.3451489","DOIUrl":null,"url":null,"abstract":"Achieving the precise and real-time detection of wheat spikes play a crucial role in wheat growth monitoring for precision agriculture community. Machine-learning methods are commonly introduced to automatically detect and count the wheat spikes, which need carefully selected hand-crafted feature descriptors, leading to time-consuming and poor performance. The deep learning has become a promising technology for the accurate detection wheat spikes, owing to its powerful ability of feature extraction. However, the obtained wheat spike images from UAV still have serious overlap, dense distribution, various orientations, and large aspect ratios, leading to poor performance of recent wheat spike detection method. To address the demand of precise and fast detection and counting of wheat spike with dense distribution and arbitrary-orientation, a novel deep learning-based method, WheatNet, has been proposed. The attention mechanism has been introduced the process of feature fusing to highlight the important features of wheat spike as well as inhibit the useless information. Additionally, to optimize the parameters of the network, a loss function with soft dynamic label assignment is adopted to reduce the number of low-quality matches, which provides significant performance gains over other wheat spike detectors. Furthermore, to achieve the precise detection of wheat spike with multi-orientations, a large-scale oriented wheat spike dataset has been constructed, named RoWheat, including 900 images and 50419 annotations with dense distribution and various orientation. Experimental studies demonstrate that the proposed WheatNet achieves a recall of 99.7% and mAP of 91.8%, showing its promising performance gain compared to other state-of-the-art methods.","PeriodicalId":100637,"journal":{"name":"IEEE Transactions on AgriFood Electronics","volume":"2 2","pages":"606-616"},"PeriodicalIF":0.0000,"publicationDate":"2024-09-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on AgriFood Electronics","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10690265/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Achieving the precise and real-time detection of wheat spikes play a crucial role in wheat growth monitoring for precision agriculture community. Machine-learning methods are commonly introduced to automatically detect and count the wheat spikes, which need carefully selected hand-crafted feature descriptors, leading to time-consuming and poor performance. The deep learning has become a promising technology for the accurate detection wheat spikes, owing to its powerful ability of feature extraction. However, the obtained wheat spike images from UAV still have serious overlap, dense distribution, various orientations, and large aspect ratios, leading to poor performance of recent wheat spike detection method. To address the demand of precise and fast detection and counting of wheat spike with dense distribution and arbitrary-orientation, a novel deep learning-based method, WheatNet, has been proposed. The attention mechanism has been introduced the process of feature fusing to highlight the important features of wheat spike as well as inhibit the useless information. Additionally, to optimize the parameters of the network, a loss function with soft dynamic label assignment is adopted to reduce the number of low-quality matches, which provides significant performance gains over other wheat spike detectors. Furthermore, to achieve the precise detection of wheat spike with multi-orientations, a large-scale oriented wheat spike dataset has been constructed, named RoWheat, including 900 images and 50419 annotations with dense distribution and various orientation. Experimental studies demonstrate that the proposed WheatNet achieves a recall of 99.7% and mAP of 91.8%, showing its promising performance gain compared to other state-of-the-art methods.