Pengjun Xiang, Fei Pan, Jun Li, Haibo Pu, Yan Guo, Xiaoyu Zhao, Mengdie Hu, Boda Zhang, Dawei He
{"title":"FFTCA: a Feature Fusion Mechanism Based on Fast Fourier Transform for Rapid Classification of Apple Damage and Real-Time Sorting by Robots","authors":"Pengjun Xiang, Fei Pan, Jun Li, Haibo Pu, Yan Guo, Xiaoyu Zhao, Mengdie Hu, Boda Zhang, Dawei He","doi":"10.1007/s11947-024-03551-5","DOIUrl":null,"url":null,"abstract":"<p>Apples are susceptible to various types of damage during the production process. Such damage not only affects the appearance and edibility of the apples but also can result in the infection of healthy apples, leading to secondary economic losses. Therefore, it is crucial to properly handle damaged apples and re-sort them to enhance their utilization value and optimize resource use. To quickly and accurately identify apple damage and perform sorting in real time, addressing the resource limitations of mobile devices and the difficulty of extracting deep network image features, this study proposes a lightweight real-time apple damage classification network, Fast Fourier Transform Channel Attention (FFTCA)-YOLOv8n-cls. The FFTCA module focuses on the frequency domain feature information of images in deep networks, enhancing the network’s feature extraction capabilities. Additionally, it integrates Convolutional Block Attention Module (CBAM) and Distribution Shifting Convolution to capture channel and spatial information of images in shallow networks and accelerate network inference. Finally, FFTCA-YOLOv8n-cls is compared with typical lightweight classification networks. Experimental results show that this network has better classification accuracy and faster inference speed. Specifically, the FFTCA-YOLOv8n-cls network is only 0.601 MB in size, achieving a classification accuracy of 96.03%, a recall of 96.08%, and an F1-score of 96.05%, demonstrating its feasibility in real-time apple damage sorting. Moreover, this study applies the network to sorting robots, completing backend inference on servers and real-time inference on embedded devices to adapt to different working environments, achieving real-time sorting of damaged apples and validating the network’s application effectiveness.</p>","PeriodicalId":562,"journal":{"name":"Food and Bioprocess Technology","volume":"71 1","pages":""},"PeriodicalIF":5.3000,"publicationDate":"2024-08-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Food and Bioprocess Technology","FirstCategoryId":"97","ListUrlMain":"https://doi.org/10.1007/s11947-024-03551-5","RegionNum":2,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"FOOD SCIENCE & TECHNOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Apples are susceptible to various types of damage during the production process. Such damage not only affects the appearance and edibility of the apples but also can result in the infection of healthy apples, leading to secondary economic losses. Therefore, it is crucial to properly handle damaged apples and re-sort them to enhance their utilization value and optimize resource use. To quickly and accurately identify apple damage and perform sorting in real time, addressing the resource limitations of mobile devices and the difficulty of extracting deep network image features, this study proposes a lightweight real-time apple damage classification network, Fast Fourier Transform Channel Attention (FFTCA)-YOLOv8n-cls. The FFTCA module focuses on the frequency domain feature information of images in deep networks, enhancing the network’s feature extraction capabilities. Additionally, it integrates Convolutional Block Attention Module (CBAM) and Distribution Shifting Convolution to capture channel and spatial information of images in shallow networks and accelerate network inference. Finally, FFTCA-YOLOv8n-cls is compared with typical lightweight classification networks. Experimental results show that this network has better classification accuracy and faster inference speed. Specifically, the FFTCA-YOLOv8n-cls network is only 0.601 MB in size, achieving a classification accuracy of 96.03%, a recall of 96.08%, and an F1-score of 96.05%, demonstrating its feasibility in real-time apple damage sorting. Moreover, this study applies the network to sorting robots, completing backend inference on servers and real-time inference on embedded devices to adapt to different working environments, achieving real-time sorting of damaged apples and validating the network’s application effectiveness.
期刊介绍:
Food and Bioprocess Technology provides an effective and timely platform for cutting-edge high quality original papers in the engineering and science of all types of food processing technologies, from the original food supply source to the consumer’s dinner table. It aims to be a leading international journal for the multidisciplinary agri-food research community.
The journal focuses especially on experimental or theoretical research findings that have the potential for helping the agri-food industry to improve process efficiency, enhance product quality and, extend shelf-life of fresh and processed agri-food products. The editors present critical reviews on new perspectives to established processes, innovative and emerging technologies, and trends and future research in food and bioproducts processing. The journal also publishes short communications for rapidly disseminating preliminary results, letters to the Editor on recent developments and controversy, and book reviews.