Zhijie Zhang , Hehe Xie, Kailiang Zhang, Li Yang, Dongxing Zhang, Tao Cui, Xiantao He
{"title":"多音频特征地图融合西瓜品质检测","authors":"Zhijie Zhang , Hehe Xie, Kailiang Zhang, Li Yang, Dongxing Zhang, Tao Cui, Xiantao He","doi":"10.1016/j.jfoodeng.2024.112452","DOIUrl":null,"url":null,"abstract":"<div><div>Non-destructive detection of the internal quality of watermelons after harvest can significantly reduce losses and waste during the subsequent sales process. However, existing algorithms often struggle with limited generalization and high iteration costs. This study leverages audio feature maps of watermelons and employs deep learning to classify ripeness (ripe or raw) and internal defects (hollow or juicy). A hybrid attention mechanism, DWTR, is proposed to enhance feature extraction by adaptively capturing spatial and channel information. Additionally, re-parameterization branches are introduced to boost model representation without increasing inference overhead. The Rep-MBF model, a multi-branch fusion approach, was developed utilizing Mel spectrograms and short-time fourier transform (STFT) spectrograms of single-tap audio as dual inputs. The Rep-MBF model demonstrated strong performance with an accuracy of 97.81%, precision of 97.49%, recall of 97.35%, and F1-Score of 97.42% on the test set. The model's inference time on a Raspberry Pi 4B (8 GB) edge computing platform was only 16.24 ms, meeting the accuracy and speed demands for watermelon internal quality detection. In real-world detection scenarios, the Rep-MBF model accurately predicted 44 out of 48 watermelon samples, achieving an overall detection success rate of 91.67%, demonstrating excellent performance in practical watermelon detection applications. The Rep-MBF model achieves high-precision, low-latency detection of both watermelon ripeness and internal defects, while also demonstrating excellent robustness. These combined attributes provide strong algorithmic support for the development of portable nondestructive detection devices for watermelon internal quality.</div></div>","PeriodicalId":359,"journal":{"name":"Journal of Food Engineering","volume":"391 ","pages":"Article 112452"},"PeriodicalIF":5.8000,"publicationDate":"2025-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Multi-audio feature maps fusion for watermelon quality detection\",\"authors\":\"Zhijie Zhang , Hehe Xie, Kailiang Zhang, Li Yang, Dongxing Zhang, Tao Cui, Xiantao He\",\"doi\":\"10.1016/j.jfoodeng.2024.112452\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Non-destructive detection of the internal quality of watermelons after harvest can significantly reduce losses and waste during the subsequent sales process. However, existing algorithms often struggle with limited generalization and high iteration costs. This study leverages audio feature maps of watermelons and employs deep learning to classify ripeness (ripe or raw) and internal defects (hollow or juicy). A hybrid attention mechanism, DWTR, is proposed to enhance feature extraction by adaptively capturing spatial and channel information. Additionally, re-parameterization branches are introduced to boost model representation without increasing inference overhead. The Rep-MBF model, a multi-branch fusion approach, was developed utilizing Mel spectrograms and short-time fourier transform (STFT) spectrograms of single-tap audio as dual inputs. The Rep-MBF model demonstrated strong performance with an accuracy of 97.81%, precision of 97.49%, recall of 97.35%, and F1-Score of 97.42% on the test set. The model's inference time on a Raspberry Pi 4B (8 GB) edge computing platform was only 16.24 ms, meeting the accuracy and speed demands for watermelon internal quality detection. In real-world detection scenarios, the Rep-MBF model accurately predicted 44 out of 48 watermelon samples, achieving an overall detection success rate of 91.67%, demonstrating excellent performance in practical watermelon detection applications. The Rep-MBF model achieves high-precision, low-latency detection of both watermelon ripeness and internal defects, while also demonstrating excellent robustness. These combined attributes provide strong algorithmic support for the development of portable nondestructive detection devices for watermelon internal quality.</div></div>\",\"PeriodicalId\":359,\"journal\":{\"name\":\"Journal of Food Engineering\",\"volume\":\"391 \",\"pages\":\"Article 112452\"},\"PeriodicalIF\":5.8000,\"publicationDate\":\"2025-05-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Food Engineering\",\"FirstCategoryId\":\"97\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0260877424005181\",\"RegionNum\":2,\"RegionCategory\":\"农林科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2024/12/19 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, CHEMICAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Food Engineering","FirstCategoryId":"97","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0260877424005181","RegionNum":2,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/12/19 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"ENGINEERING, CHEMICAL","Score":null,"Total":0}
引用次数: 0
摘要
在西瓜收获后对西瓜内部质量进行无损检测,可以显著减少西瓜在后续销售过程中的损失和浪费。然而,现有的算法往往与有限的泛化和高迭代成本作斗争。这项研究利用西瓜的音频特征图,并采用深度学习对成熟度(成熟或生)和内部缺陷(空心或多汁)进行分类。提出了一种混合注意机制DWTR,通过自适应捕获空间和信道信息来增强特征提取。此外,在不增加推理开销的情况下,引入了重参数化分支来增强模型表示。利用Mel谱图和短时傅立叶变换(STFT)谱图作为双输入,开发了一种多分支融合方法Rep-MBF模型。Rep-MBF模型在测试集上的准确率为97.81%,精密度为97.49%,召回率为97.35%,F1-Score为97.42%。该模型在Raspberry Pi 4B (8gb)边缘计算平台上的推理时间仅为16.24 ms,满足西瓜内部质量检测的精度和速度要求。在实际检测场景中,Rep-MBF模型准确预测了48个西瓜样本中的44个,总体检测成功率为91.67%,在实际西瓜检测应用中表现出优异的性能。Rep-MBF模型实现了西瓜成熟度和内部缺陷的高精度、低延迟检测,同时具有出色的鲁棒性。这些综合属性为西瓜内部品质便携式无损检测设备的开发提供了强有力的算法支持。
Multi-audio feature maps fusion for watermelon quality detection
Non-destructive detection of the internal quality of watermelons after harvest can significantly reduce losses and waste during the subsequent sales process. However, existing algorithms often struggle with limited generalization and high iteration costs. This study leverages audio feature maps of watermelons and employs deep learning to classify ripeness (ripe or raw) and internal defects (hollow or juicy). A hybrid attention mechanism, DWTR, is proposed to enhance feature extraction by adaptively capturing spatial and channel information. Additionally, re-parameterization branches are introduced to boost model representation without increasing inference overhead. The Rep-MBF model, a multi-branch fusion approach, was developed utilizing Mel spectrograms and short-time fourier transform (STFT) spectrograms of single-tap audio as dual inputs. The Rep-MBF model demonstrated strong performance with an accuracy of 97.81%, precision of 97.49%, recall of 97.35%, and F1-Score of 97.42% on the test set. The model's inference time on a Raspberry Pi 4B (8 GB) edge computing platform was only 16.24 ms, meeting the accuracy and speed demands for watermelon internal quality detection. In real-world detection scenarios, the Rep-MBF model accurately predicted 44 out of 48 watermelon samples, achieving an overall detection success rate of 91.67%, demonstrating excellent performance in practical watermelon detection applications. The Rep-MBF model achieves high-precision, low-latency detection of both watermelon ripeness and internal defects, while also demonstrating excellent robustness. These combined attributes provide strong algorithmic support for the development of portable nondestructive detection devices for watermelon internal quality.
期刊介绍:
The journal publishes original research and review papers on any subject at the interface between food and engineering, particularly those of relevance to industry, including:
Engineering properties of foods, food physics and physical chemistry; processing, measurement, control, packaging, storage and distribution; engineering aspects of the design and production of novel foods and of food service and catering; design and operation of food processes, plant and equipment; economics of food engineering, including the economics of alternative processes.
Accounts of food engineering achievements are of particular value.