Yinsheng Zhang , Xudong Yang , Yongbo Cheng , Xiaojun Wu , Xiulan Sun , Ruiqi Hou , Haiyan Wang
{"title":"Fruit freshness detection based on multi-task convolutional neural network","authors":"Yinsheng Zhang , Xudong Yang , Yongbo Cheng , Xiaojun Wu , Xiulan Sun , Ruiqi Hou , Haiyan Wang","doi":"10.1016/j.crfs.2024.100733","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><p>Fruit freshness detection by computer vision is essential for many agricultural applications, e.g., automatic harvesting and supply chain monitoring. This paper proposes to use the multi-task learning (MTL) paradigm to build a deep convolutional neural work for fruit freshness detection.</p></div><div><h3>Results</h3><p>We design an MTL model that optimizes the freshness detection (T<sub>1</sub>) and fruit type classification (T<sub>2</sub>) tasks in parallel. The model uses a shared CNN (convolutional neural network) subnet and two FC (fully connected) task heads. The shared CNN acts as a feature extraction module and feeds the two task heads with common semantic features. Based on an open fruit image dataset, we conducted a comparative study of MTL and single-task learning (STL) paradigms. The STL models use the same CNN subnet with only one specific task head. In the MTL scenario, the T<sub>1</sub> and T<sub>2</sub> mean accuracies on the test set are 93.24% and 88.66%, respectively. Meanwhile, for STL, the two accuracies are 92.50% and 87.22%. Statistical tests report significant differences between MTL and STL on T1 and T2 test accuracies. We further investigated the extracted feature vectors (semantic embeddings) from the two STL models. The vectors have an averaged 0.7 cosine similarity on the entire dataset, with most values lying in the 0.6–0.8 range. This indicates a between-task correlation and justifies the effectiveness of the proposed MTL approach.</p></div><div><h3>Conclusion</h3><p>This study proves that MTL exploits the mutual correlation between two or more relevant tasks and can maximally share their underlying feature extraction process. we envision this approach to be extended to other domains that involve multiple interconnected tasks.</p></div>","PeriodicalId":10939,"journal":{"name":"Current Research in Food Science","volume":"8 ","pages":"Article 100733"},"PeriodicalIF":6.2000,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2665927124000595/pdfft?md5=120bd65704dc6d6b245af1a18ca15f6b&pid=1-s2.0-S2665927124000595-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Current Research in Food Science","FirstCategoryId":"97","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2665927124000595","RegionNum":2,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"FOOD SCIENCE & TECHNOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Background
Fruit freshness detection by computer vision is essential for many agricultural applications, e.g., automatic harvesting and supply chain monitoring. This paper proposes to use the multi-task learning (MTL) paradigm to build a deep convolutional neural work for fruit freshness detection.
Results
We design an MTL model that optimizes the freshness detection (T1) and fruit type classification (T2) tasks in parallel. The model uses a shared CNN (convolutional neural network) subnet and two FC (fully connected) task heads. The shared CNN acts as a feature extraction module and feeds the two task heads with common semantic features. Based on an open fruit image dataset, we conducted a comparative study of MTL and single-task learning (STL) paradigms. The STL models use the same CNN subnet with only one specific task head. In the MTL scenario, the T1 and T2 mean accuracies on the test set are 93.24% and 88.66%, respectively. Meanwhile, for STL, the two accuracies are 92.50% and 87.22%. Statistical tests report significant differences between MTL and STL on T1 and T2 test accuracies. We further investigated the extracted feature vectors (semantic embeddings) from the two STL models. The vectors have an averaged 0.7 cosine similarity on the entire dataset, with most values lying in the 0.6–0.8 range. This indicates a between-task correlation and justifies the effectiveness of the proposed MTL approach.
Conclusion
This study proves that MTL exploits the mutual correlation between two or more relevant tasks and can maximally share their underlying feature extraction process. we envision this approach to be extended to other domains that involve multiple interconnected tasks.
期刊介绍:
Current Research in Food Science is an international peer-reviewed journal dedicated to advancing the breadth of knowledge in the field of food science. It serves as a platform for publishing original research articles and short communications that encompass a wide array of topics, including food chemistry, physics, microbiology, nutrition, nutraceuticals, process and package engineering, materials science, food sustainability, and food security. By covering these diverse areas, the journal aims to provide a comprehensive source of the latest scientific findings and technological advancements that are shaping the future of the food industry. The journal's scope is designed to address the multidisciplinary nature of food science, reflecting its commitment to promoting innovation and ensuring the safety and quality of the food supply.