Exploring a universal model for predicting blueberry soluble solids content based on hyperspectral imaging and transfer learning to address spatial heterogeneity challenge
{"title":"Exploring a universal model for predicting blueberry soluble solids content based on hyperspectral imaging and transfer learning to address spatial heterogeneity challenge","authors":"Guoliang Chen, Mianqing Yang, Guozheng Wang, Jingyuan Dai, Saiwei Yu, Baichao Chen, Dayang Liu","doi":"10.1016/j.saa.2025.125921","DOIUrl":null,"url":null,"abstract":"<div><div>Accurate assessment of soluble solid content (SSC) in blueberries is crucial for quality evaluation. However, in real production lines, blueberries are usually in random placement and the biological heterogeneity of blueberry parts can lead to spectral distortion, which affects the accuracy of SSC prediction models in various placement situations. Therefore, it is crucial to investigate an appropriate modeling method to minimize these negative effects. In this paper, we propose an approach that combines hyperspectral imaging (HSI) technique, residual multilayer perceptron, and transfer learning to build a universal model capable of detecting blueberry SSC in various placement situations. The study acquired SSC values of 1150 blueberry samples and hyperspectral data at different surfaces (stem end, calyx end, and two parts of the equatorial plane), used a residual multilayer perceptron to build a local model, and fine-tuned the model by transfer learning to improve its generalization ability. The results show that the optimized model has significantly improved prediction accuracy on different surfaces, especially the model based on equatorial surface data (enhanced-equator-1) performs well. In the external validation set, the model achieved correlation coefficients of prediction (<em>r<sub>p</sub></em>) of 0.941, 0.924, 0.933, and 0.943; root mean square errors of prediction (RMSEP) of 0.539 %, 0.612 %, 0.571 %, and 0.542 %; and residual predictive deviations (RPD) of 2.91, 2.57, 2.75, and 2.90 on the four surfaces, respectively. This suggests that building a local model by residual multilayer perceptron and fine-tuning the model using the transfer learning method can eliminate the effect of the heterogeneity of blueberry parts on the model to a certain extent, enhance the robustness of the model to biological heterogeneity, and improve the accuracy of the detection of blueberry SSC under different placement situations.</div></div>","PeriodicalId":433,"journal":{"name":"Spectrochimica Acta Part A: Molecular and Biomolecular Spectroscopy","volume":"334 ","pages":"Article 125921"},"PeriodicalIF":4.3000,"publicationDate":"2025-02-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Spectrochimica Acta Part A: Molecular and Biomolecular Spectroscopy","FirstCategoryId":"92","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1386142525002276","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"SPECTROSCOPY","Score":null,"Total":0}
引用次数: 0
Abstract
Accurate assessment of soluble solid content (SSC) in blueberries is crucial for quality evaluation. However, in real production lines, blueberries are usually in random placement and the biological heterogeneity of blueberry parts can lead to spectral distortion, which affects the accuracy of SSC prediction models in various placement situations. Therefore, it is crucial to investigate an appropriate modeling method to minimize these negative effects. In this paper, we propose an approach that combines hyperspectral imaging (HSI) technique, residual multilayer perceptron, and transfer learning to build a universal model capable of detecting blueberry SSC in various placement situations. The study acquired SSC values of 1150 blueberry samples and hyperspectral data at different surfaces (stem end, calyx end, and two parts of the equatorial plane), used a residual multilayer perceptron to build a local model, and fine-tuned the model by transfer learning to improve its generalization ability. The results show that the optimized model has significantly improved prediction accuracy on different surfaces, especially the model based on equatorial surface data (enhanced-equator-1) performs well. In the external validation set, the model achieved correlation coefficients of prediction (rp) of 0.941, 0.924, 0.933, and 0.943; root mean square errors of prediction (RMSEP) of 0.539 %, 0.612 %, 0.571 %, and 0.542 %; and residual predictive deviations (RPD) of 2.91, 2.57, 2.75, and 2.90 on the four surfaces, respectively. This suggests that building a local model by residual multilayer perceptron and fine-tuning the model using the transfer learning method can eliminate the effect of the heterogeneity of blueberry parts on the model to a certain extent, enhance the robustness of the model to biological heterogeneity, and improve the accuracy of the detection of blueberry SSC under different placement situations.
期刊介绍:
Spectrochimica Acta, Part A: Molecular and Biomolecular Spectroscopy (SAA) is an interdisciplinary journal which spans from basic to applied aspects of optical spectroscopy in chemistry, medicine, biology, and materials science.
The journal publishes original scientific papers that feature high-quality spectroscopic data and analysis. From the broad range of optical spectroscopies, the emphasis is on electronic, vibrational or rotational spectra of molecules, rather than on spectroscopy based on magnetic moments.
Criteria for publication in SAA are novelty, uniqueness, and outstanding quality. Routine applications of spectroscopic techniques and computational methods are not appropriate.
Topics of particular interest of Spectrochimica Acta Part A include, but are not limited to:
Spectroscopy and dynamics of bioanalytical, biomedical, environmental, and atmospheric sciences,
Novel experimental techniques or instrumentation for molecular spectroscopy,
Novel theoretical and computational methods,
Novel applications in photochemistry and photobiology,
Novel interpretational approaches as well as advances in data analysis based on electronic or vibrational spectroscopy.