{"title":"Cost-efficient training of hyperspectral deep learning models for the detection of contaminating grains in bulk oats by fluorescent tagging","authors":"Emma Van Puyenbroeck, Wouter Saeys","doi":"10.1016/j.saa.2025.125856","DOIUrl":null,"url":null,"abstract":"<div><div>Computer vision based on instance segmentation deep learning models offers great potential for automating many visual inspection tasks, such as the detection of contaminating grains in bulk oats, a nutrient rich grain which is well-tolerated by people suffering from gluten intolerance. Whereas distinguishing foreign objects is often relatively easy with the naked eye, it is much more difficult to distinguish highly similar products, e.g. different grain species or varieties. The subtle differences between such products may be captured by deep learning models combining the spectral and spatial features that are acquired with spectral cameras, measuring a spectral fingerprint for each pixel in an image. However, the training of supervised hyperspectral deep learning models requires large amounts of labelled data. As manual labelling is a tedious job and may induce labelling errors, we propose an alternative approach involving ‘tagging’ of the targets with fluorescent labels that make the targets ‘light up’ under UV illumination to efficiently generate ground truth segmentation masks. As these fluorescent labels are only visible in the UV range of the spectrum, the spectra in the SWIR range can still be used to discriminate grains from each other, making it a cost-efficient labeling technique for hyperspectral data, where labeled datasets are scarce. The primary objective of this study was to determine whether a hyperspectral deep learning segmentation model to detect uncoated spelt kernels in a bulk of oats could be trained more efficiently by coating the spelt kernels in the training images with a fluorescent paint. To this end, both a classical pixel classifier, as a benchmark model, and a deep learning segmentation model were trained on a bulk mixture of oats contaminated with coated spelt kernels and evaluated on bulk mixtures of oats and non-coated spelt kernels to assess their ability to generalize to uncoated samples. The deep learning model (RMSE = 1.34 %) outperformed the pixel classifier (RMSE = 1.91 %) in predicting the mass percentage of spelt without coating in a bulk mixture of oats, because it was more successful in segmenting the kernel edges. This indicates that the traditional pixel classification analysis could be bypassed in future research by efficiently generating the ground truth labels required for training hyperspectral deep learning models through the use of a fluorescent coating.</div></div>","PeriodicalId":433,"journal":{"name":"Spectrochimica Acta Part A: Molecular and Biomolecular Spectroscopy","volume":"332 ","pages":"Article 125856"},"PeriodicalIF":4.3000,"publicationDate":"2025-02-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Spectrochimica Acta Part A: Molecular and Biomolecular Spectroscopy","FirstCategoryId":"92","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1386142525001623","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"SPECTROSCOPY","Score":null,"Total":0}
引用次数: 0
Abstract
Computer vision based on instance segmentation deep learning models offers great potential for automating many visual inspection tasks, such as the detection of contaminating grains in bulk oats, a nutrient rich grain which is well-tolerated by people suffering from gluten intolerance. Whereas distinguishing foreign objects is often relatively easy with the naked eye, it is much more difficult to distinguish highly similar products, e.g. different grain species or varieties. The subtle differences between such products may be captured by deep learning models combining the spectral and spatial features that are acquired with spectral cameras, measuring a spectral fingerprint for each pixel in an image. However, the training of supervised hyperspectral deep learning models requires large amounts of labelled data. As manual labelling is a tedious job and may induce labelling errors, we propose an alternative approach involving ‘tagging’ of the targets with fluorescent labels that make the targets ‘light up’ under UV illumination to efficiently generate ground truth segmentation masks. As these fluorescent labels are only visible in the UV range of the spectrum, the spectra in the SWIR range can still be used to discriminate grains from each other, making it a cost-efficient labeling technique for hyperspectral data, where labeled datasets are scarce. The primary objective of this study was to determine whether a hyperspectral deep learning segmentation model to detect uncoated spelt kernels in a bulk of oats could be trained more efficiently by coating the spelt kernels in the training images with a fluorescent paint. To this end, both a classical pixel classifier, as a benchmark model, and a deep learning segmentation model were trained on a bulk mixture of oats contaminated with coated spelt kernels and evaluated on bulk mixtures of oats and non-coated spelt kernels to assess their ability to generalize to uncoated samples. The deep learning model (RMSE = 1.34 %) outperformed the pixel classifier (RMSE = 1.91 %) in predicting the mass percentage of spelt without coating in a bulk mixture of oats, because it was more successful in segmenting the kernel edges. This indicates that the traditional pixel classification analysis could be bypassed in future research by efficiently generating the ground truth labels required for training hyperspectral deep learning models through the use of a fluorescent coating.
期刊介绍:
Spectrochimica Acta, Part A: Molecular and Biomolecular Spectroscopy (SAA) is an interdisciplinary journal which spans from basic to applied aspects of optical spectroscopy in chemistry, medicine, biology, and materials science.
The journal publishes original scientific papers that feature high-quality spectroscopic data and analysis. From the broad range of optical spectroscopies, the emphasis is on electronic, vibrational or rotational spectra of molecules, rather than on spectroscopy based on magnetic moments.
Criteria for publication in SAA are novelty, uniqueness, and outstanding quality. Routine applications of spectroscopic techniques and computational methods are not appropriate.
Topics of particular interest of Spectrochimica Acta Part A include, but are not limited to:
Spectroscopy and dynamics of bioanalytical, biomedical, environmental, and atmospheric sciences,
Novel experimental techniques or instrumentation for molecular spectroscopy,
Novel theoretical and computational methods,
Novel applications in photochemistry and photobiology,
Novel interpretational approaches as well as advances in data analysis based on electronic or vibrational spectroscopy.