Cost-efficient training of hyperspectral deep learning models for the detection of contaminating grains in bulk oats by fluorescent tagging

IF 4.6 2区 化学 Q1 SPECTROSCOPY Spectrochimica Acta Part A: Molecular and Biomolecular Spectroscopy Pub Date : 2025-05-05 Epub Date: 2025-02-04 DOI:10.1016/j.saa.2025.125856
Emma Van Puyenbroeck, Wouter Saeys
{"title":"Cost-efficient training of hyperspectral deep learning models for the detection of contaminating grains in bulk oats by fluorescent tagging","authors":"Emma Van Puyenbroeck,&nbsp;Wouter Saeys","doi":"10.1016/j.saa.2025.125856","DOIUrl":null,"url":null,"abstract":"<div><div>Computer vision based on instance segmentation deep learning models offers great potential for automating many visual inspection tasks, such as the detection of contaminating grains in bulk oats, a nutrient rich grain which is well-tolerated by people suffering from gluten intolerance. Whereas distinguishing foreign objects is often relatively easy with the naked eye, it is much more difficult to distinguish highly similar products, e.g. different grain species or varieties. The subtle differences between such products may be captured by deep learning models combining the spectral and spatial features that are acquired with spectral cameras, measuring a spectral fingerprint for each pixel in an image. However, the training of supervised hyperspectral deep learning models requires large amounts of labelled data. As manual labelling is a tedious job and may induce labelling errors, we propose an alternative approach involving ‘tagging’ of the targets with fluorescent labels that make the targets ‘light up’ under UV illumination to efficiently generate ground truth segmentation masks. As these fluorescent labels are only visible in the UV range of the spectrum, the spectra in the SWIR range can still be used to discriminate grains from each other, making it a cost-efficient labeling technique for hyperspectral data, where labeled datasets are scarce. The primary objective of this study was to determine whether a hyperspectral deep learning segmentation model to detect uncoated spelt kernels in a bulk of oats could be trained more efficiently by coating the spelt kernels in the training images with a fluorescent paint. To this end, both a classical pixel classifier, as a benchmark model, and a deep learning segmentation model were trained on a bulk mixture of oats contaminated with coated spelt kernels and evaluated on bulk mixtures of oats and non-coated spelt kernels to assess their ability to generalize to uncoated samples. The deep learning model (RMSE = 1.34 %) outperformed the pixel classifier (RMSE = 1.91 %) in predicting the mass percentage of spelt without coating in a bulk mixture of oats, because it was more successful in segmenting the kernel edges. This indicates that the traditional pixel classification analysis could be bypassed in future research by efficiently generating the ground truth labels required for training hyperspectral deep learning models through the use of a fluorescent coating.</div></div>","PeriodicalId":433,"journal":{"name":"Spectrochimica Acta Part A: Molecular and Biomolecular Spectroscopy","volume":"332 ","pages":"Article 125856"},"PeriodicalIF":4.6000,"publicationDate":"2025-05-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Spectrochimica Acta Part A: Molecular and Biomolecular Spectroscopy","FirstCategoryId":"92","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1386142525001623","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/2/4 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"SPECTROSCOPY","Score":null,"Total":0}
引用次数: 0

Abstract

Computer vision based on instance segmentation deep learning models offers great potential for automating many visual inspection tasks, such as the detection of contaminating grains in bulk oats, a nutrient rich grain which is well-tolerated by people suffering from gluten intolerance. Whereas distinguishing foreign objects is often relatively easy with the naked eye, it is much more difficult to distinguish highly similar products, e.g. different grain species or varieties. The subtle differences between such products may be captured by deep learning models combining the spectral and spatial features that are acquired with spectral cameras, measuring a spectral fingerprint for each pixel in an image. However, the training of supervised hyperspectral deep learning models requires large amounts of labelled data. As manual labelling is a tedious job and may induce labelling errors, we propose an alternative approach involving ‘tagging’ of the targets with fluorescent labels that make the targets ‘light up’ under UV illumination to efficiently generate ground truth segmentation masks. As these fluorescent labels are only visible in the UV range of the spectrum, the spectra in the SWIR range can still be used to discriminate grains from each other, making it a cost-efficient labeling technique for hyperspectral data, where labeled datasets are scarce. The primary objective of this study was to determine whether a hyperspectral deep learning segmentation model to detect uncoated spelt kernels in a bulk of oats could be trained more efficiently by coating the spelt kernels in the training images with a fluorescent paint. To this end, both a classical pixel classifier, as a benchmark model, and a deep learning segmentation model were trained on a bulk mixture of oats contaminated with coated spelt kernels and evaluated on bulk mixtures of oats and non-coated spelt kernels to assess their ability to generalize to uncoated samples. The deep learning model (RMSE = 1.34 %) outperformed the pixel classifier (RMSE = 1.91 %) in predicting the mass percentage of spelt without coating in a bulk mixture of oats, because it was more successful in segmenting the kernel edges. This indicates that the traditional pixel classification analysis could be bypassed in future research by efficiently generating the ground truth labels required for training hyperspectral deep learning models through the use of a fluorescent coating.

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
高光谱深度学习模型的成本效益训练,用于荧光标记检测散装燕麦中的污染颗粒
基于实例分割深度学习模型的计算机视觉为自动化许多视觉检测任务提供了巨大的潜力,例如检测散装燕麦中的污染谷物,这是一种营养丰富的谷物,对患有麸质不耐症的人来说耐受良好。虽然用肉眼区分异物通常相对容易,但要区分高度相似的产品,例如不同的谷物品种或品种,就困难得多。这些产品之间的细微差异可以通过深度学习模型来捕捉,该模型结合光谱相机获得的光谱和空间特征,测量图像中每个像素的光谱指纹。然而,有监督的高光谱深度学习模型的训练需要大量的标记数据。由于手动标记是一项繁琐的工作,可能会导致标记错误,我们提出了一种替代方法,包括用荧光标记对目标进行“标记”,使目标在紫外线照射下“亮起”,以有效地生成地面真值分割掩模。由于这些荧光标记仅在光谱的紫外范围内可见,因此在SWIR范围内的光谱仍然可以用于区分彼此的颗粒,使其成为高光谱数据的一种经济高效的标记技术,其中标记数据集很少。本研究的主要目的是确定高光谱深度学习分割模型是否可以通过在训练图像中涂上荧光涂料来更有效地训练用于检测散装燕麦中未涂覆的拼写仁。为此,将经典像素分类器作为基准模型和深度学习分割模型分别在含有涂覆拼写仁的散装混合燕麦上进行训练,并对燕麦和未涂覆拼写仁的散装混合燕麦进行评估,以评估其泛化到未涂覆样本的能力。深度学习模型(RMSE = 1.34%)在预测散装燕麦混合物中未涂层拼写的质量百分比方面优于像素分类器(RMSE = 1.91%),因为它在分割核边缘方面更成功。这表明,通过使用荧光涂层有效地生成训练高光谱深度学习模型所需的地面真值标签,可以在未来的研究中绕过传统的像素分类分析。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
8.40
自引率
11.40%
发文量
1364
审稿时长
40 days
期刊介绍: Spectrochimica Acta, Part A: Molecular and Biomolecular Spectroscopy (SAA) is an interdisciplinary journal which spans from basic to applied aspects of optical spectroscopy in chemistry, medicine, biology, and materials science. The journal publishes original scientific papers that feature high-quality spectroscopic data and analysis. From the broad range of optical spectroscopies, the emphasis is on electronic, vibrational or rotational spectra of molecules, rather than on spectroscopy based on magnetic moments. Criteria for publication in SAA are novelty, uniqueness, and outstanding quality. Routine applications of spectroscopic techniques and computational methods are not appropriate. Topics of particular interest of Spectrochimica Acta Part A include, but are not limited to: Spectroscopy and dynamics of bioanalytical, biomedical, environmental, and atmospheric sciences, Novel experimental techniques or instrumentation for molecular spectroscopy, Novel theoretical and computational methods, Novel applications in photochemistry and photobiology, Novel interpretational approaches as well as advances in data analysis based on electronic or vibrational spectroscopy.
期刊最新文献
A concentric square ring terahertz metamaterial sensor for highly sensitive detection of cytosine methylation A light-promoted activation fluorescein derivative with unexpected structure for sensing hydrogen sulfide Crystal growth, characterization and nonlinear optical investigations of bis (p-nitrobenzoate benzimidazolium p-nitrobenzoic acid) single crystal for optical limiting applications Spectroscopic characterization of a Be2-benzene complex featuring BeBe quasi-triple bond High-precision apple classification and traceability based on enhanced CBAM for near-infrared spectroscopy
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1