{"title":"Fusing spectral and spatial features of hyperspectral reflectance imagery for differentiating between normal and defective blueberries","authors":"Boyang Deng , Yuzhen Lu , Eric Stafne","doi":"10.1016/j.atech.2024.100473","DOIUrl":null,"url":null,"abstract":"<div><p>Effective defect detection of blueberries is important to ensuring supplies of high-quality products to the fresh market. In this study, hyperspectral reflectance imaging with machine learning was evaluated for discriminating between defective and normal blueberries. Fresh blueberries hand-harvested were scanned in the wavelength range of 400–1000 nm. An image analysis pipeline was developed to segment individual blueberries and extract mean spectra and spatial features. Defective blueberries were found to have lower near-infrared reflectance than sound samples, and spectral features produced a better separation between defective and sound samples than the spatial features in the scattering plots of their first two principal components. Nine types of machine learning models were built for classifying defective and sound samples using the spectral and spatial features separately as well as their concatenation. The regularized linear discriminant analysis (RLDA) model trained on the spectral features achieved the best overall accuracy of 95.7 %, as opposed to the best accuracy of 85.3 % based on spatial features, which was obtained by LDA. Simply concatenating spectral and spatial features did not improve over modeling using spectral or spatial features alone. A model ensemble strategy integrating the spectral features-based RLDA and the spatial features-based LDA resulted in a statistically significant improvement in the overall accuracy to 96.6 %. Model-level feature integration offers an effective means for improving the discrimination between defective and normal blueberries. Both the hyperspectral data<span><sup>1</sup></span> and the software programs<span><sup>2</sup></span> of this study are made publicly available.</p></div>","PeriodicalId":74813,"journal":{"name":"Smart agricultural technology","volume":null,"pages":null},"PeriodicalIF":6.3000,"publicationDate":"2024-05-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2772375524000789/pdfft?md5=1c44df39bddb31986999d74b457c155e&pid=1-s2.0-S2772375524000789-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Smart agricultural technology","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2772375524000789","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRICULTURAL ENGINEERING","Score":null,"Total":0}
引用次数: 0
Abstract
Effective defect detection of blueberries is important to ensuring supplies of high-quality products to the fresh market. In this study, hyperspectral reflectance imaging with machine learning was evaluated for discriminating between defective and normal blueberries. Fresh blueberries hand-harvested were scanned in the wavelength range of 400–1000 nm. An image analysis pipeline was developed to segment individual blueberries and extract mean spectra and spatial features. Defective blueberries were found to have lower near-infrared reflectance than sound samples, and spectral features produced a better separation between defective and sound samples than the spatial features in the scattering plots of their first two principal components. Nine types of machine learning models were built for classifying defective and sound samples using the spectral and spatial features separately as well as their concatenation. The regularized linear discriminant analysis (RLDA) model trained on the spectral features achieved the best overall accuracy of 95.7 %, as opposed to the best accuracy of 85.3 % based on spatial features, which was obtained by LDA. Simply concatenating spectral and spatial features did not improve over modeling using spectral or spatial features alone. A model ensemble strategy integrating the spectral features-based RLDA and the spatial features-based LDA resulted in a statistically significant improvement in the overall accuracy to 96.6 %. Model-level feature integration offers an effective means for improving the discrimination between defective and normal blueberries. Both the hyperspectral data1 and the software programs2 of this study are made publicly available.