Nan Liu , Cuiling Liu , Lanzhen Chen , Jiabin Yu , Xiaorong Sun , Shanzhe Zhang , Jingzhu Wu
{"title":"Research on model transfer strategies based on the fusion of NIR-MIR spectral data","authors":"Nan Liu , Cuiling Liu , Lanzhen Chen , Jiabin Yu , Xiaorong Sun , Shanzhe Zhang , Jingzhu Wu","doi":"10.1016/j.infrared.2024.105545","DOIUrl":null,"url":null,"abstract":"<div><p>This study investigated the impact of different data fusion strategies on the performance of soluble solids content (SSC) prediction models based on near-infrared and mid-infrared spectroscopic techniques. In the data-level fusion approach, we applied standard normal variate and multiplicative scatter correction for pre-processing the NIR and MIR data. For the feature-level fusion, we utilized successive projections algorithm and competitive adaptive reweighted sampling to select informative wavelengths, and then applied direct orthogonal projection (DOP) for model transfer. The study employed a dataset of 150 honey samples to evaluate the impact of different data fusion strategies on model performance. To effectively evaluate model performance, we utilized the coefficient of R<sup>2</sup> and RMSEP as evaluation metrics. By comparing the results of data-level fusion, feature-level fusion and single-spectrum model transfer, the results showed that spectral data fusion improved the model transfer performance compared to the single-spectrum approach, with feature-level fusion exhibiting the most significant advantages. The effective variable selection techniques in feature-level fusion successfully removed a substantial amount of interfering data and significantly reduced noise influence, thereby improving the model accuracy. Specifically, the use of feature-level fusion improved the predictive model’s R<sup>2</sup> from 0.319 to 0.878 and reduced the RMSEP from 1.974 to 0.613°Brix, demonstrating the significant advantages of this approach in enhancing model transfer performance. The research findings provide important reference and theoretical support for future studies in the field of food quality assessment and other near-infrared spectroscopic data applications. This not only validates the effectiveness of the feature-level fusion approach, but also lays the foundation for establishing efficient and reliable predictive models.</p></div>","PeriodicalId":13549,"journal":{"name":"Infrared Physics & Technology","volume":"142 ","pages":"Article 105545"},"PeriodicalIF":3.1000,"publicationDate":"2024-09-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Infrared Physics & Technology","FirstCategoryId":"101","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1350449524004298","RegionNum":3,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"INSTRUMENTS & INSTRUMENTATION","Score":null,"Total":0}
引用次数: 0
Abstract
This study investigated the impact of different data fusion strategies on the performance of soluble solids content (SSC) prediction models based on near-infrared and mid-infrared spectroscopic techniques. In the data-level fusion approach, we applied standard normal variate and multiplicative scatter correction for pre-processing the NIR and MIR data. For the feature-level fusion, we utilized successive projections algorithm and competitive adaptive reweighted sampling to select informative wavelengths, and then applied direct orthogonal projection (DOP) for model transfer. The study employed a dataset of 150 honey samples to evaluate the impact of different data fusion strategies on model performance. To effectively evaluate model performance, we utilized the coefficient of R2 and RMSEP as evaluation metrics. By comparing the results of data-level fusion, feature-level fusion and single-spectrum model transfer, the results showed that spectral data fusion improved the model transfer performance compared to the single-spectrum approach, with feature-level fusion exhibiting the most significant advantages. The effective variable selection techniques in feature-level fusion successfully removed a substantial amount of interfering data and significantly reduced noise influence, thereby improving the model accuracy. Specifically, the use of feature-level fusion improved the predictive model’s R2 from 0.319 to 0.878 and reduced the RMSEP from 1.974 to 0.613°Brix, demonstrating the significant advantages of this approach in enhancing model transfer performance. The research findings provide important reference and theoretical support for future studies in the field of food quality assessment and other near-infrared spectroscopic data applications. This not only validates the effectiveness of the feature-level fusion approach, but also lays the foundation for establishing efficient and reliable predictive models.
期刊介绍:
The Journal covers the entire field of infrared physics and technology: theory, experiment, application, devices and instrumentation. Infrared'' is defined as covering the near, mid and far infrared (terahertz) regions from 0.75um (750nm) to 1mm (300GHz.) Submissions in the 300GHz to 100GHz region may be accepted at the editors discretion if their content is relevant to shorter wavelengths. Submissions must be primarily concerned with and directly relevant to this spectral region.
Its core topics can be summarized as the generation, propagation and detection, of infrared radiation; the associated optics, materials and devices; and its use in all fields of science, industry, engineering and medicine.
Infrared techniques occur in many different fields, notably spectroscopy and interferometry; material characterization and processing; atmospheric physics, astronomy and space research. Scientific aspects include lasers, quantum optics, quantum electronics, image processing and semiconductor physics. Some important applications are medical diagnostics and treatment, industrial inspection and environmental monitoring.