Journal of Chemometrics最新文献

In-Situ Detection of Microplastic Particles on Food Using Hyperspectral Imaging With One-Dimensional Convolutional Neural Network and Artificial Neural Network 基于一维卷积神经网络和人工神经网络的高光谱成像原位检测食品中的微塑料颗粒

IF 2.1 4区化学 Q1 SOCIAL WORK

Journal of Chemometrics

Pub Date : 2026-01-21 DOI: 10.1002/cem.70088

Nikhita Sai Nayani, Ran Yang, Yue Sun, Lihong Yang, Lifeng Zhou, Yiming Feng

Hyperspectral imaging (HSI) has emerged as a promising technique for microplastic detection through analysis of reflectance variations across multiple wavelengths. Traditional approaches have focused primarily on isolated microplastic particles, requiring labor-intensive separation procedures impractical for routine monitoring. The challenge of detecting microplastics directly on food surfaces stems from spectral similarities between microplastics and food matrices, making differentiation difficult using conventional methods. Leveraging recent advances in machine learning, this study explores how artificial neural networks (ANN) and one-dimensional convolutional neural networks (1D-CNN) can identify subtle spectral differences to detect microplastic particles on seafood without isolation. We systematically evaluated model architectures, preprocessing techniques, and hyperparameter configurations to optimize detection performance using hyperspectral data from tilapia samples contaminated with polyethylene microspheres. Our findings demonstrate that 1D-CNN models trained on hyperspectral data without dimensionality reduction significantly outperform other approaches, achieving object-level detection F1 scores of 0.963 for 600-μm particles and 0.950 for 300-μm particles. This detection strategy represents a substantial improvement over traditional methods and highlights the potential of deep learning–based approaches for non-destructive, efficient microplastic detection in food safety applications.

高光谱成像（HSI）已经成为一种很有前途的微塑料检测技术，通过分析多个波长的反射率变化。传统的方法主要集中在分离的微塑料颗粒上，需要劳动密集型的分离程序，无法进行常规监测。直接在食物表面检测微塑料的挑战源于微塑料和食物基质之间的光谱相似性，这使得使用传统方法进行区分变得困难。利用机器学习的最新进展，本研究探索了人工神经网络（ANN）和一维卷积神经网络（1D-CNN）如何识别细微的光谱差异，从而在不隔离的情况下检测海产品上的微塑料颗粒。我们系统地评估了模型架构、预处理技术和超参数配置，以优化聚乙烯微球污染罗非鱼样品的高光谱数据的检测性能。我们的研究结果表明，在未降维的高光谱数据上训练的1D-CNN模型显著优于其他方法，在600-μm粒子和300-μm粒子上的目标级检测F1得分分别为0.963和0.950。这种检测策略代表了对传统方法的实质性改进，并突出了基于深度学习的方法在食品安全应用中用于非破坏性、高效微塑料检测的潜力。

{"title":"In-Situ Detection of Microplastic Particles on Food Using Hyperspectral Imaging With One-Dimensional Convolutional Neural Network and Artificial Neural Network","authors":"Nikhita Sai Nayani, Ran Yang, Yue Sun, Lihong Yang, Lifeng Zhou, Yiming Feng","doi":"10.1002/cem.70088","DOIUrl":"https://doi.org/10.1002/cem.70088","url":null,"abstract":"<div>\u0000 \u0000 <p>Hyperspectral imaging (HSI) has emerged as a promising technique for microplastic detection through analysis of reflectance variations across multiple wavelengths. Traditional approaches have focused primarily on isolated microplastic particles, requiring labor-intensive separation procedures impractical for routine monitoring. The challenge of detecting microplastics directly on food surfaces stems from spectral similarities between microplastics and food matrices, making differentiation difficult using conventional methods. Leveraging recent advances in machine learning, this study explores how artificial neural networks (ANN) and one-dimensional convolutional neural networks (1D-CNN) can identify subtle spectral differences to detect microplastic particles on seafood without isolation. We systematically evaluated model architectures, preprocessing techniques, and hyperparameter configurations to optimize detection performance using hyperspectral data from tilapia samples contaminated with polyethylene microspheres. Our findings demonstrate that 1D-CNN models trained on hyperspectral data without dimensionality reduction significantly outperform other approaches, achieving object-level detection F1 scores of 0.963 for 600-μm particles and 0.950 for 300-μm particles. This detection strategy represents a substantial improvement over traditional methods and highlights the potential of deep learning–based approaches for non-destructive, efficient microplastic detection in food safety applications.</p>\u0000 </div>","PeriodicalId":15274,"journal":{"name":"Journal of Chemometrics","volume":"40 2","pages":""},"PeriodicalIF":2.1,"publicationDate":"2026-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146091422","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

XR and Hybrid Data Visualization Spaces for Enhanced Data Analytics 增强数据分析的XR和混合数据可视化空间

IF 2.1 4区化学 Q1 SOCIAL WORK

Journal of Chemometrics

Pub Date : 2026-01-20 DOI: 10.1002/cem.70100

Santiago Lombeyda, S. G. Djorgovski, Ciro Donalek

The growing complexity and information content of the data, together with the need to understand both the complex structures, relationships, and phenomena present in these data spaces, compounded with the emerging need to understand the results produced by AI tools used to analyze the data, requires development of novel, effective data visualization tools. Much of the growing complexity is reflected in the increasing dimensionality of data spaces, where extended reality (XR) naturally emerges as a candidate to help extend our capability for higher dimensional understanding. However, humans often understand lower dimensionality representations more effectively. Still, XR offers an opportunity for a seamless integration of simulated traditional data displays within the three-dimensional virtual data spaces, leading to more intuitive and more effective data analytics. In this paper we present an overview of the benefits of seamlessly integrated two-dimensional and three-dimensional interactive visual representations embedded in XR spaces, and present three case studies that leverage these approaches for more efficient data analytics.

数据日益增长的复杂性和信息内容，以及理解这些数据空间中存在的复杂结构、关系和现象的需求，再加上理解用于分析数据的人工智能工具产生的结果的新需求，需要开发新颖、有效的数据可视化工具。不断增长的复杂性在很大程度上反映在数据空间维度的增加上，扩展现实（XR）自然成为帮助我们扩展更高维度理解能力的候选。然而，人类通常更有效地理解低维度的表示。尽管如此，XR提供了在三维虚拟数据空间中无缝集成模拟传统数据显示的机会，从而实现更直观、更有效的数据分析。在本文中，我们概述了在XR空间中嵌入无缝集成的二维和三维交互式可视化表示的好处，并介绍了利用这些方法进行更有效数据分析的三个案例研究。

{"title":"XR and Hybrid Data Visualization Spaces for Enhanced Data Analytics","authors":"Santiago Lombeyda, S. G. Djorgovski, Ciro Donalek","doi":"10.1002/cem.70100","DOIUrl":"https://doi.org/10.1002/cem.70100","url":null,"abstract":"<div>\u0000 \u0000 <p>The growing complexity and information content of the data, together with the need to understand both the complex structures, relationships, and phenomena present in these data spaces, compounded with the emerging need to understand the results produced by AI tools used to analyze the data, requires development of novel, effective data visualization tools. Much of the growing complexity is reflected in the increasing dimensionality of data spaces, where extended reality (XR) naturally emerges as a candidate to help extend our capability for higher dimensional understanding. However, humans often understand lower dimensionality representations more effectively. Still, XR offers an opportunity for a seamless integration of simulated traditional data displays within the three-dimensional virtual data spaces, leading to more intuitive and more effective data analytics. In this paper we present an overview of the benefits of seamlessly integrated two-dimensional and three-dimensional interactive visual representations embedded in XR spaces, and present three case studies that leverage these approaches for more efficient data analytics.</p>\u0000 </div>","PeriodicalId":15274,"journal":{"name":"Journal of Chemometrics","volume":"40 2","pages":""},"PeriodicalIF":2.1,"publicationDate":"2026-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146096573","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Multivariate Optimization of Column Liquid Chromatography for the Separation of Saturated, Aromatic, and Acidic Biomarkers in Petroleum and Rock Extracts 柱液相色谱法分离石油和岩石提取物中饱和、芳香和酸性生物标志物的多元优化

IF 2.1 4区化学 Q1 SOCIAL WORK

Journal of Chemometrics

Pub Date : 2026-01-19 DOI: 10.1002/cem.70102

Alek A. C. de Sousa, Wandreson dos S. Sousa, Alexandre A. de Souza, Pedro H. B. Lima, Henrique A. Lopes, Wilkins O. de Barros, Benedito B. Farias Filho, Sidney G. de Lima

Conventional geochemical biomarkers include saturated, aromatic, and carboxylic acid compounds. Saturated and aromatic biomarkers, however, form the foundation of many geochemical studies because of their abundance and intrinsic significance. Although carboxylic acids are also present in sediments and petroleum, they are often found at much lower concentrations, which complicates their isolation when using a single analytical method. For this reason, the application of a modified Brønsted-based silica gel phase by classical liquid chromatography has the potential to separate and characterize these three classes. However, variations in the factors can compromise effective component separation. A well-designed experiment can optimize the process by addressing multiple methodological aspects. The Brønsted bases tested to modify the silica gel were sodium and potassium hydroxides. The central composite design (CCD) showed that, besides the expected influence of the volume of Eluent 1 (n-hexane), there is a difference between the base types in the separation of saturated and aromatic compounds. The SiO₂/KOH phase appears to be more resistant to the influence of Eluent 1. Therefore, the trend in separation efficiency is K > Na. All two silica gel modifications yield a fraction rich in carboxylic acids. The modification of the silica gel with a Brønsted base had no significant effect on the molecular geochemical parameters in the samples. This separation method saves time and material, making it a potential approach for routine analysis of saturated and aromatic biomarkers and carboxylic acids in molecular organic geochemistry.

传统的地球化学生物标志物包括饱和、芳香和羧酸化合物。饱和生物标记物和芳香生物标记物因其丰富性和内在意义而成为许多地球化学研究的基础。虽然羧酸也存在于沉积物和石油中，但它们的浓度通常要低得多，这使得使用单一分析方法分离它们变得复杂。因此，采用经典液相色谱法对br ønsted基硅胶相进行改性，可以分离和表征这三类化合物。然而，因素的变化会影响有效的组分分离。一个设计良好的实验可以通过处理多个方法学方面来优化过程。用于修饰硅胶的Brønsted碱是氢氧化钠和氢氧化钾。中心复合设计（CCD）表明，除受淋洗液1（正己烷）体积的影响外，碱类型在分离饱和化合物和芳香族化合物时也存在差异。SiO2/KOH相似乎更能抵抗洗脱液1的影响。因此，分离效率的趋势为K >； Na。所有两种硅胶改性都产生富含羧酸的部分。Brønsted碱对硅胶的改性对样品的分子地球化学参数没有显著影响。这种分离方法节省了时间和材料，使其成为分子有机地球化学中饱和和芳香生物标志物和羧酸的常规分析的潜在方法。

{"title":"Multivariate Optimization of Column Liquid Chromatography for the Separation of Saturated, Aromatic, and Acidic Biomarkers in Petroleum and Rock Extracts","authors":"Alek A. C. de Sousa, Wandreson dos S. Sousa, Alexandre A. de Souza, Pedro H. B. Lima, Henrique A. Lopes, Wilkins O. de Barros, Benedito B. Farias Filho, Sidney G. de Lima","doi":"10.1002/cem.70102","DOIUrl":"https://doi.org/10.1002/cem.70102","url":null,"abstract":"<p>Conventional geochemical biomarkers include saturated, aromatic, and carboxylic acid compounds. Saturated and aromatic biomarkers, however, form the foundation of many geochemical studies because of their abundance and intrinsic significance. Although carboxylic acids are also present in sediments and petroleum, they are often found at much lower concentrations, which complicates their isolation when using a single analytical method. For this reason, the application of a modified Brønsted-based silica gel phase by classical liquid chromatography has the potential to separate and characterize these three classes. However, variations in the factors can compromise effective component separation. A well-designed experiment can optimize the process by addressing multiple methodological aspects. The Brønsted bases tested to modify the silica gel were sodium and potassium hydroxides. The central composite design (CCD) showed that, besides the expected influence of the volume of Eluent 1 (<i>n-</i>hexane), there is a difference between the base types in the separation of saturated and aromatic compounds. The SiO<sub>2</sub>/KOH phase appears to be more resistant to the influence of Eluent 1. Therefore, the trend in separation efficiency is K > Na. All two silica gel modifications yield a fraction rich in carboxylic acids. The modification of the silica gel with a Brønsted base had no significant effect on the molecular geochemical parameters in the samples. This separation method saves time and material, making it a potential approach for routine analysis of saturated and aromatic biomarkers and carboxylic acids in molecular organic geochemistry.</p>","PeriodicalId":15274,"journal":{"name":"Journal of Chemometrics","volume":"40 2","pages":""},"PeriodicalIF":2.1,"publicationDate":"2026-01-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://analyticalsciencejournals.onlinelibrary.wiley.com/doi/epdf/10.1002/cem.70102","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146099423","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Optimizing Distance-Based Classification in Hyperspectral Imaging: a Tutorial on the Influence of Spectral Pretreatments 高光谱成像中基于距离的分类优化：光谱预处理影响教程

IF 2.1 4区化学 Q1 SOCIAL WORK

Journal of Chemometrics

Pub Date : 2026-01-15 DOI: 10.1002/cem.70101

Ana Herrero-Langreo, Nazan Altun, Aoife A. Gowen

One of the particularities of spectral imaging when compared to single point spectroscopy is the importance of spectral pretreatments to minimize environmental and sample-related effects, such as shadows, shapes, or variations in illumination. However, the influence of spectral pretreatments on distance metrics is rarely considered in any great depth. This work explores and discusses the effects of combining different spectral pretreatments with the most commonly used distance metrics. A case study on the classification of recyclable materials is used as an example. MATLAB code scripts and functions are provided and referenced through the article to allow the reader to follow and implement the process step by step. The theoretical basis for the calculation and choice of different pretreatments and distance metrics is explained in depth, and the classification performance of the different combinations of pretreatments and distance metrics is discussed in the light of these.

Results show that distance metrics based on angle or correlation (i.e., Spectral Angle Mapper or Spectral Correlation Mapper) could be used with or without pretreatments without significantly impacting the classification results. Classification based on Euclidean and Cityblock distances was the most computationally efficient but was also the most affected by multiplicative effects in the spectra and thus benefited the most from pretreatments such as standard normal variate (SNV) or from combining SNV and second derivative. Lastly, Mahalanobis distance showed the best classification performance for nonpretreated spectra but showed the worst performance for SNV pretreated spectra, illustrating the importance of assessing spectral similarity between calibration and validation datasets on pretreated spectra when using Mahalanobis distance, particularly when applying spectral pretreatments.

This work provides practical insights into the effects that the parameters used have on the results of distance-based classification in terms of the performance of classification models. Considering this issue can greatly improve classification performance when assessing the potential of hyperspectral imaging systems for a particular application.

与单点光谱学相比，光谱成像的一个特点是光谱预处理的重要性，以尽量减少环境和样品相关的影响，如阴影、形状或照明变化。然而，光谱预处理对距离度量的影响很少被深入考虑。这项工作探讨并讨论了将不同的光谱预处理与最常用的距离度量相结合的效果。以可回收材料分类为例进行了案例研究。通过文章提供和参考MATLAB代码脚本和函数，使读者能够逐步跟踪和实现该过程。深入解释了不同预处理和距离度量的计算和选择的理论依据，并在此基础上讨论了不同预处理和距离度量组合的分类性能。结果表明，基于角度或相关性的距离度量（即光谱角度映射器或光谱相关映射器）可以在不进行预处理的情况下使用，而不会显著影响分类结果。基于欧几里得距离和Cityblock距离的分类计算效率最高，但也受光谱乘法效应的影响最大，因此从标准正态变量（SNV）或SNV与二阶导数相结合的预处理中获益最多。最后，马氏距离在未预处理的光谱中表现出最好的分类性能，而在SNV预处理的光谱中表现出最差的分类性能，说明在使用马氏距离时，特别是在进行光谱预处理时，评估预处理光谱的校准数据集和验证数据集之间光谱相似性的重要性。这项工作为所使用的参数对基于距离的分类结果在分类模型性能方面的影响提供了实际的见解。在评估高光谱成像系统在特定应用中的潜力时，考虑到这个问题可以大大提高分类性能。

{"title":"Optimizing Distance-Based Classification in Hyperspectral Imaging: a Tutorial on the Influence of Spectral Pretreatments","authors":"Ana Herrero-Langreo, Nazan Altun, Aoife A. Gowen","doi":"10.1002/cem.70101","DOIUrl":"https://doi.org/10.1002/cem.70101","url":null,"abstract":"<div>\u0000 \u0000 <p>One of the particularities of spectral imaging when compared to single point spectroscopy is the importance of spectral pretreatments to minimize environmental and sample-related effects, such as shadows, shapes, or variations in illumination. However, the influence of spectral pretreatments on distance metrics is rarely considered in any great depth. This work explores and discusses the effects of combining different spectral pretreatments with the most commonly used distance metrics. A case study on the classification of recyclable materials is used as an example. MATLAB code scripts and functions are provided and referenced through the article to allow the reader to follow and implement the process step by step. The theoretical basis for the calculation and choice of different pretreatments and distance metrics is explained in depth, and the classification performance of the different combinations of pretreatments and distance metrics is discussed in the light of these.</p>\u0000 <p>Results show that distance metrics based on angle or correlation (i.e., Spectral Angle Mapper or Spectral Correlation Mapper) could be used with or without pretreatments without significantly impacting the classification results. Classification based on Euclidean and Cityblock distances was the most computationally efficient but was also the most affected by multiplicative effects in the spectra and thus benefited the most from pretreatments such as standard normal variate (SNV) or from combining SNV and second derivative. Lastly, Mahalanobis distance showed the best classification performance for nonpretreated spectra but showed the worst performance for SNV pretreated spectra, illustrating the importance of assessing spectral similarity between calibration and validation datasets on pretreated spectra when using Mahalanobis distance, particularly when applying spectral pretreatments.</p>\u0000 <p>This work provides practical insights into the effects that the parameters used have on the results of distance-based classification in terms of the performance of classification models. Considering this issue can greatly improve classification performance when assessing the potential of hyperspectral imaging systems for a particular application.</p>\u0000 </div>","PeriodicalId":15274,"journal":{"name":"Journal of Chemometrics","volume":"40 2","pages":""},"PeriodicalIF":2.1,"publicationDate":"2026-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145993960","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Characterising the Effect of Cultivar and Roasting Temperature on FT-NIR Spectral Data of Wheat Using ASCA 利用ASCA表征品种和焙烧温度对小麦FT-NIR光谱数据的影响

IF 2.1 4区化学 Q1 SOCIAL WORK

Journal of Chemometrics

Pub Date : 2026-01-15 DOI: 10.1002/cem.70096

Mia van Niekerk, Federico Marini, Stefan Hayward, Marena Manley

The physicochemical and functional properties of wheat can be modified by exposing the whole grains to thermal pretreatment. Conventional analytical methods used to investigate such modifications are expensive, labour-intensive and may be inaccurate due to interfering compounds. An economical alternative is to use Fourier transform near-infrared (FT-NIR) spectroscopy in combination with multivariate data analysis techniques. Analysis of variance simultaneous component analysis (ASCA) is an exploratory data analysis technique used to characterise the effects of experimental design factors on the chemical composition captured in the FT-NIR spectral data. In this study, two hard wheat cultivars were exposed to 10 different temperatures by means of forced convection continuous tumble roasting. ASCA was applied to the standard normal variate preprocessed spectral data to evaluate the effects of cultivar, roasting temperature and their interaction. All three factors, cultivar, roasting temperature and their interaction, had a significant effect (p $� � � < � � �$ 0.05) on the spectral data. Differences between the roasted wheat cultivars were associated with moisture, starch and aromatic compounds. The association with aromatic structures was supported by the differences in the phenolic contents of the two cultivars. Low roasting temperatures (108°C–150°C) were associated with starch and moisture changes particularly at approximately 1450, 1410 and 1940 nm. Water evaporated from the kernels, and the degree of starch polymerisation decreased. High roasting temperatures (170°C–232°C) were associated with starch and amino acids (ca. 2100 and 2294 nm), which likely underwent structural changes and participated in nonenzymatic browning reactions.

小麦的理化性质和功能特性可以通过热预处理来改变。用于研究这类修饰的传统分析方法是昂贵的，劳动密集型的，并且可能由于干扰化合物而不准确。一个经济的替代方案是使用傅立叶变换近红外（FT-NIR）光谱与多元数据分析技术相结合。方差分析同时成分分析（ASCA）是一种探索性数据分析技术，用于描述实验设计因素对FT-NIR光谱数据中捕获的化学成分的影响。采用强制对流连续滚筒式焙烧的方法，对两种硬质小麦品种进行了10种不同温度的焙烧试验。应用ASCA对标准正态变量预处理光谱数据进行分析，评价品种、焙烧温度及其相互作用的影响。品种、焙烧温度及其相互作用对光谱数据均有显著影响（p ＆lt; $$ < $$ 0.05）。不同烘焙小麦品种之间的差异与水分、淀粉和芳香化合物有关。两个品种酚类物质含量的差异支持了与芳香结构的关联。低焙烧温度（108°C - 150°C）与淀粉和水分变化有关，特别是在大约1450、1410和1940 nm处。水分从籽粒中蒸发，淀粉聚合度降低。高温（170°C - 232°C）与淀粉和氨基酸（约2100和2294 nm）有关，它们可能发生了结构变化并参与了非酶褐变反应。

{"title":"Characterising the Effect of Cultivar and Roasting Temperature on FT-NIR Spectral Data of Wheat Using ASCA","authors":"Mia van Niekerk, Federico Marini, Stefan Hayward, Marena Manley","doi":"10.1002/cem.70096","DOIUrl":"https://doi.org/10.1002/cem.70096","url":null,"abstract":"<p>The physicochemical and functional properties of wheat can be modified by exposing the whole grains to thermal pretreatment. Conventional analytical methods used to investigate such modifications are expensive, labour-intensive and may be inaccurate due to interfering compounds. An economical alternative is to use Fourier transform near-infrared (FT-NIR) spectroscopy in combination with multivariate data analysis techniques. Analysis of variance simultaneous component analysis (ASCA) is an exploratory data analysis technique used to characterise the effects of experimental design factors on the chemical composition captured in the FT-NIR spectral data. In this study, two hard wheat cultivars were exposed to 10 different temperatures by means of forced convection continuous tumble roasting. ASCA was applied to the standard normal variate preprocessed spectral data to evaluate the effects of cultivar, roasting temperature and their interaction. All three factors, cultivar, roasting temperature and their interaction, had a significant effect (<i>p</i> <span></span><math>\u0000 <semantics>\u0000 <mrow>\u0000 <mo><</mo>\u0000 </mrow>\u0000 <annotation>$$ < $$</annotation>\u0000 </semantics></math> 0.05) on the spectral data. Differences between the roasted wheat cultivars were associated with moisture, starch and aromatic compounds. The association with aromatic structures was supported by the differences in the phenolic contents of the two cultivars. Low roasting temperatures (108°C–150°C) were associated with starch and moisture changes particularly at approximately 1450, 1410 and 1940 nm. Water evaporated from the kernels, and the degree of starch polymerisation decreased. High roasting temperatures (170°C–232°C) were associated with starch and amino acids (ca. 2100 and 2294 nm), which likely underwent structural changes and participated in nonenzymatic browning reactions.</p>","PeriodicalId":15274,"journal":{"name":"Journal of Chemometrics","volume":"40 2","pages":""},"PeriodicalIF":2.1,"publicationDate":"2026-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://analyticalsciencejournals.onlinelibrary.wiley.com/doi/epdf/10.1002/cem.70096","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145987153","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Field Strength Distribution-Based Sample Selection Method 基于场强分布的样本选择方法

IF 2.1 4区化学 Q1 SOCIAL WORK

Journal of Chemometrics

Pub Date : 2026-01-08 DOI: 10.1002/cem.70094

Zhonghai He, Jialong Sun, Yi Zhang, Xiaofang Zhang

In the process of spectral modeling, the representativeness of samples to the overall space determines the modeling efficiency. A commonly used unsupervised sample selection method is based on the maximum–minimum distance of selected samples. However, this approach selects samples based solely on a single distance metric, which introduces a degree of randomness. Inspired by the spatial field strength distribution law of multiple point charges, we propose a novel sample selection method based on field strength. In this method, each sample point is treated as a point charge that generates an electric field in its vicinity. The field strength at any given position is the sum of the contributions from all point charges at that location, with a higher field strength indicating that the point is already well represented. By calculating the total field strength exerted by each selected sample on the candidate points and incorporating the point with the minimum field strength into the calibration set, the method maximizes the coverage of field strength in the calibration space. Sequentially selecting and adding points with the lowest field strength yields a highly representative sample set. This approach enables efficient and unsupervised selection of modeling samples.

在光谱建模过程中，样本对整体空间的代表性决定了建模效率。一种常用的无监督样本选择方法是基于所选样本的最大-最小距离。然而，这种方法仅基于单个距离度量来选择样本，这引入了一定程度的随机性。受多点电荷空间场强分布规律的启发，提出了一种基于场强的样品选择方法。在这种方法中，每个采样点被视为在其附近产生电场的点电荷。任何给定位置的场强是该位置所有点电荷贡献的总和，场强越高表明该点已经很好地表现出来。该方法通过计算每个选定样本对候选点施加的总场强，并将场强最小的点纳入校准集，使场强在校准空间的覆盖范围最大化。依次选择和添加具有最低场强的点产生一个高度代表性的样本集。这种方法能够有效地和无监督地选择建模样本。

{"title":"Field Strength Distribution-Based Sample Selection Method","authors":"Zhonghai He, Jialong Sun, Yi Zhang, Xiaofang Zhang","doi":"10.1002/cem.70094","DOIUrl":"https://doi.org/10.1002/cem.70094","url":null,"abstract":"<div>\u0000 \u0000 <p>In the process of spectral modeling, the representativeness of samples to the overall space determines the modeling efficiency. A commonly used unsupervised sample selection method is based on the maximum–minimum distance of selected samples. However, this approach selects samples based solely on a single distance metric, which introduces a degree of randomness. Inspired by the spatial field strength distribution law of multiple point charges, we propose a novel sample selection method based on field strength. In this method, each sample point is treated as a point charge that generates an electric field in its vicinity. The field strength at any given position is the sum of the contributions from all point charges at that location, with a higher field strength indicating that the point is already well represented. By calculating the total field strength exerted by each selected sample on the candidate points and incorporating the point with the minimum field strength into the calibration set, the method maximizes the coverage of field strength in the calibration space. Sequentially selecting and adding points with the lowest field strength yields a highly representative sample set. This approach enables efficient and unsupervised selection of modeling samples.</p>\u0000 </div>","PeriodicalId":15274,"journal":{"name":"Journal of Chemometrics","volume":"40 1","pages":""},"PeriodicalIF":2.1,"publicationDate":"2026-01-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145963821","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Fractional Kinetic Modelling of the Adsorption and Desorption Processes From Experimental SPR Curves 基于实验SPR曲线的吸附和解吸过程的分数动力学模型

IF 2.1 4区化学 Q1 SOCIAL WORK

Journal of Chemometrics

Pub Date : 2025-12-30 DOI: 10.1002/cem.70099

Higor V. M. Ferreira, Nelson H. T. Lemes, Yara L. Coelho, Luciano S. Virtuoso, Ana C. dos Santos Pires, Luis H. M. da Silva

<p>The application of surface plasmon resonance (SPR) has transformed the study of interactions between a ligand immobilized on the surface of a sensor chip (<span></span><math> <semantics> <mrow> <msub> <mrow> <mi>L</mi> </mrow> <mrow> <mi>S</mi> </mrow> </msub> </mrow> <annotation>$$ {L}_S $$</annotation> </semantics></math>) and an analyte in solution (<span></span><math> <semantics> <mrow> <mi>A</mi> </mrow> <annotation>$$ A $$</annotation> </semantics></math>). This technique enables the real-time monitoring of binding processes with high sensitivity. The adsorption–desorption dynamics, <span></span><math> <semantics> <mrow> <mi>A</mi> <mo>+</mo> <msub> <mrow> <mi>L</mi> </mrow> <mrow> <mi>S</mi> </mrow> </msub> <mo>→</mo> <mi>A</mi> <msub> <mrow> <mi>L</mi> </mrow> <mrow> <mi>S</mi> </mrow> </msub> </mrow> <annotation>$$ A+{L}_Sto A{L}_S $$</annotation> </semantics></math>, are commonly described by a set of coupled integer-order differential equations. However, such formulations exhibit limited ability to account for temperature distributions, diffusion, and transport effects involved in the reaction process. Fractional kinetic models provide a natural framework for incorporating nonlocal and memory effects into the description of complex reaction dynamics. In this study, a fractional-order kinetic model based on the Caputo derivative is applied to analyze experimental SPR data for the interaction between immobilized Baru protein (IBP) and Congo Red dye (CR), at concentrations ranging from 7.5 to 97.5 <span></span><math> <semantics> <mrow> <mi>μ</mi> </mrow> <annotation>$$ upmu $$</annotation> </semantics></math>M, pH 7.4, and 16°C. The dependence of the kinetic parameters on the model order is systematically investigated, and it is shown that the classical integer-order formulation fails to adequately reproduce the experimental sensorgrams. The results demonstrate that the fractional-order model captures the intrinsic co

表面等离子体共振（SPR）的应用改变了固定在传感器芯片（ls $$ {L}_S $$）表面的配体与溶液中分析物(A $$ A $$)。该技术能够以高灵敏度实时监测结合过程。吸附-解吸动力学；A + l s→A l s$$ A+{L}_Sto A{L}_S $$，通常用一组耦合的整阶微分方程来描述。然而，这些公式在解释反应过程中涉及的温度分布、扩散和输运效应方面的能力有限。分数动力学模型为将非局部效应和记忆效应纳入复杂反应动力学的描述提供了一个自然的框架。在本研究中，基于Caputo导数的分数级动力学模型分析了固定Baru蛋白（IBP）与刚果红染料（CR）在浓度为7.5 ～ 97.5 μ $$ upmu $$ M， pH为7.4，温度为16°C条件下相互作用的实验SPR数据。系统地研究了动力学参数对模型阶数的依赖关系，表明经典的整阶公式不能充分再现实验传感器图。结果表明，分数阶模型捕捉了SPR实验中观察到的吸附-解吸过程的内在复杂性，显著改善了实验数据的表征。

{"title":"Fractional Kinetic Modelling of the Adsorption and Desorption Processes From Experimental SPR Curves","authors":"Higor V. M. Ferreira, Nelson H. T. Lemes, Yara L. Coelho, Luciano S. Virtuoso, Ana C. dos Santos Pires, Luis H. M. da Silva","doi":"10.1002/cem.70099","DOIUrl":"https://doi.org/10.1002/cem.70099","url":null,"abstract":"<p>The application of surface plasmon resonance (SPR) has transformed the study of interactions between a ligand immobilized on the surface of a sensor chip (<span></span><math>\u0000 <semantics>\u0000 <mrow>\u0000 <msub>\u0000 <mrow>\u0000 <mi>L</mi>\u0000 </mrow>\u0000 <mrow>\u0000 <mi>S</mi>\u0000 </mrow>\u0000 </msub>\u0000 </mrow>\u0000 <annotation>$$ {L}_S $$</annotation>\u0000 </semantics></math>) and an analyte in solution (<span></span><math>\u0000 <semantics>\u0000 <mrow>\u0000 <mi>A</mi>\u0000 </mrow>\u0000 <annotation>$$ A $$</annotation>\u0000 </semantics></math>). This technique enables the real-time monitoring of binding processes with high sensitivity. The adsorption–desorption dynamics, <span></span><math>\u0000 <semantics>\u0000 <mrow>\u0000 <mi>A</mi>\u0000 <mo>+</mo>\u0000 <msub>\u0000 <mrow>\u0000 <mi>L</mi>\u0000 </mrow>\u0000 <mrow>\u0000 <mi>S</mi>\u0000 </mrow>\u0000 </msub>\u0000 <mo>→</mo>\u0000 <mi>A</mi>\u0000 <msub>\u0000 <mrow>\u0000 <mi>L</mi>\u0000 </mrow>\u0000 <mrow>\u0000 <mi>S</mi>\u0000 </mrow>\u0000 </msub>\u0000 </mrow>\u0000 <annotation>$$ A+{L}_Sto A{L}_S $$</annotation>\u0000 </semantics></math>, are commonly described by a set of coupled integer-order differential equations. However, such formulations exhibit limited ability to account for temperature distributions, diffusion, and transport effects involved in the reaction process. Fractional kinetic models provide a natural framework for incorporating nonlocal and memory effects into the description of complex reaction dynamics. In this study, a fractional-order kinetic model based on the Caputo derivative is applied to analyze experimental SPR data for the interaction between immobilized Baru protein (IBP) and Congo Red dye (CR), at concentrations ranging from 7.5 to 97.5 <span></span><math>\u0000 <semantics>\u0000 <mrow>\u0000 <mi>μ</mi>\u0000 </mrow>\u0000 <annotation>$$ upmu $$</annotation>\u0000 </semantics></math>M, pH 7.4, and 16°C. The dependence of the kinetic parameters on the model order is systematically investigated, and it is shown that the classical integer-order formulation fails to adequately reproduce the experimental sensorgrams. The results demonstrate that the fractional-order model captures the intrinsic co","PeriodicalId":15274,"journal":{"name":"Journal of Chemometrics","volume":"40 1","pages":""},"PeriodicalIF":2.1,"publicationDate":"2025-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://analyticalsciencejournals.onlinelibrary.wiley.com/doi/epdf/10.1002/cem.70099","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145887831","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Stacked Target-Related Autoencoder-Extreme Learning Machine: A Novel Soft Measurement Modeling Approach for Near-Infrared Spectroscopy 堆叠目标相关自编码器-极限学习机：一种新的近红外光谱软测量建模方法

IF 2.1 4区化学 Q1 SOCIAL WORK

Journal of Chemometrics

Pub Date : 2025-12-22 DOI: 10.1002/cem.70095

Shun Li, Fangkun Zhang, Shuobo Chen, Baoming Shan, Qilei Xu

This paper proposes a novel quantitative modeling and prediction approach for near-infrared (NIR) spectroscopy, combining a stacked target-related autoencoder with an extreme learning machine (STAE-ELM). The STAE performs hierarchical pre-training using multiple improved target-related autoencoders (TAEs) to extract deep spectral features highly correlated with target values. Crucially, the top-level structure of the STAE is replaced by the ELM, which serves as the final prediction model. This integration streamlines training by reducing parameters and steps while simultaneously enhancing performance through optimized initialization of the ELM's weights and biases. Compared to conventional feature selection methods and stacked autoencoders, the STAE-ELM extracts more comprehensive and target-relevant deep features from spectral data, mitigating overfitting risks. The method's efficacy was validated on five open NIR datasets, benchmarking against three approaches: feature selection modeling, SAE-based feature extraction modeling, and backpropagation-based deep network modeling. Results demonstrate that calibration models built with STAE-ELM achieved average reductions in RMSEP of 18.48%, 5.74%, and 12.14%, respectively compared to these benchmarks. Furthermore, modeling efficiency was significantly improved over the backpropagation-based deep network approach.

本文提出了一种新的近红外光谱定量建模和预测方法，该方法将堆叠目标相关自编码器与极限学习机（STAE-ELM）相结合。STAE使用多个改进的目标相关自编码器（TAEs）进行分层预训练，以提取与目标值高度相关的深度光谱特征。至关重要的是，STAE的顶层结构被ELM取代，ELM作为最终的预测模型。这种集成通过减少参数和步骤来简化训练，同时通过优化初始化ELM的权重和偏差来提高性能。与传统的特征选择方法和堆叠式自编码器相比，STAE-ELM从光谱数据中提取更全面和与目标相关的深层特征，降低了过拟合风险。在五个开放的近红外数据集上验证了该方法的有效性，并对三种方法进行了基准测试：特征选择建模、基于sae的特征提取建模和基于反向传播的深度网络建模。结果表明，与这些基准相比，使用STAE-ELM构建的校准模型的RMSEP平均降低了18.48%，5.74%和12.14%。此外，与基于反向传播的深度网络方法相比，建模效率显著提高。

{"title":"Stacked Target-Related Autoencoder-Extreme Learning Machine: A Novel Soft Measurement Modeling Approach for Near-Infrared Spectroscopy","authors":"Shun Li, Fangkun Zhang, Shuobo Chen, Baoming Shan, Qilei Xu","doi":"10.1002/cem.70095","DOIUrl":"https://doi.org/10.1002/cem.70095","url":null,"abstract":"<div>\u0000 \u0000 <p>This paper proposes a novel quantitative modeling and prediction approach for near-infrared (NIR) spectroscopy, combining a stacked target-related autoencoder with an extreme learning machine (STAE-ELM). The STAE performs hierarchical pre-training using multiple improved target-related autoencoders (TAEs) to extract deep spectral features highly correlated with target values. Crucially, the top-level structure of the STAE is replaced by the ELM, which serves as the final prediction model. This integration streamlines training by reducing parameters and steps while simultaneously enhancing performance through optimized initialization of the ELM's weights and biases. Compared to conventional feature selection methods and stacked autoencoders, the STAE-ELM extracts more comprehensive and target-relevant deep features from spectral data, mitigating overfitting risks. The method's efficacy was validated on five open NIR datasets, benchmarking against three approaches: feature selection modeling, SAE-based feature extraction modeling, and backpropagation-based deep network modeling. Results demonstrate that calibration models built with STAE-ELM achieved average reductions in RMSEP of 18.48%, 5.74%, and 12.14%, respectively compared to these benchmarks. Furthermore, modeling efficiency was significantly improved over the backpropagation-based deep network approach.</p>\u0000 </div>","PeriodicalId":15274,"journal":{"name":"Journal of Chemometrics","volume":"40 1","pages":""},"PeriodicalIF":2.1,"publicationDate":"2025-12-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145891329","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Application of an ECAM-ConvNeXt Model With Multichannel Spectrogram Based on Vis–NIR for Soil Property Prediction 基于多通道光谱图的ECAM-ConvNeXt模型在近红外土壤性质预测中的应用

IF 2.1 4区化学 Q1 SOCIAL WORK

Journal of Chemometrics

Pub Date : 2025-12-10 DOI: 10.1002/cem.70092

Qinghao Shuai, Zhengguang Chen, Shuo Liu, Quan Wang

Vis–NIR spectroscopy is increasingly widely used for soil property analysis due to its rapid, cost-effective, and nondestructive advantages. In particular, deep learning models perform very well when working with large sample data. In this study, we propose a deep learning model based on three-channel ECAM-ConvNeXt. Firstly, the method applies three window functions, Bartlett, Gaussian, and Blackman, in the short-time Fourier transform to convert a one-dimensional spectral sequence signal into three different two-dimensional spectrograms. Next, we perform multichannel feature fusion and use the resulting triple-channel spectrograms as model inputs. This method fully preserves the temporal information and spectral characteristics of the spectral sequence, thereby improving the performance of the model. Secondly, this study introduces the Efficient Channel Attention Module in the ConvNeXt model. This module combines the advantages of the Convolutional Block Attention Module and Efficient Channel Attention Network, further enhancing the expressive ability of the network by highlighting useful information and suppressing irrelevant information. Finally, we also validate the effectiveness of multichannel inputs by deep learning models (AlexNet18, ResNet50, MobileNet-V3, EfficientNet, VIT) and compare them with existing techniques reported in the literature. The results indicate that the root-mean-square error (RMSE) of the TriCH-ECAM-ConvNeXt model in predicting soil nitrogen content (N (g/kg)), organic carbon content (OC (g/kg)), cation exchange capacity (CEC (cmol(+)/kg)), pH, clay content (%), and sand content (%) was reduced to 0.9847, 19.7347, 6.3380, 0.3812, 5.1537, and 12.9706, respectively, and the coefficient of determination (R²) increased to 0.9307, 0.9544, 0.7999, 0.9206, 0.8493, and 0.7526, respectively.

可见-近红外光谱由于其快速、经济、无损等优点，在土壤性质分析中得到越来越广泛的应用。特别是，深度学习模型在处理大样本数据时表现非常好。在本研究中，我们提出了一种基于三通道ECAM-ConvNeXt的深度学习模型。该方法首先在短时傅里叶变换中应用Bartlett、Gaussian和Blackman三个窗函数，将一维谱序列信号转换为三个不同的二维谱图。接下来，我们执行多通道特征融合，并使用得到的三通道频谱图作为模型输入。该方法充分保留了光谱序列的时间信息和光谱特征，从而提高了模型的性能。其次，在ConvNeXt模型中引入了高效通道注意模块。该模块结合了卷积块注意模块和高效通道注意网络的优点，通过突出有用信息，抑制无关信息，进一步增强网络的表达能力。最后，我们还通过深度学习模型（AlexNet18、ResNet50、MobileNet-V3、EfficientNet、VIT）验证了多通道输入的有效性，并将它们与文献中报道的现有技术进行了比较。结果表明：TriCH-ECAM-ConvNeXt模型预测土壤氮含量（N (g/kg)）、有机碳含量（OC (g/kg)）、阳离子交换容量(CEC （cmol(+)/kg）、pH、粘土含量（%）、砂土含量（%）的均方根误差（RMSE）分别降低至0.9847、19.7347、6.3380、0.3812、5.1537、12.9706，决定系数（R2）分别提高至0.9307、0.9544、0.7999、0.9206、0.8493、0.7526。

{"title":"Application of an ECAM-ConvNeXt Model With Multichannel Spectrogram Based on Vis–NIR for Soil Property Prediction","authors":"Qinghao Shuai, Zhengguang Chen, Shuo Liu, Quan Wang","doi":"10.1002/cem.70092","DOIUrl":"https://doi.org/10.1002/cem.70092","url":null,"abstract":"<div>\u0000 \u0000 <p>Vis–NIR spectroscopy is increasingly widely used for soil property analysis due to its rapid, cost-effective, and nondestructive advantages. In particular, deep learning models perform very well when working with large sample data. In this study, we propose a deep learning model based on three-channel ECAM-ConvNeXt. Firstly, the method applies three window functions, Bartlett, Gaussian, and Blackman, in the short-time Fourier transform to convert a one-dimensional spectral sequence signal into three different two-dimensional spectrograms. Next, we perform multichannel feature fusion and use the resulting triple-channel spectrograms as model inputs. This method fully preserves the temporal information and spectral characteristics of the spectral sequence, thereby improving the performance of the model. Secondly, this study introduces the Efficient Channel Attention Module in the ConvNeXt model. This module combines the advantages of the Convolutional Block Attention Module and Efficient Channel Attention Network, further enhancing the expressive ability of the network by highlighting useful information and suppressing irrelevant information. Finally, we also validate the effectiveness of multichannel inputs by deep learning models (AlexNet18, ResNet50, MobileNet-V3, EfficientNet, VIT) and compare them with existing techniques reported in the literature. The results indicate that the root-mean-square error (RMSE) of the TriCH-ECAM-ConvNeXt model in predicting soil nitrogen content (N (g/kg)), organic carbon content (OC (g/kg)), cation exchange capacity (CEC (cmol(+)/kg)), pH, clay content (%), and sand content (%) was reduced to 0.9847, 19.7347, 6.3380, 0.3812, 5.1537, and 12.9706, respectively, and the coefficient of determination (<i>R</i><sup>2</sup>) increased to 0.9307, 0.9544, 0.7999, 0.9206, 0.8493, and 0.7526, respectively.</p>\u0000 </div>","PeriodicalId":15274,"journal":{"name":"Journal of Chemometrics","volume":"39 12","pages":""},"PeriodicalIF":2.1,"publicationDate":"2025-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145750946","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Rapid Multi-Indicator Quality Evaluation of Ginger Using Genetic Algorithm and Near-Infrared Spectroscopy 基于遗传算法和近红外光谱的生姜多指标快速质量评价

IF 2.1 4区化学 Q1 SOCIAL WORK

Journal of Chemometrics

Pub Date : 2025-12-10 DOI: 10.1002/cem.70090

Tianshu Wang, Chengwu Chen, Hui Yan, Kongfa Hu, Xichen Yang, Xia Zhang, Guisheng Zhou, Jinao Duan

To achieve rapid and comprehensive evaluation of the quality of ginger, a rapid multi-indicator quality evaluation method based on genetic algorithm and near-infrared spectroscopy technology is proposed to detect the content of multiple compounds (6-gingerol, 8-gingerol, 10-gingerol, 6-shogaol, and zingerone). First, the near-infrared spectra of ginger samples is collected. Then, the spectra is preprocessed to reduce the noise. Next, features of the spectra are extracted through the genetic algorithm where the population initialization and fitness function methods are designed. Finally, the prediction model is generated through regression. Experimental results demonstrate that the proposed method achieves higher R values (0.9052, 0.9107, 0.9269, 0.9843, 0.9030) compared to the traditional PLSR model (0.6666, 0.51, 0.4358, 0.9248, 0.4846) for 6-gingerol, 8-gingerol, 10-gingerol, 6-shogaol, and zingerone, respectively. Therefore, the proposed method can reduce prediction errors and improve the performance of near-infrared spectroscopy quantitative analysis model for ginger.

为实现对生姜质量的快速综合评价，提出了一种基于遗传算法和近红外光谱技术的快速多指标质量评价方法，检测6-姜辣素、8-姜辣素、10-姜辣素、6-姜辣素和姜酮等多种化合物的含量。首先，采集生姜样品的近红外光谱。然后，对光谱进行预处理，去除噪声。其次，通过遗传算法提取光谱特征，设计种群初始化和适应度函数方法；最后，通过回归生成预测模型。实验结果表明，与传统PLSR模型（0.6666、0.51、0.4358、0.9248、0.4846）相比，该方法对6-姜辣素、8-姜辣素、10-姜辣素、6-姜辣素和姜辣素分别获得了更高的R值（0.9052、0.9107、0.9269、0.9843、0.9030）。因此，该方法可以降低生姜近红外光谱定量分析模型的预测误差，提高模型的性能。

{"title":"Rapid Multi-Indicator Quality Evaluation of Ginger Using Genetic Algorithm and Near-Infrared Spectroscopy","authors":"Tianshu Wang, Chengwu Chen, Hui Yan, Kongfa Hu, Xichen Yang, Xia Zhang, Guisheng Zhou, Jinao Duan","doi":"10.1002/cem.70090","DOIUrl":"https://doi.org/10.1002/cem.70090","url":null,"abstract":"<div>\u0000 \u0000 <p>To achieve rapid and comprehensive evaluation of the quality of ginger, a rapid multi-indicator quality evaluation method based on genetic algorithm and near-infrared spectroscopy technology is proposed to detect the content of multiple compounds (6-gingerol, 8-gingerol, 10-gingerol, 6-shogaol, and zingerone). First, the near-infrared spectra of ginger samples is collected. Then, the spectra is preprocessed to reduce the noise. Next, features of the spectra are extracted through the genetic algorithm where the population initialization and fitness function methods are designed. Finally, the prediction model is generated through regression. Experimental results demonstrate that the proposed method achieves higher R values (0.9052, 0.9107, 0.9269, 0.9843, 0.9030) compared to the traditional PLSR model (0.6666, 0.51, 0.4358, 0.9248, 0.4846) for 6-gingerol, 8-gingerol, 10-gingerol, 6-shogaol, and zingerone, respectively. Therefore, the proposed method can reduce prediction errors and improve the performance of near-infrared spectroscopy quantitative analysis model for ginger.</p>\u0000 </div>","PeriodicalId":15274,"journal":{"name":"Journal of Chemometrics","volume":"39 12","pages":""},"PeriodicalIF":2.1,"publicationDate":"2025-12-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145750947","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0