Journal of Chemometrics最新文献_第4页

Outlier Detection Using Immersive Analytics With Virtual Reality 使用沉浸式分析与虚拟现实的异常值检测

IF 2.1 4区化学 Q1 SOCIAL WORK

Journal of Chemometrics

Pub Date : 2025-10-20 DOI: 10.1002/cem.70079

Hyrum J. Redd, Jordan M. J. Peper, John H. Kalivas

This paper honors Paul Geladi with a new chemometric data visualization approach based on immersive analytics principles using virtual reality (VR). With the many technological advancements improving the accessibility of extended reality (XR), including VR, the frontier of immersive analytics is wide open for utilizing VR as a powerful chemometric data analysis tool. Immersion of human senses into a virtually generated three-dimensional (3D) world compels us to act more instinctively in complex data decision-making scenarios by using our inherent cognitive pattern recognition capacity learned over years of experience. Proposed is an immersive analytics application of VR for a hybrid human/computer-aided outlier detection process. In this application, each training set sample is virtually realized as a glyph to visually assess the inter- and intra-sample relationships present in datasets while mining for nonrepresentative samples. Sample glyph shape and size are developed using hundreds of sample similarity measurements for each sample based both on spectral and prediction property information. These sample glyphs are visually and spatially compared with one another in VR by the user. Outlier checking in VR safeguards against masking and swamping problems that are difficult to recognize with automatic algorithms. Results from dataset situations based on near infrared (NIR) and ultraviolet (UV) spectra and analyte reference values show the viability of using VR for data analysis and outlier detection. This VR application demonstrates the looming evolution of immersive analytics with an XR interface involving human reasoning in difficult chemometric data analysis settings.

本文以一种基于虚拟现实（VR）沉浸式分析原理的新的化学计量数据可视化方法来纪念Paul Geladi。随着包括VR在内的许多技术进步提高了扩展现实（XR）的可访问性，沉浸式分析的前沿为利用VR作为强大的化学计量数据分析工具敞开了大门。人类感官沉浸在虚拟生成的三维（3D）世界中，迫使我们在复杂的数据决策场景中更本能地行动，利用我们多年经验中习得的固有认知模式识别能力。提出了一种沉浸式虚拟现实分析应用程序，用于混合人/计算机辅助离群值检测过程。在这个应用程序中，每个训练集样本实际上被实现为一个符号，以便在挖掘非代表性样本时直观地评估数据集中存在的样本间和样本内关系。基于光谱和预测属性信息，对每个样本使用数百个样本相似性测量来开发样本字形形状和大小。这些样例符号在视觉上和空间上由用户在VR中相互比较。VR中的异常值检查可以防止自动算法难以识别的掩盖和淹没问题。基于近红外（NIR）和紫外（UV）光谱以及分析物参考值的数据集情况的结果表明，将VR用于数据分析和异常值检测是可行的。这个虚拟现实应用程序展示了沉浸式分析迫在眉睫的发展，其中包括在困难的化学计量数据分析设置中涉及人类推理的XR界面。

{"title":"Outlier Detection Using Immersive Analytics With Virtual Reality","authors":"Hyrum J. Redd, Jordan M. J. Peper, John H. Kalivas","doi":"10.1002/cem.70079","DOIUrl":"https://doi.org/10.1002/cem.70079","url":null,"abstract":"<div>\u0000 \u0000 <p>This paper honors Paul Geladi with a new chemometric data visualization approach based on immersive analytics principles using virtual reality (VR). With the many technological advancements improving the accessibility of extended reality (XR), including VR, the frontier of immersive analytics is wide open for utilizing VR as a powerful chemometric data analysis tool. Immersion of human senses into a virtually generated three-dimensional (3D) world compels us to act more instinctively in complex data decision-making scenarios by using our inherent cognitive pattern recognition capacity learned over years of experience. Proposed is an immersive analytics application of VR for a hybrid human/computer-aided outlier detection process. In this application, each training set sample is virtually realized as a glyph to visually assess the inter- and intra-sample relationships present in datasets while mining for nonrepresentative samples. Sample glyph shape and size are developed using hundreds of sample similarity measurements for each sample based both on spectral and prediction property information. These sample glyphs are visually and spatially compared with one another in VR by the user. Outlier checking in VR safeguards against masking and swamping problems that are difficult to recognize with automatic algorithms. Results from dataset situations based on near infrared (NIR) and ultraviolet (UV) spectra and analyte reference values show the viability of using VR for data analysis and outlier detection. This VR application demonstrates the looming evolution of immersive analytics with an XR interface involving human reasoning in difficult chemometric data analysis settings.</p>\u0000 </div>","PeriodicalId":15274,"journal":{"name":"Journal of Chemometrics","volume":"39 11","pages":""},"PeriodicalIF":2.1,"publicationDate":"2025-10-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145327814","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Immersive Analytics in Critical Spatial Domains: From Materials to Energy Systems 沉浸式分析在关键空间领域：从材料到能源系统

IF 2.1 4区化学 Q1 SOCIAL WORK

Journal of Chemometrics

Pub Date : 2025-10-14 DOI: 10.1002/cem.70076

Rajiv Khadka

Immersive analytics (IA) leverages virtual reality, augmented reality, and mixed reality to transform how users interact with complex datasets across domains such as science, industry, and education. These immersive technologies offer spatial and multimodal environments that foster intuitive exploration, but they also introduce challenges related to cognitive load, interface design, and system performance. This article presents a comprehensive review of visualization techniques, interaction models, and multimodal inputs utilized in IA. Drawing on case studies in scientific visualization, industrial training, and educational communication, we examine both the potential and limitations of current systems. Finally, we propose future research directions, focusing on real-time collaboration, adaptive user interfaces, and scalable data exploration strategies to advance the field.

沉浸式分析（IA）利用虚拟现实、增强现实和混合现实来改变用户与科学、工业和教育等领域复杂数据集的交互方式。这些沉浸式技术提供了空间和多模式环境，促进了直观的探索，但它们也引入了与认知负荷、界面设计和系统性能相关的挑战。本文全面回顾了可视化技术、交互模型和IA中使用的多模态输入。通过科学可视化、工业培训和教育交流方面的案例研究，我们考察了当前系统的潜力和局限性。最后，我们提出了未来的研究方向，重点是实时协作，自适应用户界面和可扩展的数据探索策略，以推进该领域的发展。

引用次数: 0

Comparison Between Portable and Bench-Top Near-Infrared Spectroscopy for Corn Silage Characterization Using Partial Least Square and Support Vector Regression Methods 基于偏最小二乘法和支持向量回归的便携式和台式近红外光谱玉米青贮特征分析比较

IF 2.1 4区化学 Q1 SOCIAL WORK

Journal of Chemometrics

Pub Date : 2025-10-08 DOI: 10.1002/cem.70073

Jefferson Tales Oliva, Vinicius Herique Kieling, Felipe Augusto Bueno Rossi, Erick Oliveira Rodrigues, Giovanni Alfredo Guarneri, Larissa Macedo dos Santos Tonial

In this work, bench-top and portable near-infrared (NIR) spectroscopy equipment are compared in the context of generating predictive models for the quantification of phosphorus (P), potassium (K), and nitrogen (N) components from corn silage samples. For this, 200 spectral samples resulting from bench-top and portable NIR are preprocessed by the following sequence of approaches: mean centering application for removing the spectral bias/offset, Savitzky–Golay filter for highlighting signal energy absorption in relation to spectral noise, interval partial least square (iPLS) for selection of spectral region, and Monte Carlo method for outlier detection and removal. Then, from the preprocessed spectra, predictive models were built using the partial least squares (PLS) and support vector regression (SVR) methods for each chemical component and NIR equipment. In this sense, six models are generated, three for each NIR spectroscopy (or two for each element). As a result, considering all components and machine learning (ML) methods, bench-top models achieved R² values between 0.66 (quantification of P using PLS or SVR) and 0.81 (prediction of K and N using SVR regressors) during the validation, whereas portable ones achieved values between 0.50 (prediction of K using SVR) and 0.67 (quantification of N using PLS). Our results can be considered competitive, as robust and accurate predictors are also generated.

在这项工作中，比较了台式和便携式近红外（NIR）光谱设备在玉米青贮样品中磷(P)、钾(K)和氮(N)成分定量预测模型的生成情况。为此，我们对200个来自台式和便携式近红外的光谱样本进行了预处理，采用了以下一系列方法：均值居中用于去除光谱偏置/偏置，Savitzky-Golay滤波用于突出与光谱噪声相关的信号能量吸收，区间偏最小二乘法（iPLS）用于选择光谱区域，蒙特卡罗方法用于异常值检测和去除。然后，根据预处理后的光谱，利用偏最小二乘（PLS）和支持向量回归（SVR）方法对各化学成分和近红外设备建立预测模型。在这个意义上，产生了六个模型，每个近红外光谱三个（或每个元素两个）。因此，考虑到所有组件和机器学习（ML）方法，在验证期间，台式模型的R2值在0.66（使用PLS或SVR量化P）和0.81（使用SVR回归量预测K和N）之间，而便携式模型的R2值在0.50（使用SVR预测K）和0.67（使用PLS量化N）之间。我们的结果可以被认为是有竞争力的，因为也产生了稳健和准确的预测因子。

{"title":"Comparison Between Portable and Bench-Top Near-Infrared Spectroscopy for Corn Silage Characterization Using Partial Least Square and Support Vector Regression Methods","authors":"Jefferson Tales Oliva, Vinicius Herique Kieling, Felipe Augusto Bueno Rossi, Erick Oliveira Rodrigues, Giovanni Alfredo Guarneri, Larissa Macedo dos Santos Tonial","doi":"10.1002/cem.70073","DOIUrl":"https://doi.org/10.1002/cem.70073","url":null,"abstract":"<p>In this work, bench-top and portable near-infrared (NIR) spectroscopy equipment are compared in the context of generating predictive models for the quantification of phosphorus (P), potassium (K), and nitrogen (N) components from corn silage samples. For this, 200 spectral samples resulting from bench-top and portable NIR are preprocessed by the following sequence of approaches: mean centering application for removing the spectral bias/offset, Savitzky–Golay filter for highlighting signal energy absorption in relation to spectral noise, interval partial least square (iPLS) for selection of spectral region, and Monte Carlo method for outlier detection and removal. Then, from the preprocessed spectra, predictive models were built using the partial least squares (PLS) and support vector regression (SVR) methods for each chemical component and NIR equipment. In this sense, six models are generated, three for each NIR spectroscopy (or two for each element). As a result, considering all components and machine learning (ML) methods, bench-top models achieved <i>R</i><sup>2</sup> values between 0.66 (quantification of P using PLS or SVR) and 0.81 (prediction of K and N using SVR regressors) during the validation, whereas portable ones achieved values between 0.50 (prediction of K using SVR) and 0.67 (quantification of N using PLS). Our results can be considered competitive, as robust and accurate predictors are also generated.</p>","PeriodicalId":15274,"journal":{"name":"Journal of Chemometrics","volume":"39 10","pages":""},"PeriodicalIF":2.1,"publicationDate":"2025-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://analyticalsciencejournals.onlinelibrary.wiley.com/doi/epdf/10.1002/cem.70073","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145243059","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Expanding Multivariate Analysis Principles in Conventional Chemometric Processes 扩展传统化学计量过程中的多元分析原理

IF 2.1 4区化学 Q1 SOCIAL WORK

Journal of Chemometrics

Pub Date : 2025-09-14 DOI: 10.1002/cem.70067

John H. Kalivas, Robert C. Spiers

Chemometrics encompasses numerous facets such as experimental design, data collection and analysis, and many others. This paper, in honor of Paul Geladi, provides our perspective on growing the scientific intuition of multivariate analysis in conventional chemometric directions not generally practicing multivariate principles. The motivation for this perspective is to express our opinion on the need for chemometrics to expand the role of the Rashomon effect beyond “many models predict well” by integrating a more comprehensive consideration of the multivariate nature of matrix effects. Described are multiple chemometric techniques that have already been enhanced by broadening the application the Rashomon effect including model selection and explanation, figures of merit (FOM), sample similarity assessment for model reliability, outlier detection, and classification—all recent research topics from the authors. This expository discussion revolves around spectroscopic data such as near infrared and fluorescence, but the concepts are relevant to other chemometric data structures.

化学计量学包括许多方面，如实验设计，数据收集和分析，以及许多其他。为了纪念Paul Geladi，本文提供了我们在传统化学计量学方向上发展多元分析的科学直觉的观点，而不是通常实践多元原理。这一观点的动机是表达我们对化学计量学的需求，通过对矩阵效应的多变量性质进行更全面的考虑，将罗生门效应的作用扩展到“许多模型预测良好”之外。描述了多种化学计量学技术，这些技术已经通过扩大罗生门效应的应用而得到了加强，包括模型选择和解释、优点图（FOM）、模型可靠性的样本相似性评估、离群值检测和分类——所有这些都是作者最近的研究课题。这个说明性的讨论围绕光谱数据，如近红外和荧光，但概念是相关的其他化学计量数据结构。

引用次数: 0

IF 2.1 4区化学 Q1 SOCIAL WORK

Journal of Chemometrics

Pub Date : 2025-09-04 DOI: 10.1002/cem.70061

Tânia F. G. G. Cova, Alberto A. C. C. Pais

Clustering of binary data is central to various applications, particularly in the fields of medical diagnostics, chemistry, and chemoinformatics. However, standard similarity measures often fail to capture the informative value of rare features and matching absences, treating all attributes as equally relevant. This can lead to suboptimal clustering, especially when informative patterns are hidden in low-frequency features. This study proposes a probability-weighted approach to measuring similarity, which gives more weight to rare features and accounts for the value of shared absences based on their occurrence probabilities. We analyze how this adjustment impacts clustering results, using visual comparisons and experiments on real datasets. The results show consistent gains in clustering precision and stability compared to standard measures. Our findings suggest that incorporating the rarity of features into similarity computation can offer a more reliable basis for clustering binary data, especially in domains where rare signals carry meaningful information.

二进制数据的聚类是各种应用的核心，特别是在医学诊断、化学和化学信息学领域。然而，标准的相似性度量往往不能捕获稀有特征和匹配缺失的信息价值，将所有属性视为同等相关。这可能导致次优聚类，特别是当信息模式隐藏在低频特征中时。本文提出了一种概率加权方法来衡量相似性，该方法赋予罕见特征更多的权重，并根据它们的出现概率来计算共享缺席的值。我们使用视觉比较和真实数据集的实验来分析这种调整如何影响聚类结果。结果表明，与标准度量相比，聚类精度和稳定性得到了一致的提高。我们的研究结果表明，将特征的稀缺性纳入相似度计算可以为二元数据的聚类提供更可靠的基础，特别是在罕见信号携带有意义信息的领域。

引用次数: 0

Improving Grading Accuracy by Optimizing the Logistic Loss Function in PLS Modelling 通过优化PLS建模中的Logistic损失函数来提高分级精度

IF 2.1 4区化学 Q1 SOCIAL WORK

Journal of Chemometrics

Pub Date : 2025-09-02 DOI: 10.1002/cem.70064

Zhonghai He, Huilong Sheng, Yi Zhang, Xiaofang Zhang

The prediction results from Partial Least Squares (PLS) model are commonly used to assess whether a product meets quality standards, or whether adjustments are needed in production process parameters. It's easy to understand that misgrading is mostly occurred for marginal samples (samples near the threshold). We propose Logistic-Enhanced PLS (LE-PLS) model, which defines a logistic loss function and minimizes it via gradient descent to optimize the PLS projection vector. The prediction result of LE-PLS for marginal samples tends to be far away from the threshold value. This optimization enables LE-PLS to enhance grading capability while largely maintaining the regression accuracy of the PLS. LE-PLS was evaluated on two real-world datasets (bean pulp and corn gluten meal) and one simulated dataset, correcting 10 out of 19 misgraded samples, 6 out of 7, and 6 out of 12, respectively. Statistical analysis using paired t-tests confirmed that these improvements were significant. Although RMSEP increased slightly, the change remained within an acceptable range considering the substantial enhancement in grading performance. The algorithm has a computational complexity of $� � � O � (�, � iteration � * � samples � * � variables �, �) � � �$ during modeling training. However, its prediction-phase complexity is only $� � � O � (�, � samples � * � variables �, �) � � �$ . Given these advantages, LE-PLS is well-suited for practical applications in NIR-based quality grading of products.

偏最小二乘（PLS）模型的预测结果通常用于评估产品是否符合质量标准，或是否需要调整生产工艺参数。很容易理解，错误分级主要发生在边缘样本（接近阈值的样本）。本文提出了logistic增强型PLS （LE-PLS）模型，该模型定义了logistic损失函数，并通过梯度下降最小化logistic损失函数来优化PLS投影向量。边际样本的LE-PLS预测结果趋向于远离阈值。在两个真实数据集（豆浆和玉米蛋白粉）和一个模拟数据集上对LE-PLS进行了评估，分别纠正了19个错误样本中的10个、7个样本中的6个和12个样本中的6个。使用配对t检验的统计分析证实了这些改善是显著的。虽然RMSEP略有增加，但考虑到分级性能的实质性提高，变化仍在可接受的范围内。该算法在建模训练过程中的计算复杂度为O迭代*样本*变量$$ mathrm{O}left({mathrm{iteration}}^{ast }{mathrm{samples}}^{ast}mathrm{variables}right) $$。然而，其预测阶段复杂度仅为O个样本*个变量$$ mathrm{O}left({mathrm{samples}}^{ast}mathrm{variables}right) $$。鉴于这些优点，LE-PLS非常适合于基于nir的产品质量分级的实际应用。

{"title":"Improving Grading Accuracy by Optimizing the Logistic Loss Function in PLS Modelling","authors":"Zhonghai He, Huilong Sheng, Yi Zhang, Xiaofang Zhang","doi":"10.1002/cem.70064","DOIUrl":"10.1002/cem.70064","url":null,"abstract":"<div>\u0000 \u0000 <p>The prediction results from Partial Least Squares (PLS) model are commonly used to assess whether a product meets quality standards, or whether adjustments are needed in production process parameters. It's easy to understand that misgrading is mostly occurred for marginal samples (samples near the threshold). We propose Logistic-Enhanced PLS (LE-PLS) model, which defines a logistic loss function and minimizes it via gradient descent to optimize the PLS projection vector. The prediction result of LE-PLS for marginal samples tends to be far away from the threshold value. This optimization enables LE-PLS to enhance grading capability while largely maintaining the regression accuracy of the PLS. LE-PLS was evaluated on two real-world datasets (bean pulp and corn gluten meal) and one simulated dataset, correcting 10 out of 19 misgraded samples, 6 out of 7, and 6 out of 12, respectively. Statistical analysis using paired <i>t</i>-tests confirmed that these improvements were significant. Although RMSEP increased slightly, the change remained within an acceptable range considering the substantial enhancement in grading performance. The algorithm has a computational complexity of <span></span><math>\u0000 <semantics>\u0000 <mrow>\u0000 <mi>O</mi>\u0000 <mfenced>\u0000 <mrow>\u0000 <mtext>iteration</mtext>\u0000 <mo>*</mo>\u0000 <mtext>samples</mtext>\u0000 <mo>*</mo>\u0000 <mtext>variables</mtext>\u0000 </mrow>\u0000 </mfenced>\u0000 </mrow>\u0000 <annotation>$$ mathrm{O}left({mathrm{iteration}}^{ast }{mathrm{samples}}^{ast}mathrm{variables}right) $$</annotation>\u0000 </semantics></math> during modeling training. However, its prediction-phase complexity is only <span></span><math>\u0000 <semantics>\u0000 <mrow>\u0000 <mi>O</mi>\u0000 <mfenced>\u0000 <mrow>\u0000 <mtext>samples</mtext>\u0000 <mo>*</mo>\u0000 <mtext>variables</mtext>\u0000 </mrow>\u0000 </mfenced>\u0000 </mrow>\u0000 <annotation>$$ mathrm{O}left({mathrm{samples}}^{ast}mathrm{variables}right) $$</annotation>\u0000 </semantics></math>. Given these advantages, LE-PLS is well-suited for practical applications in NIR-based quality grading of products.</p>\u0000 </div>","PeriodicalId":15274,"journal":{"name":"Journal of Chemometrics","volume":"39 9","pages":""},"PeriodicalIF":2.1,"publicationDate":"2025-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144935016","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Paul Geladi Legacy: Pioneering Chemometrics for the Future Paul Geladi的遗产：未来化学计量学的先驱

IF 2.1 4区化学 Q1 SOCIAL WORK

Journal of Chemometrics

Pub Date : 2025-08-29 DOI: 10.1002/cem.70065

Beatriz Galindo-Prieto

<p>This special issue, entitled ‘Paul Geladi Legacy: Pioneering Chemometrics for the Future’, is a tribute to the remarkable scientific contributions of Professor Paul Geladi to the field of chemometrics. This very special issue brings together a comprehensive collection of topics that reflect the breadth and depth of Paul's work in chemometrics. While nice memories and Paul's interests in science have been shared by some of his friends and colleagues in recent publications, this editorial and its related special issue will focus on some of the most relevant scientific areas that Professor Paul Geladi explored throughout his prolific career. The title of this special issue honouring Paul is not trivial. For many years, Paul emphasized the future of chemometrics as an important and in-depth topic that should be part of scientific meetings, conferences and specialized literature. Indeed, as Paul remarked on several occasions, pioneering chemometrics for the future, not only by adapting its methodologies and advances to new challenges and technologies but also creating new chemometric research directions according to evolving trends in science, is crucial for the field of chemometrics to succeed. To achieve this, high-quality teaching and the education of the next generations in chemometrics is especially important, as well as fostering collaboration across research groups. An exemplar of the latter was the initiative led by Paul called ‘The Laboratory Profile’ (published at <i>Journal of Chemometrics</i> in the 90s), which strengthened the global network of chemometric laboratories and showcased the wide array of scientific activities taking place across university, research institutions and industry. The breadth of Paul's knowledge, enhanced from a rich network of scientists, enabled him to successfully apply the most suitable chemometric techniques across various applications.</p><p>Professor Paul Geladi was a dedicated educator. In 1986, when audiovisual resources were still rarely used in statistical lectures, Paul was ahead of his time publishing an article on the use of videotapes as pedagogic tools in chemometrics education. Besides, Paul wrote several tutorials on chemometric methods, two of which stand out as his most cited work. The first is his tutorial on principal component analysis (co-authored with Wold and Esbensen), which covers the most relevant aspects of PCA and its application, whilst the second tutorial focuses on partial least squares regression (co-authored with Kowalski) and covers the concept and algebra of the PLS algorithm. These tutorials published in international journals remain foundational references in the field. In addition, Paul authored three books of high relevance in the field of chemometrics. His book <i>Multi-Way Analysis with Applications in the Chemical Sciences</i> (co-authored with Smilde and Bro) provides chemometricians with the mathematical foundations needed to understand multi-way approaches and pra

本期特刊题为“Paul Geladi的遗产：未来化学计量学的先驱”，是对Paul Geladi教授对化学计量学领域卓越的科学贡献的致敬。这个非常特殊的问题汇集了一个全面的主题集合，反映了保罗在化学计量学方面工作的广度和深度。虽然保罗的一些朋友和同事在最近的出版物中分享了他对科学的美好回忆和兴趣，但这篇社论及其相关的特刊将重点关注保罗·格拉迪教授在其多产的职业生涯中探索的一些最相关的科学领域。这期纪念保罗的特刊的标题不是微不足道的。多年来，Paul强调化学计量学的未来是一个重要而深入的话题，应该成为科学会议、会议和专业文献的一部分。事实上，正如Paul在多个场合提到的，开拓未来的化学计量学，不仅要适应新的挑战和技术，而且要根据科学的发展趋势创造新的化学计量学研究方向，这对化学计量学领域的成功至关重要。为了实现这一目标，高质量的教学和下一代化学计量学的教育尤为重要，同时也促进了研究小组之间的合作。后者的一个例子是由Paul领导的名为“实验室简介”的倡议（发表在90年代的化学计量学杂志上），该倡议加强了化学计量实验室的全球网络，并展示了大学，研究机构和行业中发生的广泛的科学活动。Paul的知识广度，从丰富的科学家网络增强，使他能够成功地在各种应用中应用最合适的化学计量学技术。保罗·格拉迪教授是一位敬业的教育家。1986年，当视听资源还很少用于统计学讲座时，保罗已经走在了时代的前面，发表了一篇关于在化学计量学教育中使用录像带作为教学工具的文章。此外，保罗还写了几本关于化学计量学方法的教程，其中两本是他被引用最多的作品。第一本是他关于主成分分析的教程（与Wold和Esbensen合著），涵盖了PCA及其应用的最相关方面，而第二本教程侧重于偏最小二乘回归（与Kowalski合著），并涵盖了PLS算法的概念和代数。这些发表在国际期刊上的教程仍然是该领域的基础参考。此外，保罗还撰写了三本与化学计量学领域高度相关的书籍。他的著作《化学科学中的多路分析与应用》（与Smilde和Bro合著）为化学计量学家提供了理解多路方法并实际应用它们所需的数学基础。他的另外两本书《多元图像分析》和《高光谱图像分析技术与应用》（均与Grahn合著）在对图像分析感兴趣的化学计量学家中非常受欢迎。分析图像是保罗职业生涯的主要课题之一，这一点从他撰写的与多变量和高光谱成像相关的大量科学文章中可以看出。在他对方法发展的众多贡献中，保罗在80年代写的一篇最相关的文章讨论了肉类近红外反射光谱的线性化和散射校正（与MacDougall和Martens合著）。Paul出版了许多与化学计量学主题相关的优秀出版物和教材，如数据预处理，光谱学（特别是近红外），图像分析，多元校准，多路分析，变量选择，主成分分析（PCA），偏最小二乘（PLS），多组学数据分析，机器学习和化学计量学算法的开发。Paul的主要优势之一是他在广泛的学科中利用和适应化学计量学方法的新挑战的开创性能力。为了尽可能地代表Paul的科学遗产的广度，这期特刊包含了与化学、光谱学、高光谱成像、农业和食品科学、采样理论、环境健康、人工智能、分子生物学、组学和化学计量学方法的发展/优化有关的精选文章。我要感谢你们所有人的贡献和支持让这期特别的纪念Paul的《化学计量学杂志》成为可能。我相信保罗会很高兴看到这么多好朋友和同事的贡献，无论是通过写文章、评论还是以其他方式帮助他，以纪念他一生的工作。

{"title":"Paul Geladi Legacy: Pioneering Chemometrics for the Future","authors":"Beatriz Galindo-Prieto","doi":"10.1002/cem.70065","DOIUrl":"10.1002/cem.70065","url":null,"abstract":"<p>This special issue, entitled ‘Paul Geladi Legacy: Pioneering Chemometrics for the Future’, is a tribute to the remarkable scientific contributions of Professor Paul Geladi to the field of chemometrics. This very special issue brings together a comprehensive collection of topics that reflect the breadth and depth of Paul's work in chemometrics. While nice memories and Paul's interests in science have been shared by some of his friends and colleagues in recent publications, this editorial and its related special issue will focus on some of the most relevant scientific areas that Professor Paul Geladi explored throughout his prolific career. The title of this special issue honouring Paul is not trivial. For many years, Paul emphasized the future of chemometrics as an important and in-depth topic that should be part of scientific meetings, conferences and specialized literature. Indeed, as Paul remarked on several occasions, pioneering chemometrics for the future, not only by adapting its methodologies and advances to new challenges and technologies but also creating new chemometric research directions according to evolving trends in science, is crucial for the field of chemometrics to succeed. To achieve this, high-quality teaching and the education of the next generations in chemometrics is especially important, as well as fostering collaboration across research groups. An exemplar of the latter was the initiative led by Paul called ‘The Laboratory Profile’ (published at <i>Journal of Chemometrics</i> in the 90s), which strengthened the global network of chemometric laboratories and showcased the wide array of scientific activities taking place across university, research institutions and industry. The breadth of Paul's knowledge, enhanced from a rich network of scientists, enabled him to successfully apply the most suitable chemometric techniques across various applications.</p><p>Professor Paul Geladi was a dedicated educator. In 1986, when audiovisual resources were still rarely used in statistical lectures, Paul was ahead of his time publishing an article on the use of videotapes as pedagogic tools in chemometrics education. Besides, Paul wrote several tutorials on chemometric methods, two of which stand out as his most cited work. The first is his tutorial on principal component analysis (co-authored with Wold and Esbensen), which covers the most relevant aspects of PCA and its application, whilst the second tutorial focuses on partial least squares regression (co-authored with Kowalski) and covers the concept and algebra of the PLS algorithm. These tutorials published in international journals remain foundational references in the field. In addition, Paul authored three books of high relevance in the field of chemometrics. His book <i>Multi-Way Analysis with Applications in the Chemical Sciences</i> (co-authored with Smilde and Bro) provides chemometricians with the mathematical foundations needed to understand multi-way approaches and pra","PeriodicalId":15274,"journal":{"name":"Journal of Chemometrics","volume":"39 9","pages":""},"PeriodicalIF":2.1,"publicationDate":"2025-08-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://analyticalsciencejournals.onlinelibrary.wiley.com/doi/epdf/10.1002/cem.70065","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144915074","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Online Simultaneous Determination of Astragalus Polysaccharides and Calycosin-7-O-β-D-Glucoside in Astragali Radix Percolate Based on Near-Infrared Spectroscopy Technology 近红外光谱技术在线同时测定过渗黄芪中黄芪多糖和毛蕊花素-7- o -β- d -葡萄糖苷

IF 2.1 4区化学 Q1 SOCIAL WORK

Journal of Chemometrics

Pub Date : 2025-08-22 DOI: 10.1002/cem.70062

Li Zha, Kaiqi Zhang, Die Xie, Yongming Luo, Xin Che, Lihong Wang

As a crucial extraction process in traditional Chinese medicine, quality control of percolation still faces challenges in real-time monitoring methods. To address this challenge, this study focused on the Astragalus percolation process and established an NIRS-based method for synchronous online monitoring of two bioactive markers in Astragalus percolates: Astragalus polysaccharides (APSs) and calycosin-7-O-β-D-glucoside (CG), achieving rapid and nondestructive analysis. In this study, near-infrared (NIR) spectra were collected online at different time points during percolation to determine APS and CG concentrations by means of NIRS technology, with high-performance liquid chromatography (HPLC) and ultraviolet–visible spectrophotometry (UV–Vis) used as reference methods. Two modeling approaches—partial least squares regression (PLSR) and support vector regression (SVR)—were employed to establish quantitative analytical models for these bioactive components, with model performance optimized through spectral preprocessing and feature variable selection. Results demonstrated that SVR-based models achieved superior predictive accuracy compared with PLSR. The optimal APS model showed calibration and validation set R² values of 0.9995 and 0.9874, respectively, while the CG model yielded 0.9811 (calibration) and 0.9632 (validation). Both components exhibited residual prediction deviation (RPD) values exceeding the threshold (RPD > 3), with 6.5349 for APS and 3.8357 for CG, confirming excellent predictive capability. Paired t-test analysis of external test sets (p > 0.05) revealed no statistically significant difference between measured and predicted values, further validating the model's robustness for unknown sample prediction. The concentrations of APS and CG in the Astragalus percolation solution can be simultaneously determined by this method within 30 s, significantly improving analytical efficiency compared with the conventional method (60–80 min per sample), while featuring simple operation, solvent-free consumption, low cost, and pollution-free advantages. This study demonstrates that the combination of NIRS and chemometrics enables real-time monitoring of multiple key substance concentrations during the percolation process. As a green analytical technology, NIRS shows significant potential for improving production efficiency and ensuring product quality consistency.

浸透作为中药提取的关键工艺，其质量控制在实时监测方法上仍面临挑战。为了解决这一挑战，本研究以黄芪的渗滤过程为研究对象，建立了基于nir的方法，对黄芪多糖（APSs）和毛蕊花苷-7- o -β- d -葡萄糖苷（CG）两种生物活性标志物进行同步在线监测，实现了快速无损分析。本研究以高效液相色谱法（HPLC）和紫外可见分光光度法（UV-Vis）为参比方法，在线采集渗透过程中不同时间点的近红外（NIR）光谱，测定APS和CG浓度。采用偏最小二乘回归（PLSR）和支持向量回归（SVR）两种建模方法建立生物活性成分定量分析模型，并通过光谱预处理和特征变量选择优化模型性能。结果表明，与PLSR相比，基于svr的模型具有更高的预测精度。最优APS模型的校正集R2为0.9995，验证集R2为0.9874，CG模型的校正集R2为0.9811，验证集R2为0.9632。两个分量的残差预测偏差（RPD）值均超过阈值（RPD > 3）， APS的残差预测偏差为6.5349，CG的残差预测偏差为3.8357，具有较好的预测能力。外部检验集配对t检验分析（p > 0.05）显示实测值与预测值之间无统计学差异，进一步验证了模型对未知样本预测的稳健性。该方法可在30 s内同时测定黄芪渗滤液中APS和CG的浓度，与常规方法（60-80 min /个样品）相比，分析效率显著提高，同时具有操作简单、无溶剂消耗、成本低、无污染等优点。本研究表明，近红外光谱和化学计量学的结合可以实时监测渗透过程中多种关键物质的浓度。近红外光谱作为一种绿色分析技术，在提高生产效率和确保产品质量一致性方面显示出巨大的潜力。

{"title":"Online Simultaneous Determination of Astragalus Polysaccharides and Calycosin-7-O-β-D-Glucoside in Astragali Radix Percolate Based on Near-Infrared Spectroscopy Technology","authors":"Li Zha, Kaiqi Zhang, Die Xie, Yongming Luo, Xin Che, Lihong Wang","doi":"10.1002/cem.70062","DOIUrl":"10.1002/cem.70062","url":null,"abstract":"<div>\u0000 \u0000 <p>As a crucial extraction process in traditional Chinese medicine, quality control of percolation still faces challenges in real-time monitoring methods. To address this challenge, this study focused on the Astragalus percolation process and established an NIRS-based method for synchronous online monitoring of two bioactive markers in Astragalus percolates: Astragalus polysaccharides (APSs) and calycosin-7-O-β-D-glucoside (CG), achieving rapid and nondestructive analysis. In this study, near-infrared (NIR) spectra were collected online at different time points during percolation to determine APS and CG concentrations by means of NIRS technology, with high-performance liquid chromatography (HPLC) and ultraviolet–visible spectrophotometry (UV–Vis) used as reference methods. Two modeling approaches—partial least squares regression (PLSR) and support vector regression (SVR)—were employed to establish quantitative analytical models for these bioactive components, with model performance optimized through spectral preprocessing and feature variable selection. Results demonstrated that SVR-based models achieved superior predictive accuracy compared with PLSR. The optimal APS model showed calibration and validation set <i>R</i><sup>2</sup> values of 0.9995 and 0.9874, respectively, while the CG model yielded 0.9811 (calibration) and 0.9632 (validation). Both components exhibited residual prediction deviation (RPD) values exceeding the threshold (RPD > 3), with 6.5349 for APS and 3.8357 for CG, confirming excellent predictive capability. Paired <i>t</i>-test analysis of external test sets (<i>p</i> > 0.05) revealed no statistically significant difference between measured and predicted values, further validating the model's robustness for unknown sample prediction. The concentrations of APS and CG in the Astragalus percolation solution can be simultaneously determined by this method within 30 s, significantly improving analytical efficiency compared with the conventional method (60–80 min per sample), while featuring simple operation, solvent-free consumption, low cost, and pollution-free advantages. This study demonstrates that the combination of NIRS and chemometrics enables real-time monitoring of multiple key substance concentrations during the percolation process. As a green analytical technology, NIRS shows significant potential for improving production efficiency and ensuring product quality consistency.</p>\u0000 </div>","PeriodicalId":15274,"journal":{"name":"Journal of Chemometrics","volume":"39 9","pages":""},"PeriodicalIF":2.1,"publicationDate":"2025-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144888491","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Deciphering the Distinctive Features of Alpha-D-mannopyranoside Structure From Similar Structures Against FimH Through ANN and PCA: Insights and Perspectives 利用人工神经网络和主成分分析法从抗FimH的相似结构中破译α - d -甘露吡喃苷结构的特征：见解和观点

IF 2.1 4区化学 Q1 SOCIAL WORK

Journal of Chemometrics

Pub Date : 2025-08-21 DOI: 10.1002/cem.70063

M. Dhanalakshmi, K. R. Jinuraj, Muhammed Iqbal, D. Sruthi, Kajari Das, Sushma Dave, N. Muthulakshmi Andal

This computational study aimed to demonstrate distinct characteristics of alpha-D-mannopyranoside structure, leveraging D-mannose and its analogs due to their known roles in host–pathogen interactions and potential to be used as nutraceuticals. Targeting bacterial adhesion is a critical strategy to combat urinary tract infections (UTIs), especially given rising antibiotic resistance. The FimH lectin on Escherichia coli is a key mediator of this adhesion, making it a compelling target for novel anti-adhesive therapies. We employed a multi-stage virtual screening pipeline to efficiently explore a vast chemical space around the ligands and their binding interactions. Ligand-based virtual screening, utilizing self-organizing maps (SOMs), clustered 5256 D-mannose-similar structures, identifying a promising subset of 141 molecules with 39 known bioassay actives. This was followed by structure-based ligand docking to precisely evaluate their inhibitory impact on FimH lectin. To understand the structural features driving activity, principal component analysis (PCA) was then applied to analyze the molecular structures and their physicochemical descriptors. Our analysis revealed that 15 compounds exhibited the highest binding energy and docking scores. Crucially, the alpha-D-mannopyranoside conformation demonstrated the most effective inhibitory profile. This superior activity, despite structural similarities, was differentiated by two 3D-matrix descriptors: HRG and Wi G, highlighting their significance in predicting subtle yet impactful conformational preferences.

本计算研究旨在展示- d -甘露糖pyranoside结构的独特特征，利用d -甘露糖及其类似物，因为它们在宿主-病原体相互作用中的已知作用和用作营养保健品的潜力。针对细菌粘连是对抗尿路感染（uti）的关键策略，特别是在抗生素耐药性上升的情况下。大肠杆菌上的FimH凝集素是这种粘附的关键介质，使其成为新型抗粘附疗法的引人注目的靶点。我们采用多级虚拟筛选管道来有效地探索配体及其结合相互作用周围的广阔化学空间。基于配体的虚拟筛选，利用自组织图（SOMs），聚集5256个d-甘露糖类似结构，鉴定出141个具有39种已知生物测定活性的分子。接下来是基于结构的配体对接，以精确评估它们对FimH凝集素的抑制作用。为了了解驱动活性的结构特征，应用主成分分析（PCA）对分子结构及其理化描述符进行了分析。我们的分析表明，15种化合物具有最高的结合能和对接分数。关键是，α - d -甘露pyranoside构象显示出最有效的抑制谱。尽管结构相似，但这种优越的活性由两个3d矩阵描述符区分：HRG和Wi G，突出了它们在预测微妙但有影响的构象偏好方面的重要性。

{"title":"Deciphering the Distinctive Features of Alpha-D-mannopyranoside Structure From Similar Structures Against FimH Through ANN and PCA: Insights and Perspectives","authors":"M. Dhanalakshmi, K. R. Jinuraj, Muhammed Iqbal, D. Sruthi, Kajari Das, Sushma Dave, N. Muthulakshmi Andal","doi":"10.1002/cem.70063","DOIUrl":"10.1002/cem.70063","url":null,"abstract":"<div>\u0000 \u0000 <p>This computational study aimed to demonstrate distinct characteristics of alpha-D-mannopyranoside structure, leveraging D-mannose and its analogs due to their known roles in host–pathogen interactions and potential to be used as nutraceuticals. Targeting bacterial adhesion is a critical strategy to combat urinary tract infections (UTIs), especially given rising antibiotic resistance. The FimH lectin on <i>Escherichia coli</i> is a key mediator of this adhesion, making it a compelling target for novel anti-adhesive therapies. We employed a multi-stage virtual screening pipeline to efficiently explore a vast chemical space around the ligands and their binding interactions. Ligand-based virtual screening, utilizing self-organizing maps (SOMs), clustered 5256 D-mannose-similar structures, identifying a promising subset of 141 molecules with 39 known bioassay actives. This was followed by structure-based ligand docking to precisely evaluate their inhibitory impact on FimH lectin. To understand the structural features driving activity, principal component analysis (PCA) was then applied to analyze the molecular structures and their physicochemical descriptors. Our analysis revealed that 15 compounds exhibited the highest binding energy and docking scores. Crucially, the alpha-D-mannopyranoside conformation demonstrated the most effective inhibitory profile. This superior activity, despite structural similarities, was differentiated by two 3D-matrix descriptors: HRG and Wi G, highlighting their significance in predicting subtle yet impactful conformational preferences.</p>\u0000 </div>","PeriodicalId":15274,"journal":{"name":"Journal of Chemometrics","volume":"39 9","pages":""},"PeriodicalIF":2.1,"publicationDate":"2025-08-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144881314","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Online Monitoring Scheme Using GLPP Through Kantorovich Distance Combined With a Sliding Window Technique for Nonlinear Dynamic Process Fault Detection 基于Kantorovich距离和滑动窗口技术的GLPP在线监测方案用于非线性动态过程故障检测

IF 2.1 4区化学 Q1 SOCIAL WORK

Journal of Chemometrics

Pub Date : 2025-08-14 DOI: 10.1002/cem.70058

Cheng Zhang, Lu Ren, Jing Zhang, Yuan Li

To address the issue of insufficient fault detection performance of global–local preserving projections (GLPP) in the detection of minor faults within nonlinear dynamic processes, a novel fault detection method based on GLPP and Kantorovich distance combined with a sliding window (GLPP-KD) is proposed. Firstly, the GLPP algorithm is used to construct a weight matrix to retain the key information of the data, and the objective function containing local and global information is transformed into a generalized eigenvector problem to obtain a projection matrix. Additionally, the sliding window technique integrated with the Kantorovich distance is employed to quantify the discrepancies between probability distributions, thereby capturing the local dynamic characteristics of the data. Eventually, the fault detection task is achieved by identifying the minor distinctions between normal and faulty states. Experimental results show that compared with traditional methods, GLPP-KD improves the fault detection accuracy and effectively reduces the false alarm rate. The proposed method provides a strong guarantee for the safe and stable operation of the industry and has high application value.

为了解决全局局部保持投影（GLPP）在非线性动态过程中检测小故障时故障检测性能不足的问题，提出了一种基于全局局部保持投影和Kantorovich距离结合滑动窗口的故障检测方法（GLPP- kd）。首先，利用GLPP算法构造权重矩阵以保留数据的关键信息，并将包含局部和全局信息的目标函数转化为广义特征向量问题，得到投影矩阵；此外，采用结合Kantorovich距离的滑动窗口技术来量化概率分布之间的差异，从而捕捉数据的局部动态特征。最终，通过识别正常状态和故障状态之间的细微差别来完成故障检测任务。实验结果表明，与传统方法相比，GLPP-KD提高了故障检测精度，有效降低了误报率。该方法为工业安全稳定运行提供了有力保障，具有较高的应用价值。

{"title":"Online Monitoring Scheme Using GLPP Through Kantorovich Distance Combined With a Sliding Window Technique for Nonlinear Dynamic Process Fault Detection","authors":"Cheng Zhang, Lu Ren, Jing Zhang, Yuan Li","doi":"10.1002/cem.70058","DOIUrl":"10.1002/cem.70058","url":null,"abstract":"<div>\u0000 \u0000 <p>To address the issue of insufficient fault detection performance of global–local preserving projections (GLPP) in the detection of minor faults within nonlinear dynamic processes, a novel fault detection method based on GLPP and Kantorovich distance combined with a sliding window (GLPP-KD) is proposed. Firstly, the GLPP algorithm is used to construct a weight matrix to retain the key information of the data, and the objective function containing local and global information is transformed into a generalized eigenvector problem to obtain a projection matrix. Additionally, the sliding window technique integrated with the Kantorovich distance is employed to quantify the discrepancies between probability distributions, thereby capturing the local dynamic characteristics of the data. Eventually, the fault detection task is achieved by identifying the minor distinctions between normal and faulty states. Experimental results show that compared with traditional methods, GLPP-KD improves the fault detection accuracy and effectively reduces the false alarm rate. The proposed method provides a strong guarantee for the safe and stable operation of the industry and has high application value.</p>\u0000 </div>","PeriodicalId":15274,"journal":{"name":"Journal of Chemometrics","volume":"39 9","pages":""},"PeriodicalIF":2.1,"publicationDate":"2025-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144832986","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0