首页 > 最新文献

Journal of Chemometrics最新文献

英文 中文
Special Issue for Celebrating Prof. Ruqin Yu's 90th Birthday 庆祝余汝琴教授九十大寿特刊
IF 2.1 4区 化学 Q1 SOCIAL WORK Pub Date : 2025-10-28 DOI: 10.1002/cem.70083
Hai-Long Wu, Zeng-Ping Chen, Tong Wang
<p>Prof. Ruqin Yu, born in November 1935, is a distinguished chemometrician and an academician of the Chinese Academy of Sciences. After graduating from the Department of Chemistry, St. Petersburg University, he pursued further research under Academician Shuquan Liang at the Institute of Chemistry, Chinese Academy of Sciences. Prof. Yu has made pioneering contributions to chemometrics, introducing morphological approaches and chaos concepts into algorithm design, and laying the theoretical foundation for robust chemometric multi-way resolution methods. Over his prolific career, he has published more than 1200 papers and received three National Natural Science Awards. He was awarded the Chemometrics Lifetime Achievement Prize in 2015. Besides, the 2016 Prize was awarded to another Chinese chemometrician Yizeng Liang, a former Prof. Yu's student who defended his PhD thesis under Yu's supervision in 1988. As a long-standing editor of the Journal of Chemometrics, Prof. Yu has played a vital role in shaping the development of the field worldwide, particularly fostering the growth of chemometrics in China. This special issue is dedicated to celebrating Prof. Yu's 90th birthday, paying tribute to his lifelong achievements and enduring influence on chemometrics and its community. The collected works not only highlight recent advances in chemometric theory and applications but also reflect the vibrancy and diversity of research in China, much of which has been inspired by Prof. Yu's vision and guidance.</p><p>The contributions in this issue span a wide range of chemometric methodologies, including spectral analysis, chromatographic data processing, data preprocessing and variable selection, and machine learning and deep learning approaches. The first element to emerge from papers within this special issue was the inclusion of two comprehensive reviews. One review focuses on the application of near-infrared spectroscopy combined with chemometric methods to explore water structures in chemical and biological systems, illustrating how chemometrics enables the resolution of subtle spectral features and reveals molecular interactions [<span>1</span>]. The other review provides a panoramic overview of process analysis chemistry based on modern spectroscopies such as infrared, Raman, and LIBS, and summarizes methodologies including preprocessing, feature selection, modeling, and optimization for process monitoring and control [<span>2</span>]. Together, these reviews demonstrate the indispensable role of chemometrics in both fundamental structural studies and practical process analysis.</p><p>The second element relates to the analysis of complex chromatographic and metabolomics data. Chemometric strategies for hyphenated data remain at the forefront of research, with multivariate curve resolution and multi-way calibration continuing to be recognized as core approaches. One study proposes a practical framework that integrates two-way and three-way methods to reso
总的来说,这些贡献表明深度学习如何成为化学计量学工具包的重要组成部分,使模式识别,预测建模和分子设计取得进展。总之,本期特刊汇集了九篇贡献,它们共同代表了化学计量学的连续性和创新性。他们展示了多元校准和曲线分辨率等经典基础如何继续发展,而包括深度学习和生成建模在内的新方法扩展了该领域的边界。我们衷心感谢所有作者和审稿人的宝贵努力,感谢编辑部的专业支持。最重要的是,我们热烈祝贺余汝勤教授90岁寿辰,并希望这期纪念刊将激发中国和世界化学计量学的进一步发展。作者声明无利益冲突。
{"title":"Special Issue for Celebrating Prof. Ruqin Yu's 90th Birthday","authors":"Hai-Long Wu,&nbsp;Zeng-Ping Chen,&nbsp;Tong Wang","doi":"10.1002/cem.70083","DOIUrl":"https://doi.org/10.1002/cem.70083","url":null,"abstract":"&lt;p&gt;Prof. Ruqin Yu, born in November 1935, is a distinguished chemometrician and an academician of the Chinese Academy of Sciences. After graduating from the Department of Chemistry, St. Petersburg University, he pursued further research under Academician Shuquan Liang at the Institute of Chemistry, Chinese Academy of Sciences. Prof. Yu has made pioneering contributions to chemometrics, introducing morphological approaches and chaos concepts into algorithm design, and laying the theoretical foundation for robust chemometric multi-way resolution methods. Over his prolific career, he has published more than 1200 papers and received three National Natural Science Awards. He was awarded the Chemometrics Lifetime Achievement Prize in 2015. Besides, the 2016 Prize was awarded to another Chinese chemometrician Yizeng Liang, a former Prof. Yu's student who defended his PhD thesis under Yu's supervision in 1988. As a long-standing editor of the Journal of Chemometrics, Prof. Yu has played a vital role in shaping the development of the field worldwide, particularly fostering the growth of chemometrics in China. This special issue is dedicated to celebrating Prof. Yu's 90th birthday, paying tribute to his lifelong achievements and enduring influence on chemometrics and its community. The collected works not only highlight recent advances in chemometric theory and applications but also reflect the vibrancy and diversity of research in China, much of which has been inspired by Prof. Yu's vision and guidance.&lt;/p&gt;&lt;p&gt;The contributions in this issue span a wide range of chemometric methodologies, including spectral analysis, chromatographic data processing, data preprocessing and variable selection, and machine learning and deep learning approaches. The first element to emerge from papers within this special issue was the inclusion of two comprehensive reviews. One review focuses on the application of near-infrared spectroscopy combined with chemometric methods to explore water structures in chemical and biological systems, illustrating how chemometrics enables the resolution of subtle spectral features and reveals molecular interactions [&lt;span&gt;1&lt;/span&gt;]. The other review provides a panoramic overview of process analysis chemistry based on modern spectroscopies such as infrared, Raman, and LIBS, and summarizes methodologies including preprocessing, feature selection, modeling, and optimization for process monitoring and control [&lt;span&gt;2&lt;/span&gt;]. Together, these reviews demonstrate the indispensable role of chemometrics in both fundamental structural studies and practical process analysis.&lt;/p&gt;&lt;p&gt;The second element relates to the analysis of complex chromatographic and metabolomics data. Chemometric strategies for hyphenated data remain at the forefront of research, with multivariate curve resolution and multi-way calibration continuing to be recognized as core approaches. One study proposes a practical framework that integrates two-way and three-way methods to reso","PeriodicalId":15274,"journal":{"name":"Journal of Chemometrics","volume":"39 11","pages":""},"PeriodicalIF":2.1,"publicationDate":"2025-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://analyticalsciencejournals.onlinelibrary.wiley.com/doi/epdf/10.1002/cem.70083","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145406560","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Breast Cancer Prognosis Prediction Model Based on Cross-Modal Contrastive Learning 基于跨模态对比学习的乳腺癌预后预测模型
IF 2.1 4区 化学 Q1 SOCIAL WORK Pub Date : 2025-10-26 DOI: 10.1002/cem.70082
Fan Zhang, Sheng Chang, Binjie Wang, Xinhong Zhang

Breast cancer is a common malignant tumor that poses a serious threat to women's health. The incidence and mortality rates of breast cancer have shown an increasing trend worldwide in recent years; therefore, an accurate assessment of breast cancer prognosis is crucial for the development of individualized treatment plans and for the improvement of survival quality of patients. The traditional prognosis assessment of breast cancer mainly depended on doctors' clinical experience and multidisciplinary comprehensive judgment, which lacks unified objective evaluation criteria. This study proposes an innovative cross-modal contrastive learning model PreGAT based on graph neural networks and attention mechanism. The proposed model can efficiently integrate features from multiple sources of patient data, including clinical features and constructed graph structure features, and significantly improve the performance of the model through a novel contrastive learning loss function. The PreGAT model achieves excellent performance on the public METABRIC dataset with an average accuracy of 92.9% and an AUC value of 0.969. This research provides a promising technique for breast cancer prognosis prediction in clinical practice, which can provide more accurate and reliable decision support for the development of precise treatment programs.

乳腺癌是一种常见的恶性肿瘤,对妇女的健康构成严重威胁。近年来,乳腺癌的发病率和死亡率在世界范围内呈上升趋势;因此,准确评估乳腺癌预后对于制定个体化治疗方案和提高患者生存质量至关重要。传统的乳腺癌预后评估主要依靠医生的临床经验和多学科综合判断,缺乏统一的客观评价标准。本文提出了一种基于图神经网络和注意机制的跨模态对比学习模型PreGAT。该模型可以有效地整合多源患者数据的特征,包括临床特征和构建的图结构特征,并通过一种新的对比学习损失函数显著提高模型的性能。PreGAT模型在公共METABRIC数据集上取得了优异的性能,平均准确率为92.9%,AUC值为0.969。本研究在临床实践中为乳腺癌预后预测提供了一种有前景的技术,可为制定精准治疗方案提供更加准确可靠的决策支持。
{"title":"A Breast Cancer Prognosis Prediction Model Based on Cross-Modal Contrastive Learning","authors":"Fan Zhang,&nbsp;Sheng Chang,&nbsp;Binjie Wang,&nbsp;Xinhong Zhang","doi":"10.1002/cem.70082","DOIUrl":"https://doi.org/10.1002/cem.70082","url":null,"abstract":"<div>\u0000 \u0000 <p>Breast cancer is a common malignant tumor that poses a serious threat to women's health. The incidence and mortality rates of breast cancer have shown an increasing trend worldwide in recent years; therefore, an accurate assessment of breast cancer prognosis is crucial for the development of individualized treatment plans and for the improvement of survival quality of patients. The traditional prognosis assessment of breast cancer mainly depended on doctors' clinical experience and multidisciplinary comprehensive judgment, which lacks unified objective evaluation criteria. This study proposes an innovative cross-modal contrastive learning model PreGAT based on graph neural networks and attention mechanism. The proposed model can efficiently integrate features from multiple sources of patient data, including clinical features and constructed graph structure features, and significantly improve the performance of the model through a novel contrastive learning loss function. The PreGAT model achieves excellent performance on the public METABRIC dataset with an average accuracy of 92.9% and an AUC value of 0.969. This research provides a promising technique for breast cancer prognosis prediction in clinical practice, which can provide more accurate and reliable decision support for the development of precise treatment programs.</p>\u0000 </div>","PeriodicalId":15274,"journal":{"name":"Journal of Chemometrics","volume":"39 11","pages":""},"PeriodicalIF":2.1,"publicationDate":"2025-10-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145406668","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Ginseng Seed Variety Identification Based on Hyperspectral Imaging Technology and Transfer Learning 基于高光谱成像技术和迁移学习的人参种子品种鉴定
IF 2.1 4区 化学 Q1 SOCIAL WORK Pub Date : 2025-10-24 DOI: 10.1002/cem.70080
Zhan Shu, Xiong Li, Yande Liu

This study proposes a rapid and non-destructive method for ginseng seed classification at the individual seed level using hyperspectral imaging combined with transfer learning. A total of 900 seeds from three ginseng varieties were used, with 300 samples per variety. Spectral features were extracted from circular regions of interest, and classification models including PLS-DA, LSSVM, and random forest were built using preprocessed spectral data, achieving accuracies of 92%, 92%, and 93%, respectively. Principal component images (PC1–PC3) were fused into RGB format and fed into pre-trained deep learning models (AlexNet, GoogLeNet, Inception-V3, ResNet-50, SqueezeNet). Among them, the improved AlexNet model achieved the highest accuracy of 97%. The results demonstrate that transfer learning models integrating spectral and image features outperform traditional spectral models, offering an effective solution for precise classification and quality control of ginseng seeds.

本研究提出了一种结合迁移学习的高光谱成像快速无损的人参种子分类方法。总共使用了来自三个人参品种的900个种子,每个品种300个样本。从感兴趣的圆形区域提取光谱特征,利用预处理后的光谱数据构建PLS-DA、LSSVM和随机森林分类模型,准确率分别达到92%、92%和93%。将主成分图像(PC1-PC3)融合成RGB格式,并输入预训练的深度学习模型(AlexNet、GoogLeNet、Inception-V3、ResNet-50、SqueezeNet)。其中,改进的AlexNet模型准确率最高,达到97%。结果表明,结合光谱和图像特征的迁移学习模型优于传统的光谱模型,为人参种子的精确分类和质量控制提供了有效的解决方案。
{"title":"Ginseng Seed Variety Identification Based on Hyperspectral Imaging Technology and Transfer Learning","authors":"Zhan Shu,&nbsp;Xiong Li,&nbsp;Yande Liu","doi":"10.1002/cem.70080","DOIUrl":"https://doi.org/10.1002/cem.70080","url":null,"abstract":"<div>\u0000 \u0000 <p>This study proposes a rapid and non-destructive method for ginseng seed classification at the individual seed level using hyperspectral imaging combined with transfer learning. A total of 900 seeds from three ginseng varieties were used, with 300 samples per variety. Spectral features were extracted from circular regions of interest, and classification models including PLS-DA, LSSVM, and random forest were built using preprocessed spectral data, achieving accuracies of 92%, 92%, and 93%, respectively. Principal component images (PC1–PC3) were fused into RGB format and fed into pre-trained deep learning models (AlexNet, GoogLeNet, Inception-V3, ResNet-50, SqueezeNet). Among them, the improved AlexNet model achieved the highest accuracy of 97%. The results demonstrate that transfer learning models integrating spectral and image features outperform traditional spectral models, offering an effective solution for precise classification and quality control of ginseng seeds.</p>\u0000 </div>","PeriodicalId":15274,"journal":{"name":"Journal of Chemometrics","volume":"39 11","pages":""},"PeriodicalIF":2.1,"publicationDate":"2025-10-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145366597","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Data Fusion: Integrating Heterogeneous Information Sources in the Chemical Processing Industry 数据融合:化工加工行业异构信息源的集成
IF 2.1 4区 化学 Q1 SOCIAL WORK Pub Date : 2025-10-23 DOI: 10.1002/cem.70075
Eugeniu Strelet, Ivan Castillo, You Peng, Marco S. Reis

Industrial data have always played a pivotal role in the development, management, and optimization of processes and products. Over time, the intrinsic characteristics of data (modality, acquisition frequency, granularity, uncertainty, etc.) have evolved significantly and so have the computational methods and technology to process and integrate them, transforming these inputs into valuable insights. By integrating multiple heterogeneous data sources, data fusion provides an updated and improved status of the operations, enhancing information quality and leading to more comprehensive, accurate, and insightful analysis of the systems under examination and, therefore, better decision-making. In this work, we present a review of the data fusion methodologies from the perspective of practitioners, including engineers and data analysts, and how it can be systematically organized into classes of methods dedicated to achieving specific goals. The terminology adopted in the field, not always used consistently and without ambiguity, is also discussed and clarified. Additionally, several case studies are provided to showcase some of the applications and potential advantages of adopting data fusion frameworks.

工业数据一直在流程和产品的开发、管理和优化中发挥着关键作用。随着时间的推移,数据的内在特征(模态、获取频率、粒度、不确定性等)发生了显著变化,处理和整合数据的计算方法和技术也发生了显著变化,将这些输入转化为有价值的见解。通过集成多个异构数据源,数据融合提供了更新和改进的操作状态,提高了信息质量,并导致对所检查系统进行更全面、更准确和更有洞察力的分析,从而更好地做出决策。在这项工作中,我们从实践者(包括工程师和数据分析师)的角度对数据融合方法进行了回顾,以及如何将其系统地组织成致力于实现特定目标的方法类。还讨论和澄清了在该领域采用的术语,这些术语并不总是一致地使用,没有歧义。此外,还提供了几个案例研究,以展示采用数据融合框架的一些应用程序和潜在优势。
{"title":"Data Fusion: Integrating Heterogeneous Information Sources in the Chemical Processing Industry","authors":"Eugeniu Strelet,&nbsp;Ivan Castillo,&nbsp;You Peng,&nbsp;Marco S. Reis","doi":"10.1002/cem.70075","DOIUrl":"https://doi.org/10.1002/cem.70075","url":null,"abstract":"<p>Industrial data have always played a pivotal role in the development, management, and optimization of processes and products. Over time, the intrinsic characteristics of data (modality, acquisition frequency, granularity, uncertainty, etc.) have evolved significantly and so have the computational methods and technology to process and integrate them, transforming these inputs into valuable insights. By integrating multiple heterogeneous data sources, data fusion provides an updated and improved status of the operations, enhancing information quality and leading to more comprehensive, accurate, and insightful analysis of the systems under examination and, therefore, better decision-making. In this work, we present a review of the data fusion methodologies from the perspective of practitioners, including engineers and data analysts, and how it can be systematically organized into classes of methods dedicated to achieving specific goals. The terminology adopted in the field, not always used consistently and without ambiguity, is also discussed and clarified. Additionally, several case studies are provided to showcase some of the applications and potential advantages of adopting data fusion frameworks.</p>","PeriodicalId":15274,"journal":{"name":"Journal of Chemometrics","volume":"39 11","pages":""},"PeriodicalIF":2.1,"publicationDate":"2025-10-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://analyticalsciencejournals.onlinelibrary.wiley.com/doi/epdf/10.1002/cem.70075","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145366847","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Fluorescence Excitation-Emission Matrices Decomposition Using Statistical Correction for Primary Inner Filtering 基于统计校正的一次内滤波荧光激发发射矩阵分解
IF 2.1 4区 化学 Q1 SOCIAL WORK Pub Date : 2025-10-22 DOI: 10.1002/cem.70077
Nikolay Maslov, Natalya Gubanova

Multi-excitation fluorescence spectroscopy allows characterizing the distribution of fluorophores in samples of various natures and is a powerful tool for chemical, biological, and medical diagnostics. However, interpreting large sets of excitation-dependent fluorescence spectra remains a significant challenge, especially when the fluorophores are initially unknown. In optically thin samples, spectral decomposition can be performed using several statistical methods. In practice, however, various light propagation effects cause deviations from the pure component spectra and prevent the direct application of these methods. A sample-specific inner filtering correction is required prior to their use. Unfortunately, performing such a correction is not always feasible due to complex light propagation geometry or the difficulty of obtaining additional measurements. Here, we suggest an indirect procedure for optically thick samples with negligible fluorescence reabsorption that does not rely on any assumptions about the light illumination and collection geometry, by iteratively determining appropriate correction coefficients for excitation variations. Due to the correlated nature of fluorescence spectra in excitation-emission matrices, this approach enables decomposition when other methods fail. Model experiments with simulated spectral data and real fluorophores showed good agreement between the calculated spectra and the actual fluorescence spectra of the corresponding substances.

多激发荧光光谱可以表征各种性质样品中荧光团的分布,是化学、生物和医学诊断的有力工具。然而,解释大量的激发依赖的荧光光谱仍然是一个重大的挑战,特别是当荧光团最初是未知的。在光学薄样品中,光谱分解可以使用几种统计方法进行。然而,在实际应用中,各种光的传播效应导致与纯组分光谱的偏差,阻碍了这些方法的直接应用。在使用它们之前,需要对样品进行特定的内部过滤校正。不幸的是,由于复杂的光传播几何形状或难以获得额外的测量,执行这样的校正并不总是可行的。在这里,我们建议一个间接的程序,光学厚样品可忽略荧光重吸收,不依赖于任何假设的光照明和收集几何,通过迭代确定适当的校正系数的激发变化。由于荧光光谱在激发-发射矩阵中的相关性,当其他方法失败时,这种方法可以进行分解。用模拟光谱数据和真实荧光团进行的模型实验表明,计算出的光谱与对应物质的实际荧光光谱吻合较好。
{"title":"Fluorescence Excitation-Emission Matrices Decomposition Using Statistical Correction for Primary Inner Filtering","authors":"Nikolay Maslov,&nbsp;Natalya Gubanova","doi":"10.1002/cem.70077","DOIUrl":"https://doi.org/10.1002/cem.70077","url":null,"abstract":"<div>\u0000 \u0000 <p>Multi-excitation fluorescence spectroscopy allows characterizing the distribution of fluorophores in samples of various natures and is a powerful tool for chemical, biological, and medical diagnostics. However, interpreting large sets of excitation-dependent fluorescence spectra remains a significant challenge, especially when the fluorophores are initially unknown. In optically thin samples, spectral decomposition can be performed using several statistical methods. In practice, however, various light propagation effects cause deviations from the pure component spectra and prevent the direct application of these methods. A sample-specific inner filtering correction is required prior to their use. Unfortunately, performing such a correction is not always feasible due to complex light propagation geometry or the difficulty of obtaining additional measurements. Here, we suggest an indirect procedure for optically thick samples with negligible fluorescence reabsorption that does not rely on any assumptions about the light illumination and collection geometry, by iteratively determining appropriate correction coefficients for excitation variations. Due to the correlated nature of fluorescence spectra in excitation-emission matrices, this approach enables decomposition when other methods fail. Model experiments with simulated spectral data and real fluorophores showed good agreement between the calculated spectra and the actual fluorescence spectra of the corresponding substances.</p>\u0000 </div>","PeriodicalId":15274,"journal":{"name":"Journal of Chemometrics","volume":"39 11","pages":""},"PeriodicalIF":2.1,"publicationDate":"2025-10-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145366575","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multiclass Partial Least Squares Discriminant Analysis: Taking New Ways—A Perspective 多类偏最小二乘判别分析:新方法——一个视角
IF 2.1 4区 化学 Q1 SOCIAL WORK Pub Date : 2025-10-20 DOI: 10.1002/cem.70078
O. Ye. Rodionova, A. L. Pomerantsev

An overview of various prospective PLS-DA strategies is presented. PLS-DA is considered not only as a discrimination method per se but also as a method for extracting low-dimensional features suitable for discriminating high-dimensional data, which makes it similar to PCA. Unlike PCA, PLS-DA extracts two sets of features: the matrix of predicted dummy responses (Y) and the matrix of PLS scores (T). Both sets are then used as input to hard (deterministic) and soft (probabilistic) discriminators, providing a wide selection of combinations. Moreover, the features can be enriched by using additional information provided by the PLS model. A total of 18 different strategies are considered, including 8 based on Y-features and 10 based on T-features. Two data sets are used to illustrate the theory, yielding the following results. Both the T- and the Y-based approaches are comparable; additional information is ineffective. The cons and pros of each strategy are discussed.

概述了各种前瞻性PLS-DA策略。PLS-DA不仅本身是一种判别方法,而且是一种提取适合于判别高维数据的低维特征的方法,这与PCA相似。与PCA不同,PLS- da提取两组特征:预测虚拟响应矩阵(Y)和PLS评分矩阵(T)。然后,这两个集合被用作硬(确定性)和软(概率)鉴别器的输入,提供广泛的组合选择。此外,可以通过使用PLS模型提供的附加信息来丰富这些特征。总共考虑了18种不同的策略,其中8种基于y特征,10种基于t特征。两个数据集被用来说明这个理论,得到以下结果。基于T的方法和基于T的方法都具有可比性;附加信息是无效的。讨论了每种策略的优缺点。
{"title":"Multiclass Partial Least Squares Discriminant Analysis: Taking New Ways—A Perspective","authors":"O. Ye. Rodionova,&nbsp;A. L. Pomerantsev","doi":"10.1002/cem.70078","DOIUrl":"https://doi.org/10.1002/cem.70078","url":null,"abstract":"<div>\u0000 \u0000 <p>An overview of various prospective PLS-DA strategies is presented. PLS-DA is considered not only as a discrimination method per se but also as a method for extracting low-dimensional features suitable for discriminating high-dimensional data, which makes it similar to PCA. Unlike PCA, PLS-DA extracts two sets of features: the matrix of predicted dummy responses (<b>Y</b>) and the matrix of PLS scores (<b>T</b>). Both sets are then used as input to hard (deterministic) and soft (probabilistic) discriminators, providing a wide selection of combinations. Moreover, the features can be enriched by using additional information provided by the PLS model. A total of 18 different strategies are considered, including 8 based on <b>Y</b>-features and 10 based on <b>T</b>-features. Two data sets are used to illustrate the theory, yielding the following results. Both the <b>T</b>- and the <b>Y</b>-based approaches are comparable; additional information is ineffective. The cons and pros of each strategy are discussed.</p>\u0000 </div>","PeriodicalId":15274,"journal":{"name":"Journal of Chemometrics","volume":"39 11","pages":""},"PeriodicalIF":2.1,"publicationDate":"2025-10-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145327813","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Outlier Detection Using Immersive Analytics With Virtual Reality 使用沉浸式分析与虚拟现实的异常值检测
IF 2.1 4区 化学 Q1 SOCIAL WORK Pub Date : 2025-10-20 DOI: 10.1002/cem.70079
Hyrum J. Redd, Jordan M. J. Peper, John H. Kalivas

This paper honors Paul Geladi with a new chemometric data visualization approach based on immersive analytics principles using virtual reality (VR). With the many technological advancements improving the accessibility of extended reality (XR), including VR, the frontier of immersive analytics is wide open for utilizing VR as a powerful chemometric data analysis tool. Immersion of human senses into a virtually generated three-dimensional (3D) world compels us to act more instinctively in complex data decision-making scenarios by using our inherent cognitive pattern recognition capacity learned over years of experience. Proposed is an immersive analytics application of VR for a hybrid human/computer-aided outlier detection process. In this application, each training set sample is virtually realized as a glyph to visually assess the inter- and intra-sample relationships present in datasets while mining for nonrepresentative samples. Sample glyph shape and size are developed using hundreds of sample similarity measurements for each sample based both on spectral and prediction property information. These sample glyphs are visually and spatially compared with one another in VR by the user. Outlier checking in VR safeguards against masking and swamping problems that are difficult to recognize with automatic algorithms. Results from dataset situations based on near infrared (NIR) and ultraviolet (UV) spectra and analyte reference values show the viability of using VR for data analysis and outlier detection. This VR application demonstrates the looming evolution of immersive analytics with an XR interface involving human reasoning in difficult chemometric data analysis settings.

本文以一种基于虚拟现实(VR)沉浸式分析原理的新的化学计量数据可视化方法来纪念Paul Geladi。随着包括VR在内的许多技术进步提高了扩展现实(XR)的可访问性,沉浸式分析的前沿为利用VR作为强大的化学计量数据分析工具敞开了大门。人类感官沉浸在虚拟生成的三维(3D)世界中,迫使我们在复杂的数据决策场景中更本能地行动,利用我们多年经验中习得的固有认知模式识别能力。提出了一种沉浸式虚拟现实分析应用程序,用于混合人/计算机辅助离群值检测过程。在这个应用程序中,每个训练集样本实际上被实现为一个符号,以便在挖掘非代表性样本时直观地评估数据集中存在的样本间和样本内关系。基于光谱和预测属性信息,对每个样本使用数百个样本相似性测量来开发样本字形形状和大小。这些样例符号在视觉上和空间上由用户在VR中相互比较。VR中的异常值检查可以防止自动算法难以识别的掩盖和淹没问题。基于近红外(NIR)和紫外(UV)光谱以及分析物参考值的数据集情况的结果表明,将VR用于数据分析和异常值检测是可行的。这个虚拟现实应用程序展示了沉浸式分析迫在眉睫的发展,其中包括在困难的化学计量数据分析设置中涉及人类推理的XR界面。
{"title":"Outlier Detection Using Immersive Analytics With Virtual Reality","authors":"Hyrum J. Redd,&nbsp;Jordan M. J. Peper,&nbsp;John H. Kalivas","doi":"10.1002/cem.70079","DOIUrl":"https://doi.org/10.1002/cem.70079","url":null,"abstract":"<div>\u0000 \u0000 <p>This paper honors Paul Geladi with a new chemometric data visualization approach based on immersive analytics principles using virtual reality (VR). With the many technological advancements improving the accessibility of extended reality (XR), including VR, the frontier of immersive analytics is wide open for utilizing VR as a powerful chemometric data analysis tool. Immersion of human senses into a virtually generated three-dimensional (3D) world compels us to act more instinctively in complex data decision-making scenarios by using our inherent cognitive pattern recognition capacity learned over years of experience. Proposed is an immersive analytics application of VR for a hybrid human/computer-aided outlier detection process. In this application, each training set sample is virtually realized as a glyph to visually assess the inter- and intra-sample relationships present in datasets while mining for nonrepresentative samples. Sample glyph shape and size are developed using hundreds of sample similarity measurements for each sample based both on spectral and prediction property information. These sample glyphs are visually and spatially compared with one another in VR by the user. Outlier checking in VR safeguards against masking and swamping problems that are difficult to recognize with automatic algorithms. Results from dataset situations based on near infrared (NIR) and ultraviolet (UV) spectra and analyte reference values show the viability of using VR for data analysis and outlier detection. This VR application demonstrates the looming evolution of immersive analytics with an XR interface involving human reasoning in difficult chemometric data analysis settings.</p>\u0000 </div>","PeriodicalId":15274,"journal":{"name":"Journal of Chemometrics","volume":"39 11","pages":""},"PeriodicalIF":2.1,"publicationDate":"2025-10-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145327814","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Immersive Analytics in Critical Spatial Domains: From Materials to Energy Systems 沉浸式分析在关键空间领域:从材料到能源系统
IF 2.1 4区 化学 Q1 SOCIAL WORK Pub Date : 2025-10-14 DOI: 10.1002/cem.70076
Rajiv Khadka

Immersive analytics (IA) leverages virtual reality, augmented reality, and mixed reality to transform how users interact with complex datasets across domains such as science, industry, and education. These immersive technologies offer spatial and multimodal environments that foster intuitive exploration, but they also introduce challenges related to cognitive load, interface design, and system performance. This article presents a comprehensive review of visualization techniques, interaction models, and multimodal inputs utilized in IA. Drawing on case studies in scientific visualization, industrial training, and educational communication, we examine both the potential and limitations of current systems. Finally, we propose future research directions, focusing on real-time collaboration, adaptive user interfaces, and scalable data exploration strategies to advance the field.

沉浸式分析(IA)利用虚拟现实、增强现实和混合现实来改变用户与科学、工业和教育等领域复杂数据集的交互方式。这些沉浸式技术提供了空间和多模式环境,促进了直观的探索,但它们也引入了与认知负荷、界面设计和系统性能相关的挑战。本文全面回顾了可视化技术、交互模型和IA中使用的多模态输入。通过科学可视化、工业培训和教育交流方面的案例研究,我们考察了当前系统的潜力和局限性。最后,我们提出了未来的研究方向,重点是实时协作,自适应用户界面和可扩展的数据探索策略,以推进该领域的发展。
{"title":"Immersive Analytics in Critical Spatial Domains: From Materials to Energy Systems","authors":"Rajiv Khadka","doi":"10.1002/cem.70076","DOIUrl":"https://doi.org/10.1002/cem.70076","url":null,"abstract":"<div>\u0000 \u0000 <p>Immersive analytics (IA) leverages virtual reality, augmented reality, and mixed reality to transform how users interact with complex datasets across domains such as science, industry, and education. These immersive technologies offer spatial and multimodal environments that foster intuitive exploration, but they also introduce challenges related to cognitive load, interface design, and system performance. This article presents a comprehensive review of visualization techniques, interaction models, and multimodal inputs utilized in IA. Drawing on case studies in scientific visualization, industrial training, and educational communication, we examine both the potential and limitations of current systems. Finally, we propose future research directions, focusing on real-time collaboration, adaptive user interfaces, and scalable data exploration strategies to advance the field.</p>\u0000 </div>","PeriodicalId":15274,"journal":{"name":"Journal of Chemometrics","volume":"39 10","pages":""},"PeriodicalIF":2.1,"publicationDate":"2025-10-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145316984","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Comparison Between Portable and Bench-Top Near-Infrared Spectroscopy for Corn Silage Characterization Using Partial Least Square and Support Vector Regression Methods 基于偏最小二乘法和支持向量回归的便携式和台式近红外光谱玉米青贮特征分析比较
IF 2.1 4区 化学 Q1 SOCIAL WORK Pub Date : 2025-10-08 DOI: 10.1002/cem.70073
Jefferson Tales Oliva, Vinicius Herique Kieling, Felipe Augusto Bueno Rossi, Erick Oliveira Rodrigues, Giovanni Alfredo Guarneri, Larissa Macedo dos Santos Tonial

In this work, bench-top and portable near-infrared (NIR) spectroscopy equipment are compared in the context of generating predictive models for the quantification of phosphorus (P), potassium (K), and nitrogen (N) components from corn silage samples. For this, 200 spectral samples resulting from bench-top and portable NIR are preprocessed by the following sequence of approaches: mean centering application for removing the spectral bias/offset, Savitzky–Golay filter for highlighting signal energy absorption in relation to spectral noise, interval partial least square (iPLS) for selection of spectral region, and Monte Carlo method for outlier detection and removal. Then, from the preprocessed spectra, predictive models were built using the partial least squares (PLS) and support vector regression (SVR) methods for each chemical component and NIR equipment. In this sense, six models are generated, three for each NIR spectroscopy (or two for each element). As a result, considering all components and machine learning (ML) methods, bench-top models achieved R2 values between 0.66 (quantification of P using PLS or SVR) and 0.81 (prediction of K and N using SVR regressors) during the validation, whereas portable ones achieved values between 0.50 (prediction of K using SVR) and 0.67 (quantification of N using PLS). Our results can be considered competitive, as robust and accurate predictors are also generated.

在这项工作中,比较了台式和便携式近红外(NIR)光谱设备在玉米青贮样品中磷(P)、钾(K)和氮(N)成分定量预测模型的生成情况。为此,我们对200个来自台式和便携式近红外的光谱样本进行了预处理,采用了以下一系列方法:均值居中用于去除光谱偏置/偏置,Savitzky-Golay滤波用于突出与光谱噪声相关的信号能量吸收,区间偏最小二乘法(iPLS)用于选择光谱区域,蒙特卡罗方法用于异常值检测和去除。然后,根据预处理后的光谱,利用偏最小二乘(PLS)和支持向量回归(SVR)方法对各化学成分和近红外设备建立预测模型。在这个意义上,产生了六个模型,每个近红外光谱三个(或每个元素两个)。因此,考虑到所有组件和机器学习(ML)方法,在验证期间,台式模型的R2值在0.66(使用PLS或SVR量化P)和0.81(使用SVR回归量预测K和N)之间,而便携式模型的R2值在0.50(使用SVR预测K)和0.67(使用PLS量化N)之间。我们的结果可以被认为是有竞争力的,因为也产生了稳健和准确的预测因子。
{"title":"Comparison Between Portable and Bench-Top Near-Infrared Spectroscopy for Corn Silage Characterization Using Partial Least Square and Support Vector Regression Methods","authors":"Jefferson Tales Oliva,&nbsp;Vinicius Herique Kieling,&nbsp;Felipe Augusto Bueno Rossi,&nbsp;Erick Oliveira Rodrigues,&nbsp;Giovanni Alfredo Guarneri,&nbsp;Larissa Macedo dos Santos Tonial","doi":"10.1002/cem.70073","DOIUrl":"https://doi.org/10.1002/cem.70073","url":null,"abstract":"<p>In this work, bench-top and portable near-infrared (NIR) spectroscopy equipment are compared in the context of generating predictive models for the quantification of phosphorus (P), potassium (K), and nitrogen (N) components from corn silage samples. For this, 200 spectral samples resulting from bench-top and portable NIR are preprocessed by the following sequence of approaches: mean centering application for removing the spectral bias/offset, Savitzky–Golay filter for highlighting signal energy absorption in relation to spectral noise, interval partial least square (iPLS) for selection of spectral region, and Monte Carlo method for outlier detection and removal. Then, from the preprocessed spectra, predictive models were built using the partial least squares (PLS) and support vector regression (SVR) methods for each chemical component and NIR equipment. In this sense, six models are generated, three for each NIR spectroscopy (or two for each element). As a result, considering all components and machine learning (ML) methods, bench-top models achieved <i>R</i><sup>2</sup> values between 0.66 (quantification of P using PLS or SVR) and 0.81 (prediction of K and N using SVR regressors) during the validation, whereas portable ones achieved values between 0.50 (prediction of K using SVR) and 0.67 (quantification of N using PLS). Our results can be considered competitive, as robust and accurate predictors are also generated.</p>","PeriodicalId":15274,"journal":{"name":"Journal of Chemometrics","volume":"39 10","pages":""},"PeriodicalIF":2.1,"publicationDate":"2025-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://analyticalsciencejournals.onlinelibrary.wiley.com/doi/epdf/10.1002/cem.70073","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145243059","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Expanding Multivariate Analysis Principles in Conventional Chemometric Processes 扩展传统化学计量过程中的多元分析原理
IF 2.1 4区 化学 Q1 SOCIAL WORK Pub Date : 2025-09-14 DOI: 10.1002/cem.70067
John H. Kalivas, Robert C. Spiers

Chemometrics encompasses numerous facets such as experimental design, data collection and analysis, and many others. This paper, in honor of Paul Geladi, provides our perspective on growing the scientific intuition of multivariate analysis in conventional chemometric directions not generally practicing multivariate principles. The motivation for this perspective is to express our opinion on the need for chemometrics to expand the role of the Rashomon effect beyond “many models predict well” by integrating a more comprehensive consideration of the multivariate nature of matrix effects. Described are multiple chemometric techniques that have already been enhanced by broadening the application the Rashomon effect including model selection and explanation, figures of merit (FOM), sample similarity assessment for model reliability, outlier detection, and classification—all recent research topics from the authors. This expository discussion revolves around spectroscopic data such as near infrared and fluorescence, but the concepts are relevant to other chemometric data structures.

化学计量学包括许多方面,如实验设计,数据收集和分析,以及许多其他。为了纪念Paul Geladi,本文提供了我们在传统化学计量学方向上发展多元分析的科学直觉的观点,而不是通常实践多元原理。这一观点的动机是表达我们对化学计量学的需求,通过对矩阵效应的多变量性质进行更全面的考虑,将罗生门效应的作用扩展到“许多模型预测良好”之外。描述了多种化学计量学技术,这些技术已经通过扩大罗生门效应的应用而得到了加强,包括模型选择和解释、优点图(FOM)、模型可靠性的样本相似性评估、离群值检测和分类——所有这些都是作者最近的研究课题。这个说明性的讨论围绕光谱数据,如近红外和荧光,但概念是相关的其他化学计量数据结构。
{"title":"Expanding Multivariate Analysis Principles in Conventional Chemometric Processes","authors":"John H. Kalivas,&nbsp;Robert C. Spiers","doi":"10.1002/cem.70067","DOIUrl":"10.1002/cem.70067","url":null,"abstract":"<div>\u0000 \u0000 <p>Chemometrics encompasses numerous facets such as experimental design, data collection and analysis, and many others. This paper, in honor of Paul Geladi, provides our perspective on growing the scientific intuition of multivariate analysis in conventional chemometric directions not generally practicing multivariate principles. The motivation for this perspective is to express our opinion on the need for chemometrics to expand the role of the Rashomon effect beyond “many models predict well” by integrating a more comprehensive consideration of the multivariate nature of matrix effects. Described are multiple chemometric techniques that have already been enhanced by broadening the application the Rashomon effect including model selection and explanation, figures of merit (FOM), sample similarity assessment for model reliability, outlier detection, and classification—all recent research topics from the authors. This expository discussion revolves around spectroscopic data such as near infrared and fluorescence, but the concepts are relevant to other chemometric data structures.</p>\u0000 </div>","PeriodicalId":15274,"journal":{"name":"Journal of Chemometrics","volume":"39 9","pages":""},"PeriodicalIF":2.1,"publicationDate":"2025-09-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145057937","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Journal of Chemometrics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1