首页 > 最新文献

Journal of Chemometrics最新文献

英文 中文
Is Chemometrics the Most Important Advance in Analytical Chemistry Since the 1980s and Partial Least Squares the Jewel in the Crown? 化学计量学是自20世纪80年代以来分析化学最重要的进展,偏最小二乘法是皇冠上的宝石吗?
IF 2.1 4区 化学 Q1 SOCIAL WORK Pub Date : 2025-11-09 DOI: 10.1002/cem.70070
Richard G. Brereton

The first article in this column was published in 2014 [1], over 10 years ago. In this intermission, we look at the importance of chemometrics to analytical chemists and the central role of PLS (Partial Least Squares), which will be the subject of the next articles.

The name chemometrics (in Swedish) was first advocated in the literature in 1972 [2] but was far from the modern day emphasis on multivariate data and primarily involved univariate curve fitting. Although a few enthusiasts communicated together during the 1970s, it was not until the early 1980s that the subject we now recognise became organised.

Most of the major building blocks were developed in the 1980s, and we will look at how the subject fitted into analytical chemistry as from 1980. A good way to study the impact is via citations. We will use Web of Science (Clarivate) with a census date of 24 July 2025 to chart the progress of major publications in this area. We look at some of the more prominent relevant journals.

Analytica Chimica Acta is one of the first general analytical journals to publish a significant number of papers in chemometrics. The classic paper by Geladi and Kowalski on PLS (Partial Least Squares) [3] in 1986 is the most cited article ever in this journal since 1980 (5746 cites). The second most cited (2483 cites) is on a subject related to chemometrics (Box–Behnken designs), suggesting the dominant impact of chemometrics in this forum. What is remarkable is the longevity of this article. Most articles decline in importance after a few years. In Figure 1, we present the citations to this article as compared to all papers together in ACA since 1986. So not only is this, by a long way, the most highly cited article ever published in this journal, its importance is increasing with time over nearly a 40-year period, compared to general articles published in 1986. This is even clearer when we look at the percentage of citations to all papers in ACA published in 1986 over time, in Figure 2. By 2024, over 80% of all citations are to this one paper. This strongly suggests that PLS continues to remain a long-term advance in analytical chemistry over four decades.

In Elsevier's Chemometrics and Intelligent Laboratory Systems, which first published in 1986, the two most cited papers by a wide margin are about PCA (9016 cites) [4] and, again, PLS (7652 cites) [5]. However, as the former paper was published in 1987 and the latter in 2001, in fact, the interest in PLS has been greater than PCA.

The journal Analytical Chemistry, although it published some early reviews in chemometrics, does not prominently feature chemometrics articles. In the decade 1980–1989, the third most cited article was focused on chemometrics and involved PLS [6] suggesting the importance of PLS to chemometrics in its early days (2434 cites). The most cited article (7820 c

本专栏的第一篇文章发表于2014年,距今已有10多年。在这个间歇,我们看看化学计量学对分析化学家的重要性和PLS(偏最小二乘)的中心作用,这将是下一篇文章的主题。化学计量学(瑞典语)这个名称最早是在1972年的文献中提出的,但与现代强调的多变量数据相距甚远,主要涉及单变量曲线拟合。尽管在20世纪70年代有一些爱好者在一起交流,但直到20世纪80年代初,我们现在所认识的这个主题才有了组织。大多数主要的构建模块都是在20世纪80年代发展起来的,我们将从1980年开始研究这门学科是如何融入分析化学的。研究影响的一个好方法是通过引用。我们将使用人口普查日期为2025年7月24日的Web of Science (Clarivate)来绘制该领域主要出版物的进展情况。我们来看一些比较著名的相关期刊。《分析化学学报》是最早发表大量化学计量学论文的通用分析期刊之一。Geladi和Kowalski在1986年发表的关于偏最小二乘的经典论文是该杂志自1980年以来被引用最多的文章(5746次)。被引用次数第二多(2483次)的主题与化学计量学(Box-Behnken设计)有关,这表明化学计量学在这个论坛上的主导影响。值得注意的是这篇文章的篇幅很长。大多数文章几年后重要性就下降了。在图1中,我们将这篇文章的引用与1986年以来ACA所有论文的引用进行了比较。因此,这篇文章不仅是该杂志上发表的被引用次数最多的文章,而且与1986年发表的一般文章相比,在近40年的时间里,它的重要性随着时间的推移而增加。当我们观察1986年ACA发表的所有论文的引用百分比时,这一点就更加清晰了,如图2所示。到2024年,超过80%的引用都指向这篇论文。这强烈表明,PLS继续保持在分析化学的长期进步超过四十年。在1986年首次出版的Elsevier的chemometics and Intelligent Laboratory Systems中,被引用最多的两篇论文是关于PCA(9016次引用)[5]和PLS(7652次引用)[5]。然而,由于前一篇论文发表于1987年,后一篇发表于2001年,事实上,对PLS的兴趣大于PCA。《分析化学》(Analytical Chemistry)杂志虽然发表了一些关于化学计量学的早期评论,但并没有突出化学计量学的文章。在1980-1989年间,被引次数第三多的文章集中在化学计量学上,并涉及PLS b[6],这表明PLS在早期对化学计量学的重要性(2434篇引用)。自1980年以来,被引次数最多的文章是1996年发表的关于质谱法的文章(7820次)。仪器分析化学在质谱、微型化、蛋白质组学、成像等方面取得了许多惊人的进展,这篇文章代表了核心分析文献中影响力最高的文章。但其引用影响力小于b[4],与[5]相当,后者发表时间晚了5年。更引人注目的是,与分析化学中被引用最多的文章相比,关于PLS的文章的寿命更长,如图3所示。(2021年的暂时增长是由于大多数期刊通用的引用计数方法发生了变化)。可以看到,两篇关于PLS的论文都随着时间的推移而增加其影响,而论文b[7]与大多数文章一样,具有有效的生命周期,然后其影响随着时间的推移而降低。在最近的RSC期刊《分析方法》(Analytical Methods)上,其第一篇文章发表于2009年,被引用最多的论文是关于PCA的(2485次引用),第二篇是关于PARAFAC的文章,这两篇文章都是关于化学计量学的。《塔兰塔》(Talanta)杂志上有一篇关于响应面方法学(Response Surface Methodology)的文章,是1980年至今被引用最多的论文(4632次引用),许多人认为这是化学计量学的范畴。在该期刊中,被引用最多的文章是关于PLS判别分析b[10](2240次引用),第二篇被引用最多的论文也是关于PLS的一个变体。这表明在化学计量学文献中对PLS及其变体的主要兴趣。尽管本文主要基于对Web of Science中的引文的分析,但可以得出结论,化学计量学已经导致了分析化学的重大进步,可能是过去四十年来最广泛的进步。现代仪器可以从色谱和光谱中产生大量的多元数据,越来越多地被用作研究代谢组学、蛋白质组学、遗传科学、环境监测、食品化学等广泛问题的关键。
{"title":"Is Chemometrics the Most Important Advance in Analytical Chemistry Since the 1980s and Partial Least Squares the Jewel in the Crown?","authors":"Richard G. Brereton","doi":"10.1002/cem.70070","DOIUrl":"https://doi.org/10.1002/cem.70070","url":null,"abstract":"<p>The first article in this column was published in 2014 [<span>1</span>], over 10 years ago. In this intermission, we look at the importance of chemometrics to analytical chemists and the central role of PLS (Partial Least Squares), which will be the subject of the next articles.</p><p>The name chemometrics (in Swedish) was first advocated in the literature in 1972 [<span>2</span>] but was far from the modern day emphasis on multivariate data and primarily involved univariate curve fitting. Although a few enthusiasts communicated together during the 1970s, it was not until the early 1980s that the subject we now recognise became organised.</p><p>Most of the major building blocks were developed in the 1980s, and we will look at how the subject fitted into analytical chemistry as from 1980. A good way to study the impact is via citations. We will use Web of Science (Clarivate) with a census date of 24 July 2025 to chart the progress of major publications in this area. We look at some of the more prominent relevant journals.</p><p><i>Analytica Chimica Acta</i> is one of the first general analytical journals to publish a significant number of papers in chemometrics. The classic paper by Geladi and Kowalski on PLS (Partial Least Squares) [<span>3</span>] in 1986 is the most cited article ever in this journal since 1980 (5746 cites). The second most cited (2483 cites) is on a subject related to chemometrics (Box–Behnken designs), suggesting the dominant impact of chemometrics in this forum. What is remarkable is the longevity of this article. Most articles decline in importance after a few years. In Figure 1, we present the citations to this article as compared to all papers together in ACA since 1986. So not only is this, by a long way, the most highly cited article ever published in this journal, its importance is increasing with time over nearly a 40-year period, compared to general articles published in 1986. This is even clearer when we look at the percentage of citations to all papers in ACA published in 1986 over time, in Figure 2. By 2024, over 80% of all citations are to this one paper. This strongly suggests that PLS continues to remain a long-term advance in analytical chemistry over four decades.</p><p>In Elsevier's <i>Chemometrics and Intelligent Laboratory Systems</i>, which first published in 1986, the two most cited papers by a wide margin are about PCA (9016 cites) [<span>4</span>] and, again, PLS (7652 cites) [<span>5</span>]. However, as the former paper was published in 1987 and the latter in 2001, in fact, the interest in PLS has been greater than PCA.</p><p>The journal <i>Analytical Chemistry</i>, although it published some early reviews in chemometrics, does not prominently feature chemometrics articles. In the decade 1980–1989, the third most cited article was focused on chemometrics and involved PLS [<span>6</span>] suggesting the importance of PLS to chemometrics in its early days (2434 cites). The most cited article (7820 c","PeriodicalId":15274,"journal":{"name":"Journal of Chemometrics","volume":"39 11","pages":""},"PeriodicalIF":2.1,"publicationDate":"2025-11-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://analyticalsciencejournals.onlinelibrary.wiley.com/doi/epdf/10.1002/cem.70070","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145529739","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Correction to “Transforming Hyperspectral Images Into Chemical Maps: A Novel End-to-End Deep Learning Approach” 更正“将高光谱图像转换为化学图:一种新颖的端到端深度学习方法”
IF 2.1 4区 化学 Q1 SOCIAL WORK Pub Date : 2025-10-28 DOI: 10.1002/cem.70084

O.-C. G. Engstrøm, M. Albano-Gaglio, E. S. Dreier, et al., “ Transforming Hyperspectral Images Into Chemical Maps: A Novel End-to-End Deep Learning Approach,” Journal of Chemometrics 39, no. 8 (2025): e70041, https://doi.org/10.1002/cem.70041.

In the original work, there was an apparent discrepancy between two ways of making predictions with the same PLS model. One way was predicting directly on mean spectra. The other was predicting on pixel-wise spectra and subsequently averaging the result. The discrepancy between these two ways of evaluation was solely due to the order of application of preprocessing.

In the original work, the mean spectrum was computed by taking the negative logarithm of each pixel (to transform from reflectance to pseudo absorbance), averaging all pixels, applying SNV (Standard Normal Variate) transform, and finally convolution with an SG (Savitzky–Golay) filter. However, when applying pixel-wise PLS, SNV and SG were applied to each pixel individually. As SNV is nonlinear (the standard deviation is not linear), it matters whether it is applied before or after averaging. So, this update now applies preprocessing to each pixel individually before taking the average to compute a mean spectrum. This removes any discrepancy between the results obtained by pixel-wise PLS and PLS on mean spectra. Any reference to this discrepancy has been removed from the work and all results and figures have been updated accordingly. Appendix D describes the updates in detail.

We apologize for this error.

符合。陈晓明,陈晓明,陈晓明,等,“基于深度学习的高光谱图像转换方法”,《化学计量学报》第39期。8 (2025): e70041, https://doi.org/10.1002/cem.70041。在最初的工作中,在同一PLS模型下,两种预测方法之间存在明显的差异。一种方法是直接预测平均光谱。另一种方法是对逐像素光谱进行预测,然后对结果进行平均。这两种评价方法之间的差异仅仅是由于预处理的应用顺序不同。在最初的工作中,平均光谱的计算方法是对每个像素取负对数(从反射率转换为伪吸光度),对所有像素取平均值,应用标准正态变量(SNV)变换,最后与SG (Savitzky-Golay)滤波器进行卷积。然而,当应用逐像素PLS时,SNV和SG分别应用于每个像素。由于SNV是非线性的(标准差不是线性的),所以它是在平均之前还是之后应用是很重要的。所以,这个更新现在对每个像素单独进行预处理,然后取平均值来计算平均谱。这消除了像素级PLS和平均光谱PLS结果之间的任何差异。任何提及这一差异的内容都已从工作中删除,所有结果和数据也已相应更新。附录D详细描述了更新内容。我们为这个错误道歉。
{"title":"Correction to “Transforming Hyperspectral Images Into Chemical Maps: A Novel End-to-End Deep Learning Approach”","authors":"","doi":"10.1002/cem.70084","DOIUrl":"https://doi.org/10.1002/cem.70084","url":null,"abstract":"<p>\u0000 <span>O.-C. G. Engstrøm</span>, <span>M. Albano-Gaglio</span>, <span>E. S. Dreier</span>, et al., “ <span>Transforming Hyperspectral Images Into Chemical Maps: A Novel End-to-End Deep Learning Approach</span>,” <i>Journal of Chemometrics</i> <span>39</span>, no. <span>8</span> (<span>2025</span>): e70041, https://doi.org/10.1002/cem.70041.\u0000 </p><p>In the original work, there was an apparent discrepancy between two ways of making predictions with the same PLS model. One way was predicting directly on mean spectra. The other was predicting on pixel-wise spectra and subsequently averaging the result. The discrepancy between these two ways of evaluation was solely due to the order of application of preprocessing.</p><p>In the original work, the mean spectrum was computed by taking the negative logarithm of each pixel (to transform from reflectance to pseudo absorbance), averaging all pixels, applying SNV (Standard Normal Variate) transform, and finally convolution with an SG (Savitzky–Golay) filter. However, when applying pixel-wise PLS, SNV and SG were applied to each pixel individually. As SNV is nonlinear (the standard deviation is not linear), it matters whether it is applied before or after averaging. So, this update now applies preprocessing to each pixel individually before taking the average to compute a mean spectrum. This removes any discrepancy between the results obtained by pixel-wise PLS and PLS on mean spectra. Any reference to this discrepancy has been removed from the work and all results and figures have been updated accordingly. Appendix D describes the updates in detail.</p><p>We apologize for this error.</p>","PeriodicalId":15274,"journal":{"name":"Journal of Chemometrics","volume":"39 11","pages":""},"PeriodicalIF":2.1,"publicationDate":"2025-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://analyticalsciencejournals.onlinelibrary.wiley.com/doi/epdf/10.1002/cem.70084","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145406846","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Chemometrics Study to Understand the Interaction of Starch–Protein Mixtures and Food Texture 用化学计量学研究了解淀粉-蛋白质混合物与食物质地的相互作用
IF 2.1 4区 化学 Q1 SOCIAL WORK Pub Date : 2025-10-28 DOI: 10.1002/cem.70081
Liziane Dantas Lacerda, Valmor Ziegler, Sarah Winck, Nádya Pesce da Silveira

Interactions that happen in the ingredients during the industrial process of food products, mainly in starch–protein systems, interfere directly with their typical characteristics. To investigate the effects of the individual components on textural properties blends with different soy protein isolate/gluten/starch ratios were studied. Three botanical sources of starch were also investigated by using wheat, corn, and cassava starches. The flowability, penetration force, rupture force, deformability, and hardness values of the gels, determined by texture analyzer under controlled conditions, were observed and allowed comparative study. Through chemometrics, linear, quadratic, and cubic fit models were applied to obtain polynomial equations that could adequately express the observed response surfaces for each evaluated textural parameter. The statistical significance of these models was evaluated through analysis of variance and a new experimental test, making it possible to indicate the best-fit model for each textural property studied. This study indicated that there are large correlation differences between the textural properties. The effect of soy protein isolate was greater than that of gluten and starch and a similarity of the behavior was observed from corn and wheat starches.

在食品工业生产过程中,主要是在淀粉-蛋白质系统中,成分之间的相互作用直接影响了它们的典型特性。为考察各组分对大豆分离蛋白/面筋/淀粉配比的影响,研究了大豆分离蛋白/面筋/淀粉配比对大豆分离蛋白/面筋/淀粉配比的影响。还研究了小麦、玉米和木薯淀粉的三种植物淀粉来源。在控制条件下,通过织构分析仪测定凝胶的流动性、穿透力、破裂力、变形性和硬度值,并进行对比研究。通过化学计量学,应用线性、二次和三次拟合模型,得到能够充分表达每个评估纹理参数的观测响应面的多项式方程。通过方差分析和新的实验检验来评估这些模型的统计显著性,从而可以为所研究的每种纹理特性指出最适合的模型。研究表明,两种材料的织构性能之间存在较大的相关差异。大豆分离蛋白的作用大于面筋和淀粉,玉米和小麦淀粉的作用也相似。
{"title":"Chemometrics Study to Understand the Interaction of Starch–Protein Mixtures and Food Texture","authors":"Liziane Dantas Lacerda,&nbsp;Valmor Ziegler,&nbsp;Sarah Winck,&nbsp;Nádya Pesce da Silveira","doi":"10.1002/cem.70081","DOIUrl":"https://doi.org/10.1002/cem.70081","url":null,"abstract":"<p>Interactions that happen in the ingredients during the industrial process of food products, mainly in starch–protein systems, interfere directly with their typical characteristics. To investigate the effects of the individual components on textural properties blends with different soy protein isolate/gluten/starch ratios were studied. Three botanical sources of starch were also investigated by using wheat, corn, and cassava starches. The flowability, penetration force, rupture force, deformability, and hardness values of the gels, determined by texture analyzer under controlled conditions, were observed and allowed comparative study. Through chemometrics, linear, quadratic, and cubic fit models were applied to obtain polynomial equations that could adequately express the observed response surfaces for each evaluated textural parameter. The statistical significance of these models was evaluated through analysis of variance and a new experimental test, making it possible to indicate the best-fit model for each textural property studied. This study indicated that there are large correlation differences between the textural properties. The effect of soy protein isolate was greater than that of gluten and starch and a similarity of the behavior was observed from corn and wheat starches.</p>","PeriodicalId":15274,"journal":{"name":"Journal of Chemometrics","volume":"39 11","pages":""},"PeriodicalIF":2.1,"publicationDate":"2025-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://analyticalsciencejournals.onlinelibrary.wiley.com/doi/epdf/10.1002/cem.70081","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145406847","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Special Issue for Celebrating Prof. Ruqin Yu's 90th Birthday 庆祝余汝琴教授九十大寿特刊
IF 2.1 4区 化学 Q1 SOCIAL WORK Pub Date : 2025-10-28 DOI: 10.1002/cem.70083
Hai-Long Wu, Zeng-Ping Chen, Tong Wang
<p>Prof. Ruqin Yu, born in November 1935, is a distinguished chemometrician and an academician of the Chinese Academy of Sciences. After graduating from the Department of Chemistry, St. Petersburg University, he pursued further research under Academician Shuquan Liang at the Institute of Chemistry, Chinese Academy of Sciences. Prof. Yu has made pioneering contributions to chemometrics, introducing morphological approaches and chaos concepts into algorithm design, and laying the theoretical foundation for robust chemometric multi-way resolution methods. Over his prolific career, he has published more than 1200 papers and received three National Natural Science Awards. He was awarded the Chemometrics Lifetime Achievement Prize in 2015. Besides, the 2016 Prize was awarded to another Chinese chemometrician Yizeng Liang, a former Prof. Yu's student who defended his PhD thesis under Yu's supervision in 1988. As a long-standing editor of the Journal of Chemometrics, Prof. Yu has played a vital role in shaping the development of the field worldwide, particularly fostering the growth of chemometrics in China. This special issue is dedicated to celebrating Prof. Yu's 90th birthday, paying tribute to his lifelong achievements and enduring influence on chemometrics and its community. The collected works not only highlight recent advances in chemometric theory and applications but also reflect the vibrancy and diversity of research in China, much of which has been inspired by Prof. Yu's vision and guidance.</p><p>The contributions in this issue span a wide range of chemometric methodologies, including spectral analysis, chromatographic data processing, data preprocessing and variable selection, and machine learning and deep learning approaches. The first element to emerge from papers within this special issue was the inclusion of two comprehensive reviews. One review focuses on the application of near-infrared spectroscopy combined with chemometric methods to explore water structures in chemical and biological systems, illustrating how chemometrics enables the resolution of subtle spectral features and reveals molecular interactions [<span>1</span>]. The other review provides a panoramic overview of process analysis chemistry based on modern spectroscopies such as infrared, Raman, and LIBS, and summarizes methodologies including preprocessing, feature selection, modeling, and optimization for process monitoring and control [<span>2</span>]. Together, these reviews demonstrate the indispensable role of chemometrics in both fundamental structural studies and practical process analysis.</p><p>The second element relates to the analysis of complex chromatographic and metabolomics data. Chemometric strategies for hyphenated data remain at the forefront of research, with multivariate curve resolution and multi-way calibration continuing to be recognized as core approaches. One study proposes a practical framework that integrates two-way and three-way methods to reso
总的来说,这些贡献表明深度学习如何成为化学计量学工具包的重要组成部分,使模式识别,预测建模和分子设计取得进展。总之,本期特刊汇集了九篇贡献,它们共同代表了化学计量学的连续性和创新性。他们展示了多元校准和曲线分辨率等经典基础如何继续发展,而包括深度学习和生成建模在内的新方法扩展了该领域的边界。我们衷心感谢所有作者和审稿人的宝贵努力,感谢编辑部的专业支持。最重要的是,我们热烈祝贺余汝勤教授90岁寿辰,并希望这期纪念刊将激发中国和世界化学计量学的进一步发展。作者声明无利益冲突。
{"title":"Special Issue for Celebrating Prof. Ruqin Yu's 90th Birthday","authors":"Hai-Long Wu,&nbsp;Zeng-Ping Chen,&nbsp;Tong Wang","doi":"10.1002/cem.70083","DOIUrl":"https://doi.org/10.1002/cem.70083","url":null,"abstract":"&lt;p&gt;Prof. Ruqin Yu, born in November 1935, is a distinguished chemometrician and an academician of the Chinese Academy of Sciences. After graduating from the Department of Chemistry, St. Petersburg University, he pursued further research under Academician Shuquan Liang at the Institute of Chemistry, Chinese Academy of Sciences. Prof. Yu has made pioneering contributions to chemometrics, introducing morphological approaches and chaos concepts into algorithm design, and laying the theoretical foundation for robust chemometric multi-way resolution methods. Over his prolific career, he has published more than 1200 papers and received three National Natural Science Awards. He was awarded the Chemometrics Lifetime Achievement Prize in 2015. Besides, the 2016 Prize was awarded to another Chinese chemometrician Yizeng Liang, a former Prof. Yu's student who defended his PhD thesis under Yu's supervision in 1988. As a long-standing editor of the Journal of Chemometrics, Prof. Yu has played a vital role in shaping the development of the field worldwide, particularly fostering the growth of chemometrics in China. This special issue is dedicated to celebrating Prof. Yu's 90th birthday, paying tribute to his lifelong achievements and enduring influence on chemometrics and its community. The collected works not only highlight recent advances in chemometric theory and applications but also reflect the vibrancy and diversity of research in China, much of which has been inspired by Prof. Yu's vision and guidance.&lt;/p&gt;&lt;p&gt;The contributions in this issue span a wide range of chemometric methodologies, including spectral analysis, chromatographic data processing, data preprocessing and variable selection, and machine learning and deep learning approaches. The first element to emerge from papers within this special issue was the inclusion of two comprehensive reviews. One review focuses on the application of near-infrared spectroscopy combined with chemometric methods to explore water structures in chemical and biological systems, illustrating how chemometrics enables the resolution of subtle spectral features and reveals molecular interactions [&lt;span&gt;1&lt;/span&gt;]. The other review provides a panoramic overview of process analysis chemistry based on modern spectroscopies such as infrared, Raman, and LIBS, and summarizes methodologies including preprocessing, feature selection, modeling, and optimization for process monitoring and control [&lt;span&gt;2&lt;/span&gt;]. Together, these reviews demonstrate the indispensable role of chemometrics in both fundamental structural studies and practical process analysis.&lt;/p&gt;&lt;p&gt;The second element relates to the analysis of complex chromatographic and metabolomics data. Chemometric strategies for hyphenated data remain at the forefront of research, with multivariate curve resolution and multi-way calibration continuing to be recognized as core approaches. One study proposes a practical framework that integrates two-way and three-way methods to reso","PeriodicalId":15274,"journal":{"name":"Journal of Chemometrics","volume":"39 11","pages":""},"PeriodicalIF":2.1,"publicationDate":"2025-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://analyticalsciencejournals.onlinelibrary.wiley.com/doi/epdf/10.1002/cem.70083","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145406560","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Breast Cancer Prognosis Prediction Model Based on Cross-Modal Contrastive Learning 基于跨模态对比学习的乳腺癌预后预测模型
IF 2.1 4区 化学 Q1 SOCIAL WORK Pub Date : 2025-10-26 DOI: 10.1002/cem.70082
Fan Zhang, Sheng Chang, Binjie Wang, Xinhong Zhang

Breast cancer is a common malignant tumor that poses a serious threat to women's health. The incidence and mortality rates of breast cancer have shown an increasing trend worldwide in recent years; therefore, an accurate assessment of breast cancer prognosis is crucial for the development of individualized treatment plans and for the improvement of survival quality of patients. The traditional prognosis assessment of breast cancer mainly depended on doctors' clinical experience and multidisciplinary comprehensive judgment, which lacks unified objective evaluation criteria. This study proposes an innovative cross-modal contrastive learning model PreGAT based on graph neural networks and attention mechanism. The proposed model can efficiently integrate features from multiple sources of patient data, including clinical features and constructed graph structure features, and significantly improve the performance of the model through a novel contrastive learning loss function. The PreGAT model achieves excellent performance on the public METABRIC dataset with an average accuracy of 92.9% and an AUC value of 0.969. This research provides a promising technique for breast cancer prognosis prediction in clinical practice, which can provide more accurate and reliable decision support for the development of precise treatment programs.

乳腺癌是一种常见的恶性肿瘤,对妇女的健康构成严重威胁。近年来,乳腺癌的发病率和死亡率在世界范围内呈上升趋势;因此,准确评估乳腺癌预后对于制定个体化治疗方案和提高患者生存质量至关重要。传统的乳腺癌预后评估主要依靠医生的临床经验和多学科综合判断,缺乏统一的客观评价标准。本文提出了一种基于图神经网络和注意机制的跨模态对比学习模型PreGAT。该模型可以有效地整合多源患者数据的特征,包括临床特征和构建的图结构特征,并通过一种新的对比学习损失函数显著提高模型的性能。PreGAT模型在公共METABRIC数据集上取得了优异的性能,平均准确率为92.9%,AUC值为0.969。本研究在临床实践中为乳腺癌预后预测提供了一种有前景的技术,可为制定精准治疗方案提供更加准确可靠的决策支持。
{"title":"A Breast Cancer Prognosis Prediction Model Based on Cross-Modal Contrastive Learning","authors":"Fan Zhang,&nbsp;Sheng Chang,&nbsp;Binjie Wang,&nbsp;Xinhong Zhang","doi":"10.1002/cem.70082","DOIUrl":"https://doi.org/10.1002/cem.70082","url":null,"abstract":"<div>\u0000 \u0000 <p>Breast cancer is a common malignant tumor that poses a serious threat to women's health. The incidence and mortality rates of breast cancer have shown an increasing trend worldwide in recent years; therefore, an accurate assessment of breast cancer prognosis is crucial for the development of individualized treatment plans and for the improvement of survival quality of patients. The traditional prognosis assessment of breast cancer mainly depended on doctors' clinical experience and multidisciplinary comprehensive judgment, which lacks unified objective evaluation criteria. This study proposes an innovative cross-modal contrastive learning model PreGAT based on graph neural networks and attention mechanism. The proposed model can efficiently integrate features from multiple sources of patient data, including clinical features and constructed graph structure features, and significantly improve the performance of the model through a novel contrastive learning loss function. The PreGAT model achieves excellent performance on the public METABRIC dataset with an average accuracy of 92.9% and an AUC value of 0.969. This research provides a promising technique for breast cancer prognosis prediction in clinical practice, which can provide more accurate and reliable decision support for the development of precise treatment programs.</p>\u0000 </div>","PeriodicalId":15274,"journal":{"name":"Journal of Chemometrics","volume":"39 11","pages":""},"PeriodicalIF":2.1,"publicationDate":"2025-10-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145406668","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Ginseng Seed Variety Identification Based on Hyperspectral Imaging Technology and Transfer Learning 基于高光谱成像技术和迁移学习的人参种子品种鉴定
IF 2.1 4区 化学 Q1 SOCIAL WORK Pub Date : 2025-10-24 DOI: 10.1002/cem.70080
Zhan Shu, Xiong Li, Yande Liu

This study proposes a rapid and non-destructive method for ginseng seed classification at the individual seed level using hyperspectral imaging combined with transfer learning. A total of 900 seeds from three ginseng varieties were used, with 300 samples per variety. Spectral features were extracted from circular regions of interest, and classification models including PLS-DA, LSSVM, and random forest were built using preprocessed spectral data, achieving accuracies of 92%, 92%, and 93%, respectively. Principal component images (PC1–PC3) were fused into RGB format and fed into pre-trained deep learning models (AlexNet, GoogLeNet, Inception-V3, ResNet-50, SqueezeNet). Among them, the improved AlexNet model achieved the highest accuracy of 97%. The results demonstrate that transfer learning models integrating spectral and image features outperform traditional spectral models, offering an effective solution for precise classification and quality control of ginseng seeds.

本研究提出了一种结合迁移学习的高光谱成像快速无损的人参种子分类方法。总共使用了来自三个人参品种的900个种子,每个品种300个样本。从感兴趣的圆形区域提取光谱特征,利用预处理后的光谱数据构建PLS-DA、LSSVM和随机森林分类模型,准确率分别达到92%、92%和93%。将主成分图像(PC1-PC3)融合成RGB格式,并输入预训练的深度学习模型(AlexNet、GoogLeNet、Inception-V3、ResNet-50、SqueezeNet)。其中,改进的AlexNet模型准确率最高,达到97%。结果表明,结合光谱和图像特征的迁移学习模型优于传统的光谱模型,为人参种子的精确分类和质量控制提供了有效的解决方案。
{"title":"Ginseng Seed Variety Identification Based on Hyperspectral Imaging Technology and Transfer Learning","authors":"Zhan Shu,&nbsp;Xiong Li,&nbsp;Yande Liu","doi":"10.1002/cem.70080","DOIUrl":"https://doi.org/10.1002/cem.70080","url":null,"abstract":"<div>\u0000 \u0000 <p>This study proposes a rapid and non-destructive method for ginseng seed classification at the individual seed level using hyperspectral imaging combined with transfer learning. A total of 900 seeds from three ginseng varieties were used, with 300 samples per variety. Spectral features were extracted from circular regions of interest, and classification models including PLS-DA, LSSVM, and random forest were built using preprocessed spectral data, achieving accuracies of 92%, 92%, and 93%, respectively. Principal component images (PC1–PC3) were fused into RGB format and fed into pre-trained deep learning models (AlexNet, GoogLeNet, Inception-V3, ResNet-50, SqueezeNet). Among them, the improved AlexNet model achieved the highest accuracy of 97%. The results demonstrate that transfer learning models integrating spectral and image features outperform traditional spectral models, offering an effective solution for precise classification and quality control of ginseng seeds.</p>\u0000 </div>","PeriodicalId":15274,"journal":{"name":"Journal of Chemometrics","volume":"39 11","pages":""},"PeriodicalIF":2.1,"publicationDate":"2025-10-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145366597","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Data Fusion: Integrating Heterogeneous Information Sources in the Chemical Processing Industry 数据融合:化工加工行业异构信息源的集成
IF 2.1 4区 化学 Q1 SOCIAL WORK Pub Date : 2025-10-23 DOI: 10.1002/cem.70075
Eugeniu Strelet, Ivan Castillo, You Peng, Marco S. Reis

Industrial data have always played a pivotal role in the development, management, and optimization of processes and products. Over time, the intrinsic characteristics of data (modality, acquisition frequency, granularity, uncertainty, etc.) have evolved significantly and so have the computational methods and technology to process and integrate them, transforming these inputs into valuable insights. By integrating multiple heterogeneous data sources, data fusion provides an updated and improved status of the operations, enhancing information quality and leading to more comprehensive, accurate, and insightful analysis of the systems under examination and, therefore, better decision-making. In this work, we present a review of the data fusion methodologies from the perspective of practitioners, including engineers and data analysts, and how it can be systematically organized into classes of methods dedicated to achieving specific goals. The terminology adopted in the field, not always used consistently and without ambiguity, is also discussed and clarified. Additionally, several case studies are provided to showcase some of the applications and potential advantages of adopting data fusion frameworks.

工业数据一直在流程和产品的开发、管理和优化中发挥着关键作用。随着时间的推移,数据的内在特征(模态、获取频率、粒度、不确定性等)发生了显著变化,处理和整合数据的计算方法和技术也发生了显著变化,将这些输入转化为有价值的见解。通过集成多个异构数据源,数据融合提供了更新和改进的操作状态,提高了信息质量,并导致对所检查系统进行更全面、更准确和更有洞察力的分析,从而更好地做出决策。在这项工作中,我们从实践者(包括工程师和数据分析师)的角度对数据融合方法进行了回顾,以及如何将其系统地组织成致力于实现特定目标的方法类。还讨论和澄清了在该领域采用的术语,这些术语并不总是一致地使用,没有歧义。此外,还提供了几个案例研究,以展示采用数据融合框架的一些应用程序和潜在优势。
{"title":"Data Fusion: Integrating Heterogeneous Information Sources in the Chemical Processing Industry","authors":"Eugeniu Strelet,&nbsp;Ivan Castillo,&nbsp;You Peng,&nbsp;Marco S. Reis","doi":"10.1002/cem.70075","DOIUrl":"https://doi.org/10.1002/cem.70075","url":null,"abstract":"<p>Industrial data have always played a pivotal role in the development, management, and optimization of processes and products. Over time, the intrinsic characteristics of data (modality, acquisition frequency, granularity, uncertainty, etc.) have evolved significantly and so have the computational methods and technology to process and integrate them, transforming these inputs into valuable insights. By integrating multiple heterogeneous data sources, data fusion provides an updated and improved status of the operations, enhancing information quality and leading to more comprehensive, accurate, and insightful analysis of the systems under examination and, therefore, better decision-making. In this work, we present a review of the data fusion methodologies from the perspective of practitioners, including engineers and data analysts, and how it can be systematically organized into classes of methods dedicated to achieving specific goals. The terminology adopted in the field, not always used consistently and without ambiguity, is also discussed and clarified. Additionally, several case studies are provided to showcase some of the applications and potential advantages of adopting data fusion frameworks.</p>","PeriodicalId":15274,"journal":{"name":"Journal of Chemometrics","volume":"39 11","pages":""},"PeriodicalIF":2.1,"publicationDate":"2025-10-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://analyticalsciencejournals.onlinelibrary.wiley.com/doi/epdf/10.1002/cem.70075","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145366847","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Fluorescence Excitation-Emission Matrices Decomposition Using Statistical Correction for Primary Inner Filtering 基于统计校正的一次内滤波荧光激发发射矩阵分解
IF 2.1 4区 化学 Q1 SOCIAL WORK Pub Date : 2025-10-22 DOI: 10.1002/cem.70077
Nikolay Maslov, Natalya Gubanova

Multi-excitation fluorescence spectroscopy allows characterizing the distribution of fluorophores in samples of various natures and is a powerful tool for chemical, biological, and medical diagnostics. However, interpreting large sets of excitation-dependent fluorescence spectra remains a significant challenge, especially when the fluorophores are initially unknown. In optically thin samples, spectral decomposition can be performed using several statistical methods. In practice, however, various light propagation effects cause deviations from the pure component spectra and prevent the direct application of these methods. A sample-specific inner filtering correction is required prior to their use. Unfortunately, performing such a correction is not always feasible due to complex light propagation geometry or the difficulty of obtaining additional measurements. Here, we suggest an indirect procedure for optically thick samples with negligible fluorescence reabsorption that does not rely on any assumptions about the light illumination and collection geometry, by iteratively determining appropriate correction coefficients for excitation variations. Due to the correlated nature of fluorescence spectra in excitation-emission matrices, this approach enables decomposition when other methods fail. Model experiments with simulated spectral data and real fluorophores showed good agreement between the calculated spectra and the actual fluorescence spectra of the corresponding substances.

多激发荧光光谱可以表征各种性质样品中荧光团的分布,是化学、生物和医学诊断的有力工具。然而,解释大量的激发依赖的荧光光谱仍然是一个重大的挑战,特别是当荧光团最初是未知的。在光学薄样品中,光谱分解可以使用几种统计方法进行。然而,在实际应用中,各种光的传播效应导致与纯组分光谱的偏差,阻碍了这些方法的直接应用。在使用它们之前,需要对样品进行特定的内部过滤校正。不幸的是,由于复杂的光传播几何形状或难以获得额外的测量,执行这样的校正并不总是可行的。在这里,我们建议一个间接的程序,光学厚样品可忽略荧光重吸收,不依赖于任何假设的光照明和收集几何,通过迭代确定适当的校正系数的激发变化。由于荧光光谱在激发-发射矩阵中的相关性,当其他方法失败时,这种方法可以进行分解。用模拟光谱数据和真实荧光团进行的模型实验表明,计算出的光谱与对应物质的实际荧光光谱吻合较好。
{"title":"Fluorescence Excitation-Emission Matrices Decomposition Using Statistical Correction for Primary Inner Filtering","authors":"Nikolay Maslov,&nbsp;Natalya Gubanova","doi":"10.1002/cem.70077","DOIUrl":"https://doi.org/10.1002/cem.70077","url":null,"abstract":"<div>\u0000 \u0000 <p>Multi-excitation fluorescence spectroscopy allows characterizing the distribution of fluorophores in samples of various natures and is a powerful tool for chemical, biological, and medical diagnostics. However, interpreting large sets of excitation-dependent fluorescence spectra remains a significant challenge, especially when the fluorophores are initially unknown. In optically thin samples, spectral decomposition can be performed using several statistical methods. In practice, however, various light propagation effects cause deviations from the pure component spectra and prevent the direct application of these methods. A sample-specific inner filtering correction is required prior to their use. Unfortunately, performing such a correction is not always feasible due to complex light propagation geometry or the difficulty of obtaining additional measurements. Here, we suggest an indirect procedure for optically thick samples with negligible fluorescence reabsorption that does not rely on any assumptions about the light illumination and collection geometry, by iteratively determining appropriate correction coefficients for excitation variations. Due to the correlated nature of fluorescence spectra in excitation-emission matrices, this approach enables decomposition when other methods fail. Model experiments with simulated spectral data and real fluorophores showed good agreement between the calculated spectra and the actual fluorescence spectra of the corresponding substances.</p>\u0000 </div>","PeriodicalId":15274,"journal":{"name":"Journal of Chemometrics","volume":"39 11","pages":""},"PeriodicalIF":2.1,"publicationDate":"2025-10-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145366575","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multiclass Partial Least Squares Discriminant Analysis: Taking New Ways—A Perspective 多类偏最小二乘判别分析:新方法——一个视角
IF 2.1 4区 化学 Q1 SOCIAL WORK Pub Date : 2025-10-20 DOI: 10.1002/cem.70078
O. Ye. Rodionova, A. L. Pomerantsev

An overview of various prospective PLS-DA strategies is presented. PLS-DA is considered not only as a discrimination method per se but also as a method for extracting low-dimensional features suitable for discriminating high-dimensional data, which makes it similar to PCA. Unlike PCA, PLS-DA extracts two sets of features: the matrix of predicted dummy responses (Y) and the matrix of PLS scores (T). Both sets are then used as input to hard (deterministic) and soft (probabilistic) discriminators, providing a wide selection of combinations. Moreover, the features can be enriched by using additional information provided by the PLS model. A total of 18 different strategies are considered, including 8 based on Y-features and 10 based on T-features. Two data sets are used to illustrate the theory, yielding the following results. Both the T- and the Y-based approaches are comparable; additional information is ineffective. The cons and pros of each strategy are discussed.

概述了各种前瞻性PLS-DA策略。PLS-DA不仅本身是一种判别方法,而且是一种提取适合于判别高维数据的低维特征的方法,这与PCA相似。与PCA不同,PLS- da提取两组特征:预测虚拟响应矩阵(Y)和PLS评分矩阵(T)。然后,这两个集合被用作硬(确定性)和软(概率)鉴别器的输入,提供广泛的组合选择。此外,可以通过使用PLS模型提供的附加信息来丰富这些特征。总共考虑了18种不同的策略,其中8种基于y特征,10种基于t特征。两个数据集被用来说明这个理论,得到以下结果。基于T的方法和基于T的方法都具有可比性;附加信息是无效的。讨论了每种策略的优缺点。
{"title":"Multiclass Partial Least Squares Discriminant Analysis: Taking New Ways—A Perspective","authors":"O. Ye. Rodionova,&nbsp;A. L. Pomerantsev","doi":"10.1002/cem.70078","DOIUrl":"https://doi.org/10.1002/cem.70078","url":null,"abstract":"<div>\u0000 \u0000 <p>An overview of various prospective PLS-DA strategies is presented. PLS-DA is considered not only as a discrimination method per se but also as a method for extracting low-dimensional features suitable for discriminating high-dimensional data, which makes it similar to PCA. Unlike PCA, PLS-DA extracts two sets of features: the matrix of predicted dummy responses (<b>Y</b>) and the matrix of PLS scores (<b>T</b>). Both sets are then used as input to hard (deterministic) and soft (probabilistic) discriminators, providing a wide selection of combinations. Moreover, the features can be enriched by using additional information provided by the PLS model. A total of 18 different strategies are considered, including 8 based on <b>Y</b>-features and 10 based on <b>T</b>-features. Two data sets are used to illustrate the theory, yielding the following results. Both the <b>T</b>- and the <b>Y</b>-based approaches are comparable; additional information is ineffective. The cons and pros of each strategy are discussed.</p>\u0000 </div>","PeriodicalId":15274,"journal":{"name":"Journal of Chemometrics","volume":"39 11","pages":""},"PeriodicalIF":2.1,"publicationDate":"2025-10-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145327813","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Outlier Detection Using Immersive Analytics With Virtual Reality 使用沉浸式分析与虚拟现实的异常值检测
IF 2.1 4区 化学 Q1 SOCIAL WORK Pub Date : 2025-10-20 DOI: 10.1002/cem.70079
Hyrum J. Redd, Jordan M. J. Peper, John H. Kalivas

This paper honors Paul Geladi with a new chemometric data visualization approach based on immersive analytics principles using virtual reality (VR). With the many technological advancements improving the accessibility of extended reality (XR), including VR, the frontier of immersive analytics is wide open for utilizing VR as a powerful chemometric data analysis tool. Immersion of human senses into a virtually generated three-dimensional (3D) world compels us to act more instinctively in complex data decision-making scenarios by using our inherent cognitive pattern recognition capacity learned over years of experience. Proposed is an immersive analytics application of VR for a hybrid human/computer-aided outlier detection process. In this application, each training set sample is virtually realized as a glyph to visually assess the inter- and intra-sample relationships present in datasets while mining for nonrepresentative samples. Sample glyph shape and size are developed using hundreds of sample similarity measurements for each sample based both on spectral and prediction property information. These sample glyphs are visually and spatially compared with one another in VR by the user. Outlier checking in VR safeguards against masking and swamping problems that are difficult to recognize with automatic algorithms. Results from dataset situations based on near infrared (NIR) and ultraviolet (UV) spectra and analyte reference values show the viability of using VR for data analysis and outlier detection. This VR application demonstrates the looming evolution of immersive analytics with an XR interface involving human reasoning in difficult chemometric data analysis settings.

本文以一种基于虚拟现实(VR)沉浸式分析原理的新的化学计量数据可视化方法来纪念Paul Geladi。随着包括VR在内的许多技术进步提高了扩展现实(XR)的可访问性,沉浸式分析的前沿为利用VR作为强大的化学计量数据分析工具敞开了大门。人类感官沉浸在虚拟生成的三维(3D)世界中,迫使我们在复杂的数据决策场景中更本能地行动,利用我们多年经验中习得的固有认知模式识别能力。提出了一种沉浸式虚拟现实分析应用程序,用于混合人/计算机辅助离群值检测过程。在这个应用程序中,每个训练集样本实际上被实现为一个符号,以便在挖掘非代表性样本时直观地评估数据集中存在的样本间和样本内关系。基于光谱和预测属性信息,对每个样本使用数百个样本相似性测量来开发样本字形形状和大小。这些样例符号在视觉上和空间上由用户在VR中相互比较。VR中的异常值检查可以防止自动算法难以识别的掩盖和淹没问题。基于近红外(NIR)和紫外(UV)光谱以及分析物参考值的数据集情况的结果表明,将VR用于数据分析和异常值检测是可行的。这个虚拟现实应用程序展示了沉浸式分析迫在眉睫的发展,其中包括在困难的化学计量数据分析设置中涉及人类推理的XR界面。
{"title":"Outlier Detection Using Immersive Analytics With Virtual Reality","authors":"Hyrum J. Redd,&nbsp;Jordan M. J. Peper,&nbsp;John H. Kalivas","doi":"10.1002/cem.70079","DOIUrl":"https://doi.org/10.1002/cem.70079","url":null,"abstract":"<div>\u0000 \u0000 <p>This paper honors Paul Geladi with a new chemometric data visualization approach based on immersive analytics principles using virtual reality (VR). With the many technological advancements improving the accessibility of extended reality (XR), including VR, the frontier of immersive analytics is wide open for utilizing VR as a powerful chemometric data analysis tool. Immersion of human senses into a virtually generated three-dimensional (3D) world compels us to act more instinctively in complex data decision-making scenarios by using our inherent cognitive pattern recognition capacity learned over years of experience. Proposed is an immersive analytics application of VR for a hybrid human/computer-aided outlier detection process. In this application, each training set sample is virtually realized as a glyph to visually assess the inter- and intra-sample relationships present in datasets while mining for nonrepresentative samples. Sample glyph shape and size are developed using hundreds of sample similarity measurements for each sample based both on spectral and prediction property information. These sample glyphs are visually and spatially compared with one another in VR by the user. Outlier checking in VR safeguards against masking and swamping problems that are difficult to recognize with automatic algorithms. Results from dataset situations based on near infrared (NIR) and ultraviolet (UV) spectra and analyte reference values show the viability of using VR for data analysis and outlier detection. This VR application demonstrates the looming evolution of immersive analytics with an XR interface involving human reasoning in difficult chemometric data analysis settings.</p>\u0000 </div>","PeriodicalId":15274,"journal":{"name":"Journal of Chemometrics","volume":"39 11","pages":""},"PeriodicalIF":2.1,"publicationDate":"2025-10-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145327814","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Journal of Chemometrics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1