首页 > 最新文献

Journal of Chemometrics最新文献

英文 中文
Dynamic Process Fault Detection and Diagnosis Method Based on Factor Analysis: Application on the Three-Tank System Process 基于因子分析的动态过程故障检测与诊断方法:在三罐系统过程中的应用
IF 2.3 4区 化学 Q1 SOCIAL WORK Pub Date : 2024-11-16 DOI: 10.1002/cem.3627
Cheng Zhang, Ze-hao Xu, Yu-yu Lao, Yuan Li

To address the issue of underreporting faults in the detection of tiny faults by dynamic factor analysis (DFA), a novel fault detection and diagnosis method based on DFA-sliding window combined with mean square error (DFA-SWMSE) is proposed. Firstly, the data matrix is augmented by introducing time lag shifts. Secondly, factor analysis (FA) is applied to the augmented data matrix, achieving dimensionality reduction and feature extraction while retaining most of the original data's information. Then, the sliding window technique is applied to calculate the mean square error of the dimensionally reduced data, allowing for the monitoring of the system's current state and the detection of tiny faults. Finally, effective fault diagnosis is achieved through the analysis of fault factors and variable contributions. The proposed method is validated using a complex dynamic numerical example and a three-tank system process named Sim3Tanks. This system has gained widespread application in the field of process fault detection due to its ability to simulate and generate various types of faults. The proposed method is compared with principal component analysis (PCA), dynamic principal component analysis (DPCA), PCA similarity factor (SPCA), FA, and DFA. The experimental results thoroughly validate the effectiveness of the proposed method in detecting and diagnosing tiny faults in dynamic processes.

针对动态因子分析(DFA)检测微小故障时漏报故障的问题,提出了一种基于DFA-滑动窗口结合均方误差(DFA- swmse)的故障检测与诊断方法。首先,通过引入时滞位移对数据矩阵进行增广。其次,对增广后的数据矩阵进行因子分析,在保留大部分原始数据信息的前提下实现降维和特征提取;然后,应用滑动窗口技术计算降维数据的均方误差,实现对系统当前状态的监测和微小故障的检测。最后,通过对故障因素和变量贡献的分析,实现有效的故障诊断。通过一个复杂的动态数值算例和一个名为Sim3Tanks的三罐系统过程验证了该方法的有效性。该系统由于能够模拟和生成各种类型的故障,在过程故障检测领域得到了广泛的应用。将该方法与主成分分析(PCA)、动态主成分分析(DPCA)、主成分相似因子分析(SPCA)、主成分分析(FA)和DFA进行了比较。实验结果充分验证了该方法对动态过程中微小故障的检测和诊断的有效性。
{"title":"Dynamic Process Fault Detection and Diagnosis Method Based on Factor Analysis: Application on the Three-Tank System Process","authors":"Cheng Zhang,&nbsp;Ze-hao Xu,&nbsp;Yu-yu Lao,&nbsp;Yuan Li","doi":"10.1002/cem.3627","DOIUrl":"https://doi.org/10.1002/cem.3627","url":null,"abstract":"<div>\u0000 \u0000 <p>To address the issue of underreporting faults in the detection of tiny faults by dynamic factor analysis (DFA), a novel fault detection and diagnosis method based on DFA-sliding window combined with mean square error (DFA-SWMSE) is proposed. Firstly, the data matrix is augmented by introducing time lag shifts. Secondly, factor analysis (FA) is applied to the augmented data matrix, achieving dimensionality reduction and feature extraction while retaining most of the original data's information. Then, the sliding window technique is applied to calculate the mean square error of the dimensionally reduced data, allowing for the monitoring of the system's current state and the detection of tiny faults. Finally, effective fault diagnosis is achieved through the analysis of fault factors and variable contributions. The proposed method is validated using a complex dynamic numerical example and a three-tank system process named Sim3Tanks. This system has gained widespread application in the field of process fault detection due to its ability to simulate and generate various types of faults. The proposed method is compared with principal component analysis (PCA), dynamic principal component analysis (DPCA), PCA similarity factor (SPCA), FA, and DFA. The experimental results thoroughly validate the effectiveness of the proposed method in detecting and diagnosing tiny faults in dynamic processes.</p>\u0000 </div>","PeriodicalId":15274,"journal":{"name":"Journal of Chemometrics","volume":"38 12","pages":""},"PeriodicalIF":2.3,"publicationDate":"2024-11-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142861405","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Forensic Comparison of Amphetamine Chemical Profiles by Bayesian Predictive Modelling 通过贝叶斯预测模型对苯丙胺化学特征进行法医比较
IF 2.3 4区 化学 Q1 SOCIAL WORK Pub Date : 2024-11-13 DOI: 10.1002/cem.3630
Tuomas Korpinsalo, Juhana Rautavirta, Sami Huhtala, Tapani Reinikainen, Jukka Corander

Forensic chemists frequently employ statistical profiling approaches to assess the degree of similarity between samples of illicit drugs. Such profiling information can help reveal connections between nodes in distribution networks and manufacturing laboratories. For amphetamine, the routine method of comparing a pair of samples includes the use of a dissimilarity measure based on the Pearson correlation coefficient calculated between their chemical profiles obtained through gas chromatography–mass spectrometry. This simple measure of (dis)similarity has been shown distinguish pairs sharing a common origin (e.g., same production batch) to a reasonable level of accuracy. However, Pearson correlation fails to capture all the relevant notions of similarity between chemical profiles of amphetamine. We present a new statistical method for forensic drug comparison that uses a more sophisticated statistical modelling approach to determine similarity between samples. We show that this leads to improved performance over the correlation-based approach. The proposed method is easily extendable and has an intuitive interpretation both from chemistry and forensic perspectives, which supports wide applicability to illicit drug profiling in practice.

法医化学家经常采用统计分析方法来评估非法药物样本之间的相似程度。这种分析信息可以帮助揭示分销网络和制造实验室节点之间的联系。对于安非他明,比较一对样品的常规方法包括使用基于Pearson相关系数的不相似性测量,该系数是通过气相色谱-质谱法获得的化学剖面之间的计算结果。这种简单的(非)相似性度量已被证明能在合理的精度水平上区分具有共同起源(例如,同一生产批次)的对。然而,皮尔逊相关性未能捕捉到安非他明化学特征之间的相似性的所有相关概念。我们提出了一种新的法医药物比较统计方法,使用更复杂的统计建模方法来确定样品之间的相似性。我们表明,与基于相关性的方法相比,这可以提高性能。所提出的方法易于扩展,并且从化学和法医的角度都具有直观的解释,这支持在实践中广泛适用于非法药物分析。
{"title":"Forensic Comparison of Amphetamine Chemical Profiles by Bayesian Predictive Modelling","authors":"Tuomas Korpinsalo,&nbsp;Juhana Rautavirta,&nbsp;Sami Huhtala,&nbsp;Tapani Reinikainen,&nbsp;Jukka Corander","doi":"10.1002/cem.3630","DOIUrl":"https://doi.org/10.1002/cem.3630","url":null,"abstract":"<div>\u0000 \u0000 <p>Forensic chemists frequently employ statistical profiling approaches to assess the degree of similarity between samples of illicit drugs. Such profiling information can help reveal connections between nodes in distribution networks and manufacturing laboratories. For amphetamine, the routine method of comparing a pair of samples includes the use of a dissimilarity measure based on the Pearson correlation coefficient calculated between their chemical profiles obtained through gas chromatography–mass spectrometry. This simple measure of (dis)similarity has been shown distinguish pairs sharing a common origin (e.g., same production batch) to a reasonable level of accuracy. However, Pearson correlation fails to capture all the relevant notions of similarity between chemical profiles of amphetamine. We present a new statistical method for forensic drug comparison that uses a more sophisticated statistical modelling approach to determine similarity between samples. We show that this leads to improved performance over the correlation-based approach. The proposed method is easily extendable and has an intuitive interpretation both from chemistry and forensic perspectives, which supports wide applicability to illicit drug profiling in practice.</p>\u0000 </div>","PeriodicalId":15274,"journal":{"name":"Journal of Chemometrics","volume":"38 12","pages":""},"PeriodicalIF":2.3,"publicationDate":"2024-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142861077","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Chemometrics: A Vital Implement for Understanding the Water Structures by Near-Infrared Spectroscopy 化学计量学:利用近红外光谱了解水结构的重要工具
IF 2.3 4区 化学 Q1 SOCIAL WORK Pub Date : 2024-11-08 DOI: 10.1002/cem.3631
Haipeng Wang, Li Han, Wensheng Cai, Xueguang Shao

Water structures take an important role in chemical and biological systems, because the structure and function of a molecule may depend on the structure of water with which the molecule interacts. Near-infrared (NIR) spectroscopy has been proven to be powerful in analyzing the structure of water due to its sensitive response to OH. However, chemometrics is vitally important in the analysis of NIR spectrum of water due to the low resolution of the spectrum and the complexity of the water structures. In this review, chemometric methods for structural analysis of water in aqueous systems, particularly in chemical and biological processes, by NIR spectroscopy were summarized, from the improvement of spectral resolution to the effective extraction of the spectral information of different water structures. Through the changes of the spectral features of the water structures, the structural transformation of proteins, thermo-responsive polymers, antifreeze agents, as well as the structural variation of water in the transformation were elucidated. Water was proved to be a good probe for analyzing the structure and interactions in aqueous solutions and chemical/biological processes by NIR spectroscopy.

水的结构在化学和生物系统中起着重要作用,因为分子的结构和功能可能取决于与分子相互作用的水的结构。近红外(NIR)光谱因其对 OH 的灵敏反应,已被证明是分析水的结构的有力工具。然而,由于近红外光谱的分辨率较低,且水的结构复杂,因此化学计量学在分析水的近红外光谱时至关重要。在这篇综述中,从提高光谱分辨率到有效提取不同水结构的光谱信息,总结了利用近红外光谱对水系统中水的结构进行分析的化学计量学方法,特别是在化学和生物过程中。通过水结构光谱特征的变化,阐明了蛋白质、热响应聚合物、防冻剂的结构转变以及转变过程中水的结构变化。事实证明,水是利用近红外光谱分析水溶液结构和相互作用以及化学/生物过程的良好探针。
{"title":"Chemometrics: A Vital Implement for Understanding the Water Structures by Near-Infrared Spectroscopy","authors":"Haipeng Wang,&nbsp;Li Han,&nbsp;Wensheng Cai,&nbsp;Xueguang Shao","doi":"10.1002/cem.3631","DOIUrl":"https://doi.org/10.1002/cem.3631","url":null,"abstract":"<div>\u0000 \u0000 <p>Water structures take an important role in chemical and biological systems, because the structure and function of a molecule may depend on the structure of water with which the molecule interacts. Near-infrared (NIR) spectroscopy has been proven to be powerful in analyzing the structure of water due to its sensitive response to OH. However, chemometrics is vitally important in the analysis of NIR spectrum of water due to the low resolution of the spectrum and the complexity of the water structures. In this review, chemometric methods for structural analysis of water in aqueous systems, particularly in chemical and biological processes, by NIR spectroscopy were summarized, from the improvement of spectral resolution to the effective extraction of the spectral information of different water structures. Through the changes of the spectral features of the water structures, the structural transformation of proteins, thermo-responsive polymers, antifreeze agents, as well as the structural variation of water in the transformation were elucidated. Water was proved to be a good probe for analyzing the structure and interactions in aqueous solutions and chemical/biological processes by NIR spectroscopy.</p>\u0000 </div>","PeriodicalId":15274,"journal":{"name":"Journal of Chemometrics","volume":"38 12","pages":""},"PeriodicalIF":2.3,"publicationDate":"2024-11-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142860739","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Data Augmentation and Fault Diagnosis for Imbalanced Industrial Process Data Based on Residual Wasserstein Generative Adversarial Network With Gradient Penalty 基于梯度惩罚残差Wasserstein生成对抗网络的不平衡工业过程数据增强与故障诊断
IF 2.3 4区 化学 Q1 SOCIAL WORK Pub Date : 2024-11-08 DOI: 10.1002/cem.3624
Ying Tian, Jian Shen, Ao Wang, Zeqiu Li, Xiuhui Huang

In practical industrial applications, equipment usually operates normally and failures are relatively rare, resulting in serious imbalances in the collected data. This imbalance leads to issues such as overfitting, instability, and poor robustness, significantly reducing the accuracy and stability of fault diagnosis system. To address these challenges, this research proposes a method for imbalanced data augmentation and industrial process fault diagnosis based on improved Generative Adversarial Network (GAN). The method adopts Wasserstein distance with gradient penalty and integrates residual connections into the architecture of the generator. This innovation not only helps improve gradient transfer in the generator, but also significantly enhances the data generation capabilities of the generative model through improving the stability of training. Limited industrial process data is used by a generative model to produce synthetic samples with high similarity and diversity. These high-quality samples improve fault diagnosis by enriching the imbalanced dataset. Experimental results on two industrial datasets confirm the method's effectiveness in enhancing fault diagnosis performance with limited data.

在实际工业应用中,设备通常运行正常,故障相对较少,导致采集数据严重不平衡。这种不平衡导致了过拟合、不稳定、鲁棒性差等问题,大大降低了故障诊断系统的准确性和稳定性。为了解决这些问题,本研究提出了一种基于改进生成对抗网络(GAN)的不平衡数据增强和工业过程故障诊断方法。该方法采用带梯度惩罚的Wasserstein距离,并将残差连接集成到发电机的结构中。这一创新不仅有助于提高生成器中的梯度传递,而且通过提高训练的稳定性,显著增强了生成模型的数据生成能力。生成模型利用有限的工业过程数据生成具有高相似性和多样性的合成样品。这些高质量的样本通过丰富不平衡数据集来提高故障诊断。在两个工业数据集上的实验结果验证了该方法在有限数据条件下提高故障诊断性能的有效性。
{"title":"Data Augmentation and Fault Diagnosis for Imbalanced Industrial Process Data Based on Residual Wasserstein Generative Adversarial Network With Gradient Penalty","authors":"Ying Tian,&nbsp;Jian Shen,&nbsp;Ao Wang,&nbsp;Zeqiu Li,&nbsp;Xiuhui Huang","doi":"10.1002/cem.3624","DOIUrl":"https://doi.org/10.1002/cem.3624","url":null,"abstract":"<div>\u0000 \u0000 <p>In practical industrial applications, equipment usually operates normally and failures are relatively rare, resulting in serious imbalances in the collected data. This imbalance leads to issues such as overfitting, instability, and poor robustness, significantly reducing the accuracy and stability of fault diagnosis system. To address these challenges, this research proposes a method for imbalanced data augmentation and industrial process fault diagnosis based on improved Generative Adversarial Network (GAN). The method adopts Wasserstein distance with gradient penalty and integrates residual connections into the architecture of the generator. This innovation not only helps improve gradient transfer in the generator, but also significantly enhances the data generation capabilities of the generative model through improving the stability of training. Limited industrial process data is used by a generative model to produce synthetic samples with high similarity and diversity. These high-quality samples improve fault diagnosis by enriching the imbalanced dataset. Experimental results on two industrial datasets confirm the method's effectiveness in enhancing fault diagnosis performance with limited data.</p>\u0000 </div>","PeriodicalId":15274,"journal":{"name":"Journal of Chemometrics","volume":"38 12","pages":""},"PeriodicalIF":2.3,"publicationDate":"2024-11-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142860508","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Past, Present and Future of Research in Analytical Figures of Merit 功勋人物分析研究的过去、现在和未来
IF 2.3 4区 化学 Q1 SOCIAL WORK Pub Date : 2024-11-07 DOI: 10.1002/cem.3616
Alejandro Olivieri
{"title":"Past, Present and Future of Research in Analytical Figures of Merit","authors":"Alejandro Olivieri","doi":"10.1002/cem.3616","DOIUrl":"https://doi.org/10.1002/cem.3616","url":null,"abstract":"","PeriodicalId":15274,"journal":{"name":"Journal of Chemometrics","volume":"38 11","pages":""},"PeriodicalIF":2.3,"publicationDate":"2024-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/cem.3616","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142665740","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Characterization of Chemical Information and Content Prediction of Dendrobium officinale Based on ATR-FTIR 基于ATR-FTIR的铁皮石斛化学信息表征及含量预测
IF 2.3 4区 化学 Q1 SOCIAL WORK Pub Date : 2024-11-06 DOI: 10.1002/cem.3626
Peiyuan Li, Tao Shen, Shaobing Yang, Zhitian Zuo, Yuanzhong Wang, Qiang Hu

Dendrobium officinale is a medicinal and food plant with high commercial and medicinal value. Yunnan is known as China's “plant kingdom,” and although the climatic conditions are favorable, the large vertical climatic differences have led to a large difference in the quality of dendrobium from different origins. The analysis of quality differences between several origins with large ecological advantages has not been reported yet. Therefore, the aim of this study is to compare these regions in terms of both morphology and chemical composition and to analyze the variation of their chemical composition in spectral information. The PLS-DA, SVM, and PLSR models were developed to qualitatively and quantitatively evaluate Dendrobium from different production areas. The results show that the Menghai production area was superior to other production areas in terms of phenotypic morphology, quality, and yield. Within the appropriate range, the higher the specific absorbance, the higher the quercetin content.

铁皮石斛是一种具有很高商业价值和药用价值的药用和食用植物。云南被誉为中国的“植物王国”,虽然气候条件优越,但垂直气候差异较大,导致不同产地的石斛品质差异较大。几种具有较大生态优势的产地间的品质差异分析尚未见报道。因此,本研究的目的是比较这些区域在形态和化学成分方面的差异,并分析其光谱信息中化学成分的变化。采用PLS-DA、SVM和PLSR模型对不同产地的石斛进行定性和定量评价。结果表明,勐海产区在表型形态、品质和产量方面均优于其他产区。在适当范围内,比吸光度越高,槲皮素含量越高。
{"title":"Characterization of Chemical Information and Content Prediction of Dendrobium officinale Based on ATR-FTIR","authors":"Peiyuan Li,&nbsp;Tao Shen,&nbsp;Shaobing Yang,&nbsp;Zhitian Zuo,&nbsp;Yuanzhong Wang,&nbsp;Qiang Hu","doi":"10.1002/cem.3626","DOIUrl":"https://doi.org/10.1002/cem.3626","url":null,"abstract":"<div>\u0000 \u0000 <p><i>Dendrobium officinale</i> is a medicinal and food plant with high commercial and medicinal value. Yunnan is known as China's “plant kingdom,” and although the climatic conditions are favorable, the large vertical climatic differences have led to a large difference in the quality of dendrobium from different origins. The analysis of quality differences between several origins with large ecological advantages has not been reported yet. Therefore, the aim of this study is to compare these regions in terms of both morphology and chemical composition and to analyze the variation of their chemical composition in spectral information. The PLS-DA, SVM, and PLSR models were developed to qualitatively and quantitatively evaluate <i>Dendrobium</i> from different production areas. The results show that the Menghai production area was superior to other production areas in terms of phenotypic morphology, quality, and yield. Within the appropriate range, the higher the specific absorbance, the higher the quercetin content.</p>\u0000 </div>","PeriodicalId":15274,"journal":{"name":"Journal of Chemometrics","volume":"38 12","pages":""},"PeriodicalIF":2.3,"publicationDate":"2024-11-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142860198","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Three-Way Data Reduction Based on Essential Information 基于基本信息的三向数据约简
IF 2.3 4区 化学 Q1 SOCIAL WORK Pub Date : 2024-11-05 DOI: 10.1002/cem.3617
Raffaele Vitale, Azar Azizi, Mahdiyeh Ghaffari, Nematollah Omidikia, Cyril Ruckebusch

In this article, the idea of essential information-based compression is extended to trilinear datasets. This basically boils down to identifying and labelling the essential rows (ERs), columns (ECs) and tubes (ETs) of such three-dimensional datasets that allow by themselves to reconstruct in a linear way the entire space of the original measurements. ERs, ECs and ETs can be determined by exploiting convex geometry computational approaches such as convex hull or convex polytope estimations and can be used to generate a reduced version of the data at hand. These compressed data and their uncompressed counterpart share the same multilinear properties and their factorisation (carried out by means of, for example, parallel factor analysis–alternating least squares [PARAFAC-ALS]) yield, in principle, indistinguishable results. More in detail, an algorithm for the assessment and extraction of the essential information encoded in trilinear data structures is here proposed. Its performance was evaluated in both real-world and simulated scenarios which permitted to highlight the benefits that this novel data reduction strategy can bring in domains like multiway fluorescence spectroscopy and imaging.

本文将基于基本信息的压缩理念扩展到三维数据集。这基本上可以归结为识别和标注此类三维数据集的基本行(ER)、列(EC)和管(ET),这些基本行(ER)、列(EC)和管(ET)本身就能以线性方式重建原始测量的整个空间。ER、EC 和 ET 可以通过利用凸几何计算方法(如凸壳或凸多面体估算)来确定,并可用于生成手头数据的压缩版本。这些压缩数据及其未压缩的对应数据具有相同的多线性特性,其因式分解(通过并行因式分析--交替最小二乘法 [PARAFAC-ALS] 等方法进行)原则上可产生无差别的结果。更详细地说,本文提出了一种算法,用于评估和提取三线性数据结构中编码的基本信息。在真实世界和模拟场景中对其性能进行了评估,从而突出了这种新型数据缩减策略在多路荧光光谱和成像等领域的优势。
{"title":"Three-Way Data Reduction Based on Essential Information","authors":"Raffaele Vitale,&nbsp;Azar Azizi,&nbsp;Mahdiyeh Ghaffari,&nbsp;Nematollah Omidikia,&nbsp;Cyril Ruckebusch","doi":"10.1002/cem.3617","DOIUrl":"https://doi.org/10.1002/cem.3617","url":null,"abstract":"<p>In this article, the idea of essential information-based compression is extended to trilinear datasets. This basically boils down to identifying and labelling the essential rows (ERs), columns (ECs) and tubes (ETs) of such three-dimensional datasets that allow by themselves to reconstruct in a linear way the entire space of the original measurements. ERs, ECs and ETs can be determined by exploiting convex geometry computational approaches such as convex hull or convex polytope estimations and can be used to generate a reduced version of the data at hand. These compressed data and their uncompressed counterpart share the same multilinear properties and their factorisation (carried out by means of, for example, parallel factor analysis–alternating least squares [PARAFAC-ALS]) yield, in principle, indistinguishable results. More in detail, an algorithm for the assessment and extraction of the essential information encoded in trilinear data structures is here proposed. Its performance was evaluated in both real-world and simulated scenarios which permitted to highlight the benefits that this novel data reduction strategy can bring in domains like multiway fluorescence spectroscopy and imaging.</p>","PeriodicalId":15274,"journal":{"name":"Journal of Chemometrics","volume":"38 12","pages":""},"PeriodicalIF":2.3,"publicationDate":"2024-11-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/cem.3617","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142860385","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Fault Detection Strategy of Partial Least Squares Based on Temporal Neighborhood Difference 基于时间邻域差分的偏最小二乘故障检测策略
IF 2.3 4区 化学 Q1 SOCIAL WORK Pub Date : 2024-11-05 DOI: 10.1002/cem.3621
Liwei Feng, Shaofeng Guo, Yifei Wu, Yu Xing, Yuan Li

Aiming at the difficulty of detecting time-lag faults in dynamic processes, a fault detection strategy based on time neighborhood difference (TND) is proposed, and it is introduced into the partial least squares (PLS) method to suggest the PLS-TND fault detection method. The TND method takes the mean to the multibatch training set to obtain a baseline training set, and it constructs the mean squared Euclidean distance (MSED) statistic by calculating the average distance between the sample's first k-moments neighborhood samples and samples at the same moment in the baseline training set. The TND method can help the PLS method to effectively detect time-lag faults and significantly improve the fault detection capability of PLS by measuring the overall positional difference between the temporal neighborhood sample set of the sample and its temporal neighborhood sample set in the baseline training set. The PLS-TND method is compared with some classical fault detection methods through a numerical simulation process and a Continuous Stirred Tank Reactor (CSTR) system design fault detection experiment. The PLS-TND method gives the best performance of fault detection.

针对动态过程中时滞故障检测困难的问题,提出了一种基于时间邻域差分(TND)的故障检测策略,并将其引入偏最小二乘(PLS)方法中,提出了PLS-TND故障检测方法。TND方法对多批训练集取均值,得到基线训练集,并通过计算样本的前k矩邻域样本与基线训练集中同一时刻样本之间的平均距离,构造均方欧氏距离(MSED)统计量。TND方法通过测量样本的时间邻域样本集与其在基线训练集中的时间邻域样本集的总体位置差,可以帮助PLS方法有效检测时滞故障,显著提高PLS的故障检测能力。通过数值模拟过程和连续搅拌槽式反应器(CSTR)系统设计故障检测实验,将PLS-TND方法与一些经典故障检测方法进行了比较。PLS-TND方法具有最佳的故障检测性能。
{"title":"Fault Detection Strategy of Partial Least Squares Based on Temporal Neighborhood Difference","authors":"Liwei Feng,&nbsp;Shaofeng Guo,&nbsp;Yifei Wu,&nbsp;Yu Xing,&nbsp;Yuan Li","doi":"10.1002/cem.3621","DOIUrl":"https://doi.org/10.1002/cem.3621","url":null,"abstract":"<div>\u0000 \u0000 <p>Aiming at the difficulty of detecting time-lag faults in dynamic processes, a fault detection strategy based on time neighborhood difference (TND) is proposed, and it is introduced into the partial least squares (PLS) method to suggest the PLS-TND fault detection method. The TND method takes the mean to the multibatch training set to obtain a baseline training set, and it constructs the mean squared Euclidean distance (MSED) statistic by calculating the average distance between the sample's first <i>k</i>-moments neighborhood samples and samples at the same moment in the baseline training set. The TND method can help the PLS method to effectively detect time-lag faults and significantly improve the fault detection capability of PLS by measuring the overall positional difference between the temporal neighborhood sample set of the sample and its temporal neighborhood sample set in the baseline training set. The PLS-TND method is compared with some classical fault detection methods through a numerical simulation process and a Continuous Stirred Tank Reactor (CSTR) system design fault detection experiment. The PLS-TND method gives the best performance of fault detection.</p>\u0000 </div>","PeriodicalId":15274,"journal":{"name":"Journal of Chemometrics","volume":"38 12","pages":""},"PeriodicalIF":2.3,"publicationDate":"2024-11-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142860386","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Dynamic Multiblock Regression for Process Modelling 过程建模的动态多块回归
IF 2.3 4区 化学 Q1 SOCIAL WORK Pub Date : 2024-11-04 DOI: 10.1002/cem.3618
Marco Cattaldo, Alberto Ferrer, Ingrid Måge

The study introduces three novel strategies for incorporating capabilities for dynamic modelling into multiblock regression methods by integrating sequentially orthogonalised partial least squares (SO-PLS) with different dynamic modelling techniques. The study evaluates these strategies using synthetic datasets and an industrial example, comparing their performance in predictive ability, identification of process dynamics, and quantification of block contributions. Results suggest that these approaches can effectively model the dynamics with performance comparable to state-of-the-art methods, providing, at the same time, insight into the dynamic order and block contributions. One of the strategies, sequentially orthogonalised dynamic augmented (SODA)–PLS, shows promise by ensuring that redundant information in the time dimension is not included, resulting in simpler and more easily interpretable dynamic models. These multiblock dynamic regression strategies have potential applications for improved process understanding in industrial settings, especially where multiple data sources and inherent time dynamics are present.

该研究介绍了三种新的策略,通过将顺序正交偏最小二乘(SO-PLS)与不同的动态建模技术相结合,将动态建模能力纳入多块回归方法。该研究使用合成数据集和一个工业实例来评估这些策略,比较它们在预测能力、过程动态识别和区块贡献量化方面的表现。结果表明,这些方法可以有效地模拟动态,其性能与最先进的方法相当,同时提供了对动态顺序和块贡献的洞察。其中一种策略,顺序正交化动态增强(SODA) -PLS,通过确保不包括时间维度上的冗余信息,从而产生更简单和更容易解释的动态模型,显示出了希望。这些多块动态回归策略在工业环境中有潜在的应用,可以提高对过程的理解,特别是在存在多个数据源和固有时间动态的情况下。
{"title":"Dynamic Multiblock Regression for Process Modelling","authors":"Marco Cattaldo,&nbsp;Alberto Ferrer,&nbsp;Ingrid Måge","doi":"10.1002/cem.3618","DOIUrl":"https://doi.org/10.1002/cem.3618","url":null,"abstract":"<p>The study introduces three novel strategies for incorporating capabilities for dynamic modelling into multiblock regression methods by integrating sequentially orthogonalised partial least squares (SO-PLS) with different dynamic modelling techniques. The study evaluates these strategies using synthetic datasets and an industrial example, comparing their performance in predictive ability, identification of process dynamics, and quantification of block contributions. Results suggest that these approaches can effectively model the dynamics with performance comparable to state-of-the-art methods, providing, at the same time, insight into the dynamic order and block contributions. One of the strategies, sequentially orthogonalised dynamic augmented (SODA)–PLS, shows promise by ensuring that redundant information in the time dimension is not included, resulting in simpler and more easily interpretable dynamic models. These multiblock dynamic regression strategies have potential applications for improved process understanding in industrial settings, especially where multiple data sources and inherent time dynamics are present.</p>","PeriodicalId":15274,"journal":{"name":"Journal of Chemometrics","volume":"38 12","pages":""},"PeriodicalIF":2.3,"publicationDate":"2024-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/cem.3618","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142860169","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
SFMD-X: A New Functional Data Classifier Based on Shrinkage Functional Mahalanobis Distance SFMD-X:一种新的基于收缩函数马氏距离的功能数据分类器
IF 2.3 4区 化学 Q1 SOCIAL WORK Pub Date : 2024-11-04 DOI: 10.1002/cem.3615
Shunke Bao, Jiakun Guo, Zhouping Li

In this article, we propose a novel classification approach for functional data based on a shrinkage estimate of functional Mahalanobis distance. We first introduce a new shrinkage functional Mahalanobis distance (SFMD), by using this new distance, we transform the functional observations into a set of vector-valued pseudo-samples. Furthermore, we adopt some good classification algorithms designed for multivariate data to this pseudo-samples instead of the original functional data. The new approach has advantage of highly flexible and scalable, that is, it can easily combine with any classification algorithm, such as support vector machine, tree-based methods, and neural networks. We demonstrate the performance of the proposed functional classifier through both extensive simulation studies and two real data applications.

在本文中,我们提出了一种基于功能马氏距离收缩估计的功能数据分类方法。我们首先引入一个新的收缩函数马氏距离(SFMD),利用这个新距离将函数观测值转换为一组向量值伪样本。此外,我们采用了一些针对多元数据设计的好的分类算法来代替原始的功能数据。该方法具有高度的灵活性和可扩展性,即可以很容易地与任何分类算法(如支持向量机、基于树的方法和神经网络)相结合。我们通过广泛的仿真研究和两个实际数据应用来证明所提出的功能分类器的性能。
{"title":"SFMD-X: A New Functional Data Classifier Based on Shrinkage Functional Mahalanobis Distance","authors":"Shunke Bao,&nbsp;Jiakun Guo,&nbsp;Zhouping Li","doi":"10.1002/cem.3615","DOIUrl":"https://doi.org/10.1002/cem.3615","url":null,"abstract":"<div>\u0000 \u0000 <p>In this article, we propose a novel classification approach for functional data based on a shrinkage estimate of functional Mahalanobis distance. We first introduce a new shrinkage functional Mahalanobis distance (SFMD), by using this new distance, we transform the functional observations into a set of vector-valued pseudo-samples. Furthermore, we adopt some good classification algorithms designed for multivariate data to this pseudo-samples instead of the original functional data. The new approach has advantage of highly flexible and scalable, that is, it can easily combine with any classification algorithm, such as support vector machine, tree-based methods, and neural networks. We demonstrate the performance of the proposed functional classifier through both extensive simulation studies and two real data applications.</p>\u0000 </div>","PeriodicalId":15274,"journal":{"name":"Journal of Chemometrics","volume":"38 12","pages":""},"PeriodicalIF":2.3,"publicationDate":"2024-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142860168","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Journal of Chemometrics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1