Paul-Albert Schneide, Rasmus Bro, Neal B. Gallagher
{"title":"移位不变三线性-一种解决非靶向气相色谱耦合质谱数据的新模型","authors":"Paul-Albert Schneide, Rasmus Bro, Neal B. Gallagher","doi":"10.1002/cem.3501","DOIUrl":null,"url":null,"abstract":"<p>Multi-way data analysis is popular in chemometrics for the decomposition of, for example, spectroscopic or chromatographic higher-order tensor datasets. Parallel factor analysis (PARAFAC) and its extension, PARAFAC2, are extensively employed methods in chemometrics. Applications of PARAFAC2 for untargeted data analysis of hyphenated gas chromatography coupled with mass spectrometric detection (GC-MS) have proven to be very successful. This is attributable to the ability of PARAFAC2 to account for retention time shifts and shape changes in chromatographic elution profiles. Despite its usefulness, the most common implementations of PARAFAC2 are considered quite slow. Furthermore, it is difficult to apply constraints (e.g., non-negativity) to the shifted mode in PARAFAC2 models. Both aspects are addressed by a new shift-invariant tri-linearity (SIT) algorithm proposed in this paper. It is shown on simulated and real GC-MS data that the SIT algorithm is 20–60 times faster than the latest PARAFAC2-alternating least squares (ALS) implementation and the PARAFAC2-flexible coupling algorithm. Further, the SIT method allows the implementation of constraints in all modes. Trials on real-world data indicate that the SIT algorithm compares well with alternatives. The new SIT method achieves better factor resolution than the benchmark in some cases and tends to need fewer latent variables to extract the same chemical information. Although SIT is not capable of modeling shape changes in elution profiles, trials on real-world data indicate the great robustness of the method even in those cases.</p>","PeriodicalId":15274,"journal":{"name":"Journal of Chemometrics","volume":"37 8","pages":""},"PeriodicalIF":2.3000,"publicationDate":"2023-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/cem.3501","citationCount":"1","resultStr":"{\"title\":\"Shift-invariant tri-linearity—A new model for resolving untargeted gas chromatography coupled mass spectrometry data\",\"authors\":\"Paul-Albert Schneide, Rasmus Bro, Neal B. Gallagher\",\"doi\":\"10.1002/cem.3501\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Multi-way data analysis is popular in chemometrics for the decomposition of, for example, spectroscopic or chromatographic higher-order tensor datasets. Parallel factor analysis (PARAFAC) and its extension, PARAFAC2, are extensively employed methods in chemometrics. Applications of PARAFAC2 for untargeted data analysis of hyphenated gas chromatography coupled with mass spectrometric detection (GC-MS) have proven to be very successful. This is attributable to the ability of PARAFAC2 to account for retention time shifts and shape changes in chromatographic elution profiles. Despite its usefulness, the most common implementations of PARAFAC2 are considered quite slow. Furthermore, it is difficult to apply constraints (e.g., non-negativity) to the shifted mode in PARAFAC2 models. Both aspects are addressed by a new shift-invariant tri-linearity (SIT) algorithm proposed in this paper. It is shown on simulated and real GC-MS data that the SIT algorithm is 20–60 times faster than the latest PARAFAC2-alternating least squares (ALS) implementation and the PARAFAC2-flexible coupling algorithm. Further, the SIT method allows the implementation of constraints in all modes. Trials on real-world data indicate that the SIT algorithm compares well with alternatives. The new SIT method achieves better factor resolution than the benchmark in some cases and tends to need fewer latent variables to extract the same chemical information. Although SIT is not capable of modeling shape changes in elution profiles, trials on real-world data indicate the great robustness of the method even in those cases.</p>\",\"PeriodicalId\":15274,\"journal\":{\"name\":\"Journal of Chemometrics\",\"volume\":\"37 8\",\"pages\":\"\"},\"PeriodicalIF\":2.3000,\"publicationDate\":\"2023-06-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://onlinelibrary.wiley.com/doi/epdf/10.1002/cem.3501\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Chemometrics\",\"FirstCategoryId\":\"92\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1002/cem.3501\",\"RegionNum\":4,\"RegionCategory\":\"化学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"SOCIAL WORK\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Chemometrics","FirstCategoryId":"92","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/cem.3501","RegionNum":4,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"SOCIAL WORK","Score":null,"Total":0}
Shift-invariant tri-linearity—A new model for resolving untargeted gas chromatography coupled mass spectrometry data
Multi-way data analysis is popular in chemometrics for the decomposition of, for example, spectroscopic or chromatographic higher-order tensor datasets. Parallel factor analysis (PARAFAC) and its extension, PARAFAC2, are extensively employed methods in chemometrics. Applications of PARAFAC2 for untargeted data analysis of hyphenated gas chromatography coupled with mass spectrometric detection (GC-MS) have proven to be very successful. This is attributable to the ability of PARAFAC2 to account for retention time shifts and shape changes in chromatographic elution profiles. Despite its usefulness, the most common implementations of PARAFAC2 are considered quite slow. Furthermore, it is difficult to apply constraints (e.g., non-negativity) to the shifted mode in PARAFAC2 models. Both aspects are addressed by a new shift-invariant tri-linearity (SIT) algorithm proposed in this paper. It is shown on simulated and real GC-MS data that the SIT algorithm is 20–60 times faster than the latest PARAFAC2-alternating least squares (ALS) implementation and the PARAFAC2-flexible coupling algorithm. Further, the SIT method allows the implementation of constraints in all modes. Trials on real-world data indicate that the SIT algorithm compares well with alternatives. The new SIT method achieves better factor resolution than the benchmark in some cases and tends to need fewer latent variables to extract the same chemical information. Although SIT is not capable of modeling shape changes in elution profiles, trials on real-world data indicate the great robustness of the method even in those cases.
期刊介绍:
The Journal of Chemometrics is devoted to the rapid publication of original scientific papers, reviews and short communications on fundamental and applied aspects of chemometrics. It also provides a forum for the exchange of information on meetings and other news relevant to the growing community of scientists who are interested in chemometrics and its applications. Short, critical review papers are a particularly important feature of the journal, in view of the multidisciplinary readership at which it is aimed.