Helene Fog Froriep Halberg, Marta Bevilacqua, Åsmund Rinnan
Fluorescence spectroscopy has been applied for analysis of complex samples, such as food and beverages. Parallel factor analysis (PARAFAC) is a well-known decomposition method for fluorescence excitation–emission matrices (EEMs). When the complexity of the system increases, it becomes considerably more difficult to determine the optimal number of PARAFAC components, especially when the fluorophores of the system are unknown. The two commonly applied diagnostics, core consistency and split-half analysis, appear to underestimate the model complexity due to covarying components and local minima, respectively. As a more robust alternative, we propose a resampling approach with multiple initializations and submodel comparisons for estimating the optimal number of PARAFAC components in complex data.
{"title":"Resampling as a Robust Measure of Model Complexity in PARAFAC Models","authors":"Helene Fog Froriep Halberg, Marta Bevilacqua, Åsmund Rinnan","doi":"10.1002/cem.3601","DOIUrl":"10.1002/cem.3601","url":null,"abstract":"<p>Fluorescence spectroscopy has been applied for analysis of complex samples, such as food and beverages. Parallel factor analysis (PARAFAC) is a well-known decomposition method for fluorescence excitation–emission matrices (EEMs). When the complexity of the system increases, it becomes considerably more difficult to determine the optimal number of PARAFAC components, especially when the fluorophores of the system are unknown. The two commonly applied diagnostics, core consistency and split-half analysis, appear to underestimate the model complexity due to covarying components and local minima, respectively. As a more robust alternative, we propose a resampling approach with multiple initializations and submodel comparisons for estimating the optimal number of PARAFAC components in complex data.</p>","PeriodicalId":15274,"journal":{"name":"Journal of Chemometrics","volume":"38 12","pages":""},"PeriodicalIF":2.3,"publicationDate":"2024-09-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/cem.3601","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142209893","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Irene Mariñas-Collado, Juan M. Rodríguez-Díaz, M. Teresa Santos-Martín
This study addresses the complex dynamics of alcohol elimination in the human body, very important in forensic and healthcare areas. Existing models often oversimplify with the assumption of linear elimination kinetics, limiting practical application. This study presents a novel non-linear model for estimating blood alcohol concentration after multiple intakes. Initially developed for two different alcohol incorporations, it can be straightforwardly extended to the case of more intakes. Emphasising the significance of accurate parameter estimation, the research underscores the importance of precise experimental design, utilising optimal experimental design (OED) methodologies. Sensitivity analysis of model coefficients and the determination of D-optimal designs, considering correlation structures among observations, reveal a strong linear relationship between support points. This relationship can be used to obtain nearly optimal designs that are highly efficient and much easier to compute.
{"title":"A Non-Linear Model for Multiple Alcohol Intakes and Optimal Designs Strategies","authors":"Irene Mariñas-Collado, Juan M. Rodríguez-Díaz, M. Teresa Santos-Martín","doi":"10.1002/cem.3599","DOIUrl":"10.1002/cem.3599","url":null,"abstract":"<p>This study addresses the complex dynamics of alcohol elimination in the human body, very important in forensic and healthcare areas. Existing models often oversimplify with the assumption of linear elimination kinetics, limiting practical application. This study presents a novel non-linear model for estimating blood alcohol concentration after multiple intakes. Initially developed for two different alcohol incorporations, it can be straightforwardly extended to the case of more intakes. Emphasising the significance of accurate parameter estimation, the research underscores the importance of precise experimental design, utilising optimal experimental design (OED) methodologies. Sensitivity analysis of model coefficients and the determination of D-optimal designs, considering correlation structures among observations, reveal a strong linear relationship between support points. This relationship can be used to obtain nearly optimal designs that are highly efficient and much easier to compute.</p>","PeriodicalId":15274,"journal":{"name":"Journal of Chemometrics","volume":"38 12","pages":""},"PeriodicalIF":2.3,"publicationDate":"2024-08-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/cem.3599","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142209898","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
In this paper, we revisit the power curves in ANOVA simultaneous component analysis (ASCA) based on permutation testing and introduce the population curves derived from population parameters describing the relative effect among factors and interactions. The relative effect has important practical implications: The statistical power of a given factor depends on the design of other factors in the experiment and not only of the sample size. Thus, understanding the relative power in a specific experimental design can be extremely useful to maximize our capability of success when planning the experiment. In the paper, we derive relative and absolute population curves, where the former represent statistical power in terms of the normalized effect size between structure and noise, and the latter in terms of the sample size. Both types of population curves allow us to make decisions regarding the number and nature (fixed/random) of factors, their relationships (crossed/nested), and the number of levels and replicates, among others, in an multivariate experimental design (e.g., an omics study) during the planning phase of the experiment. We illustrate both types of curves through simulation.
{"title":"Population Power Curves in ASCA With Permutation Testing","authors":"José Camacho, Michael Sorochan Armstrong","doi":"10.1002/cem.3596","DOIUrl":"10.1002/cem.3596","url":null,"abstract":"<p>In this paper, we revisit the power curves in ANOVA simultaneous component analysis (ASCA) based on permutation testing and introduce the population curves derived from population parameters describing the relative effect among factors and interactions. The relative effect has important practical implications: The statistical power of a given factor depends on the design of other factors in the experiment and not only of the sample size. Thus, understanding the relative power in a specific experimental design can be extremely useful to maximize our capability of success when planning the experiment. In the paper, we derive relative and absolute population curves, where the former represent statistical power in terms of the normalized effect size between structure and noise, and the latter in terms of the sample size. Both types of population curves allow us to make decisions regarding the number and nature (fixed/random) of factors, their relationships (crossed/nested), and the number of levels and replicates, among others, in an multivariate experimental design (e.g., an omics study) during the planning phase of the experiment. We illustrate both types of curves through simulation.</p>","PeriodicalId":15274,"journal":{"name":"Journal of Chemometrics","volume":"38 12","pages":""},"PeriodicalIF":2.3,"publicationDate":"2024-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/cem.3596","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142209894","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}