首页 > 最新文献

Chemometrics and Intelligent Laboratory Systems最新文献

英文 中文
Optimal design of experiments when not every test is equally expensive 当不是每项试验都同样昂贵时,实验的最优设计
IF 3.8 2区 化学 Q2 AUTOMATION & CONTROL SYSTEMS Pub Date : 2026-02-15 Epub Date: 2025-12-17 DOI: 10.1016/j.chemolab.2025.105617
Mohammed Saif Ismail Hameed, Robin van der Haar, Ying Chen, Peter Goos
When experimental tests differ in cost and the experiment is constrained by a fixed total budget, the optimal number of tests and the allocation between expensive and inexpensive tests cannot be determined a priori. We propose using a Variable Neighborhood Search (VNS) algorithm to generate optimal experimental designs for such problems. VNS is an intuitive and flexible metaheuristic that has been successfully applied to a wide range of optimization problems. We illustrate the effectiveness of the VNS algorithm by generating improved designs for a micronization experiment.
当实验测试成本不同且实验总预算固定时,不能先验地确定最优测试数量以及昂贵和廉价测试之间的分配。我们建议使用可变邻域搜索(VNS)算法来生成此类问题的最佳实验设计。VNS是一种直观、灵活的元启发式算法,已成功地应用于各种优化问题。我们通过生成微粉化实验的改进设计来说明VNS算法的有效性。
{"title":"Optimal design of experiments when not every test is equally expensive","authors":"Mohammed Saif Ismail Hameed,&nbsp;Robin van der Haar,&nbsp;Ying Chen,&nbsp;Peter Goos","doi":"10.1016/j.chemolab.2025.105617","DOIUrl":"10.1016/j.chemolab.2025.105617","url":null,"abstract":"<div><div>When experimental tests differ in cost and the experiment is constrained by a fixed total budget, the optimal number of tests and the allocation between expensive and inexpensive tests cannot be determined a priori. We propose using a Variable Neighborhood Search (VNS) algorithm to generate optimal experimental designs for such problems. VNS is an intuitive and flexible metaheuristic that has been successfully applied to a wide range of optimization problems. We illustrate the effectiveness of the VNS algorithm by generating improved designs for a micronization experiment.</div></div>","PeriodicalId":9774,"journal":{"name":"Chemometrics and Intelligent Laboratory Systems","volume":"269 ","pages":"Article 105617"},"PeriodicalIF":3.8,"publicationDate":"2026-02-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145836845","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An unsupervised approach to anomaly detection in near-infrared spectroscopy via Covariance-Shrunk Slow Feature Analysis 基于协方差收缩慢特征分析的无监督近红外光谱异常检测方法
IF 3.8 2区 化学 Q2 AUTOMATION & CONTROL SYSTEMS Pub Date : 2026-02-15 Epub Date: 2025-12-12 DOI: 10.1016/j.chemolab.2025.105615
Yinran Xiong , Jie Tang , Guangming Qiu , Peng Wang , Yuncan Chen , Jing Jing , Lijun Zhu
Agricultural products often exhibit substantial batch-to-batch variability in their chemical and physical properties due to environmental and other uncontrollable factors, making robust quality monitoring essential for ensuring product consistency and stability. Near-infrared (NIR) spectroscopy offers rich chemical and physical information for qualitative quality assessment, but its high dimensionality and the scarcity of abnormal samples, since non-conforming products are not intentionally manufactured, limit the applicability of conventional supervised learning approaches. To address these challenges, this study proposes Covariance-Shrunk Slow Feature Analysis (CSSFA), a novel unsupervised learning method that integrates covariance shrinkage into the Slow Feature Analysis (SFA) framework. CSSFA mitigates estimation bias in high-dimensional settings and improves the robustness and interpretability of extracted features. Experiments on two NIR tobacco datasets demonstrate that CSSFA effectively captures features related to product quality stability and achieves accurate anomaly detection without requiring large numbers of abnormal samples. This work provides a scalable and interpretable strategy for anomaly detection of agricultural products using NIR spectroscopy with abnormal samples which are rare or unavailable.
由于环境和其他不可控因素的影响,农产品的化学和物理特性往往在批次之间表现出很大的差异,因此强有力的质量监控对于确保产品的一致性和稳定性至关重要。近红外(NIR)光谱为定性质量评估提供了丰富的化学和物理信息,但由于不合格产品不是故意制造的,其高维数和异常样品的稀缺性限制了传统监督学习方法的适用性。为了解决这些挑战,本研究提出了协方差收缩慢特征分析(CSSFA),这是一种将协方差收缩集成到慢特征分析(SFA)框架中的新型无监督学习方法。CSSFA减轻了高维环境下的估计偏差,提高了提取特征的鲁棒性和可解释性。在两个近红外烟草数据集上的实验表明,CSSFA有效地捕获了与产品质量稳定性相关的特征,在不需要大量异常样本的情况下实现了准确的异常检测。这项工作提供了一种可扩展和可解释的策略,用于利用近红外光谱对罕见或不可用的异常样品进行农产品异常检测。
{"title":"An unsupervised approach to anomaly detection in near-infrared spectroscopy via Covariance-Shrunk Slow Feature Analysis","authors":"Yinran Xiong ,&nbsp;Jie Tang ,&nbsp;Guangming Qiu ,&nbsp;Peng Wang ,&nbsp;Yuncan Chen ,&nbsp;Jing Jing ,&nbsp;Lijun Zhu","doi":"10.1016/j.chemolab.2025.105615","DOIUrl":"10.1016/j.chemolab.2025.105615","url":null,"abstract":"<div><div>Agricultural products often exhibit substantial batch-to-batch variability in their chemical and physical properties due to environmental and other uncontrollable factors, making robust quality monitoring essential for ensuring product consistency and stability. Near-infrared (NIR) spectroscopy offers rich chemical and physical information for qualitative quality assessment, but its high dimensionality and the scarcity of abnormal samples, since non-conforming products are not intentionally manufactured, limit the applicability of conventional supervised learning approaches. To address these challenges, this study proposes Covariance-Shrunk Slow Feature Analysis (CSSFA), a novel unsupervised learning method that integrates covariance shrinkage into the Slow Feature Analysis (SFA) framework. CSSFA mitigates estimation bias in high-dimensional settings and improves the robustness and interpretability of extracted features. Experiments on two NIR tobacco datasets demonstrate that CSSFA effectively captures features related to product quality stability and achieves accurate anomaly detection without requiring large numbers of abnormal samples. This work provides a scalable and interpretable strategy for anomaly detection of agricultural products using NIR spectroscopy with abnormal samples which are rare or unavailable.</div></div>","PeriodicalId":9774,"journal":{"name":"Chemometrics and Intelligent Laboratory Systems","volume":"269 ","pages":"Article 105615"},"PeriodicalIF":3.8,"publicationDate":"2026-02-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145787024","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Raman mapping and Chemometrics: An open access Python-based routine to preprocess and generate chemical maps applying CLS, PCA and PLS methods 拉曼映射和化学计量学:一个开放访问的基于python的程序,用于预处理和生成化学图,应用CLS, PCA和PLS方法
IF 3.8 2区 化学 Q2 AUTOMATION & CONTROL SYSTEMS Pub Date : 2026-02-15 Epub Date: 2025-12-12 DOI: 10.1016/j.chemolab.2025.105616
Luiz Renato Rosa Leme de Souza, Carlos Alberto Rios, Márcia Cristina Breitkreitz

Context:

Python is a widely-known open-source and robust programming language used in many research fields. Artificial Intelligence (AI) is a growing tool with many applications that is capable of helping with long and difficult tasks. Routines for preprocessing spectra signals and applying chemometric models are usually part of expensive software. Despite the existence of isolated code snippets, libraries, and tutorials, it is a hard task to find an open-access routine that guides from the raw Raman mapping data set to the clear chemical information contained within the analyzed samples by means of chemical maps.

Objectives:

This paper presents an AI-assisted Python-based routine for preprocessing Raman mapping results and generating chemical maps of samples using the chemometric methods: CLS, PLS and PCA, with the goal of providing an open access routine for research purposes.

Methods:

Python programming language and AI tools were used as code generators, translators, and debugging tools to assist the creation of the routine, and the results were compared to the ones obtained by a Matlab routine.

Results:

The Python routine successfully performed the preprocessing of the Raman spectra and the calculations of the chemometric methods CLS, PLS and PCA generating chemical maps. The results were equivalent to those of Matlab for the same data set, leading to the same conclusions.

Conclusion:

This paper demonstrated the application of an open access Python-based AI-guided routine to preprocess and generate chemical maps applying CLS, PCA and PLS models, now available and editable to suit different needs.
上下文:Python是一种广为人知的开源和健壮的编程语言,用于许多研究领域。人工智能(AI)是一个不断发展的工具,有许多应用程序能够帮助完成长期和困难的任务。光谱信号的预处理和化学计量模型的应用通常是昂贵软件的一部分。尽管存在孤立的代码片段、库和教程,但很难找到一个开放访问的例程,通过化学图从原始拉曼映射数据集引导到分析样品中包含的清晰化学信息。目的:本文提出了一个人工智能辅助的基于python的程序,用于预处理拉曼映射结果,并使用化学计量学方法:CLS, PLS和PCA生成样品的化学图,目的是为研究目的提供一个开放获取的程序。方法:使用Python编程语言和人工智能工具作为代码生成器、翻译器和调试工具,辅助例程的创建,并将结果与Matlab例程的结果进行比较。结果:Python程序成功地完成了拉曼光谱的预处理和化学计量学方法CLS、PLS和PCA的计算,生成了化学图谱。对于相同的数据集,结果与Matlab等效,得出相同的结论。结论:本文展示了一个开放获取的基于python的人工智能引导程序的应用,该程序应用CLS、PCA和PLS模型预处理和生成化学图谱,现已可用并可编辑,以满足不同的需求。
{"title":"Raman mapping and Chemometrics: An open access Python-based routine to preprocess and generate chemical maps applying CLS, PCA and PLS methods","authors":"Luiz Renato Rosa Leme de Souza,&nbsp;Carlos Alberto Rios,&nbsp;Márcia Cristina Breitkreitz","doi":"10.1016/j.chemolab.2025.105616","DOIUrl":"10.1016/j.chemolab.2025.105616","url":null,"abstract":"<div><h3>Context:</h3><div>Python is a widely-known open-source and robust programming language used in many research fields. Artificial Intelligence (AI) is a growing tool with many applications that is capable of helping with long and difficult tasks. Routines for preprocessing spectra signals and applying chemometric models are usually part of expensive software. Despite the existence of isolated code snippets, libraries, and tutorials, it is a hard task to find an open-access routine that guides from the raw Raman mapping data set to the clear chemical information contained within the analyzed samples by means of chemical maps.</div></div><div><h3>Objectives:</h3><div>This paper presents an AI-assisted Python-based routine for preprocessing Raman mapping results and generating chemical maps of samples using the chemometric methods: CLS, PLS and PCA, with the goal of providing an open access routine for research purposes.</div></div><div><h3>Methods:</h3><div>Python programming language and AI tools were used as code generators, translators, and debugging tools to assist the creation of the routine, and the results were compared to the ones obtained by a Matlab routine.</div></div><div><h3>Results:</h3><div>The Python routine successfully performed the preprocessing of the Raman spectra and the calculations of the chemometric methods CLS, PLS and PCA generating chemical maps. The results were equivalent to those of Matlab for the same data set, leading to the same conclusions.</div></div><div><h3>Conclusion:</h3><div>This paper demonstrated the application of an open access Python-based AI-guided routine to preprocess and generate chemical maps applying CLS, PCA and PLS models, now available and editable to suit different needs.</div></div>","PeriodicalId":9774,"journal":{"name":"Chemometrics and Intelligent Laboratory Systems","volume":"269 ","pages":"Article 105616"},"PeriodicalIF":3.8,"publicationDate":"2026-02-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145787025","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A hybrid variable selection with cross-domain constrained ensemble (CCE) for large-scale spectroscopic data 大尺度光谱数据的跨域约束系综混合变量选择
IF 3.8 2区 化学 Q2 AUTOMATION & CONTROL SYSTEMS Pub Date : 2026-02-15 Epub Date: 2025-12-17 DOI: 10.1016/j.chemolab.2025.105614
Haoran Li , Xin Zhang , Pengchegn Wu , Yang Zhang , Jiyong Shi , Xiaobo Zou
Advances in spectral techniques have generated high-resolution data with thousands of variables. Although an increasing number of variables provides more comprehensive molecular information, it also brings more challenges for existing chemometrics methods, such as the risk of over-fitting and the lack of interpretability. Therefore, we propose a hybrid variable selection approach specifically designed for large-scale datasets. First, considering the continuous characteristics of spectral variables and their importance, interval partial least squares (iPLS) and variable combination population analysis (VCPA) were applied to select relevant variables while reducing the variable space. Second, we consider that truly relevant variables exhibit consistent importance across the sample domain for the same analytical tasks and are therefore more likely to be selected and retained. Consequently, a cross-domain constrained ensemble (CCE) strategy is developed using the least absolute shrinkage and selection operator (LASSO) to further enhance the performance of variable selection. Experiments on wine 1H NMR and pork Raman spectroscopy datasets demonstrate that the proposed method improves prediction performance in terms of RMSEP and RPD. In addition, the proposed CCE method demonstrates superior prediction improvement performance over other final selection methods. These results confirm the effectiveness of both the hybrid variable selection framework and the CCE strategy in handling large-scale spectral datasets.
光谱技术的进步产生了包含数千个变量的高分辨率数据。虽然越来越多的变量提供了更全面的分子信息,但也给现有的化学计量学方法带来了更多的挑战,如过度拟合的风险和缺乏可解释性。因此,我们提出了一种专门为大规模数据集设计的混合变量选择方法。首先,考虑到光谱变量的连续特征及其重要性,采用区间偏最小二乘(iPLS)和变量组合总体分析(VCPA)方法选择相关变量,同时减小变量空间;其次,我们认为真正相关的变量在相同的分析任务的样本域中表现出一致的重要性,因此更有可能被选择和保留。为此,提出了一种基于最小绝对收缩和选择算子(LASSO)的跨域约束集成(CCE)策略,以进一步提高变量选择的性能。在葡萄酒1H NMR和猪肉拉曼光谱数据集上的实验表明,该方法在RMSEP和RPD方面提高了预测性能。此外,所提出的CCE方法比其他最终选择方法具有更好的预测改进性能。这些结果证实了混合变量选择框架和CCE策略在处理大规模光谱数据集方面的有效性。
{"title":"A hybrid variable selection with cross-domain constrained ensemble (CCE) for large-scale spectroscopic data","authors":"Haoran Li ,&nbsp;Xin Zhang ,&nbsp;Pengchegn Wu ,&nbsp;Yang Zhang ,&nbsp;Jiyong Shi ,&nbsp;Xiaobo Zou","doi":"10.1016/j.chemolab.2025.105614","DOIUrl":"10.1016/j.chemolab.2025.105614","url":null,"abstract":"<div><div>Advances in spectral techniques have generated high-resolution data with thousands of variables. Although an increasing number of variables provides more comprehensive molecular information, it also brings more challenges for existing chemometrics methods, such as the risk of over-fitting and the lack of interpretability. Therefore, we propose a hybrid variable selection approach specifically designed for large-scale datasets. First, considering the continuous characteristics of spectral variables and their importance, interval partial least squares (iPLS) and variable combination population analysis (VCPA) were applied to select relevant variables while reducing the variable space. Second, we consider that truly relevant variables exhibit consistent importance across the sample domain for the same analytical tasks and are therefore more likely to be selected and retained. Consequently, a cross-domain constrained ensemble (CCE) strategy is developed using the least absolute shrinkage and selection operator (LASSO) to further enhance the performance of variable selection. Experiments on wine <span><math><msup><mrow></mrow><mrow><mn>1</mn></mrow></msup></math></span>H NMR and pork Raman spectroscopy datasets demonstrate that the proposed method improves prediction performance in terms of RMSEP and RPD. In addition, the proposed CCE method demonstrates superior prediction improvement performance over other final selection methods. These results confirm the effectiveness of both the hybrid variable selection framework and the CCE strategy in handling large-scale spectral datasets.</div></div>","PeriodicalId":9774,"journal":{"name":"Chemometrics and Intelligent Laboratory Systems","volume":"269 ","pages":"Article 105614"},"PeriodicalIF":3.8,"publicationDate":"2026-02-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145787026","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Evaluation of a mathematical approach to detect fraudulent substitution of Darjeeling tea with other types of tea using the elemental profiles obtained by Energy Dispersive X-ray Fluorescence 利用能量色散x射线荧光所获得的元素谱,评估一种检测大吉岭茶与其他类型茶的欺诈性替代的数学方法
IF 3.8 2区 化学 Q2 AUTOMATION & CONTROL SYSTEMS Pub Date : 2026-02-15 Epub Date: 2025-12-08 DOI: 10.1016/j.chemolab.2025.105606
Sergej Papoci, Manuel Jiménez, Michele Ghidotti, María Beatriz de la Calle Guntiñas
The willingness of consumers to pay higher prices for high quality specialties, such as Darjeeling tea, goes hand in hand with an increase of fraudulent practices in which Darjeeling tea is substituted totally or partially by cheaper teas. Currently, to evaluate the percentage of substitution that a method can detect, Darjeeling tea is mixed in different proportions with non-Darjeeling teas, and after homogenisation the mixture is analysed. This time-consuming approach implies the use of valuable amounts of sample and, therefore an alternative approach is needed. Here a method is described to calculate the minimum detectable substitution percentage of Darjeeling tea by other teas without needing to prepare real mixtures. The approach is based on the use of virtual mixtures made with the results obtained for commercially available Darjeeling and non-Darjeeling teas. The method used for authentication purposes, made use of the elemental profiles of tea obtained by Energy Dispersive X-ray Fluorescence, combined with chemometrics and modelling by Partial Least Square-Discriminant Analysis. The false positives percentage at different substitution levels, was evaluated and compared with the results obtained with real mixtures of Darjeeling and non-Darjeeling teas. Comparable results were obtained with both approaches. Twenty percent was the lowest substitution level that could be detected with an acceptable sensitivity (94 %) and specificity (86 %). A fast, easy to implement approach has been developed and validated, to calculate the minimum substitution percentage that can be detected by an authentication analytical method, without the need to carry out additional laboratory experiments.
消费者愿意支付更高的价格来购买高质量的特产,如大吉岭茶,与此同时,欺诈行为也在增加,大吉岭茶全部或部分被更便宜的茶所取代。目前,为了评估一种方法可以检测到的替代百分比,将大吉岭茶与非大吉岭茶以不同比例混合,并在均质后对混合物进行分析。这种耗时的方法意味着要使用大量有价值的样本,因此需要另一种方法。这里描述了一种方法来计算大吉岭茶被其他茶的最小可检测替代百分比,而无需制备真正的混合物。该方法基于对市售的大吉岭茶和非大吉岭茶所获得的结果进行的虚拟混合物的使用。该方法利用能量色散x射线荧光获得的茶叶元素谱,结合化学计量学和偏最小二乘判别分析建模,用于鉴定目的。对不同替代水平下的假阳性率进行了评价,并与实际混合的大吉岭茶和非大吉岭茶进行了比较。两种方法获得的结果具有可比性。20%是可接受的灵敏度(94%)和特异性(86%)检测到的最低替代水平。已经开发并验证了一种快速,易于实施的方法,以计算可以通过认证分析方法检测到的最小替代百分比,而无需进行额外的实验室实验。
{"title":"Evaluation of a mathematical approach to detect fraudulent substitution of Darjeeling tea with other types of tea using the elemental profiles obtained by Energy Dispersive X-ray Fluorescence","authors":"Sergej Papoci,&nbsp;Manuel Jiménez,&nbsp;Michele Ghidotti,&nbsp;María Beatriz de la Calle Guntiñas","doi":"10.1016/j.chemolab.2025.105606","DOIUrl":"10.1016/j.chemolab.2025.105606","url":null,"abstract":"<div><div>The willingness of consumers to pay higher prices for high quality specialties, such as Darjeeling tea, goes hand in hand with an increase of fraudulent practices in which Darjeeling tea is substituted totally or partially by cheaper teas. Currently, to evaluate the percentage of substitution that a method can detect, Darjeeling tea is mixed in different proportions with non-Darjeeling teas, and after homogenisation the mixture is analysed. This time-consuming approach implies the use of valuable amounts of sample and, therefore an alternative approach is needed. Here a method is described to calculate the minimum detectable substitution percentage of Darjeeling tea by other teas without needing to prepare real mixtures. The approach is based on the use of virtual mixtures made with the results obtained for commercially available Darjeeling and non-Darjeeling teas. The method used for authentication purposes, made use of the elemental profiles of tea obtained by Energy Dispersive X-ray Fluorescence, combined with chemometrics and modelling by Partial Least Square-Discriminant Analysis. The false positives percentage at different substitution levels, was evaluated and compared with the results obtained with real mixtures of Darjeeling and non-Darjeeling teas. Comparable results were obtained with both approaches. Twenty percent was the lowest substitution level that could be detected with an acceptable sensitivity (94 %) and specificity (86 %). A fast, easy to implement approach has been developed and validated, to calculate the minimum substitution percentage that can be detected by an authentication analytical method, without the need to carry out additional laboratory experiments.</div></div>","PeriodicalId":9774,"journal":{"name":"Chemometrics and Intelligent Laboratory Systems","volume":"269 ","pages":"Article 105606"},"PeriodicalIF":3.8,"publicationDate":"2026-02-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145733576","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A semantic framework for drug-target affinity prediction using Mamba and graph convolutional networks for multimodal feature fusion 使用曼巴和图卷积网络进行多模态特征融合的药物靶点亲和力预测的语义框架
IF 3.8 2区 化学 Q2 AUTOMATION & CONTROL SYSTEMS Pub Date : 2026-02-15 Epub Date: 2025-12-01 DOI: 10.1016/j.chemolab.2025.105601
Maoyuan Zhou , Jingjie He , Xingyu Liu , Junmin Huang , Jirui Zhang , Jiaxing Li , Xiaorui Huang , Qianjin Guo
Accurately assessing drug-target interaction (DTA) strength is pivotal in drug development. Enhancing DTA prediction precision necessitates effective protein representation methods. This study introduces MAGTSF-DTA, a multi-modal feature fusion semantic framework leveraging Mamba and graph convolutional networks (GCN). For molecules, atomic-level graph structures are generated from SMILES sequences, and the Mamba module is integrated with GCN to achieve efficient semantic learning. Furthermore, protein-protein interaction (PPI) networks are incorporated, and hierarchical approaches (HMANet & LMANet) are designed to integrate diverse protein features, enriching protein semantic representations. Experiments demonstrate that the proposed model significantly improves prediction accuracy on benchmark datasets compared to state-of-the-art techniques, validating the effectiveness of the Mamba architecture in DTA prediction and showcasing the model's generalization and interpretability.
准确评估药物-靶标相互作用(DTA)强度是药物开发的关键。提高DTA预测精度需要有效的蛋白质表示方法。本研究介绍了MAGTSF-DTA,一种利用曼巴和图卷积网络(GCN)的多模态特征融合语义框架。对于分子,从SMILES序列生成原子级图结构,并将Mamba模块与GCN相结合,实现高效的语义学习。此外,蛋白质-蛋白质相互作用(PPI)网络被纳入,分层方法(HMANet & LMANet)被设计用于整合各种蛋白质特征,丰富蛋白质语义表示。实验表明,与最先进的技术相比,该模型显著提高了基准数据集的预测精度,验证了曼巴架构在DTA预测中的有效性,并展示了模型的泛化和可解释性。
{"title":"A semantic framework for drug-target affinity prediction using Mamba and graph convolutional networks for multimodal feature fusion","authors":"Maoyuan Zhou ,&nbsp;Jingjie He ,&nbsp;Xingyu Liu ,&nbsp;Junmin Huang ,&nbsp;Jirui Zhang ,&nbsp;Jiaxing Li ,&nbsp;Xiaorui Huang ,&nbsp;Qianjin Guo","doi":"10.1016/j.chemolab.2025.105601","DOIUrl":"10.1016/j.chemolab.2025.105601","url":null,"abstract":"<div><div>Accurately assessing drug-target interaction (DTA) strength is pivotal in drug development. Enhancing DTA prediction precision necessitates effective protein representation methods. This study introduces MAGTSF-DTA, a multi-modal feature fusion semantic framework leveraging Mamba and graph convolutional networks (GCN). For molecules, atomic-level graph structures are generated from SMILES sequences, and the Mamba module is integrated with GCN to achieve efficient semantic learning. Furthermore, protein-protein interaction (PPI) networks are incorporated, and hierarchical approaches (HMANet &amp; LMANet) are designed to integrate diverse protein features, enriching protein semantic representations. Experiments demonstrate that the proposed model significantly improves prediction accuracy on benchmark datasets compared to state-of-the-art techniques, validating the effectiveness of the Mamba architecture in DTA prediction and showcasing the model's generalization and interpretability.</div></div>","PeriodicalId":9774,"journal":{"name":"Chemometrics and Intelligent Laboratory Systems","volume":"269 ","pages":"Article 105601"},"PeriodicalIF":3.8,"publicationDate":"2026-02-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145682820","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multimodal fusion of CT features and density for rapid prediction of raw-coal ash CT特征与密度的多模态融合用于原煤灰分快速预测
IF 3.8 2区 化学 Q2 AUTOMATION & CONTROL SYSTEMS Pub Date : 2026-02-15 Epub Date: 2026-01-06 DOI: 10.1016/j.chemolab.2026.105628
Shuxian Su , Peixian Geng , Jiashan Yang , Gansu Zhang , Gehang Xue , Liang Dong , Zhaolin Lu
Ash content is a key quality index in coal preparation. To overcome the time-consuming, labor-intensive nature of offline assays, the interference sensitivity of online measurements, and the surface-only information of vision/spectroscopy, this study proposes a multimodal deep-learning framework for fast and accurate raw-coal ash prediction. First, a CT-based data acquisition system was designed to synchronously collect CT image slices of raw-coal particles and particle-density information. Through quantitative analysis of the association between density and ash content, the decisive role of density – as a key indicator of coal properties – in ash prediction is revealed. By further mining the frequency-domain and spatial features of coal CT images, and based on these findings, a multimodal approach is formulated that fuses CT features with density. The model adopts a three-branch architecture: an improved EfficientNet-B0 branch learns spatial cues, an AshFormer branch captures frequency patterns related to mineral distribution and microstructural discontinuities, and a multilayer perceptron encodes density. Cross-modal attention achieves deep fusion and complementarity across modalities, and a KAN-based regression head outputs ash content. On industrial data, the proposed method attains MAPE = 0.0468, RMSE = 0.0573, and R2=97.0%, outperforming single-modality image models (ΔMAPE=0.0096, ΔRMSE=0.0413, ΔR2=+4.48 percentage points). These results demonstrate the advantage of multimodal fusion in improving the accuracy and generalization of coal-quality analysis and provide a new approach for rapid analysis under small-sample conditions.
灰分是选煤过程中重要的质量指标。为了克服离线分析的耗时、劳动密集型、在线测量的干扰敏感性以及视觉/光谱的表面信息,本研究提出了一种多模态深度学习框架,用于快速、准确地预测原煤灰分。首先,设计了基于CT的数据采集系统,同步采集原煤颗粒CT图像切片和颗粒密度信息;通过定量分析密度与灰分之间的关系,揭示了密度作为煤质的关键指标在灰分预测中的决定性作用。通过进一步挖掘煤炭CT图像的频域和空间特征,并基于这些发现,制定了一种融合CT特征和密度的多模态方法。该模型采用三分支架构:改进的EfficientNet-B0分支学习空间线索,AshFormer分支捕获与矿物分布和微观结构不连续相关的频率模式,多层感知器编码密度。跨模态注意实现了模态间的深度融合和互补,基于kan的回归头输出灰分含量。在工业数据上,本文方法的MAPE= 0.0468, RMSE= 0.0573, R2=97.0%,优于单模态图像模型(ΔMAPE=−0.0096,ΔRMSE=−0.0413,ΔR2=+4.48个百分点)。这些结果表明了多模态融合在提高煤质分析精度和泛化方面的优势,为小样本条件下的快速分析提供了一种新的方法。
{"title":"Multimodal fusion of CT features and density for rapid prediction of raw-coal ash","authors":"Shuxian Su ,&nbsp;Peixian Geng ,&nbsp;Jiashan Yang ,&nbsp;Gansu Zhang ,&nbsp;Gehang Xue ,&nbsp;Liang Dong ,&nbsp;Zhaolin Lu","doi":"10.1016/j.chemolab.2026.105628","DOIUrl":"10.1016/j.chemolab.2026.105628","url":null,"abstract":"<div><div>Ash content is a key quality index in coal preparation. To overcome the time-consuming, labor-intensive nature of offline assays, the interference sensitivity of online measurements, and the surface-only information of vision/spectroscopy, this study proposes a multimodal deep-learning framework for fast and accurate raw-coal ash prediction. First, a CT-based data acquisition system was designed to synchronously collect CT image slices of raw-coal particles and particle-density information. Through quantitative analysis of the association between density and ash content, the decisive role of density – as a key indicator of coal properties – in ash prediction is revealed. By further mining the frequency-domain and spatial features of coal CT images, and based on these findings, a multimodal approach is formulated that fuses CT features with density. The model adopts a three-branch architecture: an improved EfficientNet-B0 branch learns spatial cues, an AshFormer branch captures frequency patterns related to mineral distribution and microstructural discontinuities, and a multilayer perceptron encodes density. Cross-modal attention achieves deep fusion and complementarity across modalities, and a KAN-based regression head outputs ash content. On industrial data, the proposed method attains MAPE = 0.0468, RMSE = 0.0573, and <span><math><mrow><msup><mrow><mi>R</mi></mrow><mrow><mn>2</mn></mrow></msup><mo>=</mo><mn>97</mn><mo>.</mo><mn>0</mn><mtext>%</mtext></mrow></math></span>, outperforming single-modality image models (<span><math><mrow><mi>Δ</mi><mi>M</mi><mi>A</mi><mi>P</mi><mi>E</mi><mo>=</mo><mo>−</mo><mn>0</mn><mo>.</mo><mn>0096</mn></mrow></math></span>, <span><math><mrow><mi>Δ</mi><mi>R</mi><mi>M</mi><mi>S</mi><mi>E</mi><mo>=</mo><mo>−</mo><mn>0</mn><mo>.</mo><mn>0413</mn></mrow></math></span>, <span><math><mrow><mi>Δ</mi><msup><mrow><mi>R</mi></mrow><mrow><mn>2</mn></mrow></msup><mo>=</mo><mo>+</mo><mn>4</mn><mo>.</mo><mn>48</mn></mrow></math></span> percentage points). These results demonstrate the advantage of multimodal fusion in improving the accuracy and generalization of coal-quality analysis and provide a new approach for rapid analysis under small-sample conditions.</div></div>","PeriodicalId":9774,"journal":{"name":"Chemometrics and Intelligent Laboratory Systems","volume":"269 ","pages":"Article 105628"},"PeriodicalIF":3.8,"publicationDate":"2026-02-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145920782","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Bayesian network-based robust framework for determining density, viscosity, and thermal conductivity of near-critical-state CO2 applied in CCUS 一个基于贝叶斯网络的鲁棒框架,用于确定CCUS中近临界状态CO2的密度、粘度和导热性
IF 3.8 2区 化学 Q2 AUTOMATION & CONTROL SYSTEMS Pub Date : 2026-02-15 Epub Date: 2026-01-09 DOI: 10.1016/j.chemolab.2026.105636
Bingtao Zhao , Lu Ding , Yaxin Su
Accurate modeling of the thermophysical properties of CO2 in the region around the critical point (RACP) or at the near-critical state is crucial for performance assessment and process design in CO2 capture, utilization, and storage. Despite its importance, significant challenges persist because of sharp fluctuations in these properties induced by critical effects, which directly influence flow behavior, interfacial tension, and process performance. To address this issue, we develop a Bayesian regularized neural network (BRNN)-based robust framework to predict the density, viscosity, and thermal conductivity of CO2 within the RACP. Initial steps involve constructing a backpropagation neural network facilitated by the Kennard-Stone algorithm for data partitioning. A comprehensive analysis is conducted to evaluate the impacts of training algorithms, the number of neurons in the hidden layer, and optimization methods on network performance. By refining the training procedure and optimizing weights and thresholds using the genetic algorithm, we ultimately propose a more accurate model named GA-BRNN. This model demonstrates superior generalization capabilities when compared to traditional correlations and other machine learning models for the prediction of CO2 properties in the RACP, yielding the mean squared error of 2.0484 × 10−4 (R2 = 0.9635) for density, 1.8680 (R2 = 0.9743) for viscosity, and 5.2196 (R2 = 0.9900) for thermal conductivity. The findings may provide a positive reference for modeling the thermophysical properties of near-critical-state CO2 applied in the processes related to carbon capture, utilization, and storage.
准确模拟二氧化碳在临界点(RACP)附近或近临界状态下的热物理性质,对二氧化碳捕集、利用和封存过程的性能评估和工艺设计至关重要。尽管它很重要,但由于临界效应导致这些特性急剧波动,直接影响流动行为、界面张力和工艺性能,因此仍然存在重大挑战。为了解决这个问题,我们开发了一个基于贝叶斯正则化神经网络(BRNN)的鲁棒框架来预测RACP内CO2的密度、粘度和导热性。最初的步骤包括构建一个反向传播神经网络,由Kennard-Stone算法促进数据分区。综合分析了训练算法、隐层神经元数量、优化方法对网络性能的影响。通过改进训练过程并使用遗传算法优化权值和阈值,我们最终提出了一个更精确的模型,命名为GA-BRNN。与传统的相关性和其他机器学习模型相比,该模型在预测RACP中CO2性质方面表现出了优越的泛化能力,密度的均方误差为2.0484 × 10−4 (R2 = 0.9635),粘度的均方误差为1.8680 (R2 = 0.9743),导热系数的均方误差为5.2196 (R2 = 0.9900)。研究结果可为碳捕获、利用和封存过程中近临界态CO2的热物理性质建模提供积极参考。
{"title":"A Bayesian network-based robust framework for determining density, viscosity, and thermal conductivity of near-critical-state CO2 applied in CCUS","authors":"Bingtao Zhao ,&nbsp;Lu Ding ,&nbsp;Yaxin Su","doi":"10.1016/j.chemolab.2026.105636","DOIUrl":"10.1016/j.chemolab.2026.105636","url":null,"abstract":"<div><div>Accurate modeling of the thermophysical properties of CO<sub>2</sub> in the region around the critical point (RACP) or at the near-critical state is crucial for performance assessment and process design in CO<sub>2</sub> capture, utilization, and storage. Despite its importance, significant challenges persist because of sharp fluctuations in these properties induced by critical effects, which directly influence flow behavior, interfacial tension, and process performance. To address this issue, we develop a Bayesian regularized neural network (BRNN)-based robust framework to predict the density, viscosity, and thermal conductivity of CO<sub>2</sub> within the RACP. Initial steps involve constructing a backpropagation neural network facilitated by the Kennard-Stone algorithm for data partitioning. A comprehensive analysis is conducted to evaluate the impacts of training algorithms, the number of neurons in the hidden layer, and optimization methods on network performance. By refining the training procedure and optimizing weights and thresholds using the genetic algorithm, we ultimately propose a more accurate model named GA-BRNN. This model demonstrates superior generalization capabilities when compared to traditional correlations and other machine learning models for the prediction of CO<sub>2</sub> properties in the RACP, yielding the mean squared error of 2.0484 × 10<sup>−4</sup> (R<sup>2</sup> = 0.9635) for density, 1.8680 (R<sup>2</sup> = 0.9743) for viscosity, and 5.2196 (R<sup>2</sup> = 0.9900) for thermal conductivity. The findings may provide a positive reference for modeling the thermophysical properties of near-critical-state CO<sub>2</sub> applied in the processes related to carbon capture, utilization, and storage.</div></div>","PeriodicalId":9774,"journal":{"name":"Chemometrics and Intelligent Laboratory Systems","volume":"269 ","pages":"Article 105636"},"PeriodicalIF":3.8,"publicationDate":"2026-02-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145972955","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Deep learning-driven components analysis of Raman spectral mixtures: An integrated masked autoencoder with convolutional neural network approach 深度学习驱动的拉曼光谱混合成分分析:基于卷积神经网络的集成掩模自编码器
IF 3.8 2区 化学 Q2 AUTOMATION & CONTROL SYSTEMS Pub Date : 2026-02-15 Epub Date: 2025-12-30 DOI: 10.1016/j.chemolab.2025.105627
Zichuan Bu , Jihong Liu , Jiageng Zhang , Chi Liu , Yihua Liu , Kaili Ren , Xuewen Yan , Wei Gao , Jun Dong
Raman spectroscopy is a pivotal tool in analytical and physical chemistry, yet its application in complex systems is hindered by spectral superposition and analysis challenges. The development of deep learning technology has provided new ideas for the component analysis of complex mixtures. This study proposes a mixture component identification method named MCI, which is based on the masked autoencoder and convolutional neural network. The aim is to effectively solve the problems of qualitative recognition and quantitative analysis in the Raman spectra of mixtures. The MCI method adopts a multi-stage framework: First, the Voigt function is used to accurately extract the characteristic peaks of the mixture. Second, the MAE model is employed to reconstruct the corresponding pure-substance spectra. Then, the CNN model is combined to conduct qualitative and quantitative analyses on the reconstructed spectra. Finally, the spectrum of the remaining components is obtained by subtracting the reconstructed spectrum from the mixture spectrum. By iterating the above process, the step-by-step unmixing of complex mixtures is achieved. In the generated mixed sample test data, the MCI outperforms the other three comparative models in terms of complete recognition accuracy in qualitative analysis and the evaluation indicators of each substance, while maintaining a lower average concentration error in quantitative analysis. Moreover, for complex mixtures containing interfering substances, the MCI shows strong anti-interference ability and maintains a high Identification accuracy. In the actual measurement of mixed sample Raman spectral identification detection, The MCI model achieved an average accuracy and F1_Score of 97 % in all test samples, further verifying its reliability and practicality in detecting the main components of real and complex mixtures. In summary, this study provides a new technical method for Raman spectral analysis of complex mixtures, which holds certain theoretical significance and practical value.
拉曼光谱是分析化学和物理化学中的关键工具,但其在复杂系统中的应用受到光谱叠加和分析挑战的阻碍。深度学习技术的发展为复杂混合物的成分分析提供了新的思路。本文提出了一种基于掩模自编码器和卷积神经网络的混合分量识别方法MCI。目的是有效地解决混合物拉曼光谱的定性识别和定量分析问题。MCI方法采用多阶段框架:首先,利用Voigt函数准确提取混合物的特征峰;其次,利用MAE模型重构相应的纯物质谱;然后结合CNN模型对重构光谱进行定性和定量分析。最后,在混合谱中减去重构谱,得到剩余分量的谱。通过重复上述过程,可以实现复杂混合物的逐步分解。在生成的混合样品测试数据中,MCI在定性分析和各物质评价指标上的完全识别准确率优于其他三种比较模型,同时在定量分析中保持较低的平均浓度误差。此外,对于含有干扰物质的复杂混合物,MCI显示出较强的抗干扰能力,并保持较高的识别精度。在混合样品拉曼光谱识别检测的实际测量中,MCI模型在所有测试样品中的平均精度和F1_Score均达到97%,进一步验证了其在检测真实和复杂混合物主要成分方面的可靠性和实用性。综上所述,本研究为复杂混合物的拉曼光谱分析提供了一种新的技术方法,具有一定的理论意义和实用价值。
{"title":"Deep learning-driven components analysis of Raman spectral mixtures: An integrated masked autoencoder with convolutional neural network approach","authors":"Zichuan Bu ,&nbsp;Jihong Liu ,&nbsp;Jiageng Zhang ,&nbsp;Chi Liu ,&nbsp;Yihua Liu ,&nbsp;Kaili Ren ,&nbsp;Xuewen Yan ,&nbsp;Wei Gao ,&nbsp;Jun Dong","doi":"10.1016/j.chemolab.2025.105627","DOIUrl":"10.1016/j.chemolab.2025.105627","url":null,"abstract":"<div><div>Raman spectroscopy is a pivotal tool in analytical and physical chemistry, yet its application in complex systems is hindered by spectral superposition and analysis challenges. The development of deep learning technology has provided new ideas for the component analysis of complex mixtures. This study proposes a mixture component identification method named MCI, which is based on the masked autoencoder and convolutional neural network. The aim is to effectively solve the problems of qualitative recognition and quantitative analysis in the Raman spectra of mixtures. The MCI method adopts a multi-stage framework: First, the Voigt function is used to accurately extract the characteristic peaks of the mixture. Second, the MAE model is employed to reconstruct the corresponding pure-substance spectra. Then, the CNN model is combined to conduct qualitative and quantitative analyses on the reconstructed spectra. Finally, the spectrum of the remaining components is obtained by subtracting the reconstructed spectrum from the mixture spectrum. By iterating the above process, the step-by-step unmixing of complex mixtures is achieved. In the generated mixed sample test data, the MCI outperforms the other three comparative models in terms of complete recognition accuracy in qualitative analysis and the evaluation indicators of each substance, while maintaining a lower average concentration error in quantitative analysis. Moreover, for complex mixtures containing interfering substances, the MCI shows strong anti-interference ability and maintains a high Identification accuracy. In the actual measurement of mixed sample Raman spectral identification detection, The MCI model achieved an average accuracy and <em>F1_Score</em> of 97 % in all test samples, further verifying its reliability and practicality in detecting the main components of real and complex mixtures. In summary, this study provides a new technical method for Raman spectral analysis of complex mixtures, which holds certain theoretical significance and practical value.</div></div>","PeriodicalId":9774,"journal":{"name":"Chemometrics and Intelligent Laboratory Systems","volume":"269 ","pages":"Article 105627"},"PeriodicalIF":3.8,"publicationDate":"2026-02-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145880468","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Boosting pneumonia diagnosis with machine learning and spectroscopic data fusion techniques 利用机器学习和光谱数据融合技术促进肺炎诊断
IF 3.8 2区 化学 Q2 AUTOMATION & CONTROL SYSTEMS Pub Date : 2026-01-15 Epub Date: 2025-11-19 DOI: 10.1016/j.chemolab.2025.105586
Ander Bastida Urkiza , Eneko Lopez , Renata Matekalo , Andreas Seifert
Photonic techniques combined with chemometrics offer promising opportunities for next-generation medical in vitro diagnostics. In this work, we evaluate the ability of vibrational spectroscopy to distinguish viral respiratory infections that have progressed to pneumonia. Pneumonia remains a major global health burden and is currently the eighth leading cause of death worldwide. Reliable and differentiated diagnoses are still challenging, as existing methods are time-consuming and require specialized laboratories and expertise.
We present a rapid, machine-learning–based in vitro approach for classifying influenza A and seasonal flu and for discriminating between these pathogenic strains. To enhance diagnostic performance, we employ data fusion of complementary Raman and Fourier-transform infrared absorption spectra acquired from microliter-scale droplets of human blood plasma.
By integrating spectral information from both modalities, the models capture a broader range of physiological changes and more comprehensively reflect the biochemical profile of the samples, leading to more robust classification. Using generalized linear models, we achieve accuracies of up to 95% in distinguishing healthy controls from influenza A– and seasonal flu–infected samples. The results further highlight specific scenarios in which data fusion yields measurable improvements in predictive power.
光子技术与化学计量学相结合为下一代医学体外诊断提供了有希望的机会。在这项工作中,我们评估振动光谱的能力,以区分病毒性呼吸道感染已进展为肺炎。肺炎仍然是全球主要的卫生负担,目前是全世界第八大死亡原因。由于现有的方法耗时且需要专门的实验室和专业知识,可靠和差异化的诊断仍然具有挑战性。我们提出了一种快速的、基于机器学习的体外方法,用于对甲型流感和季节性流感进行分类,并区分这些致病菌株。为了提高诊断性能,我们采用了从人血浆微升液滴中获得的互补拉曼变换和傅里叶变换红外吸收光谱的数据融合。通过整合两种模式的光谱信息,该模型捕获了更广泛的生理变化,更全面地反映了样品的生化特征,从而实现了更稳健的分类。使用广义线性模型,我们在区分健康对照与甲型流感和季节性流感感染样本方面实现了高达95%的准确性。结果进一步强调了数据融合在预测能力方面产生可衡量改进的特定场景。
{"title":"Boosting pneumonia diagnosis with machine learning and spectroscopic data fusion techniques","authors":"Ander Bastida Urkiza ,&nbsp;Eneko Lopez ,&nbsp;Renata Matekalo ,&nbsp;Andreas Seifert","doi":"10.1016/j.chemolab.2025.105586","DOIUrl":"10.1016/j.chemolab.2025.105586","url":null,"abstract":"<div><div>Photonic techniques combined with chemometrics offer promising opportunities for next-generation medical in vitro diagnostics. In this work, we evaluate the ability of vibrational spectroscopy to distinguish viral respiratory infections that have progressed to pneumonia. Pneumonia remains a major global health burden and is currently the eighth leading cause of death worldwide. Reliable and differentiated diagnoses are still challenging, as existing methods are time-consuming and require specialized laboratories and expertise.</div><div>We present a rapid, machine-learning–based in vitro approach for classifying influenza A and seasonal flu and for discriminating between these pathogenic strains. To enhance diagnostic performance, we employ data fusion of complementary Raman and Fourier-transform infrared absorption spectra acquired from microliter-scale droplets of human blood plasma.</div><div>By integrating spectral information from both modalities, the models capture a broader range of physiological changes and more comprehensively reflect the biochemical profile of the samples, leading to more robust classification. Using generalized linear models, we achieve accuracies of up to 95% in distinguishing healthy controls from influenza A– and seasonal flu–infected samples. The results further highlight specific scenarios in which data fusion yields measurable improvements in predictive power.</div></div>","PeriodicalId":9774,"journal":{"name":"Chemometrics and Intelligent Laboratory Systems","volume":"268 ","pages":"Article 105586"},"PeriodicalIF":3.8,"publicationDate":"2026-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145569651","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Chemometrics and Intelligent Laboratory Systems
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1