Chemometrics and Intelligent Laboratory Systems最新文献_第9页

A reliable deep neural network using the radial basis for the spreading virus in computers with kill signals 一种基于径向基的可靠深度神经网络，用于具有杀伤信号的计算机中病毒的传播

IF 3.8 2区化学 Q2 AUTOMATION & CONTROL SYSTEMS

Chemometrics and Intelligent Laboratory Systems

Pub Date : 2025-11-01 DOI: 10.1016/j.chemolab.2025.105560

Zulqurnain Sabir , Bahaa Basbous , Basma Souayeh , Muhammad Umar , Soheil Salahshour

Purpose

The purpose of this work is to provide a reliable neural network process for the spreading virus in computers with kill signals. The mathematical model shows susceptible, exposed, infected individuals to form the virus inactive, and kill signals classes.

Method

A structure of deep neural network (DNN) is designed by using two different hidden layers having radial basis activation functions in both layers, optimization through the Bayesian regularization, twenty and thirty numbers of neurons in primary and secondary hidden layers for the spreading virus in computers with kill signals. The stochastic DNN framework is presented to solve the spreading virus in computers with kill signals by selecting the data for training as 70 %, and 15 %, 15 % for both validation and testing.

Results

The accuracy of the scheme is observed through the overlapping of the solutions along with negligible absolute error for solving the model. The consistency of the solver is observed through the process of error histogram, regression, and state transition.

Novelty

The proposed DNN structure having radial basis activation function has never been applied for the spreading virus in computers with kill signals.

目的为具有杀伤信号的计算机中病毒的传播提供一种可靠的神经网络过程。该数学模型显示了易感、暴露、感染个体形成的病毒灭活和杀伤信号等级。方法采用两层具有径向基激活函数的不同隐层设计深度神经网络（DNN）结构，通过贝叶斯正则化优化，在主隐层和次隐层分别设置20和30个神经元，用于在具有杀伤信号的计算机中传播病毒。提出了随机深度神经网络框架，通过选择训练数据为70%，验证数据为15%，测试数据为15%，来解决具有杀死信号的计算机中病毒的传播问题。结果通过解的重叠观察到该方案的精度，求解模型的绝对误差可以忽略不计。通过误差直方图、回归和状态转移的过程来观察求解器的一致性。新颖提出的具有径向基激活函数的深度神经网络结构尚未应用于具有杀伤信号的计算机中病毒的传播。

{"title":"A reliable deep neural network using the radial basis for the spreading virus in computers with kill signals","authors":"Zulqurnain Sabir , Bahaa Basbous , Basma Souayeh , Muhammad Umar , Soheil Salahshour","doi":"10.1016/j.chemolab.2025.105560","DOIUrl":"10.1016/j.chemolab.2025.105560","url":null,"abstract":"<div><h3>Purpose</h3><div>The purpose of this work is to provide a reliable neural network process for the spreading virus in computers with kill signals. The mathematical model shows susceptible, exposed, infected individuals to form the virus inactive, and kill signals classes.</div></div><div><h3>Method</h3><div>A structure of deep neural network (DNN) is designed by using two different hidden layers having radial basis activation functions in both layers, optimization through the Bayesian regularization, twenty and thirty numbers of neurons in primary and secondary hidden layers for the spreading virus in computers with kill signals. The stochastic DNN framework is presented to solve the spreading virus in computers with kill signals by selecting the data for training as 70 %, and 15 %, 15 % for both validation and testing.</div></div><div><h3>Results</h3><div>The accuracy of the scheme is observed through the overlapping of the solutions along with negligible absolute error for solving the model. The consistency of the solver is observed through the process of error histogram, regression, and state transition.</div></div><div><h3>Novelty</h3><div>The proposed DNN structure having radial basis activation function has never been applied for the spreading virus in computers with kill signals.</div></div>","PeriodicalId":9774,"journal":{"name":"Chemometrics and Intelligent Laboratory Systems","volume":"268 ","pages":"Article 105560"},"PeriodicalIF":3.8,"publicationDate":"2025-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145464512","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Instrumental prediction of in vivo sensory properties of emollients to allow the development of new biobased ingredients 用仪器预测润肤剂的体内感官特性，以开发新的生物基成分

IF 3.8 2区化学 Q2 AUTOMATION & CONTROL SYSTEMS

Chemometrics and Intelligent Laboratory Systems

Pub Date : 2025-10-30 DOI: 10.1016/j.chemolab.2025.105559

Floriane Rischard , Amandine Flourat , Ecaterina Gore , Géraldine Savary

An important step in the development of novel cosmetic ingredients is the setting up of sensory analyses to assess their tactile properties. A recent work allowed the obtention of 12 novel biobased emollients with interesting physico-chemical properties. Four of the most promising emollients were selected in the present study and their safety was tested to ensure they are suitable for use on human skin. Their tactile properties, along with ten commercial emollients, were assessed by 16 expert assessors: circular spreading behavior, thickness of residual film and slippery after feel. In addition to characterizing a wide range of emollients, the results made possible the establishment of three predictive models using Partial Least Squares regressions. These original models correspond to various sensory attributes of the emollients, both during and after their application on the skin. All predictive models were then validated by leave-one-out cross validations. Only three instrumental parameters (viscosity, friction, stickiness) were necessary to build the models and predict the tactile properties. This approach was then applied to the eight other biobased emollients that were not initially used to establish the predictions in order to validate the models. Results demonstrate the significant value of such models for developing new ingredients. Ultimately, these predictive models could override the time-consuming and costly process of safety testing and sensory analyses in the research in development of future newly produced emollients for dermocosmetic applications.

开发新型化妆品成分的一个重要步骤是建立感官分析来评估其触觉特性。最近的一项工作使人们注意到12种具有有趣的物理化学性质的新型生物基润肤剂。在本研究中选择了四种最有前途的润肤剂，并对其安全性进行了测试，以确保它们适合用于人体皮肤。他们的触觉特性，连同十种商业润肤剂，由16位专家评估：圆形扩散行为，残余膜的厚度和光滑后的感觉。除了表征范围广泛的润肤剂外，结果还可以使用偏最小二乘回归建立三种预测模型。这些原始模型对应于润肤剂的各种感官属性，无论是在他们的应用在皮肤上。然后通过留一交叉验证验证所有预测模型。只需要三个仪器参数（粘度，摩擦力，粘性）就可以建立模型并预测触觉特性。然后将这种方法应用于其他八种最初未用于建立预测的生物基润肤剂，以验证模型。结果表明，这些模型对开发新成分具有重要的价值。最终，这些预测模型可以在未来新生产的皮肤美容应用润肤剂的研究开发中超越耗时和昂贵的安全测试和感官分析过程。

{"title":"Instrumental prediction of in vivo sensory properties of emollients to allow the development of new biobased ingredients","authors":"Floriane Rischard , Amandine Flourat , Ecaterina Gore , Géraldine Savary","doi":"10.1016/j.chemolab.2025.105559","DOIUrl":"10.1016/j.chemolab.2025.105559","url":null,"abstract":"<div><div>An important step in the development of novel cosmetic ingredients is the setting up of sensory analyses to assess their tactile properties. A recent work allowed the obtention of 12 novel biobased emollients with interesting physico-chemical properties. Four of the most promising emollients were selected in the present study and their safety was tested to ensure they are suitable for use on human skin. Their tactile properties, along with ten commercial emollients, were assessed by 16 expert assessors: circular spreading behavior, thickness of residual film and slippery after feel. In addition to characterizing a wide range of emollients, the results made possible the establishment of three predictive models using Partial Least Squares regressions. These original models correspond to various sensory attributes of the emollients, both during and after their application on the skin. All predictive models were then validated by leave-one-out cross validations. Only three instrumental parameters (viscosity, friction, stickiness) were necessary to build the models and predict the tactile properties. This approach was then applied to the eight other biobased emollients that were not initially used to establish the predictions in order to validate the models. Results demonstrate the significant value of such models for developing new ingredients. Ultimately, these predictive models could override the time-consuming and costly process of safety testing and sensory analyses in the research in development of future newly produced emollients for dermocosmetic applications.</div></div>","PeriodicalId":9774,"journal":{"name":"Chemometrics and Intelligent Laboratory Systems","volume":"268 ","pages":"Article 105559"},"PeriodicalIF":3.8,"publicationDate":"2025-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145464520","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Enhancing IoT anomaly detection with the Dwarf Mongoose-Chaos optimized deep belief framework 利用矮猫鼬-混沌优化的深度信念框架增强物联网异常检测

IF 3.8 2区化学 Q2 AUTOMATION & CONTROL SYSTEMS

Chemometrics and Intelligent Laboratory Systems

Pub Date : 2025-10-30 DOI: 10.1016/j.chemolab.2025.105558

Veena Potdar , Mohan Govindasa Kabadi

Anomaly detection is essential for identifying deviations from normal patterns in data, enabling the detection of security breaches or system faults, particularly in Internet of Things (IoT) networks. However, traditional machine learning (ML) and deep learning (DL) methods often struggle with the dynamic and complex nature of IoT environments, where attack patterns are non-linear, continuously evolving, and context-dependent. These models typically require large labeled datasets and retraining to adapt to new threats, which limits their responsiveness and scalability. Additionally, their high computational demands make real-time deployment on resource-constrained IoT devices challenging. Furthermore, many ML/DL models exhibit poor generalization, performing well in controlled scenarios but failing to maintain accuracy across diverse, real-world IoT settings with varying devices, protocols, and data distributions. To address these issues, this work proposes the Dwarf Mongoose-Chaos Optimized Deep Belief (DCODB) Framework, which combines advanced preprocessing, feature selection (FS), and classification techniques. Initial preprocessing involves Min-Max Normalization and One-Hot Encoding to scale numerical features and transform categorical data for effective model input. FS is optimized by the novel Dwarf Mongoose-Chaos Fusion Optimization (DMCFO), which is a swarm intelligence algorithm that leverages chaotic maps to improve the effectiveness of the Dwarf Mongoose Optimization Algorithm (DMO), reducing dimensionality and improving classification accuracy. The refined features are then classified using a Deep Belief Network (DBN), which processes hierarchical feature representations to differentiate between normal and anomalous behaviors in the NSL-KDD dataset. The proposed framework has been thoroughly assessed using diverse metrics, demonstrating its effectiveness in anomaly detection by achieving above 99 % Balanced Accuracy, along with exceptional Precision, Recall, F1 Score, Specificity, and the AUC-ROC curve. These high-performance metrics affirm the model's capability to deliver reliable and scalable anomaly detection in IoT environments, strengthening overall security.

异常检测对于识别数据中正常模式的偏差至关重要，能够检测安全漏洞或系统故障，特别是在物联网（IoT）网络中。然而，传统的机器学习（ML）和深度学习（DL）方法经常与物联网环境的动态性和复杂性作斗争，其中攻击模式是非线性的，不断发展的，并且依赖于上下文。这些模型通常需要大型标记数据集和重新训练以适应新的威胁，这限制了它们的响应能力和可扩展性。此外，它们的高计算需求使得在资源受限的物联网设备上进行实时部署具有挑战性。此外，许多ML/DL模型表现出较差的泛化，在受控场景中表现良好，但无法在具有不同设备，协议和数据分布的各种现实世界物联网设置中保持准确性。为了解决这些问题，本工作提出了矮猫鼬-混沌优化深度信念（DCODB）框架，该框架结合了先进的预处理、特征选择（FS）和分类技术。初始预处理包括Min-Max归一化和One-Hot编码，以缩放数值特征并将分类数据转换为有效的模型输入。该算法是一种基于混沌映射的群智能算法，利用混沌映射来提高小猫鼬优化算法（DMO）的有效性，降低了分类维数，提高了分类精度。然后使用深度信念网络（DBN）对精炼的特征进行分类，DBN处理分层特征表示以区分NSL-KDD数据集中的正常和异常行为。所提出的框架已经使用不同的指标进行了彻底的评估，证明了其在异常检测方面的有效性，达到了99%以上的平衡准确率，以及出色的精度、召回率、F1评分、特异性和AUC-ROC曲线。这些高性能指标肯定了该模型在物联网环境中提供可靠和可扩展异常检测的能力，从而增强了整体安全性。

{"title":"Enhancing IoT anomaly detection with the Dwarf Mongoose-Chaos optimized deep belief framework","authors":"Veena Potdar , Mohan Govindasa Kabadi","doi":"10.1016/j.chemolab.2025.105558","DOIUrl":"10.1016/j.chemolab.2025.105558","url":null,"abstract":"<div><div>Anomaly detection is essential for identifying deviations from normal patterns in data, enabling the detection of security breaches or system faults, particularly in Internet of Things (IoT) networks. However, traditional machine learning (ML) and deep learning (DL) methods often struggle with the dynamic and complex nature of IoT environments, where attack patterns are non-linear, continuously evolving, and context-dependent. These models typically require large labeled datasets and retraining to adapt to new threats, which limits their responsiveness and scalability. Additionally, their high computational demands make real-time deployment on resource-constrained IoT devices challenging. Furthermore, many ML/DL models exhibit poor generalization, performing well in controlled scenarios but failing to maintain accuracy across diverse, real-world IoT settings with varying devices, protocols, and data distributions. To address these issues, this work proposes the Dwarf Mongoose-Chaos Optimized Deep Belief (DCODB) Framework, which combines advanced preprocessing, feature selection (FS), and classification techniques. Initial preprocessing involves Min-Max Normalization and One-Hot Encoding to scale numerical features and transform categorical data for effective model input. FS is optimized by the novel Dwarf Mongoose-Chaos Fusion Optimization (DMCFO), which is a swarm intelligence algorithm that leverages chaotic maps to improve the effectiveness of the Dwarf Mongoose Optimization Algorithm (DMO), reducing dimensionality and improving classification accuracy. The refined features are then classified using a Deep Belief Network (DBN), which processes hierarchical feature representations to differentiate between normal and anomalous behaviors in the NSL-KDD dataset. The proposed framework has been thoroughly assessed using diverse metrics, demonstrating its effectiveness in anomaly detection by achieving above 99 % Balanced Accuracy, along with exceptional Precision, Recall, F1 Score, Specificity, and the AUC-ROC curve. These high-performance metrics affirm the model's capability to deliver reliable and scalable anomaly detection in IoT environments, strengthening overall security.</div></div>","PeriodicalId":9774,"journal":{"name":"Chemometrics and Intelligent Laboratory Systems","volume":"268 ","pages":"Article 105558"},"PeriodicalIF":3.8,"publicationDate":"2025-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145464518","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Solving the puzzle: Simulation of multivariate data with control over the structure of columns and rows using the two-sided orthogonal procrustes problem 解决难题：用双侧正交procrustes问题控制多变量数据的列和行结构的模拟

IF 3.8 2区化学 Q2 AUTOMATION & CONTROL SYSTEMS

Chemometrics and Intelligent Laboratory Systems

Pub Date : 2025-10-30 DOI: 10.1016/j.chemolab.2025.105556

Francisco Arteaga , José Camacho , Alberto Ferrer

Researchers interested in developing new multivariate statistical methods often need to be able to generate multivariate datasets with specific characteristics to test the effectiveness of their data analysis algorithms under specific conditions.

In this paper, we present a family of methods for generating multivariate centred datasets by simultaneously controlling features of the cross-product matrices

X^{⊤} X

and

X X^{⊤}

. This provides an interesting trade-off to control for the variance structure in the data, important for the family of algorithms that operate on the data matrix, like, e.g., Principal Component Analysis, and control for the distances among objects, important for algorithms that operate on the distance matrix, like Multidimensional Scaling. The proposed methods form a general framework that can be understood as a jigsaw puzzle, joining pieces obtained from the spectral decomposition of a target covariance matrix and the singular value decomposition of a target data matrix. These methods have in common that they are derived from a two-sided orthogonal Procrustes problem.

对开发新的多元统计方法感兴趣的研究人员通常需要能够生成具有特定特征的多元数据集，以测试其数据分析算法在特定条件下的有效性。在本文中，我们提出了一组通过同时控制交叉积矩阵X， X和XX， X的特征来生成多元中心数据集的方法。这为控制数据中的方差结构提供了一个有趣的权衡，这对于在数据矩阵上操作的算法家族很重要，例如，主成分分析，以及控制对象之间的距离，对于在距离矩阵上操作的算法很重要，例如多维缩放。所提出的方法形成了一个总体框架，可以理解为拼图游戏，将从目标协方差矩阵的谱分解和目标数据矩阵的奇异值分解中获得的碎片连接起来。这些方法的共同之处在于它们都是由一个双边正交的Procrustes问题导出的。

{"title":"Solving the puzzle: Simulation of multivariate data with control over the structure of columns and rows using the two-sided orthogonal procrustes problem","authors":"Francisco Arteaga , José Camacho , Alberto Ferrer","doi":"10.1016/j.chemolab.2025.105556","DOIUrl":"10.1016/j.chemolab.2025.105556","url":null,"abstract":"<div><div>Researchers interested in developing new multivariate statistical methods often need to be able to generate multivariate datasets with specific characteristics to test the effectiveness of their data analysis algorithms under specific conditions.</div><div>In this paper, we present a family of methods for generating multivariate centred datasets by simultaneously controlling features of the cross-product matrices <span><math><mrow><msup><mi>X</mi><mo>⊤</mo></msup><mi>X</mi></mrow></math></span> and <span><math><mrow><mi>X</mi><msup><mi>X</mi><mo>⊤</mo></msup></mrow></math></span>. This provides an interesting trade-off to control for the variance structure in the data, important for the family of algorithms that operate on the data matrix, like, e.g., Principal Component Analysis, and control for the distances among objects, important for algorithms that operate on the distance matrix, like Multidimensional Scaling. The proposed methods form a general framework that can be understood as a jigsaw puzzle, joining pieces obtained from the spectral decomposition of a target covariance matrix and the singular value decomposition of a target data matrix. These methods have in common that they are derived from a two-sided orthogonal Procrustes problem.</div></div>","PeriodicalId":9774,"journal":{"name":"Chemometrics and Intelligent Laboratory Systems","volume":"268 ","pages":"Article 105556"},"PeriodicalIF":3.8,"publicationDate":"2025-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145464513","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Time-resolved fluorescence spectroscopy and improved parallel factor framework-clustering analysis for oil spill type identification and concentration quantification 时间分辨荧光光谱和改进的平行因子框架聚类分析用于溢油类型识别和浓度定量

IF 3.8 2区化学 Q2 AUTOMATION & CONTROL SYSTEMS

Chemometrics and Intelligent Laboratory Systems

Pub Date : 2025-10-30 DOI: 10.1016/j.chemolab.2025.105564

Peiliang Wu , Zhiwei Wang , Yuhan Zhao , Deming Kong

Oil spills hidden below the sea surface and in a suspended state are known as submerged oil. Determining the source of an oil spill and evaluating the amount of oil spilled can provide a basis for the effective development of oil spill emergency response strategies and policies. Because of this, this paper proposes an oil spill species identification and concentration quantification analytical method based on the combination of time-resolved fluorescence spectroscopy (TRFS) and improved parallel factor framework-clustering analysis (IPFFCA). The IPFFCA model first decomposes the oil TRFS data to extract the loading matrix and reconstructs the landscape maps corresponding to each component based on the loading matrix. Subsequently, the non-negative least squares algorithm was employed to fit the component landscape maps to the unfolded actual spectra, thereby estimating the score matrix of the samples. Building upon this, the score matrix was used as input to develop oil species identification and concentration quantification models via particle swarm optimization support vector machine (PSO-SVM) and extreme gradient boosting (XGBoost), respectively. To verify the effectiveness of the proposed analytical method, six typical submerged oil samples were experimentally prepared, and their TRFS data were collected and analyzed. The experimental results show that the analytical method proposed in this paper achieves 92 % accuracy in the oil species identification task, the average coefficient of determination of the concentration prediction in the validation set of the six types of samples reaches 0.95, and the root mean square error is 0.08, indicating strong predictive performance.

隐藏在海面以下并处于悬浮状态的石油泄漏被称为水下石油。确定溢油源和评估溢油量可以为有效制定溢油应急战略和政策提供基础。为此，本文提出了一种基于时间分辨荧光光谱（TRFS）与改进的平行因子框架聚类分析（IPFFCA）相结合的溢油物种识别与浓度定量分析方法。IPFFCA模型首先对油品TRFS数据进行分解，提取加载矩阵，并根据加载矩阵重构各分量对应的景观图。随后，采用非负最小二乘算法将组分景观图拟合到展开的实际光谱中，从而估计样本的得分矩阵。在此基础上，以分数矩阵为输入，分别利用粒子群优化支持向量机（PSO-SVM）和极限梯度提升（XGBoost）技术建立油种识别和浓度量化模型。为了验证该分析方法的有效性，实验制备了6个典型的水下油样，并对其TRFS数据进行了采集和分析。实验结果表明，本文提出的分析方法在油类识别任务中准确率达到92%，6种样品验证集中浓度预测的平均决定系数达到0.95，均方根误差为0.08，具有较强的预测性能。

{"title":"Time-resolved fluorescence spectroscopy and improved parallel factor framework-clustering analysis for oil spill type identification and concentration quantification","authors":"Peiliang Wu , Zhiwei Wang , Yuhan Zhao , Deming Kong","doi":"10.1016/j.chemolab.2025.105564","DOIUrl":"10.1016/j.chemolab.2025.105564","url":null,"abstract":"<div><div>Oil spills hidden below the sea surface and in a suspended state are known as submerged oil. Determining the source of an oil spill and evaluating the amount of oil spilled can provide a basis for the effective development of oil spill emergency response strategies and policies. Because of this, this paper proposes an oil spill species identification and concentration quantification analytical method based on the combination of time-resolved fluorescence spectroscopy (TRFS) and improved parallel factor framework-clustering analysis (IPFFCA). The IPFFCA model first decomposes the oil TRFS data to extract the loading matrix and reconstructs the landscape maps corresponding to each component based on the loading matrix. Subsequently, the non-negative least squares algorithm was employed to fit the component landscape maps to the unfolded actual spectra, thereby estimating the score matrix of the samples. Building upon this, the score matrix was used as input to develop oil species identification and concentration quantification models via particle swarm optimization support vector machine (PSO-SVM) and extreme gradient boosting (XGBoost), respectively. To verify the effectiveness of the proposed analytical method, six typical submerged oil samples were experimentally prepared, and their TRFS data were collected and analyzed. The experimental results show that the analytical method proposed in this paper achieves 92 % accuracy in the oil species identification task, the average coefficient of determination of the concentration prediction in the validation set of the six types of samples reaches 0.95, and the root mean square error is 0.08, indicating strong predictive performance.</div></div>","PeriodicalId":9774,"journal":{"name":"Chemometrics and Intelligent Laboratory Systems","volume":"268 ","pages":"Article 105564"},"PeriodicalIF":3.8,"publicationDate":"2025-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145464514","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Extraction of soil nutrient information from visible and near-infrared signals using deep learning models 利用深度学习模型从可见光和近红外信号中提取土壤养分信息

IF 3.8 2区化学 Q2 AUTOMATION & CONTROL SYSTEMS

Chemometrics and Intelligent Laboratory Systems

Pub Date : 2025-10-29 DOI: 10.1016/j.chemolab.2025.105561

Chunru Xiong , Jufang Hu , Ken Cai , Fangxiu Meng , Qinyong Lin , Huazhou Chen

This study aims to combine the deep learning algorithm and the visible and near-infrared (Vis-NIR) spectroscopy technology to build a soil nutrient information extraction model. A deep learning framework based on Long Short-Term Memory (LSTM) is proposed to establish optimal calibration model for the analysis of the full range of Vis-NIR spectral data. Moreover, an influence function is designed to select the informative wavelength variables, which is an important goal in engineering application of spectroscopy for reducing the model dimensionality and enhancing model robustness. Experiment was performed for the prediction of nitrogen (N), phosphorus (P) and potassium (K) contents of soil. The modeling results showed that the proposed model could improve the modeling efficiency of soil nutrient information extraction, and also obtained higher accuracy in the modeling and predictive procedures than the conventional model. This will provide effective response to the challenges in engineering applications, to promote the Vis-NIR spectroscopy technology be applied for fast detection, and to obtain robust models with high precisions in soil nutrient information extraction process.

本研究旨在将深度学习算法与可见光和近红外（Vis-NIR）光谱技术相结合，构建土壤养分信息提取模型。提出了一种基于长短期记忆（LSTM）的深度学习框架，建立了全范围可见光-近红外光谱数据分析的最优校准模型。此外，设计了一个影响函数来选择信息丰富的波长变量，这是光谱学工程应用中降低模型维数和增强模型鲁棒性的重要目标。进行了土壤氮(N)、磷(P)、钾(K)含量预测试验。建模结果表明，该模型可以提高土壤养分信息提取的建模效率，并且在建模和预测过程中获得了比传统模型更高的精度。这将有效应对工程应用中的挑战，促进可见光-近红外光谱技术在土壤养分信息提取过程中的快速检测应用，并获得高精度的鲁棒模型。

{"title":"Extraction of soil nutrient information from visible and near-infrared signals using deep learning models","authors":"Chunru Xiong , Jufang Hu , Ken Cai , Fangxiu Meng , Qinyong Lin , Huazhou Chen","doi":"10.1016/j.chemolab.2025.105561","DOIUrl":"10.1016/j.chemolab.2025.105561","url":null,"abstract":"<div><div>This study aims to combine the deep learning algorithm and the visible and near-infrared (Vis-NIR) spectroscopy technology to build a soil nutrient information extraction model. A deep learning framework based on Long Short-Term Memory (LSTM) is proposed to establish optimal calibration model for the analysis of the full range of Vis-NIR spectral data. Moreover, an influence function is designed to select the informative wavelength variables, which is an important goal in engineering application of spectroscopy for reducing the model dimensionality and enhancing model robustness. Experiment was performed for the prediction of nitrogen (N), phosphorus (P) and potassium (K) contents of soil. The modeling results showed that the proposed model could improve the modeling efficiency of soil nutrient information extraction, and also obtained higher accuracy in the modeling and predictive procedures than the conventional model. This will provide effective response to the challenges in engineering applications, to promote the Vis-NIR spectroscopy technology be applied for fast detection, and to obtain robust models with high precisions in soil nutrient information extraction process.</div></div>","PeriodicalId":9774,"journal":{"name":"Chemometrics and Intelligent Laboratory Systems","volume":"268 ","pages":"Article 105561"},"PeriodicalIF":3.8,"publicationDate":"2025-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145398088","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Decoding brain tumor patterns in MRI images: Unleashing optimized insights with Progressive Wasserstein generative adversarial network 解码MRI图像中的脑肿瘤模式：利用渐进式Wasserstein生成对抗网络释放优化的见解

IF 3.8 2区化学 Q2 AUTOMATION & CONTROL SYSTEMS

Chemometrics and Intelligent Laboratory Systems

Pub Date : 2025-10-29 DOI: 10.1016/j.chemolab.2025.105563

G. Jenifa , B.R. Tapas Bapu , Vivekanandan M. , J. Senthil Murugan

Brain tumor (BT) detection and segmentation are of vital importance for accurate diagnosis, but are still difficult because of intricate brain anatomy, non-spherical tumor shapes, and low contrast of MRI images. Conventional manual methods are time-consuming and invasive with observer variability, whereas traditional machine learning (ML) approaches based on handcrafted features tend to miss subtle patterns of tumor areas. Even the deep learning (DL) models like CNNs, despite their effectiveness, have limitations such as high computation expenses, poor generalization to heterogeneous data, and complexity in delineating tumor boundaries accurately, which are subtle. These drawbacks are sought to be overcome by this manuscript, suggesting an innovative technique for automatic BT detection in MRI samples. Initially, the normalized gamma corrected contrast-limited adaptive histogram equalization (NG-CCLAHE) is introduced for enhancing the MRI image quality. Then, the Faster 2D-Otsu Thresholding technique is introduced for segmenting the tumor regions from the MRI samples. Followed by this, the synchroextracting Transform (SET) technique is employed to extract features, which are then optimized with an Improved Ladybug Beetle Optimization Algorithm (ILBOA). The improved features are fed into the PWGAN, allowing for more accurate and effective tumor detection. Experimental assessment using the Br35H Brain Tumor Detection 2020 dataset reflects high-level performance with 98.6 % accuracy, 92 % DSC, 95 % PDR, 23 % classification error, 37.8s computation time, and an F1-score of 98.59 %. These aspects identify the proposed approach's efficiency and competency in brain tumor patterns from MRI images.

脑肿瘤（BT）的检测和分割对于准确诊断至关重要，但由于脑解剖结构复杂，肿瘤形状非球形，MRI图像对比度低，因此检测和分割仍然很困难。传统的人工方法既耗时又具有侵入性，而且观测者的可变性很大，而传统的基于手工特征的机器学习（ML）方法往往会错过肿瘤区域的微妙模式。即使是cnn这样的深度学习（DL）模型，尽管它们很有效，但也存在计算费用高、对异构数据的泛化能力差、准确描绘肿瘤边界的复杂性等局限性，这些都是微妙的。这些缺点是寻求克服这篇手稿，建议在MRI样品中自动BT检测的创新技术。首先，引入归一化伽马校正对比度限制自适应直方图均衡化（NG-CCLAHE）来提高MRI图像质量。然后，引入更快的2D-Otsu阈值分割技术，从MRI样本中分割肿瘤区域。然后，采用同步提取变换（SET）技术提取特征，并用改进的瓢虫甲虫优化算法（ILBOA）对特征进行优化。改进的特征被输入到PWGAN中，允许更准确和有效的肿瘤检测。使用Br35H脑肿瘤检测2020数据集的实验评估反映出高水平的性能，准确率为98.6%，DSC为92%，PDR为95%，分类误差为23%，计算时间为37.8s， f1评分为98.59%。这些方面确定了所提出的方法在MRI图像中脑肿瘤模式的效率和能力。

{"title":"Decoding brain tumor patterns in MRI images: Unleashing optimized insights with Progressive Wasserstein generative adversarial network","authors":"G. Jenifa , B.R. Tapas Bapu , Vivekanandan M. , J. Senthil Murugan","doi":"10.1016/j.chemolab.2025.105563","DOIUrl":"10.1016/j.chemolab.2025.105563","url":null,"abstract":"<div><div>Brain tumor (BT) detection and segmentation are of vital importance for accurate diagnosis, but are still difficult because of intricate brain anatomy, non-spherical tumor shapes, and low contrast of MRI images. Conventional manual methods are time-consuming and invasive with observer variability, whereas traditional machine learning (ML) approaches based on handcrafted features tend to miss subtle patterns of tumor areas. Even the deep learning (DL) models like CNNs, despite their effectiveness, have limitations such as high computation expenses, poor generalization to heterogeneous data, and complexity in delineating tumor boundaries accurately, which are subtle. These drawbacks are sought to be overcome by this manuscript, suggesting an innovative technique for automatic BT detection in MRI samples. Initially, the normalized gamma corrected contrast-limited adaptive histogram equalization (NG-CCLAHE) is introduced for enhancing the MRI image quality. Then, the Faster 2D-Otsu Thresholding technique is introduced for segmenting the tumor regions from the MRI samples. Followed by this, the synchroextracting Transform (SET) technique is employed to extract features, which are then optimized with an Improved Ladybug Beetle Optimization Algorithm (ILBOA). The improved features are fed into the PWGAN, allowing for more accurate and effective tumor detection. Experimental assessment using the Br35H Brain Tumor Detection 2020 dataset reflects high-level performance with 98.6 % accuracy, 92 % DSC, 95 % PDR, 23 % classification error, 37.8s computation time, and an F1-score of 98.59 %. These aspects identify the proposed approach's efficiency and competency in brain tumor patterns from MRI images.</div></div>","PeriodicalId":9774,"journal":{"name":"Chemometrics and Intelligent Laboratory Systems","volume":"268 ","pages":"Article 105563"},"PeriodicalIF":3.8,"publicationDate":"2025-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145569577","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

When just-in-time learning meets deep learning: An industrial quality prediction practice on deep partial least squares model 当即时学习与深度学习相遇：深度偏最小二乘模型的工业质量预测实践

IF 3.8 2区化学 Q2 AUTOMATION & CONTROL SYSTEMS

Chemometrics and Intelligent Laboratory Systems

Pub Date : 2025-10-27 DOI: 10.1016/j.chemolab.2025.105555

Junhua Zheng , Zeyu Yang , Zhiqiang Ge

While deep learning has made great progresses in various application domains, the nature of computational expensive and reliance on large-scale data makes it inefficient or even impossible for small data modeling, particularly under the just-in-time learning framework. Effective combination of deep learning and just-in-time learning may explore great potentials for both two learning paradigms, thus should be attractive and beneficial to the research community. In this paper, an improved form of the lightweight deep partial least squares (PLS) model is developed under the framework of Just-in-time learning. Without complicated backpropagation and time-consuming parameter tuning algorithms, deep PLS provides a transparent model structure which also works well for small training data. As a result, fusion of those two learning strategies makes the new proposed method as a very promising predictive modeling tool in industrial soft sensor applications, the performance of which is evaluated and confirmed through a real industrial example.

虽然深度学习在各个应用领域取得了很大的进步，但计算成本高和依赖大规模数据的性质使得小数据建模效率低下甚至不可能，特别是在即时学习框架下。深度学习和即时学习的有效结合可以为这两种学习范式探索巨大的潜力，因此应该对研究界具有吸引力和有益的意义。本文在实时学习的框架下，提出了一种改进的轻量级深度偏最小二乘模型。没有复杂的反向传播和耗时的参数调整算法，深度PLS提供了一个透明的模型结构，也适用于小的训练数据。结果表明，这两种学习策略的融合使该方法在工业软测量应用中成为一种非常有前途的预测建模工具，并通过实际工业实例对其性能进行了评价和验证。

引用次数: 0

Advancing chemical manufacturing processes through data-driven approaches: A survey 通过数据驱动的方法推进化学制造过程：一项调查

IF 3.8 2区化学 Q2 AUTOMATION & CONTROL SYSTEMS

Chemometrics and Intelligent Laboratory Systems

Pub Date : 2025-10-22 DOI: 10.1016/j.chemolab.2025.105553

Yellam Naidu Kottavalasa, Lauro Snidaro

The chemical industry is the backbone of global manufacturing, driving innovation across multiple sectors. Since chemical processes are complex and dynamic in nature, it is still difficult to maintain efficiency, consistency in product, and optimize process parameters. Traditional approaches often fall short in handling these complexities, prompting manufacturers to adopt data-driven methodologies, including statistical models, machine learning techniques, and deep learning architectures. This survey discusses how these models help in fault detection, process optimization, and quality control. We examine the role of statistical models in capturing process variation, machine learning models in detecting patterns and anomalies, and neural networks in predictive maintenance and real-time monitoring. Additionally, we explore fusion-based architectures, including hybrid statistical, machine learning, and deep learning methods, that facilitate better fault detection and parameter estimation. The survey also highlights how data-driven approaches support sustainable chemical manufacturing by enabling real-time decisions, adaptive control, and effective process monitoring.

化工行业是全球制造业的支柱，推动着多个行业的创新。由于化学过程的复杂性和动态性，保持效率、产品一致性和优化工艺参数仍然是困难的。传统方法往往无法处理这些复杂性，这促使制造商采用数据驱动的方法，包括统计模型、机器学习技术和深度学习架构。本调查讨论了这些模型如何帮助故障检测、过程优化和质量控制。我们研究了统计模型在捕获过程变化中的作用，机器学习模型在检测模式和异常中的作用，以及神经网络在预测性维护和实时监控中的作用。此外，我们还探索了基于融合的架构，包括混合统计、机器学习和深度学习方法，以促进更好的故障检测和参数估计。该调查还强调了数据驱动方法如何通过实现实时决策、自适应控制和有效的过程监控来支持可持续化工制造。

{"title":"Advancing chemical manufacturing processes through data-driven approaches: A survey","authors":"Yellam Naidu Kottavalasa, Lauro Snidaro","doi":"10.1016/j.chemolab.2025.105553","DOIUrl":"10.1016/j.chemolab.2025.105553","url":null,"abstract":"<div><div>The chemical industry is the backbone of global manufacturing, driving innovation across multiple sectors. Since chemical processes are complex and dynamic in nature, it is still difficult to maintain efficiency, consistency in product, and optimize process parameters. Traditional approaches often fall short in handling these complexities, prompting manufacturers to adopt data-driven methodologies, including statistical models, machine learning techniques, and deep learning architectures. This survey discusses how these models help in fault detection, process optimization, and quality control. We examine the role of statistical models in capturing process variation, machine learning models in detecting patterns and anomalies, and neural networks in predictive maintenance and real-time monitoring. Additionally, we explore fusion-based architectures, including hybrid statistical, machine learning, and deep learning methods, that facilitate better fault detection and parameter estimation. The survey also highlights how data-driven approaches support sustainable chemical manufacturing by enabling real-time decisions, adaptive control, and effective process monitoring.</div></div>","PeriodicalId":9774,"journal":{"name":"Chemometrics and Intelligent Laboratory Systems","volume":"267 ","pages":"Article 105553"},"PeriodicalIF":3.8,"publicationDate":"2025-10-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145412356","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Adversarial Domain Adaptation Guided by Farthest Distance for open set electronic nose drift compensation 基于最远距离制导的开集电子鼻漂移补偿对抗域自适应

IF 3.8 2区化学 Q2 AUTOMATION & CONTROL SYSTEMS

Chemometrics and Intelligent Laboratory Systems

Pub Date : 2025-10-22 DOI: 10.1016/j.chemolab.2025.105554

Yong Pan , Chuandong Li , Jiang Xiong , Ziye Hou , Youbin Yao

With advancements in modern science and technology, electronic noses (ENs) have gained significant attention for their applications in environmental monitoring, food quality inspection, and medical equipment. ENs mimic biological olfactory systems to classify gases using arrays of sensors and pattern recognition models. However, gas sensor drift poses a major challenge, leading to performance degradation in EN systems. To address this, Domain Adaptation (DA) methods align source domain data with target domain drift data. While traditional DA methods assume identical class compositions in both domains, this is often unrealistic in practice, leading to suboptimal results. Open Set Domain Adaptation (OSDA) methods address unknown classes in the target domain, but they often focus too much on distinguishing unknown classes, neglecting accurate recognition of known classes. To overcome these limitations, we propose the Adversarial Domain Adaptation Guided by Farthest Distance (ADA-FDG), comprising two complementary modules: Farthest Distance Guide (FDG) and Confidence Normalized Adaptive Factor (CNAF). FDG adaptively builds a guide set that lies farthest from the source distribution in feature space, ensuring adversarial alignment learns to the edge region distribution. CNAF assigns a weight to each batch proportional to its classification confidence, preventing unknown-class samples from contaminating the ADA process. By integrating FDG and CNAF in an adversarial training framework, ADA-FDG achieves more precise alignment of source and target distributions while preserving clear separation between known and unknown classes. Extensive experiments on two benchmark datasets demonstrate that ADA-FDG consistently outperforms state-of-the-art closed and open set DA methods, delivering significant improvements in overall, known-class, and unknown-class accuracy.

随着现代科学技术的进步，电子鼻在环境监测、食品质量检测、医疗设备等方面的应用越来越受到人们的重视。ENs模拟生物嗅觉系统，利用传感器阵列和模式识别模型对气体进行分类。然而，气体传感器漂移带来了重大挑战，导致EN系统的性能下降。为了解决这个问题，域适应（DA）方法将源域数据与目标域漂移数据对齐。虽然传统的数据处理方法在两个领域中假设相同的类组成，但这在实践中往往是不现实的，从而导致次优结果。开放集域自适应（Open Set Domain Adaptation， OSDA）方法主要针对目标域中的未知类，但往往过于关注识别未知类，而忽略了对已知类的准确识别。为了克服这些限制，我们提出了由最远距离引导的对抗域自适应（ADA-FDG），它由最远距离引导（FDG）和置信归一化自适应因子（CNAF）两个互补模块组成。FDG自适应地构建距离源分布在特征空间中最远的引导集，保证对抗性对齐学习到边缘区域分布。CNAF为每个批次分配与其分类置信度成比例的权重，防止未知类别的样品污染ADA过程。通过在对抗性训练框架中集成FDG和CNAF， ADA-FDG实现了更精确的源和目标分布对齐，同时保留了已知和未知类别之间的明确分离。在两个基准数据集上进行的大量实验表明，ADA-FDG始终优于最先进的封闭集和开放集数据分析方法，在总体、已知类和未知类精度方面都有显著提高。

{"title":"Adversarial Domain Adaptation Guided by Farthest Distance for open set electronic nose drift compensation","authors":"Yong Pan , Chuandong Li , Jiang Xiong , Ziye Hou , Youbin Yao","doi":"10.1016/j.chemolab.2025.105554","DOIUrl":"10.1016/j.chemolab.2025.105554","url":null,"abstract":"<div><div>With advancements in modern science and technology, electronic noses (ENs) have gained significant attention for their applications in environmental monitoring, food quality inspection, and medical equipment. ENs mimic biological olfactory systems to classify gases using arrays of sensors and pattern recognition models. However, gas sensor drift poses a major challenge, leading to performance degradation in EN systems. To address this, Domain Adaptation (DA) methods align source domain data with target domain drift data. While traditional DA methods assume identical class compositions in both domains, this is often unrealistic in practice, leading to suboptimal results. Open Set Domain Adaptation (OSDA) methods address unknown classes in the target domain, but they often focus too much on distinguishing unknown classes, neglecting accurate recognition of known classes. To overcome these limitations, we propose the Adversarial Domain Adaptation Guided by Farthest Distance (ADA-FDG), comprising two complementary modules: Farthest Distance Guide (FDG) and Confidence Normalized Adaptive Factor (CNAF). FDG adaptively builds a guide set that lies farthest from the source distribution in feature space, ensuring adversarial alignment learns to the edge region distribution. CNAF assigns a weight to each batch proportional to its classification confidence, preventing unknown-class samples from contaminating the ADA process. By integrating FDG and CNAF in an adversarial training framework, ADA-FDG achieves more precise alignment of source and target distributions while preserving clear separation between known and unknown classes. Extensive experiments on two benchmark datasets demonstrate that ADA-FDG consistently outperforms state-of-the-art closed and open set DA methods, delivering significant improvements in overall, known-class, and unknown-class accuracy.</div></div>","PeriodicalId":9774,"journal":{"name":"Chemometrics and Intelligent Laboratory Systems","volume":"267 ","pages":"Article 105554"},"PeriodicalIF":3.8,"publicationDate":"2025-10-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145358703","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0