首页 > 最新文献

Journal of Chemometrics最新文献

英文 中文
Research progress on the application of hyperspectral imaging techniques in tea science 高光谱成像技术在茶叶科学中的应用研究进展
IF 2.4 4区 化学 Q1 Mathematics Pub Date : 2023-04-12 DOI: 10.1002/cem.3481
Dongxia Liang, Qiaoyi Zhou, Caijin Ling, Liyang Gao, Xiaoting Mu, Zhencheng Liao

Hyperspectral imaging technology combines two-dimensional imaging and spectral technology, which can simultaneously obtain spatial and spectral information of the object to be measured and is an advanced technical method. With the development of science and technology, the detection of tea has also been continuously improved, and it has developed in the direction of being nondestructive, fast, real-time, and accurate. In this paper, the principle of hyperspectral imaging technology is introduced, and according to research on hyperspectral imaging technology in the nondestructive rapid detection of tea in the past 5 years, the application of hyperspectral imaging technology in the detection of tea biochemical components, accurate classification, determination of mildew degree, and stress monitoring and the application progress in planting production management are analyzed. Additionally, the main challenges existing in the current research are analyzed, and future application prospects are proposed to provide a reference for the application and promotion of hyperspectral imaging technology in the actual production of tea.

高光谱成像技术结合了二维成像和光谱技术,可以同时获得被测物体的空间和光谱信息,是一种先进的技术方法。随着科学技术的发展,茶叶的检测也在不断改进,并朝着无损、快速、实时、准确的方向发展。本文介绍了高光谱成像技术的原理,并根据近5年来高光谱成像在茶叶无损快速检测中的研究 分析了近年来高光谱成像技术在茶叶生化成分检测、准确分类、霉变程度测定、胁迫监测等方面的应用,以及在种植生产管理中的应用进展。此外,分析了当前研究中存在的主要挑战,并提出了未来的应用前景,为高光谱成像技术在茶叶实际生产中的应用和推广提供参考。
{"title":"Research progress on the application of hyperspectral imaging techniques in tea science","authors":"Dongxia Liang,&nbsp;Qiaoyi Zhou,&nbsp;Caijin Ling,&nbsp;Liyang Gao,&nbsp;Xiaoting Mu,&nbsp;Zhencheng Liao","doi":"10.1002/cem.3481","DOIUrl":"10.1002/cem.3481","url":null,"abstract":"<p>Hyperspectral imaging technology combines two-dimensional imaging and spectral technology, which can simultaneously obtain spatial and spectral information of the object to be measured and is an advanced technical method. With the development of science and technology, the detection of tea has also been continuously improved, and it has developed in the direction of being nondestructive, fast, real-time, and accurate. In this paper, the principle of hyperspectral imaging technology is introduced, and according to research on hyperspectral imaging technology in the nondestructive rapid detection of tea in the past 5 years, the application of hyperspectral imaging technology in the detection of tea biochemical components, accurate classification, determination of mildew degree, and stress monitoring and the application progress in planting production management are analyzed. Additionally, the main challenges existing in the current research are analyzed, and future application prospects are proposed to provide a reference for the application and promotion of hyperspectral imaging technology in the actual production of tea.</p>","PeriodicalId":15274,"journal":{"name":"Journal of Chemometrics","volume":null,"pages":null},"PeriodicalIF":2.4,"publicationDate":"2023-04-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47785080","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
An algorithm for robust multiblock partial least squares predictive modelling 一种鲁棒多块偏最小二乘预测建模算法
IF 2.4 4区 化学 Q1 Mathematics Pub Date : 2023-04-03 DOI: 10.1002/cem.3480
Puneet Mishra, Kristian Hovde Liland

A new algorithm for robust multiblock (data fusion) modelling in the presence of outlying observations is presented. The method is a combination of a robust modelling technique called iterative reweighted partial least squares and the block order and scale-independent component-wise multiblock partial least squares modelling. The method is based on automatic down-weighting of outlying observations such that their contribution is minimal during the estimation of block-wise partial least squares models, thus leading to robust modelling minimally affected by outliers. The algorithm and test of the methods for modelling multiblock data sets (simulated and real) in the presence of outlying observation are demonstrated.

提出了一种新的多块数据融合鲁棒建模算法。该方法是一种称为迭代重加权偏最小二乘法的鲁棒建模技术与块顺序和尺度无关的分量明智的多块偏最小二乘法建模的结合。该方法基于离群观测值的自动降权,使得它们在块偏最小二乘模型估计期间的贡献最小,从而导致受离群值影响最小的鲁棒建模。给出了多块数据集(模拟数据集和真实数据集)在外围观测条件下的建模算法和测试。
{"title":"An algorithm for robust multiblock partial least squares predictive modelling","authors":"Puneet Mishra,&nbsp;Kristian Hovde Liland","doi":"10.1002/cem.3480","DOIUrl":"10.1002/cem.3480","url":null,"abstract":"<p>A new algorithm for robust multiblock (data fusion) modelling in the presence of outlying observations is presented. The method is a combination of a robust modelling technique called <i>iterative reweighted partial least squares</i> and the block order and scale-independent component-wise multiblock partial least squares modelling. The method is based on automatic down-weighting of outlying observations such that their contribution is minimal during the estimation of block-wise partial least squares models, thus leading to robust modelling minimally affected by outliers. The algorithm and test of the methods for modelling multiblock data sets (simulated and real) in the presence of outlying observation are demonstrated.</p>","PeriodicalId":15274,"journal":{"name":"Journal of Chemometrics","volume":null,"pages":null},"PeriodicalIF":2.4,"publicationDate":"2023-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/cem.3480","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43070986","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Evaluation of spectral collection strategies for identification of Dalbergia spp. using handheld laser-induced breakdown spectroscopy 手持式激光诱导击穿光谱(LIBS)鉴定黄檀的光谱采集策略评价
IF 2.4 4区 化学 Q1 Mathematics Pub Date : 2023-03-30 DOI: 10.1002/cem.3479
Caelin P. Celani, Rachel A. McCormick, Amelia M. Speed, William Johnston, James A. Jordan, Tyler B. Coplen, Karl S. Booksh

The illegal timber trade has significant impact on the survival of endangered tropical hardwood species like Dalbergia spp. (rosewood), a world-wide protected genus from the Convention on International Trade in Endangered Species of Wild Fauna and Flora (CITES). Due to increased threat to Dalbergia spp., and lack of action to reduce threats, port of entry analysis methods are required to identify Dalbergia spp. Handheld laser-induced breakdown spectroscopy (LIBS) has been shown to be capable of identifying species and establishing provenance of Dalbergia spp. and other tropical hardwoods, but analysis methods for this work have yet to be investigated in detail. The present work investigates five well-known algorithms—partial least squares discriminant analysis (PLS-DA), classification and regression trees (CART), k-nearest neighbor (k-NN), random forest (RF), and support vector machine (SVM)—two training/test set sampling regimes, and data collection at two signal-to-noise (S/N) ratios to assess the potential for handheld LIBS analyses. Additionally, imbalanced classes are addressed. For this application, SVM and RF yield near identical results (though RF takes nearly 100 longer to compute), while the S/N ratio has a significant effect on model success assuming all else is equal. It was found that forming a training set with replicate low S/N analyses can perform as well as higher precision training sets for true prediction, even if the predicted samples have low signal to noise! This work confirms handheld LIBS analyzers can provide a viable method for classification of hardwood species, even within the same genus.

非法木材贸易对濒危热带硬木物种的生存造成了重大影响,如达尔伯里木属(花梨木),它是《濒危野生动植物种国际贸易公约》(CITES)中的世界性保护木属。手持式激光诱导击穿光谱仪(LIBS)已被证明能够识别 Dalbergia spp.和其他热带硬木的物种并确定其来源,但这项工作的分析方法还有待详细研究。本研究调查了五种著名的算法--部分最小二乘判别分析 (PLS-DA)、分类和回归树 (CART)、k-近邻 (k-NN)、随机森林 (RF) 和支持向量机 (SVM)--两种训练/测试集采样制度,以及两种信噪比 (S/N) 下的数据采集,以评估手持式 LIBS 分析的潜力。此外,还解决了不平衡类的问题。在这一应用中,SVM 和 RF 得出的结果几乎相同(尽管 RF 的计算时间长了近 100 倍),而信噪比对模型成功与否有显著影响(假设其他条件相同)。研究发现,即使预测样本的信噪比很低,用重复的低信噪比分析结果组成训练集,在真实预测方面的表现也不亚于精度更高的训练集!这项工作证实了手持式 LIBS 分析仪可以为硬木树种的分类提供一种可行的方法,即使是同一属中的树种。
{"title":"Evaluation of spectral collection strategies for identification of Dalbergia spp. using handheld laser-induced breakdown spectroscopy","authors":"Caelin P. Celani,&nbsp;Rachel A. McCormick,&nbsp;Amelia M. Speed,&nbsp;William Johnston,&nbsp;James A. Jordan,&nbsp;Tyler B. Coplen,&nbsp;Karl S. Booksh","doi":"10.1002/cem.3479","DOIUrl":"10.1002/cem.3479","url":null,"abstract":"<p>The illegal timber trade has significant impact on the survival of endangered tropical hardwood species like <i>Dalbergia</i> spp. (rosewood), a world-wide protected genus from the Convention on International Trade in Endangered Species of Wild Fauna and Flora (CITES). Due to increased threat to <i>Dalbergia</i> spp., and lack of action to reduce threats, port of entry analysis methods are required to identify <i>Dalbergia</i> spp. Handheld laser-induced breakdown spectroscopy (LIBS) has been shown to be capable of identifying species and establishing provenance of <i>Dalbergia</i> spp. and other tropical hardwoods, but analysis methods for this work have yet to be investigated in detail. The present work investigates five well-known algorithms—partial least squares discriminant analysis (PLS-DA), classification and regression trees (CART), <i>k</i>-nearest neighbor (<i>k</i>-NN), random forest (RF), and support vector machine (SVM)—two training/test set sampling regimes, and data collection at two signal-to-noise (S/N) ratios to assess the potential for handheld LIBS analyses. Additionally, imbalanced classes are addressed. For this application, SVM and RF yield near identical results (though RF takes nearly 100 longer to compute), while the S/N ratio has a significant effect on model success assuming all else is equal. It was found that forming a training set with replicate low S/N analyses can perform as well as higher precision training sets for true prediction, even if the predicted samples have low signal to noise! This work confirms handheld LIBS analyzers can provide a viable method for classification of hardwood species, even within the same genus.</p>","PeriodicalId":15274,"journal":{"name":"Journal of Chemometrics","volume":null,"pages":null},"PeriodicalIF":2.4,"publicationDate":"2023-03-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45393298","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Weighted multivariate curve resolution—Alternating least squares based on sample relevance 加权多元曲线解析-基于样本相关性的交替最小二乘
IF 2.4 4区 化学 Q1 Mathematics Pub Date : 2023-03-26 DOI: 10.1002/cem.3478
Mohamad Ahmad, Raffaele Vitale, Marina Cocchi, Cyril Ruckebusch

Alternating least squares within the multivariate curve resolution framework has seen a lot of practical applications and shows their distinction with their relatively simple and flexible implementation. However, the limitations of least squares should be carefully considered when deviating from the standard assumed data structure. Within this work, we highlight the effects of noise in the presence of minor components, and we propose a novel weighting scheme within the weighted multivariate curve-resolution-alternating least squares framework to resolve it. Two simulated and one Raman imaging case are investigated by comparing the novel methodology against standard multivariate curve resolution-alternating least squares and essential spectral pixel selection. A trade-off is observed between current methods, whereas the novel weighting scheme demonstrates a balance where the benefits of the previous two methods are retained.

交替最小二乘在多变量曲线解析框架中得到了大量的实际应用,并以其相对简单和灵活的实现显示出其独特之处。然而,当偏离标准假设的数据结构时,应该仔细考虑最小二乘的局限性。在这项工作中,我们强调了小分量存在时噪声的影响,并在加权多元曲线-分辨率-交替最小二乘框架内提出了一种新的加权方案来解决它。通过将新方法与标准多元曲线分辨率-交替最小二乘法和基本光谱像素选择进行比较,研究了两个模拟和一个拉曼成像案例。在当前方法之间观察到一种权衡,而新的加权方案显示了一种平衡,其中保留了前两种方法的优点。
{"title":"Weighted multivariate curve resolution—Alternating least squares based on sample relevance","authors":"Mohamad Ahmad,&nbsp;Raffaele Vitale,&nbsp;Marina Cocchi,&nbsp;Cyril Ruckebusch","doi":"10.1002/cem.3478","DOIUrl":"10.1002/cem.3478","url":null,"abstract":"<p>Alternating least squares within the multivariate curve resolution framework has seen a lot of practical applications and shows their distinction with their relatively simple and flexible implementation. However, the limitations of least squares should be carefully considered when deviating from the standard assumed data structure. Within this work, we highlight the effects of noise in the presence of minor components, and we propose a novel weighting scheme within the weighted multivariate curve-resolution-alternating least squares framework to resolve it. Two simulated and one Raman imaging case are investigated by comparing the novel methodology against standard multivariate curve resolution-alternating least squares and essential spectral pixel selection. A trade-off is observed between current methods, whereas the novel weighting scheme demonstrates a balance where the benefits of the previous two methods are retained.</p>","PeriodicalId":15274,"journal":{"name":"Journal of Chemometrics","volume":null,"pages":null},"PeriodicalIF":2.4,"publicationDate":"2023-03-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/cem.3478","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43409795","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Partial least squares regression with multiple domains 多域偏最小二乘回归
IF 2.4 4区 化学 Q1 Mathematics Pub Date : 2023-03-26 DOI: 10.1002/cem.3477
Bianca Mikulasek, Valeria Fonseca Diaz, David Gabauer, Christoph Herwig, Ramin Nikzad-Langerodi

This paper introduces the multiple domain-invariant partial least squares (mdi-PLS) method, which generalizes the recently introduced domain-invariant partial least squares method (di-PLS). In contrast to di-PLS which solely allows transferring of knowledge from a single source to a single target domain, the proposed approach enables the incorporation of data from an arbitrary number of domains. Additionally, mdi-PLS offers a high level of flexibility by accepting labeled (supervised) and unlabeled (unsupervised) data to cope with dataset shifts. We demonstrate the application of the mdi-PLS method on a simulated and one real-world dataset. Our results show a clear outperformance of both PLS and di-PLS when data from multiple related domains are available for training multivariate calibration models underpinning the benefit of mdi-PLS.

本文介绍了多域不变偏最小二乘法(mdi‐PLS),它推广了最近引入的域不变偏最小平方法(di‐PLS)。与仅允许将知识从单个源转移到单个目标域的di-PLS相比,所提出的方法能够合并任意数量域的数据。此外,mdi-PLS通过接受标记(监督)和未标记(无监督)数据来应对数据集的变化,提供了高度的灵活性。我们在模拟和一个真实世界的数据集上演示了mdi-PLS方法的应用。我们的结果表明,当来自多个相关领域的数据可用于训练支持mdi-PLS益处的多变量校准模型时,PLS和di-PLS的性能都明显优于PLS。
{"title":"Partial least squares regression with multiple domains","authors":"Bianca Mikulasek,&nbsp;Valeria Fonseca Diaz,&nbsp;David Gabauer,&nbsp;Christoph Herwig,&nbsp;Ramin Nikzad-Langerodi","doi":"10.1002/cem.3477","DOIUrl":"10.1002/cem.3477","url":null,"abstract":"<p>This paper introduces the multiple domain-invariant partial least squares (mdi-PLS) method, which generalizes the recently introduced domain-invariant partial least squares method (di-PLS). In contrast to di-PLS which solely allows transferring of knowledge from a single source to a single target domain, the proposed approach enables the incorporation of data from an arbitrary number of domains. Additionally, mdi-PLS offers a high level of flexibility by accepting labeled (supervised) and unlabeled (unsupervised) data to cope with dataset shifts. We demonstrate the application of the mdi-PLS method on a simulated and one real-world dataset. Our results show a clear outperformance of both PLS and di-PLS when data from multiple related domains are available for training multivariate calibration models underpinning the benefit of mdi-PLS.</p>","PeriodicalId":15274,"journal":{"name":"Journal of Chemometrics","volume":null,"pages":null},"PeriodicalIF":2.4,"publicationDate":"2023-03-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"44547298","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Intelligent component detection of quaternary blended oil based on near infrared spectroscopy technology 基于近红外光谱技术的四元调合油智能成分检测
IF 2.4 4区 化学 Q1 Mathematics Pub Date : 2023-03-18 DOI: 10.1002/cem.3476
Zhide Zhao, Laijun Sun, Hongyi Bai, Hong Zhang, Yujie Tian

In this study, near infrared spectroscopy (NIRS) technique was used for quantitative detection of quaternary blended oil. After a series of preprocessing, the prediction effects of the three models and their preprocessing combinations were compared. Taking soybean oil content prediction as an example, random forest (RF) model had better performance after second derivative (D2) optimization. In feature selection, a two-step feature selection method was adopted to extract the feature wavelength. First, the elastic net (EN) was used for the initial screening of feature wavelengths, and most irrelevant features were eliminated. The number of feature wavelengths was reduced from 1048 to 134. After that, the competitive adaptive re-weighted sampling (CARS) method was used to screen the remaining characteristic wavelengths more carefully, and 20 effective characteristic wavelengths were selected. Finally, a quantitative detection model was established based on 20 effective characteristic wavelengths selected by EN + CARS. Evaluated by the test set, The correlation coefficient of determination (R2), root-mean-square error of prediction (RMSEP) and Relative Percent Difference (RPD) values of 2D + EN + CARS + RF model were 0.97953, 1.34306 and 7.08875, respectively. The results showed that the two-step feature selection method can effectively extract the feature wavelength, and the NIRS technology can realize the intelligent detection of blended oil components.

采用近红外光谱(NIRS)技术对四元调合油进行定量检测。经过一系列预处理后,比较了三种模型及其预处理组合的预测效果。以大豆油含量预测为例,随机森林(RF)模型经过二阶导数(D2)优化后,具有较好的预测效果。在特征选择方面,采用两步特征选择方法提取特征波长。首先,利用弹性网(EN)对特征波长进行初步筛选,剔除大部分不相关的特征;特征波长的数量从1048个减少到134个。之后,采用竞争自适应加权采样(CARS)方法对剩余特征波长进行更细致的筛选,选出20个有效特征波长。最后,基于EN + CARS选择的20个有效特征波长建立了定量检测模型。经检验集评估,2D + EN + CARS + RF模型的判定相关系数(R2)、预测均方根误差(RMSEP)和相对百分比差(RPD)值分别为0.97953、1.34306和7.08875。结果表明,两步特征选择方法可以有效地提取特征波长,近红外光谱技术可以实现混合油成分的智能检测。
{"title":"Intelligent component detection of quaternary blended oil based on near infrared spectroscopy technology","authors":"Zhide Zhao,&nbsp;Laijun Sun,&nbsp;Hongyi Bai,&nbsp;Hong Zhang,&nbsp;Yujie Tian","doi":"10.1002/cem.3476","DOIUrl":"10.1002/cem.3476","url":null,"abstract":"<p>In this study, near infrared spectroscopy (NIRS) technique was used for quantitative detection of quaternary blended oil. After a series of preprocessing, the prediction effects of the three models and their preprocessing combinations were compared. Taking soybean oil content prediction as an example, random forest (RF) model had better performance after second derivative (D2) optimization. In feature selection, a two-step feature selection method was adopted to extract the feature wavelength. First, the elastic net (EN) was used for the initial screening of feature wavelengths, and most irrelevant features were eliminated. The number of feature wavelengths was reduced from 1048 to 134. After that, the competitive adaptive re-weighted sampling (CARS) method was used to screen the remaining characteristic wavelengths more carefully, and 20 effective characteristic wavelengths were selected. Finally, a quantitative detection model was established based on 20 effective characteristic wavelengths selected by EN + CARS. Evaluated by the test set, The correlation coefficient of determination (<i>R</i><sup>2</sup>), root-mean-square error of prediction (RMSEP) and Relative Percent Difference (RPD) values of 2D + EN + CARS + RF model were 0.97953, 1.34306 and 7.08875, respectively. The results showed that the two-step feature selection method can effectively extract the feature wavelength, and the NIRS technology can realize the intelligent detection of blended oil components.</p>","PeriodicalId":15274,"journal":{"name":"Journal of Chemometrics","volume":null,"pages":null},"PeriodicalIF":2.4,"publicationDate":"2023-03-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46026258","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Artificial neural network (ANN) modeling for simultaneous removal of a binary mixture of Pb(II) and Cu(II) by cobalt hydroxide nano-flakes 氢氧化钴纳米薄片同时去除Pb(II)和Cu(II)二元混合物的人工神经网络(ANN)建模
IF 2.4 4区 化学 Q1 Mathematics Pub Date : 2023-03-16 DOI: 10.1002/cem.3475
Javad Zolgharnein, Tahere Shariatmanesh, Saeideh Dermanaki Farahani

A three-layer artificial neural network (ANN) model was developed to predict the efficiency of Cu(II) and Pb(II) ion removal from aqueous solution by cobalt hydroxide nano-flakes. It is based on experimental sets obtained from a D-optimal design. The input variables to the neural network were as follows: the initial concentration of Pb(II) and Cu (II) ions (mg L−1), initial pH, and sorbent mass (g). The configuration of the backpropagation neural network for both Cu(II) and Pb (II) ions was a tangent sigmoid transfer function (tansig) at the hidden layer, linear transfer function (purelin) at the output layer, and Levenberg–Marquardt training algorithm (LMA). ANN-predicted results were very close to the experimental results with a coefficient of determination (R2) of 0.9970 and mean square error (MSE) 0.000376. Analysis based on the ANN model indicated that sorbent mass appeared to be the most influential factor in the adsorption process of Cu(II) and Pb(II). Characterization of the cobalt hydroxide nano-flakes and possible metal ions-adsorbent interactions were confirmed by Fourier transform infrared spectroscopy (FT-IR), X-ray diffraction (XRD), and scanning electron microscopy (SEM).

建立了一个三层人工神经网络(ANN)模型来预测氢氧化钴纳米薄片对水溶液中Cu(II)和Pb(II)离子的去除效率。它是基于从D‐最优设计中获得的实验集。神经网络的输入变量为:Pb(II)和Cu(II)离子的初始浓度(mg L−1)、初始pH和吸附剂质量(g)。Cu(II)和Pb(II)离子的反向传播神经网络配置为隐藏层的正切s型传递函数(tansig)、输出层的线性传递函数(purelin)和Levenberg-Marquardt训练算法(LMA)。ANN‐预测结果与实验结果非常接近,决定系数(R2)为0.9970,均方误差(MSE)为0.000376。基于人工神经网络模型的分析表明,吸附剂质量是铜(II)和铅(II)吸附过程中影响最大的因素。通过傅里叶变换红外光谱(FT - IR)、X射线衍射(XRD)和扫描电子显微镜(SEM)证实了氢氧化钴纳米薄片和可能的金属离子-吸附剂相互作用的表征。
{"title":"Artificial neural network (ANN) modeling for simultaneous removal of a binary mixture of Pb(II) and Cu(II) by cobalt hydroxide nano-flakes","authors":"Javad Zolgharnein,&nbsp;Tahere Shariatmanesh,&nbsp;Saeideh Dermanaki Farahani","doi":"10.1002/cem.3475","DOIUrl":"10.1002/cem.3475","url":null,"abstract":"<p>A three-layer artificial neural network (ANN) model was developed to predict the efficiency of Cu(II) and Pb(II) ion removal from aqueous solution by cobalt hydroxide nano-flakes. It is based on experimental sets obtained from a D-optimal design. The input variables to the neural network were as follows: the initial concentration of Pb(II) and Cu (II) ions (mg L<sup>−1</sup>), initial pH, and sorbent mass (g). The configuration of the backpropagation neural network for both Cu(II) and Pb (II) ions was a tangent sigmoid transfer function (tansig) at the hidden layer, linear transfer function (purelin) at the output layer, and Levenberg–Marquardt training algorithm (LMA). ANN-predicted results were very close to the experimental results with a coefficient of determination (<i>R</i><sup>2</sup>) of 0.9970 and mean square error (MSE) 0.000376. Analysis based on the ANN model indicated that sorbent mass appeared to be the most influential factor in the adsorption process of Cu(II) and Pb(II). Characterization of the cobalt hydroxide nano-flakes and possible metal ions-adsorbent interactions were confirmed by Fourier transform infrared spectroscopy (FT-IR), X-ray diffraction (XRD), and scanning electron microscopy (SEM).</p>","PeriodicalId":15274,"journal":{"name":"Journal of Chemometrics","volume":null,"pages":null},"PeriodicalIF":2.4,"publicationDate":"2023-03-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42119878","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A novel eco-friendly methods for simultaneous determination of aspirin, clopidogrel, and atorvastatin or rosuvastatin in their fixed-dose combination using chemometric techniques and artificial neural networks 利用化学计量学技术和人工神经网络同时测定阿司匹林、氯吡格雷和阿托伐他汀或瑞舒伐他汀固定剂量组合的新型生态友好方法
IF 2.4 4区 化学 Q1 Mathematics Pub Date : 2023-03-13 DOI: 10.1002/cem.3474
Norhan S. AlSawy, Ehab F. ElKady, Eman A. Mostafa

In this study, the simultaneous determination of aspirin, clopidogrel, and either atorvastatin or rosuvastatin in their fixed-dose combination (FDC) formulations has been reported. As a straightforward substitute for employing distinct models for each component, UV spectrophotometry was applied with chemometric approaches and artificial neural networks to achieve this. Three chemometric techniques, including principal component regression (PCR), partial least-squares (PLS), and classical least-squares (CLS), were applied in addition to the radial basis function-artificial neural network (RBF-ANN). The validation of a set of laboratory-prepared combinations of aspirin, clopidogrel, and atorvastatin in one ternary mixture and aspirin, clopidogrel, and rosuvastatin in a second ternary mixture was assessed, and the results from the use of these approaches were recorded and compared. The absorbance data matrix matching the concentration data matrix in CLS, PCR, and PLS was created using measurements of absorbances in the range of 250–280 nm at intervals of 0.2 nm in their zero-order spectra. Then, in order to forecast the unknown concentrations, calibration or regression was created utilizing the concentration and absorbance data matrices. Using RBF-ANN for the simultaneous determination of aspirin, clopidogrel, and atorvastatin or rosuvastatin in their formulations was achieved by providing the input layer with 151 neurons; there are 2 hidden layers and 3 output neurons were obtained. The green profile of the developed methods has been assessed and compared with previously reported spectrophotometric methods. The suggested techniques were effectively applied to FDC dosage forms that contained the cited medications.

在这项研究中,同时测定阿司匹林、氯吡格雷和阿托伐他汀或瑞舒伐他汀的固定剂量组合(FDC)制剂。作为对每种成分采用不同模型的直接替代品,紫外分光光度法与化学计量方法和人工神经网络一起应用来实现这一目标。除了径向基函数-人工神经网络(RBF - ANN)之外,还应用了三种化学计量学技术,包括主成分回归(PCR)、偏最小二乘(PLS)和经典最小二乘(CLS)。对一组实验室制备的阿司匹林、氯吡格雷和阿托伐他汀三元混合物和阿司匹林、氯吡格雷和瑞舒伐他汀三元混合物的有效性进行了评估,并记录和比较了使用这些方法的结果。在250-280 nm的零阶光谱中,以0.2 nm的间隔测量吸光度,建立了与CLS、PCR和PLS中浓度数据矩阵相匹配的吸光度数据矩阵。然后,为了预测未知浓度,利用浓度和吸光度数据矩阵创建校准或回归。使用RBF - ANN同时测定阿司匹林,氯吡格雷,阿托伐他汀或瑞舒伐他汀的配方是通过提供151个神经元输入层实现的;有2个隐藏层和3个输出神经元。已经评估了所开发方法的绿色轮廓,并与以前报道的分光光度法进行了比较。建议的技术有效地应用于含有引用药物的FDC剂型。
{"title":"A novel eco-friendly methods for simultaneous determination of aspirin, clopidogrel, and atorvastatin or rosuvastatin in their fixed-dose combination using chemometric techniques and artificial neural networks","authors":"Norhan S. AlSawy,&nbsp;Ehab F. ElKady,&nbsp;Eman A. Mostafa","doi":"10.1002/cem.3474","DOIUrl":"10.1002/cem.3474","url":null,"abstract":"<p>In this study, the simultaneous determination of aspirin, clopidogrel, and either atorvastatin or rosuvastatin in their fixed-dose combination (FDC) formulations has been reported. As a straightforward substitute for employing distinct models for each component, UV spectrophotometry was applied with chemometric approaches and artificial neural networks to achieve this. Three chemometric techniques, including principal component regression (PCR), partial least-squares (PLS), and classical least-squares (CLS), were applied in addition to the radial basis function-artificial neural network (RBF-ANN). The validation of a set of laboratory-prepared combinations of aspirin, clopidogrel, and atorvastatin in one ternary mixture and aspirin, clopidogrel, and rosuvastatin in a second ternary mixture was assessed, and the results from the use of these approaches were recorded and compared. The absorbance data matrix matching the concentration data matrix in CLS, PCR, and PLS was created using measurements of absorbances in the range of 250–280 nm at intervals of 0.2 nm in their zero-order spectra. Then, in order to forecast the unknown concentrations, calibration or regression was created utilizing the concentration and absorbance data matrices. Using RBF-ANN for the simultaneous determination of aspirin, clopidogrel, and atorvastatin or rosuvastatin in their formulations was achieved by providing the input layer with 151 neurons; there are 2 hidden layers and 3 output neurons were obtained. The green profile of the developed methods has been assessed and compared with previously reported spectrophotometric methods. The suggested techniques were effectively applied to FDC dosage forms that contained the cited medications.</p>","PeriodicalId":15274,"journal":{"name":"Journal of Chemometrics","volume":null,"pages":null},"PeriodicalIF":2.4,"publicationDate":"2023-03-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45648486","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Ink source prediction and assessment based on direct analysis in real-time mass spectrometry via the likelihood ratio 墨水源预测和评估基于直接分析在实时质谱通过似然比
IF 2.4 4区 化学 Q1 Mathematics Pub Date : 2023-03-09 DOI: 10.1002/cem.3473
Xiaohong Chen, Xu Yang, Jing-wei Zhang, Hao Tang, Qing-hua Zhang, Ya-chen Wang, Zi-feng Jiang, Yan-ling Liu

Ink classification is the ability to distinguish unknown inks into different groups, and ink source prediction is the ability to predict the manufacturer or model of an unknown ink. These are regular tasks in forensic analysis. The latter is more challenging than the former, as ink source prediction has expanded beyond ink classification. In this work, we reported on an approach to predict the source of black inks based on direct analysis in real time mass spectrometry and assess the strength of black ink source prediction conclusion via the likelihood ratio, using a dataset that included 39 inks from three manufacturers with a high market share. Most of these inks contain similar or identical chemical components. Dimensionality reduction based on the principal component analysis and unified manifold approximation and projection algorithms was implemented, and subsequently, the distribution plots illustrated the variations between and within the inks. Unified manifold approximation and projection showed significant priority in avoiding overcrowding of cluster representation versus principal component analysis, with results as high as 99.83% for the prediction of the ink source using 41,432 spectra data (70% data for training and 30% data for testing) after dimensionality reduction. A likelihood ratio was used to evaluate the strength of ink evidence, and the pool-adjacent-violators algorithm and logistic algorithms were used to calibrate the likelihood ratio. The results showed that the pool-adjacent-violators algorithm and logistic algorithms both had an excellent equal error rate of 0.004 but slightly different results in the rates of misleading evidence in favor of the prosecutor's hypothesis, rates of misleading evidence in favor of the defense's hypothesis, log likelihood ratio costs after calibration (Cllrcal), and minimum log likelihood ratio costs (Cllrmin). A blind test validated the robustness of the methods.

油墨分类是将未知油墨区分为不同组的能力,而油墨来源预测是预测未知油墨的制造商或型号的能力。这些是法医分析中的常规任务。后者比前者更具挑战性,因为墨水来源预测已经扩展到墨水分类之外。在这项工作中,我们报告了一种基于实时质谱直接分析预测黑色墨水来源的方法,并通过似然比评估黑色墨水来源预测结论的强度,使用的数据集包括来自三家市场份额较高的制造商的39种墨水。这些油墨大多含有相似或相同的化学成分。实现了基于主成分分析和统一流形近似和投影算法的降维,随后,分布图显示了油墨之间和内部的变化。与主成分分析相比,统一的流形近似和投影在避免聚类表示过度拥挤方面显示出显著的优先级,在降维后,使用41432个光谱数据(70%的数据用于训练,30%的数据用于测试)预测墨源的结果高达99.83%。似然比用于评估墨迹证据的强度,池相邻违规者算法和逻辑算法用于校准似然比。结果表明,池相邻违反者算法和逻辑算法都有0.004的优秀等误差率,但在有利于检察官假设的误导性证据率、有利于辩方假设的误导证据率、校准后的对数似然比成本(Cllrcal),以及最小对数似然比成本(Cllrmin)。盲测试验证了这些方法的稳健性。
{"title":"Ink source prediction and assessment based on direct analysis in real-time mass spectrometry via the likelihood ratio","authors":"Xiaohong Chen,&nbsp;Xu Yang,&nbsp;Jing-wei Zhang,&nbsp;Hao Tang,&nbsp;Qing-hua Zhang,&nbsp;Ya-chen Wang,&nbsp;Zi-feng Jiang,&nbsp;Yan-ling Liu","doi":"10.1002/cem.3473","DOIUrl":"10.1002/cem.3473","url":null,"abstract":"<p>Ink classification is the ability to distinguish unknown inks into different groups, and ink source prediction is the ability to predict the manufacturer or model of an unknown ink. These are regular tasks in forensic analysis. The latter is more challenging than the former, as ink source prediction has expanded beyond ink classification. In this work, we reported on an approach to predict the source of black inks based on direct analysis in real time mass spectrometry and assess the strength of black ink source prediction conclusion via the likelihood ratio, using a dataset that included 39 inks from three manufacturers with a high market share. Most of these inks contain similar or identical chemical components. Dimensionality reduction based on the principal component analysis and unified manifold approximation and projection algorithms was implemented, and subsequently, the distribution plots illustrated the variations between and within the inks. Unified manifold approximation and projection showed significant priority in avoiding overcrowding of cluster representation versus principal component analysis, with results as high as 99.83% for the prediction of the ink source using 41,432 spectra data (70% data for training and 30% data for testing) after dimensionality reduction. A likelihood ratio was used to evaluate the strength of ink evidence, and the pool-adjacent-violators algorithm and logistic algorithms were used to calibrate the likelihood ratio. The results showed that the pool-adjacent-violators algorithm and logistic algorithms both had an excellent equal error rate of 0.004 but slightly different results in the rates of misleading evidence in favor of the prosecutor's hypothesis, rates of misleading evidence in favor of the defense's hypothesis, log likelihood ratio costs after calibration (\u0000<math>\u0000 <msubsup>\u0000 <mi>C</mi>\u0000 <mi>llr</mi>\u0000 <mi>cal</mi>\u0000 </msubsup></math>), and minimum log likelihood ratio costs (\u0000<math>\u0000 <msubsup>\u0000 <mi>C</mi>\u0000 <mi>llr</mi>\u0000 <mi>min</mi>\u0000 </msubsup></math>). A blind test validated the robustness of the methods.</p>","PeriodicalId":15274,"journal":{"name":"Journal of Chemometrics","volume":null,"pages":null},"PeriodicalIF":2.4,"publicationDate":"2023-03-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43916924","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Noise simulation in classification with the noisemodel R package: Applications analyzing the impact of errors with chemical data 用noisemodel R包进行分类中的噪声模拟:分析化学数据误差影响的应用
IF 2.4 4区 化学 Q1 Mathematics Pub Date : 2023-03-06 DOI: 10.1002/cem.3472
José A. Sáez

Classification datasets created from chemical processes can be affected by errors, which impair the accuracy of the models built. This fact highlights the importance of analyzing the robustness of classifiers against different types and levels of noise to know their behavior against potential errors. In this context, noise models have been proposed to study noise-related phenomenology in a controlled environment, allowing errors to be introduced into the data in a supervised manner. This paper introduces the noisemodel R package, which contains the first extensive implementation of noise models for classification datasets, proposing it as support tool to analyze the impact of errors related to chemical data. It provides 72 noise models found in the specialized literature that allow errors to be introduced in different ways in classes and attributes. Each of them is properly documented and referenced, unifying their results through a specific S3 class, which benefits from customized print, summary and plot methods. The usage of the package is illustrated through four application examples considering real-world chemical datasets, where errors are prone to occur. The software presented will help to deepen the understanding of the problem of noisy chemical data, as well as to develop new robust algorithms and noise preprocessing methods properly adapted to different types of errors in this scenario.

从化学过程中创建的分类数据集可能受到错误的影响,这会损害所建立模型的准确性。这一事实突出了分析分类器对不同类型和级别的噪声的鲁棒性以了解其对潜在错误的行为的重要性。在此背景下,噪声模型被提出用于在受控环境中研究与噪声相关的现象学,允许以监督的方式将误差引入数据中。本文介绍了noisemodel R包,它首次广泛实现了分类数据集的噪声模型,并将其作为分析与化学数据相关的误差影响的支持工具。它提供了在专业文献中发现的72个噪声模型,这些模型允许在类和属性中以不同的方式引入错误。它们中的每一个都有适当的文档和引用,并通过特定的S3类统一它们的结果,该类受益于定制的打印、摘要和绘图方法。通过四个应用实例说明了该软件包的使用,这些应用实例考虑了现实世界的化学数据集,其中容易发生错误。所提供的软件将有助于加深对噪声化学数据问题的理解,以及开发新的鲁棒算法和噪声预处理方法,以适应这种情况下不同类型的误差。
{"title":"Noise simulation in classification with the noisemodel R package: Applications analyzing the impact of errors with chemical data","authors":"José A. Sáez","doi":"10.1002/cem.3472","DOIUrl":"10.1002/cem.3472","url":null,"abstract":"<p>Classification datasets created from chemical processes can be affected by errors, which impair the accuracy of the models built. This fact highlights the importance of analyzing the robustness of classifiers against different types and levels of noise to know their behavior against potential errors. In this context, noise models have been proposed to study noise-related phenomenology in a controlled environment, allowing errors to be introduced into the data in a supervised manner. This paper introduces the <i>noisemodel</i> R package, which contains the first extensive implementation of noise models for classification datasets, proposing it as support tool to analyze the impact of errors related to chemical data. It provides 72 noise models found in the specialized literature that allow errors to be introduced in different ways in classes and attributes. Each of them is properly documented and referenced, unifying their results through a specific S3 class, which benefits from customized print, summary and plot methods. The usage of the package is illustrated through four application examples considering real-world chemical datasets, where errors are prone to occur. The software presented will help to deepen the understanding of the problem of noisy chemical data, as well as to develop new robust algorithms and noise preprocessing methods properly adapted to different types of errors in this scenario.</p>","PeriodicalId":15274,"journal":{"name":"Journal of Chemometrics","volume":null,"pages":null},"PeriodicalIF":2.4,"publicationDate":"2023-03-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/cem.3472","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41398798","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
期刊
Journal of Chemometrics
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1