Enhanced data point importance: Layered significance of variables in multivariate calibration

IF 5.7 2区 化学 Q1 CHEMISTRY, ANALYTICAL Analytica Chimica Acta Pub Date : 2024-10-22 DOI:10.1016/j.aca.2024.343357
Somaye Vali Zade , Klaus Neymeyr , Mathias Sawall , Hamid Abdollahi
{"title":"Enhanced data point importance: Layered significance of variables in multivariate calibration","authors":"Somaye Vali Zade ,&nbsp;Klaus Neymeyr ,&nbsp;Mathias Sawall ,&nbsp;Hamid Abdollahi","doi":"10.1016/j.aca.2024.343357","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><div>The Enhanced Data Point Importance (EDPI) method, a systematic approach for evaluating the importance of data points in multivariate calibration, is introduced. Factor decomposition methods allow for the evaluation of the impact of variables on maintaining the structural pattern of data in the abstract space. Essential data points play a key role in these patterns and the method of Data Point Importance (DPI) aims to evaluate the essential data points in terms of their importance. All other points are rated by zero. In this contribution, DPI is extended to include inner points to evaluate the importance of these points in the absence of the essential points. The EDPI method employs convex peeling to sort data points systematically.</div></div><div><h3>Results</h3><div>EDPI method was applied to near-infrared and Raman spectroscopy data sets, including corn and alcohol mixtures and simulated data, to rank and select important variables. EDPI effectively identified variables that contributed to the preservation of the data structure and highlighted key spectral regions with different degrees of selectivity. In the alcohol dataset, EDPI revealed important physicochemical insights by focusing on specific regions where non-analytes spectra overlapped. It performed in a similar way to the Variable Importance in Projection (VIP) method, but with fewer variables selected.</div></div><div><h3>Significance</h3><div>The experimental results obtained from calibrating near-infrared and Raman spectroscopic datasets using partial least squares highlight the effectiveness of the proposed EDPI strategy when contrasted with the conventional variable importance in projection (VIP) method for variable selection.</div></div>","PeriodicalId":240,"journal":{"name":"Analytica Chimica Acta","volume":"1332 ","pages":"Article 343357"},"PeriodicalIF":5.7000,"publicationDate":"2024-10-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Analytica Chimica Acta","FirstCategoryId":"92","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0003267024011589","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, ANALYTICAL","Score":null,"Total":0}
引用次数: 0

Abstract

Background

The Enhanced Data Point Importance (EDPI) method, a systematic approach for evaluating the importance of data points in multivariate calibration, is introduced. Factor decomposition methods allow for the evaluation of the impact of variables on maintaining the structural pattern of data in the abstract space. Essential data points play a key role in these patterns and the method of Data Point Importance (DPI) aims to evaluate the essential data points in terms of their importance. All other points are rated by zero. In this contribution, DPI is extended to include inner points to evaluate the importance of these points in the absence of the essential points. The EDPI method employs convex peeling to sort data points systematically.

Results

EDPI method was applied to near-infrared and Raman spectroscopy data sets, including corn and alcohol mixtures and simulated data, to rank and select important variables. EDPI effectively identified variables that contributed to the preservation of the data structure and highlighted key spectral regions with different degrees of selectivity. In the alcohol dataset, EDPI revealed important physicochemical insights by focusing on specific regions where non-analytes spectra overlapped. It performed in a similar way to the Variable Importance in Projection (VIP) method, but with fewer variables selected.

Significance

The experimental results obtained from calibrating near-infrared and Raman spectroscopic datasets using partial least squares highlight the effectiveness of the proposed EDPI strategy when contrasted with the conventional variable importance in projection (VIP) method for variable selection.

Abstract Image

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
增强数据点重要性:多变量校准中变量的分层重要性
背景介绍增强数据点重要性(EDPI)方法是一种评估多元校准中数据点重要性的系统方法。因子分解方法可以评估变量对维持抽象空间中数据结构模式的影响。基本数据点在这些模式中起着关键作用,数据点重要性(DPI)方法旨在评估基本数据点的重要性。所有其他点的评分均为零。在本文中,DPI 扩展到包括内部点,以便在没有基本点的情况下评估这些点的重要性。结果将 EDPI 方法应用于近红外和拉曼光谱数据集,包括玉米和酒精混合物以及模拟数据,对重要变量进行排序和选择。EDPI 有效地识别了有助于保持数据结构的变量,并突出了具有不同选择性的关键光谱区域。在酒精数据集中,EDPI 通过关注非分析物光谱重叠的特定区域,揭示了重要的物理化学观点。它的性能与投影变量重要性(VIP)方法类似,但选择的变量更少。重要意义使用偏最小二乘法校准近红外和拉曼光谱数据集所获得的实验结果,与传统的投影变量重要性(VIP)变量选择方法相比,凸显了所提出的 EDPI 策略的有效性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Analytica Chimica Acta
Analytica Chimica Acta 化学-分析化学
CiteScore
10.40
自引率
6.50%
发文量
1081
审稿时长
38 days
期刊介绍: Analytica Chimica Acta has an open access mirror journal Analytica Chimica Acta: X, sharing the same aims and scope, editorial team, submission system and rigorous peer review. Analytica Chimica Acta provides a forum for the rapid publication of original research, and critical, comprehensive reviews dealing with all aspects of fundamental and applied modern analytical chemistry. The journal welcomes the submission of research papers which report studies concerning the development of new and significant analytical methodologies. In determining the suitability of submitted articles for publication, particular scrutiny will be placed on the degree of novelty and impact of the research and the extent to which it adds to the existing body of knowledge in analytical chemistry.
期刊最新文献
Polypeptide functionalized gold nanoprobes for SERS-fluorescence detection and imaging of caspase-9 during apoptosis In-situ nucleic acid amplification induced by DNA self-assembly for rapid and ultrasensitive detection of miRNA Comprehensive Host Cell Proteins Profiling in Biopharmaceuticals by A Sensitivity Enhanced Mass Spectrometry Strategy Using TMT-labeling and Signal Boosting Development of a fully automated slurry sampling introduction system for GF-AAS and its application for the determination of cadmium in different matrices Measurement of β-Nuclides in Various Solutions Using Plastic Scintillation Microspheres
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1