A knowledge-informed burst-sparsity learning (BSL) with non-uniform pattern-coupled prior for spectroscopic regression

IF 3.8 2区 化学 Q2 AUTOMATION & CONTROL SYSTEMS Chemometrics and Intelligent Laboratory Systems Pub Date : 2025-03-20 DOI:10.1016/j.chemolab.2025.105378
Haoran Li , Pengcheng Wu , Shihong Ding , Tao Chen , Xiaobo Zou , Jisheng Dai
{"title":"A knowledge-informed burst-sparsity learning (BSL) with non-uniform pattern-coupled prior for spectroscopic regression","authors":"Haoran Li ,&nbsp;Pengcheng Wu ,&nbsp;Shihong Ding ,&nbsp;Tao Chen ,&nbsp;Xiaobo Zou ,&nbsp;Jisheng Dai","doi":"10.1016/j.chemolab.2025.105378","DOIUrl":null,"url":null,"abstract":"<div><div>In this paper, we propose a knowledge-informed spectroscopic regression method named burst-sparsity learning (BSL) to address limitations in interpretability and consistency analysis. The concept of burst-sparsity (BS) refers to the distribution of chemically relevant structures inspired by spectral response mechanisms, characterized by significant variables that are sparse and occur in clusters. First, we formulate spectroscopic regression as a sparse recovery problem using the sparse Bayesian learning (SBL) model, which leverages the flexibility of SBL to provide an accurate sparse representation and allows for the integration of prior knowledge. Second, since the BS structure is unavailable, an enhanced non-uniform pattern-coupled (PC) prior was developed to capture more BS structures by considering adjacent coefficients. Extensive experiments are conducted to verify the efficacy of the BSL method. The results show that the BSL enhances the prediction performance in term of RMSEP and <strong>R<sub>p</sub><sup>2</sup></strong> across various spectroscopic techniques and dataset scales, highlighting its impressive potential for real-world applications. In additional, the deep combination of domain knowledge into machine learning provides deeper insights into how chemically relevant features contribute to the model’s predictions.</div></div>","PeriodicalId":9774,"journal":{"name":"Chemometrics and Intelligent Laboratory Systems","volume":"262 ","pages":"Article 105378"},"PeriodicalIF":3.8000,"publicationDate":"2025-03-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Chemometrics and Intelligent Laboratory Systems","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0169743925000632","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0

Abstract

In this paper, we propose a knowledge-informed spectroscopic regression method named burst-sparsity learning (BSL) to address limitations in interpretability and consistency analysis. The concept of burst-sparsity (BS) refers to the distribution of chemically relevant structures inspired by spectral response mechanisms, characterized by significant variables that are sparse and occur in clusters. First, we formulate spectroscopic regression as a sparse recovery problem using the sparse Bayesian learning (SBL) model, which leverages the flexibility of SBL to provide an accurate sparse representation and allows for the integration of prior knowledge. Second, since the BS structure is unavailable, an enhanced non-uniform pattern-coupled (PC) prior was developed to capture more BS structures by considering adjacent coefficients. Extensive experiments are conducted to verify the efficacy of the BSL method. The results show that the BSL enhances the prediction performance in term of RMSEP and Rp2 across various spectroscopic techniques and dataset scales, highlighting its impressive potential for real-world applications. In additional, the deep combination of domain knowledge into machine learning provides deeper insights into how chemically relevant features contribute to the model’s predictions.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
一种具有非均匀模式耦合先验的知识渊博的突发稀疏学习(BSL)用于光谱回归
本文提出了一种基于知识的光谱回归方法——突发稀疏学习(burst-sparsity learning, BSL),以解决可解释性和一致性分析的局限性。突发稀疏性(BS)的概念是指受光谱响应机制启发的化学相关结构的分布,其特征是显著变量稀疏且出现在簇中。首先,我们使用稀疏贝叶斯学习(SBL)模型将光谱回归表述为稀疏恢复问题,该模型利用SBL的灵活性提供准确的稀疏表示并允许先验知识的集成。其次,由于BS结构不可用,开发了一种增强的非均匀模式耦合(PC)先验,通过考虑相邻系数来捕获更多BS结构。为了验证BSL方法的有效性,进行了大量的实验。结果表明,BSL在各种光谱技术和数据集尺度上提高了RMSEP和Rp2的预测性能,突出了其在实际应用中的巨大潜力。此外,将领域知识深入结合到机器学习中,可以更深入地了解化学相关特征如何有助于模型的预测。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
7.50
自引率
7.70%
发文量
169
审稿时长
3.4 months
期刊介绍: Chemometrics and Intelligent Laboratory Systems publishes original research papers, short communications, reviews, tutorials and Original Software Publications reporting on development of novel statistical, mathematical, or computer techniques in Chemistry and related disciplines. Chemometrics is the chemical discipline that uses mathematical and statistical methods to design or select optimal procedures and experiments, and to provide maximum chemical information by analysing chemical data. The journal deals with the following topics: 1) Development of new statistical, mathematical and chemometrical methods for Chemistry and related fields (Environmental Chemistry, Biochemistry, Toxicology, System Biology, -Omics, etc.) 2) Novel applications of chemometrics to all branches of Chemistry and related fields (typical domains of interest are: process data analysis, experimental design, data mining, signal processing, supervised modelling, decision making, robust statistics, mixture analysis, multivariate calibration etc.) Routine applications of established chemometrical techniques will not be considered. 3) Development of new software that provides novel tools or truly advances the use of chemometrical methods. 4) Well characterized data sets to test performance for the new methods and software. The journal complies with International Committee of Medical Journal Editors'' Uniform requirements for manuscripts.
期刊最新文献
Multimodal fusion of CT features and density for rapid prediction of raw-coal ash Ensemble robust SIMPLS with block-penalized smoothing for scalar-on-function regression Distributed learning of deep residual principal component analysis for large-scale industrial process monitoring A novel statistical framework to address FTIR spectral challenges: Hybrid MARS–PCA/KPCA models for pollutants analysis in honey samples An efficient automated deep spatio-temporal feature learning framework for industrial soft sensing
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1