Integrating feature attribution and symbolic regression for automatic model structure identification and strategic sampling

IF 3.9 2区 工程技术 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS Computers & Chemical Engineering Pub Date : 2025-06-01 Epub Date: 2025-02-11 DOI:10.1016/j.compchemeng.2025.109036
Alexander W. Rogers , Amanda Lane , Cesar Mendoza , Simon Watson , Adam Kowalski , Philip Martin , Dongda Zhang
{"title":"Integrating feature attribution and symbolic regression for automatic model structure identification and strategic sampling","authors":"Alexander W. Rogers ,&nbsp;Amanda Lane ,&nbsp;Cesar Mendoza ,&nbsp;Simon Watson ,&nbsp;Adam Kowalski ,&nbsp;Philip Martin ,&nbsp;Dongda Zhang","doi":"10.1016/j.compchemeng.2025.109036","DOIUrl":null,"url":null,"abstract":"<div><div>In today's competitive and dynamic global markets, rapidly designing processes for formulated products – complex blends such as cosmetics, detergents, or personal care goods – is both essential and challenging. Understanding how processing conditions and chemical composition interact to determine product key performance indicators (KPIs) often remains unclear. In this work, we introduce a novel model-based design of experiments (MbDoE) framework that combines artificial neural network feature attribution with symbolic regression (SR) to uncover interpretable physical relationships. By leveraging feature attribution to guide the search within SR's large combinatorial space, our method efficiently targets structural improvements in candidate models. Additionally, a strategic sampling approach determining the most informative time points to measure KPI determining attributes ensures that each experiment yields maximum information. Applied to a comprehensive in-silico case study, the framework successfully recovered the differential equations for the underlying mechanisms driving the rate of change in the KPIs during the formulated product manufacturing process and reduced the required number of experiments threefold, even with limited data availability. These results highlight the significant potential of artificial neural network guided SR-MbDoE to accelerate process flow diagram development, enhance understanding of complex formulated processes, and improve decision-making in the chemical industry.</div></div>","PeriodicalId":286,"journal":{"name":"Computers & Chemical Engineering","volume":"197 ","pages":"Article 109036"},"PeriodicalIF":3.9000,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers & Chemical Engineering","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0098135425000407","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/2/11 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0

Abstract

In today's competitive and dynamic global markets, rapidly designing processes for formulated products – complex blends such as cosmetics, detergents, or personal care goods – is both essential and challenging. Understanding how processing conditions and chemical composition interact to determine product key performance indicators (KPIs) often remains unclear. In this work, we introduce a novel model-based design of experiments (MbDoE) framework that combines artificial neural network feature attribution with symbolic regression (SR) to uncover interpretable physical relationships. By leveraging feature attribution to guide the search within SR's large combinatorial space, our method efficiently targets structural improvements in candidate models. Additionally, a strategic sampling approach determining the most informative time points to measure KPI determining attributes ensures that each experiment yields maximum information. Applied to a comprehensive in-silico case study, the framework successfully recovered the differential equations for the underlying mechanisms driving the rate of change in the KPIs during the formulated product manufacturing process and reduced the required number of experiments threefold, even with limited data availability. These results highlight the significant potential of artificial neural network guided SR-MbDoE to accelerate process flow diagram development, enhance understanding of complex formulated processes, and improve decision-making in the chemical industry.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
结合特征属性和符号回归实现模型结构自动识别和策略采样
在当今竞争激烈、充满活力的全球市场中,快速设计配方产品(如化妆品、洗涤剂或个人护理用品等复杂混合物)的工艺既重要又具有挑战性。了解加工条件和化学成分如何相互作用以确定产品关键性能指标(kpi)通常仍然不清楚。在这项工作中,我们引入了一种新的基于模型的实验设计(MbDoE)框架,该框架将人工神经网络特征归因与符号回归(SR)相结合,以揭示可解释的物理关系。通过利用特征属性来指导在SR的大组合空间中的搜索,我们的方法有效地针对候选模型的结构改进。此外,确定最具信息量的时间点以测量KPI确定属性的策略抽样方法可确保每个实验产生最大的信息。应用于一个全面的计算机案例研究,该框架成功地恢复了在配方产品制造过程中驱动kpi变化率的潜在机制的微分方程,并将所需的实验数量减少了三倍,即使数据可用性有限。这些结果凸显了人工神经网络引导SR-MbDoE在加速工艺流程图开发、增强对复杂配方工艺的理解以及改善化工行业决策方面的巨大潜力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Computers & Chemical Engineering
Computers & Chemical Engineering 工程技术-工程:化工
CiteScore
8.70
自引率
14.00%
发文量
374
审稿时长
70 days
期刊介绍: Computers & Chemical Engineering is primarily a journal of record for new developments in the application of computing and systems technology to chemical engineering problems.
期刊最新文献
Kolmogorov-Arnold network driven soft sensors for chemical processes with distributed output Reproducibility of GPU-based Large Eddy Simulations for mixing in stirred tank reactors PlantGraphExpert: A knowledge graph-driven tool for chemical plant operator assistance Strategic design of decentralized multi-hub hydrogen supply chains with LNG value chain integration for global trade Adaptive physics-informed neural network-based digital twins integrated with Ensemble Kalman Filter
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1