Alexander W. Rogers , Amanda Lane , Cesar Mendoza , Simon Watson , Adam Kowalski , Philip Martin , Dongda Zhang
{"title":"Integrating feature attribution and symbolic regression for automatic model structure identification and strategic sampling","authors":"Alexander W. Rogers , Amanda Lane , Cesar Mendoza , Simon Watson , Adam Kowalski , Philip Martin , Dongda Zhang","doi":"10.1016/j.compchemeng.2025.109036","DOIUrl":null,"url":null,"abstract":"<div><div>In today's competitive and dynamic global markets, rapidly designing processes for formulated products – complex blends such as cosmetics, detergents, or personal care goods – is both essential and challenging. Understanding how processing conditions and chemical composition interact to determine product key performance indicators (KPIs) often remains unclear. In this work, we introduce a novel model-based design of experiments (MbDoE) framework that combines artificial neural network feature attribution with symbolic regression (SR) to uncover interpretable physical relationships. By leveraging feature attribution to guide the search within SR's large combinatorial space, our method efficiently targets structural improvements in candidate models. Additionally, a strategic sampling approach determining the most informative time points to measure KPI determining attributes ensures that each experiment yields maximum information. Applied to a comprehensive in-silico case study, the framework successfully recovered the differential equations for the underlying mechanisms driving the rate of change in the KPIs during the formulated product manufacturing process and reduced the required number of experiments threefold, even with limited data availability. These results highlight the significant potential of artificial neural network guided SR-MbDoE to accelerate process flow diagram development, enhance understanding of complex formulated processes, and improve decision-making in the chemical industry.</div></div>","PeriodicalId":286,"journal":{"name":"Computers & Chemical Engineering","volume":"197 ","pages":"Article 109036"},"PeriodicalIF":3.9000,"publicationDate":"2025-02-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers & Chemical Engineering","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0098135425000407","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0
Abstract
In today's competitive and dynamic global markets, rapidly designing processes for formulated products – complex blends such as cosmetics, detergents, or personal care goods – is both essential and challenging. Understanding how processing conditions and chemical composition interact to determine product key performance indicators (KPIs) often remains unclear. In this work, we introduce a novel model-based design of experiments (MbDoE) framework that combines artificial neural network feature attribution with symbolic regression (SR) to uncover interpretable physical relationships. By leveraging feature attribution to guide the search within SR's large combinatorial space, our method efficiently targets structural improvements in candidate models. Additionally, a strategic sampling approach determining the most informative time points to measure KPI determining attributes ensures that each experiment yields maximum information. Applied to a comprehensive in-silico case study, the framework successfully recovered the differential equations for the underlying mechanisms driving the rate of change in the KPIs during the formulated product manufacturing process and reduced the required number of experiments threefold, even with limited data availability. These results highlight the significant potential of artificial neural network guided SR-MbDoE to accelerate process flow diagram development, enhance understanding of complex formulated processes, and improve decision-making in the chemical industry.
期刊介绍:
Computers & Chemical Engineering is primarily a journal of record for new developments in the application of computing and systems technology to chemical engineering problems.