首页 > 最新文献

Digital Chemical Engineering最新文献

英文 中文
Computational applications using data driven modeling in process Systems: A review 过程系统中使用数据驱动建模的计算应用:综述
Q2 ENGINEERING, CHEMICAL Pub Date : 2023-09-01 DOI: 10.1016/j.dche.2023.100111
Sumit K. Bishnu , Sabla Y. Alnouri , Dhabia M. Al-Mohannadi

Modeling and optimization of various processes enable more efficient operations and better planning activities for new process developments. With recent advances in computing power, data driven models, such as Machine Learning (ML), are being extensively applied in many areas of chemical engineering topics. Compared to mechanistic models that often do not reflect the realities of field conditions and the high costs associated with them, these techniques are relatively easier to implement. Data-driven models generated via ML techniques can be regularly updated, thereby giving an accurate picture of the system. Due to these inherent benefits, such tools are increasingly gaining a lot of traction in process systems. Even though data-driven models have the potential to be used as a replacement for traditional optimization tools that can be implemented in various process industries, it was found that applications of such models in process systems were quite limited to reactor modeling, molecular design, as well as safety, and relatability. The challenge still exists for data-driven modeling due to the lack of specialized tools tailored for macro systems and scale up. Most datasets were found to be derived from experimental studies which are limited in nature and only fit into microsystems. Hence, this paper provides a state of the art review on recent applications for data driven modeling research in process systems, and discusses the prominent challenges and future outlooks that were observed.

各种过程的建模和优化使得更有效的操作和更好的新过程开发计划活动成为可能。随着计算能力的进步,数据驱动模型,如机器学习(ML),正在广泛应用于化学工程主题的许多领域。与机械模型相比,机械模型往往不能反映现场的实际情况,而且与此相关的成本很高,这些技术相对容易实施。通过ML技术生成的数据驱动模型可以定期更新,从而给出系统的准确图像。由于这些固有的好处,这些工具在过程系统中越来越受欢迎。尽管数据驱动模型有可能被用作传统优化工具的替代品,可以在各种过程工业中实现,但我们发现,这些模型在过程系统中的应用非常局限于反应器建模、分子设计、以及安全性和相关性。由于缺乏针对宏观系统和扩展的专门工具,数据驱动建模的挑战仍然存在。发现大多数数据集来自实验研究,这些研究在本质上是有限的,只适合微系统。因此,本文对过程系统中数据驱动建模研究的最新应用进行了回顾,并讨论了所观察到的突出挑战和未来前景。
{"title":"Computational applications using data driven modeling in process Systems: A review","authors":"Sumit K. Bishnu ,&nbsp;Sabla Y. Alnouri ,&nbsp;Dhabia M. Al-Mohannadi","doi":"10.1016/j.dche.2023.100111","DOIUrl":"10.1016/j.dche.2023.100111","url":null,"abstract":"<div><p>Modeling and optimization of various processes enable more efficient operations and better planning activities for new process developments. With recent advances in computing power, data driven models, such as Machine Learning (ML), are being extensively applied in many areas of chemical engineering topics. Compared to mechanistic models that often do not reflect the realities of field conditions and the high costs associated with them, these techniques are relatively easier to implement. Data-driven models generated via ML techniques can be regularly updated, thereby giving an accurate picture of the system. Due to these inherent benefits, such tools are increasingly gaining a lot of traction in process systems. Even though data-driven models have the potential to be used as a replacement for traditional optimization tools that can be implemented in various process industries, it was found that applications of such models in process systems were quite limited to reactor modeling, molecular design, as well as safety, and relatability. The challenge still exists for data-driven modeling due to the lack of specialized tools tailored for macro systems and scale up. Most datasets were found to be derived from experimental studies which are limited in nature and only fit into microsystems. Hence, this paper provides a state of the art review on recent applications for data driven modeling research in process systems, and discusses the prominent challenges and future outlooks that were observed.</p></div>","PeriodicalId":72815,"journal":{"name":"Digital Chemical Engineering","volume":"8 ","pages":"Article 100111"},"PeriodicalIF":0.0,"publicationDate":"2023-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48016266","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A one-class support vector machine for detecting valve stiction 一种检测气门静摩擦力的一类支持向量机
Q2 ENGINEERING, CHEMICAL Pub Date : 2023-09-01 DOI: 10.1016/j.dche.2023.100116
Harrison O’Neill , Yousaf Khalid , Graham Spink , Patrick Thorpe

In industrial processes, control valve stiction is known to be one of the primary causes for poor control loop performance. Stiction introduces oscillatory behaviour in the process, leading to increased energy consumption, variations in product quality, shortened equipment lifespan and a reduction in overall plant profitability. Several detection algorithms using routine operating data have been developed over the last few decades. However, with the exception of a handful of recent publications, few attempts to apply classical supervised learning techniques have been published thus far. In this work, principal component analysis, linear discriminant analysis and a one-class support vector machine are trained to detect stiction using time series features as input. These features are extracted from the data using the tsfresh package for Python. The training data consists of simulated stiction examples generated using the XCH stiction model as well as other sources of oscillation. The classifier is subsequently benchmarked against closed-loop stiction data collected in an industrial setting, with performance exceeding that of existing methods.

在工业过程中,控制阀的粘滞是导致控制回路性能差的主要原因之一。粘滞在过程中引入振荡行为,导致能源消耗增加,产品质量变化,设备寿命缩短,工厂整体盈利能力降低。在过去的几十年里,已经开发了几种使用常规操作数据的检测算法。然而,除了少数最近的出版物外,迄今为止很少有应用经典监督学习技术的尝试发表。在这项工作中,主成分分析、线性判别分析和一类支持向量机被训练成使用时间序列特征作为输入来检测粘滞。这些特性是使用Python的tsfresh包从数据中提取的。训练数据包括使用XCH粘滞模型和其他振荡源生成的模拟粘滞样例。分类器随后对在工业环境中收集的闭环伸缩数据进行基准测试,其性能超过现有方法。
{"title":"A one-class support vector machine for detecting valve stiction","authors":"Harrison O’Neill ,&nbsp;Yousaf Khalid ,&nbsp;Graham Spink ,&nbsp;Patrick Thorpe","doi":"10.1016/j.dche.2023.100116","DOIUrl":"10.1016/j.dche.2023.100116","url":null,"abstract":"<div><p>In industrial processes, control valve stiction is known to be one of the primary causes for poor control loop performance. Stiction introduces oscillatory behaviour in the process, leading to increased energy consumption, variations in product quality, shortened equipment lifespan and a reduction in overall plant profitability. Several detection algorithms using routine operating data have been developed over the last few decades. However, with the exception of a handful of recent publications, few attempts to apply classical supervised learning techniques have been published thus far. In this work, principal component analysis, linear discriminant analysis and a one-class support vector machine are trained to detect stiction using time series features as input. These features are extracted from the data using the <span>tsfresh</span> package for Python. The training data consists of simulated stiction examples generated using the XCH stiction model as well as other sources of oscillation. The classifier is subsequently benchmarked against closed-loop stiction data collected in an industrial setting, with performance exceeding that of existing methods.</p></div>","PeriodicalId":72815,"journal":{"name":"Digital Chemical Engineering","volume":"8 ","pages":"Article 100116"},"PeriodicalIF":0.0,"publicationDate":"2023-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49218076","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Insight into evolutionary optimization approach of batch and fed-batch fermenters for lactic acid production 乳酸生产分批和补料分批发酵器进化优化方法初探
Q2 ENGINEERING, CHEMICAL Pub Date : 2023-09-01 DOI: 10.1016/j.dche.2023.100105
Ashish M. Gujarathi , Swaprabha P. Patel , Badria Al Siyabi

Differential evolution (DE) algorithm and genetic algorithm (GA) are used in this study to estimate a set of kinetic parameters for Arabic date juice-based lactic acid production via batch and fed-batch mode of fermentation. Different feeding approaches like feed-forward control, exponential-, and modified exponential- feed are employed to obtain optimum kinetic parameters. The global optimum sets of kinetic parameters for both fermentation methods are found by minimizing the least square error between the experimental data and the simulated model results. In both batch and fed-batch fermentation methods (including different feeding strategies) the DE algorithm resulted in either the least value of the objective function or the least value of the sum of the square of residual errors between the experimental and model-predicted values for biomass growth (X), substrate consumption (S), and product formation (P). Six different strategies of the DE algorithm are used and their performance is compared for exponential feeding fed-batch fermenter. For exponential feeding fed-batch fermenter best suitable DE strategies were found to be best/1/bin and current to best/1/bin based on algorithm control parameters analysis. This manuscript highlights the limitations and improvements in the performance of individual algorithms on the given biochemical fermenters.

采用差分进化(DE)算法和遗传算法(GA)对分批发酵和补料分批发酵方式生产阿拉伯枣汁乳酸的动力学参数进行了估计。采用前馈控制、指数进给和改进指数进给等不同的进给方法来获得最优的动力学参数。通过最小化实验数据与模拟模型结果之间的最小二乘误差,找到两种发酵方法的全局最优动力学参数集。在分批和补料分批发酵方法(包括不同的投料策略)中,DE算法的结果要么是目标函数的最小值,要么是生物量生长(X)、底物消耗(S)和产物形成(P)的实验值与模型预测值之间的残差平方和的最小值。使用了六种不同的DE算法策略,并比较了它们在指数投料分批发酵罐中的性能。通过对算法控制参数的分析,确定了指数进料间歇式发酵罐最适合的DE策略为best/1/bin和current to best/1/bin。这篇手稿强调了在给定的生化发酵罐上单个算法性能的局限性和改进。
{"title":"Insight into evolutionary optimization approach of batch and fed-batch fermenters for lactic acid production","authors":"Ashish M. Gujarathi ,&nbsp;Swaprabha P. Patel ,&nbsp;Badria Al Siyabi","doi":"10.1016/j.dche.2023.100105","DOIUrl":"10.1016/j.dche.2023.100105","url":null,"abstract":"<div><p>Differential evolution (DE) algorithm and genetic algorithm (GA) are used in this study to estimate a set of kinetic parameters for Arabic date juice-based lactic acid production via batch and fed-batch mode of fermentation. Different feeding approaches like feed-forward control, exponential-, and modified exponential- feed are employed to obtain optimum kinetic parameters. The global optimum sets of kinetic parameters for both fermentation methods are found by minimizing the least square error between the experimental data and the simulated model results. In both batch and fed-batch fermentation methods (including different feeding strategies) the DE algorithm resulted in either the least value of the objective function or the least value of the sum of the square of residual errors between the experimental and model-predicted values for biomass growth (X), substrate consumption (S), and product formation (P). Six different strategies of the DE algorithm are used and their performance is compared for exponential feeding fed-batch fermenter. For exponential feeding fed-batch fermenter best suitable DE strategies were found to be best/1/bin and current to best/1/bin based on algorithm control parameters analysis. This manuscript highlights the limitations and improvements in the performance of individual algorithms on the given biochemical fermenters.</p></div>","PeriodicalId":72815,"journal":{"name":"Digital Chemical Engineering","volume":"8 ","pages":"Article 100105"},"PeriodicalIF":0.0,"publicationDate":"2023-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46534728","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Machine learning based modelling and optimization of post-combustion carbon capture process using MEA supporting carbon neutrality 使用支持碳中和的MEA对燃烧后碳捕获过程进行基于机器学习的建模和优化
Q2 ENGINEERING, CHEMICAL Pub Date : 2023-09-01 DOI: 10.1016/j.dche.2023.100115
Waqar Muhammad Ashraf, Vivek Dua

The role of carbon capture technology using monoethanolamine (MEA) is critical for achieving the carbon-neutrality goal. However, maintaining the efficient operation of the post-combustion carbon capture is challenging considering the hyperdimensional design space and nonlinear characteristics of the process. In this work, CO2 capture level from the flue gas in the absorption column is investigated for the post-combustion carbon capture process using MEA. Artificial neural network (ANN) and support vector machine (SVM) models are constructed to model CO2 capture level under extensive hyperparameters tuning. The comparative performance analysis based on external validation test confirmed the superior modelling and generalization ability of ANN for the carbon capture process. Later, partial derivative-based sensitivity analysis is carried out and it is the found that absorbent-based input variables like lean solvent temperature and lean solvent flow rate are the two most significant input variables on CO2 capture level in the absorption column. The optimization problem with the ANN model embedded in the nonlinear programming-based optimization environment is solved under different operating scenarios to determine the optimum operating ranges for the input variables corresponding to the maximum CO2 capture level. This research presents the optimum operating conditions for CO2 removal from the flue gas for the post-combustion carbon capture process using MEA that contributes to achieving the carbon neutrality goal.

使用单乙醇胺(MEA)的碳捕集技术的作用对于实现碳中和目标至关重要。然而,考虑到该过程的超维设计空间和非线性特征,保持燃烧后碳捕获的有效运行是具有挑战性的。在这项工作中,利用MEA研究了燃烧后碳捕集过程中吸收塔烟气中的CO2捕集水平。建立了人工神经网络(ANN)和支持向量机(SVM)模型,对CO2捕获水平进行了广泛超参数整定。基于外部验证试验的性能对比分析证实了人工神经网络在碳捕集过程中具有优越的建模和泛化能力。随后,进行了基于偏导数的敏感性分析,发现基于吸收剂的输入变量如贫溶剂温度和贫溶剂流速是影响吸收塔CO2捕集水平的两个最显著的输入变量。将人工神经网络模型嵌入到基于非线性规划的优化环境中,求解不同运行场景下的优化问题,确定最大CO2捕集水平对应的输入变量的最佳运行范围。本研究提出了利用MEA从燃烧后碳捕集过程中烟气中去除二氧化碳的最佳操作条件,有助于实现碳中和目标。
{"title":"Machine learning based modelling and optimization of post-combustion carbon capture process using MEA supporting carbon neutrality","authors":"Waqar Muhammad Ashraf,&nbsp;Vivek Dua","doi":"10.1016/j.dche.2023.100115","DOIUrl":"10.1016/j.dche.2023.100115","url":null,"abstract":"<div><p>The role of carbon capture technology using monoethanolamine (MEA) is critical for achieving the carbon-neutrality goal. However, maintaining the efficient operation of the post-combustion carbon capture is challenging considering the hyperdimensional design space and nonlinear characteristics of the process. In this work, CO<sub>2</sub> capture level from the flue gas in the absorption column is investigated for the post-combustion carbon capture process using MEA. Artificial neural network (ANN) and support vector machine (SVM) models are constructed to model CO<sub>2</sub> capture level under extensive hyperparameters tuning. The comparative performance analysis based on external validation test confirmed the superior modelling and generalization ability of ANN for the carbon capture process. Later, partial derivative-based sensitivity analysis is carried out and it is the found that absorbent-based input variables like lean solvent temperature and lean solvent flow rate are the two most significant input variables on CO<sub>2</sub> capture level in the absorption column. The optimization problem with the ANN model embedded in the nonlinear programming-based optimization environment is solved under different operating scenarios to determine the optimum operating ranges for the input variables corresponding to the maximum CO<sub>2</sub> capture level. This research presents the optimum operating conditions for CO<sub>2</sub> removal from the flue gas for the post-combustion carbon capture process using MEA that contributes to achieving the carbon neutrality goal.</p></div>","PeriodicalId":72815,"journal":{"name":"Digital Chemical Engineering","volume":"8 ","pages":"Article 100115"},"PeriodicalIF":0.0,"publicationDate":"2023-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49369797","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Artificial intelligence driven smart operation of large industrial complexes supporting the net-zero goal: Coal power plants 人工智能驱动的大型工业综合体智能运营支持净零目标:煤电厂
Q2 ENGINEERING, CHEMICAL Pub Date : 2023-09-01 DOI: 10.1016/j.dche.2023.100119
Waqar Muhammad Ashraf, Vivek Dua

The true potential of artificial intelligence (AI) is to contribute towards the performance enhancement and informed decision making for the operation of the large industrial complexes like coal power plants. In this paper, AI based modelling and optimization framework is developed and deployed for the smart and efficient operation of a 660 MW supercritical coal power plant. The industrial data under various power generation capacity of the plant is collected, visualized, processed and subsequently, utilized to train artificial neural network (ANN) model for predicting the power generation. The ANN model presents good predictability and generalization performance in external validation test with R2 = 0.99 and RMSE =2.69 MW. The partial derivative of the ANN model is taken with respect to the input variable to evaluate the variable’ sensitivity on the power generation. It is found that main steam flow rate is the most significant variable having percentage significance value of 75.3 %. Nonlinear programming (NLP) technique is applied to maximize the power generation. The NLP-simulated optimized values of the input variables are verified on the power generation operation. The plant-level performance indicators are improved under optimum operating mode of power generation: savings in fuel consumption (3 t/h), improvement in thermal efficiency (1.3 %) and reduction in emissions discharge (50.5 kt/y). It is also investigated that maximum power production capacity of the plant is reduced from 660 MW to 635 MW when the emissions discharge limit is changed from 510 t/h to 470 t/h. It is concluded that the improved plant-level performance indicators and informed decision making present the potential of AI based modelling and optimization analysis to reliably contribute to net-zero goal from the coal power plant.

人工智能(AI)的真正潜力是为煤电厂等大型工业园区的运营提高绩效和明智决策做出贡献。为实现660 MW超临界燃煤电厂的智能高效运行,开发并部署了基于人工智能的建模与优化框架。对电厂不同发电量下的工业数据进行采集、可视化、处理后,用于训练人工神经网络(ANN)模型进行发电量预测。在外部验证试验中,ANN模型具有良好的可预测性和泛化性能,R2 = 0.99, RMSE =2.69 MW。对输入变量求神经网络模型的偏导数,以评估该变量对发电的敏感性。发现主蒸汽流量是最显著的变量,其百分比显著性值为75.3%。采用非线性规划(NLP)技术实现发电最大化。在发电运行中验证了nlp模拟的输入变量优化值。在发电的最佳运行模式下,工厂一级的性能指标得到改善:节省燃料消耗(3吨/小时),提高热效率(1.3%)和减少排放(50.5千吨/年)。研究发现,当排放限值由510 t/h提高到470 t/h时,电厂的最大发电能力由660 MW降低到635 MW。结论是,改进的工厂级绩效指标和知情决策显示了基于人工智能的建模和优化分析的潜力,可以可靠地为燃煤电厂的净零目标做出贡献。
{"title":"Artificial intelligence driven smart operation of large industrial complexes supporting the net-zero goal: Coal power plants","authors":"Waqar Muhammad Ashraf,&nbsp;Vivek Dua","doi":"10.1016/j.dche.2023.100119","DOIUrl":"10.1016/j.dche.2023.100119","url":null,"abstract":"<div><p>The true potential of artificial intelligence (AI) is to contribute towards the performance enhancement and informed decision making for the operation of the large industrial complexes like coal power plants. In this paper, AI based modelling and optimization framework is developed and deployed for the smart and efficient operation of a 660 MW supercritical coal power plant. The industrial data under various power generation capacity of the plant is collected, visualized, processed and subsequently, utilized to train artificial neural network (ANN) model for predicting the power generation. The ANN model presents good predictability and generalization performance in external validation test with R<sup>2</sup> = 0.99 and RMSE =2.69 MW. The partial derivative of the ANN model is taken with respect to the input variable to evaluate the variable’ sensitivity on the power generation. It is found that main steam flow rate is the most significant variable having percentage significance value of 75.3 %. Nonlinear programming (NLP) technique is applied to maximize the power generation. The NLP-simulated optimized values of the input variables are verified on the power generation operation. The plant-level performance indicators are improved under optimum operating mode of power generation: savings in fuel consumption (3 t/h), improvement in thermal efficiency (1.3 %) and reduction in emissions discharge (50.5 kt/y). It is also investigated that maximum power production capacity of the plant is reduced from 660 MW to 635 MW when the emissions discharge limit is changed from 510 t/h to 470 t/h. It is concluded that the improved plant-level performance indicators and informed decision making present the potential of AI based modelling and optimization analysis to reliably contribute to net-zero goal from the coal power plant.</p></div>","PeriodicalId":72815,"journal":{"name":"Digital Chemical Engineering","volume":"8 ","pages":"Article 100119"},"PeriodicalIF":0.0,"publicationDate":"2023-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45624522","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Process control of mAb production using multi-actor proximal policy optimization 基于多因素近端策略优化的单克隆抗体生产过程控制
Q2 ENGINEERING, CHEMICAL Pub Date : 2023-09-01 DOI: 10.1016/j.dche.2023.100108
Nikita Gupta , Shikhar Anand , Tanuja Joshi , Deepak Kumar , Manojkumar Ramteke , Hariprasad Kodamana

Monoclonal antibodies (mAb) are biopharmaceutical products that improve human immunity. In this work, we propose a multi-actor proximal policy optimization-based reinforcement learning (RL) for the control of mAb production. Here, manipulated variable is flowrate and the control variable is mAb concentration. Based on root mean square error (RMSE) values and convergence performance, it has been observed that multi-actor PPO has performed better as compared to other RL algorithms. It is observed that PPO predicts a 40 % reduction in the number of days to reach the desired concentration. Moreover, the performance of PPO is improved as the number of actors increases. PPO agent shows the best performance with three actors, but on further increasing, its performance deteriorated. These results are verified based on three case studies, namely, (i) for nominal conditions, (ii) in the presence of noise in raw materials and measurements, and (iii) in the presence of stochastic disturbance in temperature and noise in measurements. The results indicate that the proposed approach outperforms the deep deterministic policy gradient (DDPG), twin delayed deep deterministic policy gradient (TD3), and proximal policy optimization (PPO) algorithms for the control of the bioreactor system.

单克隆抗体(mAb)是提高人体免疫力的生物制药产品。在这项工作中,我们提出了一种基于多参与者近端策略优化的强化学习(RL)来控制单克隆抗体的产生。这里,操纵变量为流速,控制变量为mAb浓度。基于均方根误差(RMSE)值和收敛性能,已经观察到与其他RL算法相比,多参与者PPO表现更好。据观察,PPO预测达到所需浓度的天数减少40%。此外,随着参与者数量的增加,PPO的性能也有所提高。PPO助剂在添加三种药剂时表现出最佳的性能,但随着添加量的增加,其性能逐渐下降。这些结果基于三个案例研究进行验证,即(i)标称条件下,(ii)原材料和测量中存在噪声的情况下,以及(iii)测量中存在温度随机干扰和噪声的情况下。结果表明,该方法在生物反应器系统控制方面优于深度确定性策略梯度(DDPG)、双延迟深度确定性策略梯度(TD3)和近端策略优化(PPO)算法。
{"title":"Process control of mAb production using multi-actor proximal policy optimization","authors":"Nikita Gupta ,&nbsp;Shikhar Anand ,&nbsp;Tanuja Joshi ,&nbsp;Deepak Kumar ,&nbsp;Manojkumar Ramteke ,&nbsp;Hariprasad Kodamana","doi":"10.1016/j.dche.2023.100108","DOIUrl":"10.1016/j.dche.2023.100108","url":null,"abstract":"<div><p>Monoclonal antibodies (mAb) are biopharmaceutical products that improve human immunity. In this work, we propose a multi-actor proximal policy optimization-based reinforcement learning (RL) for the control of mAb production. Here, manipulated variable is flowrate and the control variable is mAb concentration. Based on root mean square error (RMSE) values and convergence performance, it has been observed that multi-actor PPO has performed better as compared to other RL algorithms. It is observed that PPO predicts a 40 % reduction in the number of days to reach the desired concentration. Moreover, the performance of PPO is improved as the number of actors increases. PPO agent shows the best performance with three actors, but on further increasing, its performance deteriorated. These results are verified based on three case studies, namely, (i) for nominal conditions, (ii) in the presence of noise in raw materials and measurements, and (iii) in the presence of stochastic disturbance in temperature and noise in measurements. The results indicate that the proposed approach outperforms the deep deterministic policy gradient (DDPG), twin delayed deep deterministic policy gradient (TD3), and proximal policy optimization (PPO) algorithms for the control of the bioreactor system.</p></div>","PeriodicalId":72815,"journal":{"name":"Digital Chemical Engineering","volume":"8 ","pages":"Article 100108"},"PeriodicalIF":0.0,"publicationDate":"2023-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45164605","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Studies on crystallization process for pharmaceutical compounds using ANN modeling and model based control 药物结晶过程的神经网络建模与模型控制研究
Q2 ENGINEERING, CHEMICAL Pub Date : 2023-09-01 DOI: 10.1016/j.dche.2023.100114
P. Swapna Reddy , Amancha Sucharitha , Narendra Akiti , F. Fenila , Surendra Sasikumar Jampa

Solvent selection and Controlling of operating parameters play a crucial role in batch cooling crystallization process. Choosing a best solvent for crystallization process involves more experimentation and time. To overcome this problem, an Artificial Neural Network (ANN) model technique is used to predict the carbamazepine form Ⅲ solubility by considering the thermodynamic properties of different solvents i.e. critical temperature, critical pressure, temperature, molecular weight, and acentric factor. The ANN model was trained and evaluated for solubility at various input data sets using experimental solubility data available in the literature. The ANN model with 20 hidden neurons has given the R2 value of 0.9943 which shows that the developed ANN model can be used for the selection of best solvent for batch crystallization process. Further, to determine the optimal cooling profile of batch cooling crystallization process, a multi-objective optimization problem is formulated by considering objectives as minimizing the coefficient of variation (CV) and maximizing the Number mean size (NMS) of crystals subjected to population balance equations using “method of moments” technique. Two types of temperature strategies i.e., piece-wise constant and piece-wise linear are developed and solved using NSGA-Ⅱ dynamic optimization procedure. The optimal NMS value attained through piece-wise linear strategy was 197.1 µm. This value has been increased by 28.3 µm from the nominal case (without optimization) and the coefficient of variation has decreased from 0.951 to 0.76. Further, optimal NMS value attained through piece-wise constant strategy was 205 µm. The value has been increased by 36.2 µm and the coefficient of variation has decreased from 0.951 to 0.73. This proves that the crystal attributes can be improved by optimal cooling temperature profile obtained by multi-objective optimization framework. For implementing the optimal cooling profile an advanced model-based control, i.e., Generic Model Control (GMC) was developed. It was observed that the GMC controller has the good tracking profile with no offset with/without disturbances and small value of root mean square error (RMSE) of 0.0016 using piece-wise constant as set point temperature. Using piece-wise linear as set point temperature, the RMSE value was 0.0018. In particular, it is advantageous to operate the batch cooling crystallization process with piece-wise linear strategy for set point trajectory tracking problems.

溶剂的选择和操作参数的控制在间歇冷却结晶过程中起着至关重要的作用。为结晶过程选择最佳溶剂需要更多的实验和时间。为了克服这一问题,采用人工神经网络(ANN)模型技术,综合考虑不同溶剂的热力学性质,即临界温度、临界压力、温度、分子量和偏心因子,预测卡马西平形式Ⅲ溶解度。使用文献中可用的实验溶解度数据,对人工神经网络模型进行训练并评估其在各种输入数据集上的溶解度。有20个隐藏神经元的人工神经网络模型的R2值为0.9943,表明所建立的人工神经网络模型可用于间歇结晶过程中最佳溶剂的选择。此外,为了确定间歇冷却结晶过程的最佳冷却方式,利用“矩量法”技术,以最小变异系数(CV)和最大服从种群平衡方程的晶体平均尺寸(NMS)为目标,建立了多目标优化问题。提出了分段常数和分段线性两种温度策略,并采用NSGA-Ⅱ动态优化程序进行求解。通过分段线性策略获得的最优NMS值为197.1µm。该值比标称情况(未经优化)增加了28.3µm,变异系数从0.951降至0.76。此外,通过分段常数策略获得的最佳NMS值为205µm。该值增加了36.2µm,变异系数从0.951降低到0.73。这证明了通过多目标优化框架得到的最优冷却温度分布可以改善晶体属性。为了实现最优的冷却轮廓,一种先进的基于模型的控制,即通用模型控制(GMC)被开发出来。结果表明,GMC控制器具有良好的跟踪轮廓,无扰动偏移,以分段常数为设定点温度的均方根误差(RMSE)值较小,为0.0016。采用分段线性作为设定点温度,RMSE值为0.0018。尤其对于设定点轨迹跟踪问题,采用分段线性策略操作间歇冷却结晶过程是有利的。
{"title":"Studies on crystallization process for pharmaceutical compounds using ANN modeling and model based control","authors":"P. Swapna Reddy ,&nbsp;Amancha Sucharitha ,&nbsp;Narendra Akiti ,&nbsp;F. Fenila ,&nbsp;Surendra Sasikumar Jampa","doi":"10.1016/j.dche.2023.100114","DOIUrl":"10.1016/j.dche.2023.100114","url":null,"abstract":"<div><p>Solvent selection and Controlling of operating parameters play a crucial role in batch cooling crystallization process. Choosing a best solvent for crystallization process involves more experimentation and time. To overcome this problem, an Artificial Neural Network (ANN) model technique is used to predict the carbamazepine form Ⅲ solubility by considering the thermodynamic properties of different solvents i.e. critical temperature, critical pressure, temperature, molecular weight, and acentric factor. The ANN model was trained and evaluated for solubility at various input data sets using experimental solubility data available in the literature. The ANN model with 20 hidden neurons has given the R<sup>2</sup> value of 0.9943 which shows that the developed ANN model can be used for the selection of best solvent for batch crystallization process. Further, to determine the optimal cooling profile of batch cooling crystallization process, a multi-objective optimization problem is formulated by considering objectives as minimizing the coefficient of variation (CV) and maximizing the Number mean size (NMS) of crystals subjected to population balance equations using “method of moments” technique. Two types of temperature strategies i.e., piece-wise constant and piece-wise linear are developed and solved using NSGA-Ⅱ dynamic optimization procedure. The optimal NMS value attained through piece-wise linear strategy was 197.1 µm. This value has been increased by 28.3 µm from the nominal case (without optimization) and the coefficient of variation has decreased from 0.951 to 0.76. Further, optimal NMS value attained through piece-wise constant strategy was 205 µm. The value has been increased by 36.2 µm and the coefficient of variation has decreased from 0.951 to 0.73. This proves that the crystal attributes can be improved by optimal cooling temperature profile obtained by multi-objective optimization framework. For implementing the optimal cooling profile an advanced model-based control, i.e., Generic Model Control (GMC) was developed. It was observed that the GMC controller has the good tracking profile with no offset with/without disturbances and small value of root mean square error (RMSE) of 0.0016 using piece-wise constant as set point temperature. Using piece-wise linear as set point temperature, the RMSE value was 0.0018. In particular, it is advantageous to operate the batch cooling crystallization process with piece-wise linear strategy for set point trajectory tracking problems.</p></div>","PeriodicalId":72815,"journal":{"name":"Digital Chemical Engineering","volume":"8 ","pages":"Article 100114"},"PeriodicalIF":0.0,"publicationDate":"2023-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46297285","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A framework for enhancing industrial soft sensor learning models 增强工业软传感器学习模型的框架
Q2 ENGINEERING, CHEMICAL Pub Date : 2023-09-01 DOI: 10.1016/j.dche.2023.100112
João Guilherme Mattos , Patrick Nigri Happ , William Fernandes , Helio Côrtes Vieira Lopes , Simone D J Barbosa , Marcos Kalinowski , Luisa Silveira Rosa , Cassia Novello , Leonardo Dorigo Ribeiro , Patricia Rodrigues Ventura , Marcelo Cardoso Marques , Renato Neves Pitta , Valmir Jose Camolesi , Livia Pereira Lemos Costa , Bruno Itagyba Paravidino , Cristiane Salgado Pereira

Refinery industrial processes are very complex with nonlinear dynamics resulting from varying feedstock characteristics and also from changes in product prioritization. Along these processes, there are key properties of intermediate compounds that must be monitored and controlled since they directly affect the quality of the end products commercialized by these manufacturers. However, most of these properties can only be measured through time-consuming and expensive laboratory analysis, which is impossible to obtain in high frequencies, as required to properly monitor them. In this sense, developing soft sensors is the most common way to obtain high-frequency estimations for these measurements, helping advanced control systems to establish the correct setpoints for temperatures, pressures, and other sensors along the refining process, controlling the quality of end products. Since the amount of labeled data is scarce, most academic research has focused on employing semi- supervised learning strategies to develop machine learning (ML) models as soft sensors. Our research, on the other hand, goes in another direction. We aim to elaborate a framework that leverages the knowledge of domain experts and employs data augmentation techniques to build an enhanced fully labeled dataset that could be fed to any supervised ML algorithm to generate a quality soft sensor. We applied our framework together with Automated ML to train a model capable of predicting a specific key property associated with the production of Naphtha compounds in a refinery: the ASTM 95% distillation temperature of the Heavy Naphtha. Although our framework is model agnostic, we opted by using Automated ML for the optimization strategy, since it applies a diverse set of models to the dataset, reducing the bias of utilizing a single optimization algorithm. We evaluated the proposed framework on a case study carried out in an industrial refinery in Brazil, where the previous model in production for estimating the ASTM 95% distillation temperature of the Heavy Naphtha was based entirely on the physicochemical knowledge of the process. By adopting our framework with Automated ML, we were capable of improving the R2 score by 120%. The resulting ML model is currently operating in real-time inside the refinery, leading to significant economic gains.

炼油工业过程是非常复杂的,由于原料特性的变化和产品优先级的变化而产生非线性动力学。在这些过程中,中间化合物的一些关键特性必须被监测和控制,因为它们直接影响到这些制造商商业化的最终产品的质量。然而,大多数这些特性只能通过耗时和昂贵的实验室分析来测量,这是不可能在高频率下获得的,因为需要适当地监测它们。从这个意义上说,开发软传感器是获得这些测量的高频估计的最常见方法,有助于先进的控制系统在精炼过程中为温度、压力和其他传感器建立正确的设定值,控制最终产品的质量。由于标记数据的数量很少,大多数学术研究都集中在使用半监督学习策略来开发机器学习(ML)模型作为软传感器。另一方面,我们的研究则走向了另一个方向。我们的目标是制定一个框架,利用领域专家的知识,并采用数据增强技术来构建一个增强的完全标记数据集,该数据集可以馈送到任何有监督的ML算法,以生成高质量的软传感器。我们将我们的框架与Automated ML一起应用于训练一个模型,该模型能够预测炼油厂中与石脑油化合物生产相关的特定关键属性:重石脑油的ASTM 95%蒸馏温度。虽然我们的框架是模型不可知的,但我们选择使用自动化机器学习进行优化策略,因为它将不同的模型集应用于数据集,减少了使用单一优化算法的偏差。我们在巴西的一家工业精炼厂进行的一个案例研究中评估了拟议的框架,在该案例中,以前用于估计重石脑油的ASTM 95%蒸馏温度的生产模型完全基于该过程的物理化学知识。通过采用我们的自动化机器学习框架,我们能够将R2分数提高120%。由此产生的机器学习模型目前在炼油厂内实时运行,带来了显著的经济效益。
{"title":"A framework for enhancing industrial soft sensor learning models","authors":"João Guilherme Mattos ,&nbsp;Patrick Nigri Happ ,&nbsp;William Fernandes ,&nbsp;Helio Côrtes Vieira Lopes ,&nbsp;Simone D J Barbosa ,&nbsp;Marcos Kalinowski ,&nbsp;Luisa Silveira Rosa ,&nbsp;Cassia Novello ,&nbsp;Leonardo Dorigo Ribeiro ,&nbsp;Patricia Rodrigues Ventura ,&nbsp;Marcelo Cardoso Marques ,&nbsp;Renato Neves Pitta ,&nbsp;Valmir Jose Camolesi ,&nbsp;Livia Pereira Lemos Costa ,&nbsp;Bruno Itagyba Paravidino ,&nbsp;Cristiane Salgado Pereira","doi":"10.1016/j.dche.2023.100112","DOIUrl":"10.1016/j.dche.2023.100112","url":null,"abstract":"<div><p>Refinery industrial processes are very complex with nonlinear dynamics resulting from varying feedstock characteristics and also from changes in product prioritization. Along these processes, there are key properties of intermediate compounds that must be monitored and controlled since they directly affect the quality of the end products commercialized by these manufacturers. However, most of these properties can only be measured through time-consuming and expensive laboratory analysis, which is impossible to obtain in high frequencies, as required to properly monitor them. In this sense, developing soft sensors is the most common way to obtain high-frequency estimations for these measurements, helping advanced control systems to establish the correct setpoints for temperatures, pressures, and other sensors along the refining process, controlling the quality of end products. Since the amount of labeled data is scarce, most academic research has focused on employing semi- supervised learning strategies to develop machine learning (ML) models as soft sensors. Our research, on the other hand, goes in another direction. We aim to elaborate a framework that leverages the knowledge of domain experts and employs data augmentation techniques to build an enhanced fully labeled dataset that could be fed to any supervised ML algorithm to generate a quality soft sensor. We applied our framework together with Automated ML to train a model capable of predicting a specific key property associated with the production of Naphtha compounds in a refinery: the ASTM <span>95</span><svg><path></path></svg>% distillation temperature of the Heavy Naphtha. Although our framework is model agnostic, we opted by using Automated ML for the optimization strategy, since it applies a diverse set of models to the dataset, reducing the bias of utilizing a single optimization algorithm. We evaluated the proposed framework on a case study carried out in an industrial refinery in Brazil, where the previous model in production for estimating the ASTM <span>95</span><svg><path></path></svg>% distillation temperature of the Heavy Naphtha was based entirely on the physicochemical knowledge of the process. By adopting our framework with Automated ML, we were capable of improving the R<sup>2</sup> score by 120%. The resulting ML model is currently operating in real-time inside the refinery, leading to significant economic gains.</p></div>","PeriodicalId":72815,"journal":{"name":"Digital Chemical Engineering","volume":"8 ","pages":"Article 100112"},"PeriodicalIF":0.0,"publicationDate":"2023-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45171976","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Machine learning applications in biomass pyrolysis: From biorefinery to end-of-life product management 生物质热解中的机器学习应用:从生物炼制到报废产品管理
Q2 ENGINEERING, CHEMICAL Pub Date : 2023-09-01 DOI: 10.1016/j.dche.2023.100103
David Akorede Akinpelu , Oluwaseun A. Adekoya , Peter Olusakin Oladoye , Chukwuma C. Ogbaga , Jude A. Okolie

The thermochemical conversion of biomass is a promising technology due to its cost-effectiveness and feedstock flexibility, with pyrolysis being a particularly noteworthy method for its diverse product range. Despite the potential of pyrolysis, commercialization remains elusive, and there is a growing need to fully understand its dynamics to facilitate process scaling up. However, waste biomass pyrolysis is complex, time-consuming, and capital-intensive. Machine Learning (ML) has emerged as a possible means of supporting and accelerating pyrolysis research despite these challenges. This study provides a comprehensive overview of the use of ML in pyrolysis, from biorefinery to end-of-life product management. In addition, the success of ML in process optimization and control, predicting product yield, real-time monitoring, life-cycle assessment (LCA), and techno-economic analysis (TEA) during biomass pyrolysis is highlighted. Several ML methods have been utilized in a bid to study pyrolysis; the potentiality of artificial neural networks (ANNs) to learn extremely non-linear input-output correlations has led to the widespread adoption of these networks. Furthermore, the current knowledge gaps in ML research in pyrolysis and future recommendations for its application are identified. Finally, this study demonstrates the potential of ML in accelerating research and development as well as the scalability of pyrolysis of biomass.

由于其成本效益和原料灵活性,生物质的热化学转化是一种很有前途的技术,热解是一种特别值得注意的方法,因为其产品范围广泛。尽管热解具有潜力,但商业化仍然难以捉摸,人们越来越需要充分了解其动力学,以促进工艺规模的扩大。然而,废弃生物质热解是复杂、耗时和资本密集型的。尽管存在这些挑战,机器学习(ML)已成为支持和加速热解研究的一种可能手段。本研究全面概述了ML在热解中的应用,从生物精炼到报废产品管理。此外,还强调了ML在生物质热解过程中的工艺优化和控制、产品产量预测、实时监测、生命周期评估(LCA)和技术经济分析(TEA)方面的成功。已经使用了几种ML方法来研究热解;人工神经网络学习极其非线性的输入输出相关性的潜力导致了这些网络的广泛采用。此外,还确定了ML在热解研究中的当前知识差距及其未来应用建议。最后,本研究证明了ML在加速生物质热解研究和开发方面的潜力以及可扩展性。
{"title":"Machine learning applications in biomass pyrolysis: From biorefinery to end-of-life product management","authors":"David Akorede Akinpelu ,&nbsp;Oluwaseun A. Adekoya ,&nbsp;Peter Olusakin Oladoye ,&nbsp;Chukwuma C. Ogbaga ,&nbsp;Jude A. Okolie","doi":"10.1016/j.dche.2023.100103","DOIUrl":"https://doi.org/10.1016/j.dche.2023.100103","url":null,"abstract":"<div><p>The thermochemical conversion of biomass is a promising technology due to its cost-effectiveness and feedstock flexibility, with pyrolysis being a particularly noteworthy method for its diverse product range. Despite the potential of pyrolysis, commercialization remains elusive, and there is a growing need to fully understand its dynamics to facilitate process scaling up. However, waste biomass pyrolysis is complex, time-consuming, and capital-intensive. Machine Learning (ML) has emerged as a possible means of supporting and accelerating pyrolysis research despite these challenges. This study provides a comprehensive overview of the use of ML in pyrolysis, from biorefinery to end-of-life product management. In addition, the success of ML in process optimization and control, predicting product yield, real-time monitoring, life-cycle assessment (LCA), and techno-economic analysis (TEA) during biomass pyrolysis is highlighted. Several ML methods have been utilized in a bid to study pyrolysis; the potentiality of artificial neural networks (ANNs) to learn extremely non-linear input-output correlations has led to the widespread adoption of these networks. Furthermore, the current knowledge gaps in ML research in pyrolysis and future recommendations for its application are identified. Finally, this study demonstrates the potential of ML in accelerating research and development as well as the scalability of pyrolysis of biomass.</p></div>","PeriodicalId":72815,"journal":{"name":"Digital Chemical Engineering","volume":"8 ","pages":"Article 100103"},"PeriodicalIF":0.0,"publicationDate":"2023-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49715152","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Design of a machine learning-aided screening framework for antibiofilm peptides 一种基于机器学习的抗生物膜肽筛选框架的设计
Q2 ENGINEERING, CHEMICAL Pub Date : 2023-09-01 DOI: 10.1016/j.dche.2023.100107
Hema Chandra Puchakayala , Pranshul Bhatnagar , Pranav Nambiar, Arnab Dutta, Debirupa Mitra

Biofilms are formed by multicellular colonies of microorganisms that are protected by hard extracellular matrices. Eradication of biofilms is a challenging task due to their recalcitrant nature and thus biofilm formation poses a global threat to public health. In this regard, antibiofilm peptides are a promising class of therapeutics that are active against biofilms. However, large-scale experimental screening and testing of peptides for antibiofilm activity is a resource-intensive task. In this study, a machine learning-aided design framework is proposed to aid in screening of antibiofilm peptides. An SVM-based binary classification model is developed using amino acid compositions, sequence, and physicochemical properties of peptides as independent features. The physicochemical property-based model developed in this study achieved the highest accuracy of 97.9%, which is found to be substantially higher than the other feature representation techniques. The explainability of this model is performed using SHAP analysis. Results obtained show that amphiphilicity, aliphaticity and cationicity have positive correlation whereas steric parameter, length, and volume have negative correlation with antibiofilm activity of peptides. The developed model can be accessed freely via web tool: AntiBFP.

生物膜是由坚硬的细胞外基质保护的微生物的多细胞菌落形成的。由于生物膜的顽固性,根除生物膜是一项具有挑战性的任务,因此生物膜的形成对公众健康构成了全球性威胁。在这方面,抗生物膜肽是一类很有前途的治疗药物,对生物膜有活性。然而,大规模的实验筛选和测试肽的抗生物膜活性是一项资源密集型的任务。在这项研究中,提出了一个机器学习辅助设计框架,以帮助筛选抗生素膜肽。以肽的氨基酸组成、序列和理化性质为独立特征,建立了基于支持向量机的二分类模型。本研究开发的基于物理化学性质的模型达到了97.9%的最高准确率,大大高于其他特征表示技术。该模型的可解释性是使用SHAP分析来执行的。结果表明,多肽的两亲性、脂肪性和阳离子性与抗菌膜活性呈正相关,而空间参数、长度和体积与抗菌膜活性呈负相关。开发的模型可以通过web工具AntiBFP自由访问。
{"title":"Design of a machine learning-aided screening framework for antibiofilm peptides","authors":"Hema Chandra Puchakayala ,&nbsp;Pranshul Bhatnagar ,&nbsp;Pranav Nambiar,&nbsp;Arnab Dutta,&nbsp;Debirupa Mitra","doi":"10.1016/j.dche.2023.100107","DOIUrl":"10.1016/j.dche.2023.100107","url":null,"abstract":"<div><p>Biofilms are formed by multicellular colonies of microorganisms that are protected by hard extracellular matrices. Eradication of biofilms is a challenging task due to their recalcitrant nature and thus biofilm formation poses a global threat to public health. In this regard, antibiofilm peptides are a promising class of therapeutics that are active against biofilms. However, large-scale experimental screening and testing of peptides for antibiofilm activity is a resource-intensive task. In this study, a machine learning-aided design framework is proposed to aid in screening of antibiofilm peptides. An SVM-based binary classification model is developed using amino acid compositions, sequence, and physicochemical properties of peptides as independent features. The physicochemical property-based model developed in this study achieved the highest accuracy of 97.9%, which is found to be substantially higher than the other feature representation techniques. The explainability of this model is performed using SHAP analysis. Results obtained show that amphiphilicity, aliphaticity and cationicity have positive correlation whereas steric parameter, length, and volume have negative correlation with antibiofilm activity of peptides. The developed model can be accessed freely via web tool: <span>AntiBFP</span><svg><path></path></svg>.</p></div>","PeriodicalId":72815,"journal":{"name":"Digital Chemical Engineering","volume":"8 ","pages":"Article 100107"},"PeriodicalIF":0.0,"publicationDate":"2023-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"43990573","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Digital Chemical Engineering
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1