{"title":"预测木质纤维素生物质热液液化的机器学习方法","authors":"Tossapon Katongtung, Sanphawat Phromphithak, Thossaporn Onsree, Nakorn Tippayawong","doi":"10.1007/s12155-024-10773-0","DOIUrl":null,"url":null,"abstract":"<div><p>Hydrothermal liquefaction (HTL) of lignocellulosic biomass has gained attention as a promising technology for the production of biofuels and other value-added products. HTL process optimization is complex and involves various parameters such as reaction time, temperature, and pressure. In recent years, machine learning (ML) approaches have been adopted as a tool to optimize and predict the HTL process performance. The purposes of this study were to investigate the ML-based prediction of bio-crude yield (BCY) and their higher heating values (HHVs) from HTL of lignocellulosic biomass and to elucidate the relationship of features affecting the output of interest. Pre-processing and normalization were applied to a dataset of 215 data points with 17 input features. Feature selection using the Shapley value method identified key predictors. ML models including multilayer perceptron, kernel ridge regression, random forest, and extreme gradient boosting (XGB) were trained and evaluated. XGB algorithm shows superior performance in predicting the yields and their calorific values to within 5–8% of experimental values. Temperature was the most influential feature for both BCY and HHV prediction accounting for about 30%, followed by other feedstock and operational characteristics. In addition, a user interface was presented for ease of use in the ML modeling.</p></div>","PeriodicalId":487,"journal":{"name":"BioEnergy Research","volume":"17 4","pages":"2246 - 2258"},"PeriodicalIF":3.1000,"publicationDate":"2024-05-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Machine Learning Approach for Predicting Hydrothermal Liquefaction of Lignocellulosic Biomass\",\"authors\":\"Tossapon Katongtung, Sanphawat Phromphithak, Thossaporn Onsree, Nakorn Tippayawong\",\"doi\":\"10.1007/s12155-024-10773-0\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Hydrothermal liquefaction (HTL) of lignocellulosic biomass has gained attention as a promising technology for the production of biofuels and other value-added products. HTL process optimization is complex and involves various parameters such as reaction time, temperature, and pressure. In recent years, machine learning (ML) approaches have been adopted as a tool to optimize and predict the HTL process performance. The purposes of this study were to investigate the ML-based prediction of bio-crude yield (BCY) and their higher heating values (HHVs) from HTL of lignocellulosic biomass and to elucidate the relationship of features affecting the output of interest. Pre-processing and normalization were applied to a dataset of 215 data points with 17 input features. Feature selection using the Shapley value method identified key predictors. ML models including multilayer perceptron, kernel ridge regression, random forest, and extreme gradient boosting (XGB) were trained and evaluated. XGB algorithm shows superior performance in predicting the yields and their calorific values to within 5–8% of experimental values. Temperature was the most influential feature for both BCY and HHV prediction accounting for about 30%, followed by other feedstock and operational characteristics. In addition, a user interface was presented for ease of use in the ML modeling.</p></div>\",\"PeriodicalId\":487,\"journal\":{\"name\":\"BioEnergy Research\",\"volume\":\"17 4\",\"pages\":\"2246 - 2258\"},\"PeriodicalIF\":3.1000,\"publicationDate\":\"2024-05-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"BioEnergy Research\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://link.springer.com/article/10.1007/s12155-024-10773-0\",\"RegionNum\":3,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"ENERGY & FUELS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"BioEnergy Research","FirstCategoryId":"5","ListUrlMain":"https://link.springer.com/article/10.1007/s12155-024-10773-0","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENERGY & FUELS","Score":null,"Total":0}
引用次数: 0
摘要
木质纤维素生物质的水热液化(HTL)技术作为生产生物燃料和其他高附加值产品的一项前景广阔的技术,受到了广泛关注。HTL 工艺优化非常复杂,涉及反应时间、温度和压力等多个参数。近年来,机器学习(ML)方法已被用作优化和预测 HTL 工艺性能的工具。本研究的目的是研究基于 ML 的木质纤维素生物质 HTL 生物原油产量(BCY)及其高热值(HHVs)预测,并阐明影响相关产出的特征之间的关系。对包含 215 个数据点和 17 个输入特征的数据集进行了预处理和归一化。使用 Shapley 值法进行特征选择,确定关键预测因子。对包括多层感知器、核岭回归、随机森林和极梯度提升(XGB)在内的 ML 模型进行了训练和评估。XGB 算法在预测产量及其热值方面表现出色,其预测值在实验值的 5-8% 范围内。温度是对 BCY 和 HHV 预测影响最大的特征,约占 30%,其次是其他原料和操作特征。此外,为了便于使用 ML 建模,还提供了一个用户界面。
Machine Learning Approach for Predicting Hydrothermal Liquefaction of Lignocellulosic Biomass
Hydrothermal liquefaction (HTL) of lignocellulosic biomass has gained attention as a promising technology for the production of biofuels and other value-added products. HTL process optimization is complex and involves various parameters such as reaction time, temperature, and pressure. In recent years, machine learning (ML) approaches have been adopted as a tool to optimize and predict the HTL process performance. The purposes of this study were to investigate the ML-based prediction of bio-crude yield (BCY) and their higher heating values (HHVs) from HTL of lignocellulosic biomass and to elucidate the relationship of features affecting the output of interest. Pre-processing and normalization were applied to a dataset of 215 data points with 17 input features. Feature selection using the Shapley value method identified key predictors. ML models including multilayer perceptron, kernel ridge regression, random forest, and extreme gradient boosting (XGB) were trained and evaluated. XGB algorithm shows superior performance in predicting the yields and their calorific values to within 5–8% of experimental values. Temperature was the most influential feature for both BCY and HHV prediction accounting for about 30%, followed by other feedstock and operational characteristics. In addition, a user interface was presented for ease of use in the ML modeling.
期刊介绍:
BioEnergy Research fills a void in the rapidly growing area of feedstock biology research related to biomass, biofuels, and bioenergy. The journal publishes a wide range of articles, including peer-reviewed scientific research, reviews, perspectives and commentary, industry news, and government policy updates. Its coverage brings together a uniquely broad combination of disciplines with a common focus on feedstock biology and science, related to biomass, biofeedstock, and bioenergy production.