Prediction of microalgae harvesting efficiency and identification of important parameters for ballasted flotation using an optimized machine learning model
{"title":"Prediction of microalgae harvesting efficiency and identification of important parameters for ballasted flotation using an optimized machine learning model","authors":"Kaiwei Xu , Zihan Zhu , Haining Yu , Xiaotong Zou","doi":"10.1016/j.algal.2025.103985","DOIUrl":null,"url":null,"abstract":"<div><div>Ballasted flotation is an innovative and effective technique for the separation and recovery of microalgae. However, conventional experimental approaches to determine the optimal harvesting efficiency of microalgae are often inefficient and subjective, largely due to the varying properties of microalgae, types of ballasted agents (low-density materials, LDMs), and operational conditions. This study aims to develop a machine learning approach to establish the relationship between various features and harvesting efficiency in ballasted flotation, offering new insights for achieving efficient microalgal harvesting. The results showed that the performance of the Backpropagation Neural Network (BPNN) model outperformed other machine learning models examined in the study. To further enhance the predictive accuracy of the BPNN model, two additional optimization algorithms, Genetic Algorithm (GA) and Particle Swarm Optimization (PSO), were used to optimize the initial parameters of BPNN model. The findings demonstrated that both optimization models effectively improved the predictive ability of the BPNN model, with GA-BPNN exhibiting smaller testing Mean Absolute Error and Root Mean Square Error values (0.041 and 0.007, respectively), and a higher testing R<sup>2</sup> value (0.923), indicating superior performance compared to PSO-BPNN. SHAP analysis identified that microalgal concentration and the diameter of LDMs were the two most influential parameters affecting microalgal harvesting. Finally, experimental validation of microalgae harvesting confirmed the model's accuracy, with results falling within a 5 % error margin of the predicted values. These insights obtained through machine learning analysis can facilitate the development of high-throughput experimental designs, which can significantly enhance the harvesting efficiency of microalgae.</div></div>","PeriodicalId":7855,"journal":{"name":"Algal Research-Biomass Biofuels and Bioproducts","volume":"87 ","pages":"Article 103985"},"PeriodicalIF":4.6000,"publicationDate":"2025-02-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Algal Research-Biomass Biofuels and Bioproducts","FirstCategoryId":"99","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2211926425000943","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOTECHNOLOGY & APPLIED MICROBIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Ballasted flotation is an innovative and effective technique for the separation and recovery of microalgae. However, conventional experimental approaches to determine the optimal harvesting efficiency of microalgae are often inefficient and subjective, largely due to the varying properties of microalgae, types of ballasted agents (low-density materials, LDMs), and operational conditions. This study aims to develop a machine learning approach to establish the relationship between various features and harvesting efficiency in ballasted flotation, offering new insights for achieving efficient microalgal harvesting. The results showed that the performance of the Backpropagation Neural Network (BPNN) model outperformed other machine learning models examined in the study. To further enhance the predictive accuracy of the BPNN model, two additional optimization algorithms, Genetic Algorithm (GA) and Particle Swarm Optimization (PSO), were used to optimize the initial parameters of BPNN model. The findings demonstrated that both optimization models effectively improved the predictive ability of the BPNN model, with GA-BPNN exhibiting smaller testing Mean Absolute Error and Root Mean Square Error values (0.041 and 0.007, respectively), and a higher testing R2 value (0.923), indicating superior performance compared to PSO-BPNN. SHAP analysis identified that microalgal concentration and the diameter of LDMs were the two most influential parameters affecting microalgal harvesting. Finally, experimental validation of microalgae harvesting confirmed the model's accuracy, with results falling within a 5 % error margin of the predicted values. These insights obtained through machine learning analysis can facilitate the development of high-throughput experimental designs, which can significantly enhance the harvesting efficiency of microalgae.
期刊介绍:
Algal Research is an international phycology journal covering all areas of emerging technologies in algae biology, biomass production, cultivation, harvesting, extraction, bioproducts, biorefinery, engineering, and econometrics. Algae is defined to include cyanobacteria, microalgae, and protists and symbionts of interest in biotechnology. The journal publishes original research and reviews for the following scope: algal biology, including but not exclusive to: phylogeny, biodiversity, molecular traits, metabolic regulation, and genetic engineering, algal cultivation, e.g. phototrophic systems, heterotrophic systems, and mixotrophic systems, algal harvesting and extraction systems, biotechnology to convert algal biomass and components into biofuels and bioproducts, e.g., nutraceuticals, pharmaceuticals, animal feed, plastics, etc. algal products and their economic assessment