{"title":"An optimized machine learning framework for predicting and interpreting corporate ESG greenwashing behavior.","authors":"Fanlong Zeng, Jintao Wang, Chaoyan Zeng","doi":"10.1371/journal.pone.0316287","DOIUrl":null,"url":null,"abstract":"<p><p>The accurate prediction and interpretation of corporate Environmental, Social, and Governance (ESG) greenwashing behavior is crucial for enhancing information transparency and improving regulatory effectiveness. This paper addresses the limitations in hyperparameter optimization and interpretability of existing prediction models by introducing an optimized machine learning framework. The framework integrates an Improved Hunter-Prey Optimization (IHPO) algorithm, an eXtreme Gradient Boosting (XGBoost) model, and SHapley Additive exPlanations (SHAP) theory to predict and interpret corporate ESG greenwashing behavior. Initially, a comprehensive ESG greenwashing prediction dataset was developed through an extensive literature review and expert interviews. The IHPO algorithm was then employed to optimize the hyperparameters of the XGBoost model, forming an IHPO-XGBoost ensemble learning model for predicting corporate ESG greenwashing behavior. Finally, SHAP was used to interpret the model's prediction outcomes. The results demonstrate that the IHPO-XGBoost model achieves outstanding performance in predicting corporate ESG greenwashing, with R², RMSE, MAE, and adjusted R² values of 0.9790, 0.1376, 0.1000, and 0.9785, respectively. Compared to traditional HPO-XGBoost models and XGBoost models combined with other optimization algorithms, the IHPO-XGBoost model exhibits superior overall performance. The interpretability analysis using SHAP theory highlights the key features influencing the prediction outcomes, revealing the specific contributions of feature interactions and the impacts of individual sample features. The findings provide valuable insights for regulators and investors to more effectively identify and assess potential corporate ESG greenwashing behavior, thereby enhancing regulatory efficiency and investment decision-making.</p>","PeriodicalId":20189,"journal":{"name":"PLoS ONE","volume":"20 3","pages":"e0316287"},"PeriodicalIF":2.9000,"publicationDate":"2025-03-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11884703/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"PLoS ONE","FirstCategoryId":"103","ListUrlMain":"https://doi.org/10.1371/journal.pone.0316287","RegionNum":3,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q1","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
The accurate prediction and interpretation of corporate Environmental, Social, and Governance (ESG) greenwashing behavior is crucial for enhancing information transparency and improving regulatory effectiveness. This paper addresses the limitations in hyperparameter optimization and interpretability of existing prediction models by introducing an optimized machine learning framework. The framework integrates an Improved Hunter-Prey Optimization (IHPO) algorithm, an eXtreme Gradient Boosting (XGBoost) model, and SHapley Additive exPlanations (SHAP) theory to predict and interpret corporate ESG greenwashing behavior. Initially, a comprehensive ESG greenwashing prediction dataset was developed through an extensive literature review and expert interviews. The IHPO algorithm was then employed to optimize the hyperparameters of the XGBoost model, forming an IHPO-XGBoost ensemble learning model for predicting corporate ESG greenwashing behavior. Finally, SHAP was used to interpret the model's prediction outcomes. The results demonstrate that the IHPO-XGBoost model achieves outstanding performance in predicting corporate ESG greenwashing, with R², RMSE, MAE, and adjusted R² values of 0.9790, 0.1376, 0.1000, and 0.9785, respectively. Compared to traditional HPO-XGBoost models and XGBoost models combined with other optimization algorithms, the IHPO-XGBoost model exhibits superior overall performance. The interpretability analysis using SHAP theory highlights the key features influencing the prediction outcomes, revealing the specific contributions of feature interactions and the impacts of individual sample features. The findings provide valuable insights for regulators and investors to more effectively identify and assess potential corporate ESG greenwashing behavior, thereby enhancing regulatory efficiency and investment decision-making.
期刊介绍:
PLOS ONE is an international, peer-reviewed, open-access, online publication. PLOS ONE welcomes reports on primary research from any scientific discipline. It provides:
* Open-access—freely accessible online, authors retain copyright
* Fast publication times
* Peer review by expert, practicing researchers
* Post-publication tools to indicate quality and impact
* Community-based dialogue on articles
* Worldwide media coverage