{"title":"Machine learning-based prediction of the C/N ratio in municipal organic waste","authors":"Aliakbar Dehghan , Vahide Oskoei , Taherh Khajavi , Mansour Baziar , Mahmood Yousefi","doi":"10.1016/j.eti.2024.103977","DOIUrl":null,"url":null,"abstract":"<div><div>Carbon-to-nitrogen (C/N) ratio plays a crucial role in managing organic waste in urban settings as it facilitates composting processes and nutrient reclamation. Encouraging composting and nutrient recovery aids in diminishing the waste disposal in landfills and mitigating the associated greenhouse gas emissions. This research uses machine learning techniques to predict carbon-to-nitrogen (C/N) ratio of organic waste present in municipal solid waste (MSW). The actual data, sourced from the Solid Waste Management Organization in Mashhad County, Iran consists of chemical analyses related to organic waste component in 17 cities. Factors such as percentage of organic waste, moisture content, ash content, pH level, and C/N ratio offer valuable information on the characteristics of organic waste. Cubic spline curve fitting is employed to interpolate the data, and subsequently, the dataset is partitioned into training and testing sets to aid in model development and evaluation. Five machine learning models (AdaBoost, Random Forest, Extra Trees, Decision Tree, and CatBoost) are utilized, and a systematic exploration of hyperparameters is conducted. The Extra Trees model exhibited outstanding accuracy, with R² values of 1.0 for the training phase and 0.97 for the testing phase, accompanied by minimal Mean Squared Error (MSE) values of 0 and 0.114, respectively. Furthermore, this investigation utilized SHAP analysis to examine the importance of features, uncovering that ash content (%) emerged as the most significant factor in forecasting the C/N ratio. Thus, the Extra Trees model emerges as a reliable instrument for forecasting the C/N ratio across 17 municipalities within Mashhad County.</div></div>","PeriodicalId":11725,"journal":{"name":"Environmental Technology & Innovation","volume":"37 ","pages":"Article 103977"},"PeriodicalIF":7.1000,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Environmental Technology & Innovation","FirstCategoryId":"93","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S235218642400453X","RegionNum":2,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/12/18 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"BIOTECHNOLOGY & APPLIED MICROBIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Carbon-to-nitrogen (C/N) ratio plays a crucial role in managing organic waste in urban settings as it facilitates composting processes and nutrient reclamation. Encouraging composting and nutrient recovery aids in diminishing the waste disposal in landfills and mitigating the associated greenhouse gas emissions. This research uses machine learning techniques to predict carbon-to-nitrogen (C/N) ratio of organic waste present in municipal solid waste (MSW). The actual data, sourced from the Solid Waste Management Organization in Mashhad County, Iran consists of chemical analyses related to organic waste component in 17 cities. Factors such as percentage of organic waste, moisture content, ash content, pH level, and C/N ratio offer valuable information on the characteristics of organic waste. Cubic spline curve fitting is employed to interpolate the data, and subsequently, the dataset is partitioned into training and testing sets to aid in model development and evaluation. Five machine learning models (AdaBoost, Random Forest, Extra Trees, Decision Tree, and CatBoost) are utilized, and a systematic exploration of hyperparameters is conducted. The Extra Trees model exhibited outstanding accuracy, with R² values of 1.0 for the training phase and 0.97 for the testing phase, accompanied by minimal Mean Squared Error (MSE) values of 0 and 0.114, respectively. Furthermore, this investigation utilized SHAP analysis to examine the importance of features, uncovering that ash content (%) emerged as the most significant factor in forecasting the C/N ratio. Thus, the Extra Trees model emerges as a reliable instrument for forecasting the C/N ratio across 17 municipalities within Mashhad County.
碳氮比(C/N)在城市环境中的有机废物管理中起着至关重要的作用,因为它有助于堆肥过程和养分回收。鼓励堆肥和养分回收有助于减少垃圾填埋场的废物处置,并减轻相关的温室气体排放。本研究使用机器学习技术来预测城市固体废物(MSW)中有机废物的碳氮比(C/N)。实际数据来自伊朗马什哈德县固体废物管理组织,包括对17个城市有机废物成分的化学分析。有机废物的百分比、水分含量、灰分含量、pH值和碳氮比等因素为有机废物的特性提供了有价值的信息。采用三次样条曲线拟合对数据进行插值,然后将数据集划分为训练集和测试集,以帮助模型开发和评估。利用了五种机器学习模型(AdaBoost, Random Forest, Extra Trees, Decision Tree和CatBoost),并对超参数进行了系统的探索。Extra Trees模型在训练阶段的R²值为1.0,测试阶段的R²值为0.97,最小均方误差(MSE)值分别为0和0.114。此外,本研究利用SHAP分析来检验特征的重要性,发现灰分含量(%)成为预测碳氮比的最重要因素。因此,Extra Trees模型成为预测马什哈德县17个城市的碳氮比的可靠工具。
期刊介绍:
Environmental Technology & Innovation adopts a challenge-oriented approach to solutions by integrating natural sciences to promote a sustainable future. The journal aims to foster the creation and development of innovative products, technologies, and ideas that enhance the environment, with impacts across soil, air, water, and food in rural and urban areas.
As a platform for disseminating scientific evidence for environmental protection and sustainable development, the journal emphasizes fundamental science, methodologies, tools, techniques, and policy considerations. It emphasizes the importance of science and technology in environmental benefits, including smarter, cleaner technologies for environmental protection, more efficient resource processing methods, and the evidence supporting their effectiveness.