A novel framework for developing accurate and explainable leaf nitrogen content estimation model for aquilaria sinensis seedlings using canopy RGB imagery
{"title":"A novel framework for developing accurate and explainable leaf nitrogen content estimation model for aquilaria sinensis seedlings using canopy RGB imagery","authors":"Zhulin Chen , Xuefeng Wang","doi":"10.1016/j.biosystemseng.2025.02.003","DOIUrl":null,"url":null,"abstract":"<div><div>Leaf nitrogen content (LNC) is crucial for the cultivation and health management of the endangered tree species <em>Aquilaria sinensis</em>. Although RGB imagery combined with machine learning has been effective for non-destructive LNC estimation, current models often neglect colour index texture features and face feature selection and interpretability challenges. This study introduces a framework to address these issues. Firstly, the canopy RGB imagery colour indices and the texture features of <em>Aquilaria sinensis</em> seedlings were collected as an initial feature set. Then, an improved hybrid feature selection algorithm combining SHapley Additive exPlanation (SHAP) with a dynamic ranking strategy was applied with a regression algorithm. This approach was tested using random forest (RF), support vector regression (SVR), and deep neural network (DNN) models. Optimal feature subsets were identified for each model, and performance comparisons determined the best LNC estimation model. Results show that texture features derived from colour indices significantly enhance LNC estimation accuracy. The dynamic SHAP ranking method outperformed RF and fixed SHAP rankings in feature selection. The optimal model, a DNN with an R<sup>2</sup> of 0.946 and RMSE of 1.859 g kg<sup>−1</sup> included two colour indices and five colour index texture features. While the normalised red colour index had the highest contribution, texture features contributed more overall to model accuracy. This method can be extended to other biophysical and biochemical parameter estimations.</div></div>","PeriodicalId":9173,"journal":{"name":"Biosystems Engineering","volume":"251 ","pages":"Pages 128-144"},"PeriodicalIF":4.4000,"publicationDate":"2025-02-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biosystems Engineering","FirstCategoryId":"97","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1537511025000297","RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRICULTURAL ENGINEERING","Score":null,"Total":0}
引用次数: 0
Abstract
Leaf nitrogen content (LNC) is crucial for the cultivation and health management of the endangered tree species Aquilaria sinensis. Although RGB imagery combined with machine learning has been effective for non-destructive LNC estimation, current models often neglect colour index texture features and face feature selection and interpretability challenges. This study introduces a framework to address these issues. Firstly, the canopy RGB imagery colour indices and the texture features of Aquilaria sinensis seedlings were collected as an initial feature set. Then, an improved hybrid feature selection algorithm combining SHapley Additive exPlanation (SHAP) with a dynamic ranking strategy was applied with a regression algorithm. This approach was tested using random forest (RF), support vector regression (SVR), and deep neural network (DNN) models. Optimal feature subsets were identified for each model, and performance comparisons determined the best LNC estimation model. Results show that texture features derived from colour indices significantly enhance LNC estimation accuracy. The dynamic SHAP ranking method outperformed RF and fixed SHAP rankings in feature selection. The optimal model, a DNN with an R2 of 0.946 and RMSE of 1.859 g kg−1 included two colour indices and five colour index texture features. While the normalised red colour index had the highest contribution, texture features contributed more overall to model accuracy. This method can be extended to other biophysical and biochemical parameter estimations.
期刊介绍:
Biosystems Engineering publishes research in engineering and the physical sciences that represent advances in understanding or modelling of the performance of biological systems for sustainable developments in land use and the environment, agriculture and amenity, bioproduction processes and the food chain. The subject matter of the journal reflects the wide range and interdisciplinary nature of research in engineering for biological systems.