{"title":"Feature Selection and Spectral Indices for Identifying Maize Stress Types.","authors":"Yanru Li, Keming Yang, Bing Wu","doi":"10.1177/00037028241279328","DOIUrl":null,"url":null,"abstract":"<p><p>This study aims to identify different types of stress on maize leaves using feature selection and spectral index methods. Spectral data were collected from leaves under heavy metal, water, fertilizer stress, as well as under normal healthy conditions. Preprocessing steps such as continuum removal (CR), standard normal variable (SNV) transformation, multiple scattering correction (MSC), detrend correction (DT), and first-order derivative (FOD) were applied to the raw spectra. Various feature selection methods including ReliefF, chi-square test, recursive feature elimination (FRE), mutual information (MI), random forest (RF), and gradient boosting tree (GBT) were employed to determine the importance scores of different spectral bands, thus identifying sensitive spectral features capable of distinguishing various stress types. Spectral indices for stress type differentiation were constructed using label correlation method. Classification models were built using support vector machine (SVM), K-nearest neighbors (KNN), Gaussian naive Bayes (GNB), extreme gradient boosting (XGBoost), RF, and adaptive boosting (AdaBoost) algorithms. Results indicate that the characteristic spectral bands for differentiating stress types are primarily distributed around the red edge (near 700-800 nm) and water absorption valley (near 1900 nm). Spectral indices constructed using combinations of spectral bands around the near-infrared plateau absorption valley (near 1185 nm) and water absorption valley (near 1460 nm) effectively differentiate maize stress types. Among the modeling classification algorithms, RF and AdaBoost algorithms exhibited optimal performance, demonstrating high classification accuracy on both training and validation sets. These findings hold promise for providing new technical support for maize stress monitoring and diagnosis in agricultural production.</p>","PeriodicalId":2,"journal":{"name":"ACS Applied Bio Materials","volume":null,"pages":null},"PeriodicalIF":4.6000,"publicationDate":"2024-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACS Applied Bio Materials","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.1177/00037028241279328","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MATERIALS SCIENCE, BIOMATERIALS","Score":null,"Total":0}
引用次数: 0
Abstract
This study aims to identify different types of stress on maize leaves using feature selection and spectral index methods. Spectral data were collected from leaves under heavy metal, water, fertilizer stress, as well as under normal healthy conditions. Preprocessing steps such as continuum removal (CR), standard normal variable (SNV) transformation, multiple scattering correction (MSC), detrend correction (DT), and first-order derivative (FOD) were applied to the raw spectra. Various feature selection methods including ReliefF, chi-square test, recursive feature elimination (FRE), mutual information (MI), random forest (RF), and gradient boosting tree (GBT) were employed to determine the importance scores of different spectral bands, thus identifying sensitive spectral features capable of distinguishing various stress types. Spectral indices for stress type differentiation were constructed using label correlation method. Classification models were built using support vector machine (SVM), K-nearest neighbors (KNN), Gaussian naive Bayes (GNB), extreme gradient boosting (XGBoost), RF, and adaptive boosting (AdaBoost) algorithms. Results indicate that the characteristic spectral bands for differentiating stress types are primarily distributed around the red edge (near 700-800 nm) and water absorption valley (near 1900 nm). Spectral indices constructed using combinations of spectral bands around the near-infrared plateau absorption valley (near 1185 nm) and water absorption valley (near 1460 nm) effectively differentiate maize stress types. Among the modeling classification algorithms, RF and AdaBoost algorithms exhibited optimal performance, demonstrating high classification accuracy on both training and validation sets. These findings hold promise for providing new technical support for maize stress monitoring and diagnosis in agricultural production.