Jorge Enrique Chaparro , José Edinson Aedo , Felipe Lumbreras Ruiz
{"title":"Machine Learning for the estimation of foliar nitrogen content in pineapple crops using multispectral images and Internet of Things (IoT) platforms","authors":"Jorge Enrique Chaparro , José Edinson Aedo , Felipe Lumbreras Ruiz","doi":"10.1016/j.jafr.2024.101208","DOIUrl":null,"url":null,"abstract":"<div><p>Nitrogen is the most important nutritional element during the vegetative growth phase of the pineapple crop; however, its presence in the soil is insufficient to meet plant demands. In this study, nine machine learning techniques were validated to estimate the total nitrogen (TN) content in MD2 pineapple crops from data from multiple sources. These sources included multispectral images captured by an unmanned aerial vehicle (UAV); in situ sensors, which collected information on ecological factors such as pH, temperature, solar radiation, relative humidity, soil moisture, wind speed and direction, as well as SPAD values indicating leaf chlorophyll content.</p><p>Total nitrogen (TN) values were taken from leaf tissue samples, which were then analyzed in a laboratory. To introduce nitrogen variability, a complete randomized block experimental design was implemented, applying five different treatments in five blocks, each with 12 replications, during a 6-month period in a pineapple crop located in Tauramena, Colombia. To address the inherent variability in the agricultural and environmental data, dimensionality was reduced using Principal Component Analysis (PCA). In addition, regularization techniques were applied, including cross-validation, feature selection, boost methods, L1 (Lasso) and L2 (Ridge) regularization, as well as hyperparameter optimization. These strategies generated more robust and accurate models, with the multilayer perceptron regressor (MLP regressor) and extreme gradient boosting (XGBoost) algorithms standing out. On the first sampling date, XGBoost achieved an <em>R</em><sup>2</sup> of 86.98 %, being the highest. On the following dates, MLP achieved a <em>R</em><sup>2</sup> of 59.11 % on the second date; XGBoost achieved a <em>R</em><sup>2</sup> of 68.00 % on the third date, and on the last date, MLP achieved a <em>R</em><sup>2</sup> of 69.4 %. These results indicate that the integration of data from multiple sources and the use of machine learning models could greatly improve the precision of nitro-gen (N) diagnostics in pineapple crops, especially in real-time applications. These findings highlight the promising potential of developing machine learning models that integrate multisensor data fusion for various applications in agriculture.</p></div>","PeriodicalId":34393,"journal":{"name":"Journal of Agriculture and Food Research","volume":"18 ","pages":"Article 101208"},"PeriodicalIF":4.8000,"publicationDate":"2024-07-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S266615432400245X/pdfft?md5=e5cef71b09614a7f561f276f62b2f68c&pid=1-s2.0-S266615432400245X-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Agriculture and Food Research","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S266615432400245X","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRICULTURE, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
Abstract
Nitrogen is the most important nutritional element during the vegetative growth phase of the pineapple crop; however, its presence in the soil is insufficient to meet plant demands. In this study, nine machine learning techniques were validated to estimate the total nitrogen (TN) content in MD2 pineapple crops from data from multiple sources. These sources included multispectral images captured by an unmanned aerial vehicle (UAV); in situ sensors, which collected information on ecological factors such as pH, temperature, solar radiation, relative humidity, soil moisture, wind speed and direction, as well as SPAD values indicating leaf chlorophyll content.
Total nitrogen (TN) values were taken from leaf tissue samples, which were then analyzed in a laboratory. To introduce nitrogen variability, a complete randomized block experimental design was implemented, applying five different treatments in five blocks, each with 12 replications, during a 6-month period in a pineapple crop located in Tauramena, Colombia. To address the inherent variability in the agricultural and environmental data, dimensionality was reduced using Principal Component Analysis (PCA). In addition, regularization techniques were applied, including cross-validation, feature selection, boost methods, L1 (Lasso) and L2 (Ridge) regularization, as well as hyperparameter optimization. These strategies generated more robust and accurate models, with the multilayer perceptron regressor (MLP regressor) and extreme gradient boosting (XGBoost) algorithms standing out. On the first sampling date, XGBoost achieved an R2 of 86.98 %, being the highest. On the following dates, MLP achieved a R2 of 59.11 % on the second date; XGBoost achieved a R2 of 68.00 % on the third date, and on the last date, MLP achieved a R2 of 69.4 %. These results indicate that the integration of data from multiple sources and the use of machine learning models could greatly improve the precision of nitro-gen (N) diagnostics in pineapple crops, especially in real-time applications. These findings highlight the promising potential of developing machine learning models that integrate multisensor data fusion for various applications in agriculture.