Manuela Ortega Monsalve, Mario Cerón-Muñoz, Luis Galeano-Vasco, Marisol Medina-Sierra
{"title":"Use of Machine Learning Models for Prediction of Organic Carbon and Nitrogen in Soil from Hyperspectral Imagery in Laboratory","authors":"Manuela Ortega Monsalve, Mario Cerón-Muñoz, Luis Galeano-Vasco, Marisol Medina-Sierra","doi":"10.1155/2023/4389885","DOIUrl":null,"url":null,"abstract":"Organic carbon and total nitrogen are essential nutrients for plant growth. The presence of these nutrients at acceptable levels can create an optimal environment for the development of crops of interest. The application of spectroscopic techniques and the use of machine learning algorithms have made it possible to calibrate models capable of predicting the number of elements present in the soil. One of these techniques is hyperspectral imaging, which captures portions of the electromagnetic spectrum where the materials present in the soil can be differentiated due to the vibrations of chemical bonds. The objective of this research is to use statistical models to predict OC and N in soils from hyperspectral images. Transformations were applied to spectral and chemical data and the models used were Random Forest (RF) and Support Vector Machine (SVM). To select the best model, the values of the coefficient of determination (<span><svg height=\"11.7978pt\" style=\"vertical-align:-0.2063999pt\" version=\"1.1\" viewbox=\"-0.0498162 -11.5914 13.2276 11.7978\" width=\"13.2276pt\" xmlns=\"http://www.w3.org/2000/svg\" xmlns:xlink=\"http://www.w3.org/1999/xlink\"><g transform=\"matrix(.013,0,0,-0.013,0,0)\"></path></g><g transform=\"matrix(.0091,0,0,-0.0091,8.151,-5.741)\"></path></g></svg>),</span> root mean square error of prediction (RMSEP), and the ratio of performance to deviation (RPD) were considered. For OC, the values found for the RF model were an <svg height=\"11.7978pt\" style=\"vertical-align:-0.2063999pt\" version=\"1.1\" viewbox=\"-0.0498162 -11.5914 13.2276 11.7978\" width=\"13.2276pt\" xmlns=\"http://www.w3.org/2000/svg\" xmlns:xlink=\"http://www.w3.org/1999/xlink\"><g transform=\"matrix(.013,0,0,-0.013,0,0)\"><use xlink:href=\"#g113-83\"></use></g><g transform=\"matrix(.0091,0,0,-0.0091,8.151,-5.741)\"><use xlink:href=\"#g50-51\"></use></g></svg> of 0.87, an RMSEP of 0.10, and an RPD of 6.74; the SVM model presented an <svg height=\"11.7978pt\" style=\"vertical-align:-0.2063999pt\" version=\"1.1\" viewbox=\"-0.0498162 -11.5914 13.2276 11.7978\" width=\"13.2276pt\" xmlns=\"http://www.w3.org/2000/svg\" xmlns:xlink=\"http://www.w3.org/1999/xlink\"><g transform=\"matrix(.013,0,0,-0.013,0,0)\"><use xlink:href=\"#g113-83\"></use></g><g transform=\"matrix(.0091,0,0,-0.0091,8.151,-5.741)\"><use xlink:href=\"#g50-51\"></use></g></svg> of 0.92, an RMSEP of 0.20, and an RPD of 3.56. For the variable N, the values found for the RF model were an <svg height=\"11.7978pt\" style=\"vertical-align:-0.2063999pt\" version=\"1.1\" viewbox=\"-0.0498162 -11.5914 13.2276 11.7978\" width=\"13.2276pt\" xmlns=\"http://www.w3.org/2000/svg\" xmlns:xlink=\"http://www.w3.org/1999/xlink\"><g transform=\"matrix(.013,0,0,-0.013,0,0)\"><use xlink:href=\"#g113-83\"></use></g><g transform=\"matrix(.0091,0,0,-0.0091,8.151,-5.741)\"><use xlink:href=\"#g50-51\"></use></g></svg> of 0.79, an RMSEP of 0.03, and an RPD of 5.44; for the SVM model, they were an <svg height=\"11.7978pt\" style=\"vertical-align:-0.2063999pt\" version=\"1.1\" viewbox=\"-0.0498162 -11.5914 13.2276 11.7978\" width=\"13.2276pt\" xmlns=\"http://www.w3.org/2000/svg\" xmlns:xlink=\"http://www.w3.org/1999/xlink\"><g transform=\"matrix(.013,0,0,-0.013,0,0)\"><use xlink:href=\"#g113-83\"></use></g><g transform=\"matrix(.0091,0,0,-0.0091,8.151,-5.741)\"><use xlink:href=\"#g50-51\"></use></g></svg> of 0.87, an RMSEP of 0.08, and an RPD of 2.76. The RF model showed a better fit for both variables. The SVM model also produced acceptable results. The results show that machine learning models are a good alternative for analysing soil-related variables.","PeriodicalId":17079,"journal":{"name":"Journal of Spectroscopy","volume":"28 1","pages":""},"PeriodicalIF":1.7000,"publicationDate":"2023-12-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Spectroscopy","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.1155/2023/4389885","RegionNum":4,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
引用次数: 0
Abstract
Organic carbon and total nitrogen are essential nutrients for plant growth. The presence of these nutrients at acceptable levels can create an optimal environment for the development of crops of interest. The application of spectroscopic techniques and the use of machine learning algorithms have made it possible to calibrate models capable of predicting the number of elements present in the soil. One of these techniques is hyperspectral imaging, which captures portions of the electromagnetic spectrum where the materials present in the soil can be differentiated due to the vibrations of chemical bonds. The objective of this research is to use statistical models to predict OC and N in soils from hyperspectral images. Transformations were applied to spectral and chemical data and the models used were Random Forest (RF) and Support Vector Machine (SVM). To select the best model, the values of the coefficient of determination (), root mean square error of prediction (RMSEP), and the ratio of performance to deviation (RPD) were considered. For OC, the values found for the RF model were an of 0.87, an RMSEP of 0.10, and an RPD of 6.74; the SVM model presented an of 0.92, an RMSEP of 0.20, and an RPD of 3.56. For the variable N, the values found for the RF model were an of 0.79, an RMSEP of 0.03, and an RPD of 5.44; for the SVM model, they were an of 0.87, an RMSEP of 0.08, and an RPD of 2.76. The RF model showed a better fit for both variables. The SVM model also produced acceptable results. The results show that machine learning models are a good alternative for analysing soil-related variables.
期刊介绍:
Journal of Spectroscopy (formerly titled Spectroscopy: An International Journal) is a peer-reviewed, open access journal that publishes original research articles as well as review articles in all areas of spectroscopy.