The global attention to the utilization of unmanned aerial vehicle remote sensing drones in crop disease-wide detection has led to the urgent need to find an adapted model for different environmental conditions. Therefore, the current study has focused on spatiotemporal usage of different multispectral cameras in acquiring spectral reflectance models of in-field rice bacterial blight stresses. Where, long short-term memory (LSTM) model was compared with the other models in transfer learning strategy for assessing the blight stress severity. The results revealed that by extracting 30% of the data from the target domain and transferring it to the source domain, the adaptability of the model across different sites was effectively enhanced. Besides, LSTM showed high tuning transfer efficiency that demonstrated optimal predictive performance and the shortest training time in transfer tasks. Its coefficient of the prediction set was 0.82, and its residual prediction deviation has reached 2.26. In practice, LSTM enabled the acquisition of reliable prediction results at a minimal sample collection cost while circumventing feature reduction resulting from inter-domain data alignment. When the transfer ratio reached 20%, the coefficient of determination of the prediction set reached 0.71, and the residual prediction deviation reached 1.79. The novelty of this study came from the transfer learning efficiency in improving the model’s application capabilities across the different sites, environment, and unmanned aerial vehicle in farmland disease detection.
{"title":"Transfer learning for plant disease detection model based on low-altitude UAV remote sensing","authors":"Zhenyu Huang, Xiulin Bai, Mostafa Gouda, Hui Hu, Ningyuan Yang, Yong He, Xuping Feng","doi":"10.1007/s11119-024-10217-x","DOIUrl":"https://doi.org/10.1007/s11119-024-10217-x","url":null,"abstract":"<p>The global attention to the utilization of unmanned aerial vehicle remote sensing drones in crop disease-wide detection has led to the urgent need to find an adapted model for different environmental conditions. Therefore, the current study has focused on spatiotemporal usage of different multispectral cameras in acquiring spectral reflectance models of in-field rice bacterial blight stresses. Where, long short-term memory (LSTM) model was compared with the other models in transfer learning strategy for assessing the blight stress severity. The results revealed that by extracting 30% of the data from the target domain and transferring it to the source domain, the adaptability of the model across different sites was effectively enhanced. Besides, LSTM showed high tuning transfer efficiency that demonstrated optimal predictive performance and the shortest training time in transfer tasks. Its coefficient of the prediction set was 0.82, and its residual prediction deviation has reached 2.26. In practice, LSTM enabled the acquisition of reliable prediction results at a minimal sample collection cost while circumventing feature reduction resulting from inter-domain data alignment. When the transfer ratio reached 20%, the coefficient of determination of the prediction set reached 0.71, and the residual prediction deviation reached 1.79. The novelty of this study came from the transfer learning efficiency in improving the model’s application capabilities across the different sites, environment, and unmanned aerial vehicle in farmland disease detection.</p>","PeriodicalId":20423,"journal":{"name":"Precision Agriculture","volume":"22 1","pages":""},"PeriodicalIF":6.2,"publicationDate":"2024-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142849052","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-12-19DOI: 10.1007/s11119-024-10196-z
Salvador J. Vicencio-Medina, Yasmin A. Rios-Solis, Nestor M. Cid-Garcia
The first stage in the precision agriculture cycle has been a vital study area in recent years because it allows soil testing followed by data analysis. In this stage, a strategic delineation of site-specific management zones acquires a particular interest because it enables site-specific treatment to improve crop yield by efficiently using the input of resources. The delineation of site-specific management zones problem is to determine the minimum number of zones that cover the entire field so that each zone’s homogeneity is significant according to a specific biological, chemical, or physical soil property. Furthermore, the delineated zones should be orthogonal-shaped to be practical for agricultural machinery. This work has proposed a new bio-inspired algorithm, specifically an Estimation of Distribution Algorithm, based on a decoder that heavily relies on the Disjoint-Set algorithm and a new reactive penalized fitness function that detects unfeasible solutions. The new methodology improves the solutions presented in the literature by using a new search engine that drastically reduces the computational times of similar algorithms. Our algorithm has been tested with the literature benchmark, considering a new reactive penalization in the fitness function. It obtains the best solutions for 66.66% of the instances benchmark compared to the best literature method. Due to the algorithm’s efficiency, a new set of larger instances is introduced to test the scalability and robustness of the method. It obtained an efficiency of 79.3%.
{"title":"A bio-inspired optimization algorithm with disjoint sets to delineate orthogonal site-specific management zones","authors":"Salvador J. Vicencio-Medina, Yasmin A. Rios-Solis, Nestor M. Cid-Garcia","doi":"10.1007/s11119-024-10196-z","DOIUrl":"https://doi.org/10.1007/s11119-024-10196-z","url":null,"abstract":"<p>The first stage in the precision agriculture cycle has been a vital study area in recent years because it allows soil testing followed by data analysis. In this stage, a strategic delineation of site-specific management zones acquires a particular interest because it enables site-specific treatment to improve crop yield by efficiently using the input of resources. The delineation of site-specific management zones problem is to determine the minimum number of zones that cover the entire field so that each zone’s homogeneity is significant according to a specific biological, chemical, or physical soil property. Furthermore, the delineated zones should be orthogonal-shaped to be practical for agricultural machinery. This work has proposed a new bio-inspired algorithm, specifically an Estimation of Distribution Algorithm, based on a decoder that heavily relies on the Disjoint-Set algorithm and a new reactive penalized fitness function that detects unfeasible solutions. The new methodology improves the solutions presented in the literature by using a new search engine that drastically reduces the computational times of similar algorithms. Our algorithm has been tested with the literature benchmark, considering a new reactive penalization in the fitness function. It obtains the best solutions for 66.66% of the instances benchmark compared to the best literature method. Due to the algorithm’s efficiency, a new set of larger instances is introduced to test the scalability and robustness of the method. It obtained an efficiency of 79.3%.</p>","PeriodicalId":20423,"journal":{"name":"Precision Agriculture","volume":"52 1","pages":""},"PeriodicalIF":6.2,"publicationDate":"2024-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142849014","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-12-18DOI: 10.1007/s11119-024-10198-x
Yue Wang, Lola Suarez, Alberto Hornero, Tomas Poblete, Dongryeol Ryu, Victoria Gonzalez-Dugo, Pablo J. Zarco-Tejada
Introduction
Optimizing fruit quality and yield in agriculture requires accurately monitoring leaf nitrogen (N) status spatially and temporally throughout the growing season. Standard remote sensing approaches for assessing leaf N rely on proxies like vegetation indices or leaf chlorophyll a + b (Cab) content. However, limitations exist due to the Cab-N relationship’s saturation and early nutrient deficiency insensitivity.
Methods
The study utilized Sentinel-2 satellite imagery to estimate a set of plant biochemical traits in large almond orchards in a two-year study. These traits, including leaf dry matter, leaf water content, and leaf Cab retrieved from the radiative transfer model, were used to explain the observed variability of leaf N. Airborne hyperspectral imagery-derived leaf N using Cab and solar-induced fluorescence served as a benchmark for validation.
Results
Results demonstrate that plant traits quantified from Sentinel-2 were strongly associated with leaf N variability across the orchard, with a strong contribution from the estimated leaf Cab content and leaf dry matter biochemical constituent, outperforming the consistency of vegetation indices. The Sentinel-2 model explaining leaf N variability yielded r2 = 0.82 and nRMSE = 13% in a two-year dataset, obtaining consistent performance and trait contribution across both years.
Conclusion
This study highlights the potential application of Sentinel-2 satellite imagery for monitoring leaf N variability in almond tree orchards. Incorporating plant biochemical traits allows for a more consistent and reliable prediction of leaf N compared to traditional vegetation indices over two years, making it a promising method for precision agriculture applications.
在农业中,优化水果品质和产量需要在整个生长季节准确监测叶片氮(N)的时空状态。评估叶片氮含量的标准遥感方法依赖于植被指数或叶片叶绿素a + b (Cab)含量等替代指标。然而,由于Cab-N关系的饱和和早期营养缺乏的不敏感性,存在局限性。方法利用Sentinel-2卫星图像,对大型杏仁果园进行为期两年的植物生化性状研究。这些性状,包括叶片干物质、叶片含水量和从辐射转移模型中获取的叶片驾驶室,被用来解释观测到的叶片氮的变化。利用驾驶室和太阳诱导荧光获得的机载高光谱图像衍生的叶片氮作为验证的基准。结果表明,Sentinel-2量化的植物性状与整个果园叶片N变异密切相关,其中叶片Cab含量和叶片干物质生化成分的贡献较大,优于植被指数的一致性。在两年的数据集中,解释叶片N变异的Sentinel-2模型的r2 = 0.82, nRMSE = 13%,在两年中获得一致的性能和性状贡献。结论Sentinel-2卫星影像在杏树果园叶片氮变异监测中的应用前景广阔。与传统的植被指数相比,结合植物生化性状可以更一致、更可靠地预测两年的叶片氮,使其成为一种有前景的精准农业应用方法。
{"title":"Assessing plant traits derived from Sentinel-2 to characterize leaf nitrogen variability in almond orchards: modeling and validation with airborne hyperspectral imagery","authors":"Yue Wang, Lola Suarez, Alberto Hornero, Tomas Poblete, Dongryeol Ryu, Victoria Gonzalez-Dugo, Pablo J. Zarco-Tejada","doi":"10.1007/s11119-024-10198-x","DOIUrl":"https://doi.org/10.1007/s11119-024-10198-x","url":null,"abstract":"<h3 data-test=\"abstract-sub-heading\">Introduction</h3><p>Optimizing fruit quality and yield in agriculture requires accurately monitoring leaf nitrogen (N) status spatially and temporally throughout the growing season. Standard remote sensing approaches for assessing leaf N rely on proxies like vegetation indices or leaf chlorophyll <i>a</i> + <i>b</i> (C<sub>ab</sub>) content. However, limitations exist due to the C<sub>ab</sub>-N relationship’s saturation and early nutrient deficiency insensitivity.</p><h3 data-test=\"abstract-sub-heading\">Methods</h3><p>The study utilized Sentinel-2 satellite imagery to estimate a set of plant biochemical traits in large almond orchards in a two-year study. These traits, including leaf dry matter, leaf water content, and leaf C<sub>ab</sub> retrieved from the radiative transfer model, were used to explain the observed variability of leaf N. Airborne hyperspectral imagery-derived leaf N using C<sub>ab</sub> and solar-induced fluorescence served as a benchmark for validation.</p><h3 data-test=\"abstract-sub-heading\">Results</h3><p>Results demonstrate that plant traits quantified from Sentinel-2 were strongly associated with leaf N variability across the orchard, with a strong contribution from the estimated leaf C<sub>ab</sub> content and leaf dry matter biochemical constituent, outperforming the consistency of vegetation indices. The Sentinel-2 model explaining leaf N variability yielded <i>r</i><sup>2</sup> = 0.82 and nRMSE = 13% in a two-year dataset, obtaining consistent performance and trait contribution across both years.</p><h3 data-test=\"abstract-sub-heading\">Conclusion</h3><p>This study highlights the potential application of Sentinel-2 satellite imagery for monitoring leaf N variability in almond tree orchards. Incorporating plant biochemical traits allows for a more consistent and reliable prediction of leaf N compared to traditional vegetation indices over two years, making it a promising method for precision agriculture applications.</p>","PeriodicalId":20423,"journal":{"name":"Precision Agriculture","volume":"82 1","pages":""},"PeriodicalIF":6.2,"publicationDate":"2024-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142849012","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-12-14DOI: 10.1007/s11119-024-10210-4
A. A. Gauci, A. Lindsey, S. A. Shearer, D. Barker, E. M. Hawkins, John P. Fulton
On-farm experiments (OFE) typically do not account for limitations of grain yield monitors such as the dynamics of grain flow through a large combine. A common question asked within OFE is how ground speed impacts yield estimates from grain yield monitors. Therefore, the objective of this study was to determine if combine ground speed influences the ability of grain yield monitors to report yield differences for OFE. Six sub-plot treatment resolutions that differed in length (7.6, 15.2, 30.5, 61.0, 121.9, and 243.8 m) of imposed yield variation were harvested at combine ground speeds of 3.2 and 6.4 km h−1. Treatments were replicated 3 times. The intentional yield variability in maize (Zea mays L.) was created by alternating nitrogen application (0–202 kg N ha−1) across the treatment lengths. A factory installed yield monitor (YM3) and a third-party platform (P1) using the controller area network (CAN) bus data were used to collect yield data and compared to plot combine data collected from adjacent rows for each treatment length along a pass. Comparisons were made between each YM and plot combine yield estimates for each low and high yield treatment lengths. Combine ground speed did not significantly impact yield estimates (p ≥ 0.31 for all speed interactions) except speed * method due to lack of calibration. There were no significant differences the computed yield differences (all speed interactions p ≥ 0.40). Combine ground speed did not significantly influence the ability of yield monitoring technologies (i.e. mass flow sensor) to estimate the average low and high yields (p ≥ 0.31 for all speed interactions for individual plot lengths except when operating outside the calibrated flow range of the mass flow sensor. Operating outside the calibrated flow range of the mass flow sensor resulted in mass flow rate being overestimated by an average of 23% for both yield monitors (YM3 and P1).
农场试验(OFE)通常没有考虑到粮食产量监测的局限性,例如大型联合收割机中粮食流动的动态。OFE内部经常被问到的一个问题是,地面速度如何影响谷物产量监测器对产量的估计。因此,本研究的目的是确定联合地面速度是否影响粮食产量监测仪报告OFE产量差异的能力。在3.2和6.4 km h−1的联合地面速度下,收获了6个不同长度的子地块处理分辨率(7.6、15.2、30.5、61.0、121.9和243.8 m)的强制产量变化。处理重复3次。玉米(Zea mays L.)在不同处理期间交替施氮(0 ~ 202 kg N ha−1),造成有意产量变异。使用工厂安装的产量监控器(YM3)和第三方平台(P1)使用控制器局域网(CAN)总线数据收集产量数据,并与相邻行收集的沿着通道每个处理长度的组合数据进行比较。在每个低产量和高产量处理长度下,对每个YM和小区组合的产量估计值进行了比较。由于缺乏校准,除速度*法外,联合地面速度对产量估计没有显著影响(所有速度相互作用的p≥0.31)。计算产率差异无显著性差异(所有速度相互作用p≥0.40)。联合地面速度对产量监测技术(即质量流量传感器)估计平均低产量和高产量的能力没有显著影响(p≥0.31),所有速度相互作用对单个地块长度的影响,除非在质量流量传感器的校准流量范围之外运行。在质量流量传感器的校准流量范围之外工作,导致两个产量监测器(YM3和P1)的质量流量平均高估了23%。
{"title":"On-farm experimentation: assessing the effect of combine ground speed on grain yield monitor data estimates","authors":"A. A. Gauci, A. Lindsey, S. A. Shearer, D. Barker, E. M. Hawkins, John P. Fulton","doi":"10.1007/s11119-024-10210-4","DOIUrl":"https://doi.org/10.1007/s11119-024-10210-4","url":null,"abstract":"<p>On-farm experiments (OFE) typically do not account for limitations of grain yield monitors such as the dynamics of grain flow through a large combine. A common question asked within OFE is how ground speed impacts yield estimates from grain yield monitors. Therefore, the objective of this study was to determine if combine ground speed influences the ability of grain yield monitors to report yield differences for OFE. Six sub-plot treatment resolutions that differed in length (7.6, 15.2, 30.5, 61.0, 121.9, and 243.8 m) of imposed yield variation were harvested at combine ground speeds of 3.2 and 6.4 km h<sup>−1</sup>. Treatments were replicated 3 times. The intentional yield variability in maize (<i>Zea mays L.</i>) was created by alternating nitrogen application (0–202 kg N ha<sup>−1</sup>) across the treatment lengths. A factory installed yield monitor (YM3) and a third-party platform (P1) using the controller area network (CAN) bus data were used to collect yield data and compared to plot combine data collected from adjacent rows for each treatment length along a pass. Comparisons were made between each YM and plot combine yield estimates for each low and high yield treatment lengths. Combine ground speed did not significantly impact yield estimates (<i>p</i> ≥ 0.31 for all speed interactions) except speed * method due to lack of calibration. There were no significant differences the computed yield differences (all speed interactions <i>p</i> ≥ 0.40). Combine ground speed did not significantly influence the ability of yield monitoring technologies (i.e. mass flow sensor) to estimate the average low and high yields (<i>p</i> ≥ 0.31 for all speed interactions for individual plot lengths except when operating outside the calibrated flow range of the mass flow sensor. Operating outside the calibrated flow range of the mass flow sensor resulted in mass flow rate being overestimated by an average of 23% for both yield monitors (YM3 and P1).</p>","PeriodicalId":20423,"journal":{"name":"Precision Agriculture","volume":"14 1","pages":""},"PeriodicalIF":6.2,"publicationDate":"2024-12-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142820891","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-12-14DOI: 10.1007/s11119-024-10209-x
Larissa A. Gonçalves, Eduardo G. de Souza, Lúcia H. P. Nóbrega, Vanderlei Artur Bier, Marcio F. Maggi, Claudio L. Bazzi, Miguel Angel Uribe-Opazo
Spatial and temporal variability of the soil’s apparent electrical conductivity (ECa) and other soil attributes can be analyzed using specific digital platforms for precision agriculture, contributing to agricultural management decision-making. Understanding these variations enables more efficient and sustainable management practices tailored to each area’s characteristics, leading to higher crop yields and reduced environmental impacts. A critical question arises: should ECa measurement be done regularly or just once? This study aims to evaluate the spatial and temporal variability of soil’s apparent electrical conductivity to determine if a single ECa measurement can characterize spatial soil variability. The experiment was conducted in two areas under different management practices in Céu Azul, PR, Brazil. One area operates under a direct planting system, cultivating soybeans in the summer and rotating with wheat or corn during the winter. The second area is used as pasture during the winter and planted with corn or soybeans in the summer. ECa data from 2013 to 2016, along with chemical and physical soil attributes from 2013, were retrieved from our laboratory database. Additionally, ECa data were collected on 19/05/2022, 18/10/2022, and 10/03/2023. All ECa measurements were performed using an EM38-MK2 conductivity meter in horizontal dipolar and drag mode. ECa normalization methods such as range, average, and standard score were employed to mitigate temporal influences partially. Data was processed using the AgDataBox web platform, which included data cleaning, data interpolation, creation of thematic maps, delineation of management zones, and spatial correlation matrix procedures. Thematic maps revealed that ECa spatial variability exhibited a stable pattern. Both areas showed significant cross-correlation among topography and most soil chemical and physical attributes. The study concluded that ECa measurement could be performed once as a co-variable for interpolating other variables since the ECa pattern remained stable in both areas. The average method was the most effective normalization method in both areas. Furthermore, management zones (MZs) were delineated using equivalent normalized ECa (ECa_Eq) (mS/m) with the three data normalization methods. The agreement between MZs was sufficient to conclude that the influence of the normalization methods can be ignored.
{"title":"Spatial and temporal variability of soil apparent electrical conductivity","authors":"Larissa A. Gonçalves, Eduardo G. de Souza, Lúcia H. P. Nóbrega, Vanderlei Artur Bier, Marcio F. Maggi, Claudio L. Bazzi, Miguel Angel Uribe-Opazo","doi":"10.1007/s11119-024-10209-x","DOIUrl":"https://doi.org/10.1007/s11119-024-10209-x","url":null,"abstract":"<p>Spatial and temporal variability of the soil’s apparent electrical conductivity (ECa) and other soil attributes can be analyzed using specific digital platforms for precision agriculture, contributing to agricultural management decision-making. Understanding these variations enables more efficient and sustainable management practices tailored to each area’s characteristics, leading to higher crop yields and reduced environmental impacts. A critical question arises: should ECa measurement be done regularly or just once? This study aims to evaluate the spatial and temporal variability of soil’s apparent electrical conductivity to determine if a single ECa measurement can characterize spatial soil variability. The experiment was conducted in two areas under different management practices in Céu Azul, PR, Brazil. One area operates under a direct planting system, cultivating soybeans in the summer and rotating with wheat or corn during the winter. The second area is used as pasture during the winter and planted with corn or soybeans in the summer. ECa data from 2013 to 2016, along with chemical and physical soil attributes from 2013, were retrieved from our laboratory database. Additionally, ECa data were collected on 19/05/2022, 18/10/2022, and 10/03/2023. All ECa measurements were performed using an EM38-MK2 conductivity meter in horizontal dipolar and drag mode. ECa normalization methods such as range, average, and standard score were employed to mitigate temporal influences partially. Data was processed using the AgDataBox web platform, which included data cleaning, data interpolation, creation of thematic maps, delineation of management zones, and spatial correlation matrix procedures. Thematic maps revealed that ECa spatial variability exhibited a stable pattern. Both areas showed significant cross-correlation among topography and most soil chemical and physical attributes. The study concluded that ECa measurement could be performed once as a co-variable for interpolating other variables since the ECa pattern remained stable in both areas. The average method was the most effective normalization method in both areas. Furthermore, management zones (MZs) were delineated using equivalent normalized ECa (ECa_Eq) (mS/m) with the three data normalization methods. The agreement between MZs was sufficient to conclude that the influence of the normalization methods can be ignored.</p>","PeriodicalId":20423,"journal":{"name":"Precision Agriculture","volume":"10 1","pages":""},"PeriodicalIF":6.2,"publicationDate":"2024-12-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142820857","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-12-14DOI: 10.1007/s11119-024-10207-z
Damian Oswald, Alireza Pourreza, Momtanu Chakraborty, Sat Darshan S. Khalsa, Patrick H. Brown
Nitrogen (N) is vital for plant growth, but its imbalance can negatively affect crop yields, the environment, and water quality. This is especially crucial for California’s almond orchards, which are the most N-hungry nut crop and require substantial N for high productivity. The current practices of uniform and extensive N application lead to N leaching into the groundwater, creating environmental hazards. Traditional remote sensing methods often rely on data-driven approaches that work well statistically (achieving a high R2 value) with one dataset but aren’t adaptable across different datasets. To create a more robust, data-driven model, one would typically need a vast and varied collection of datasets. Our goal, however, is to develop a more universally applicable model using smaller datasets, typical of commercial orchards, that can accurately estimate N content in tree canopies, regardless of differences in spatial, spectral, and temporal data. In this study, we investigate and evaluate multiple remote sensing approaches for estimating N concentration in Californian almonds, utilizing hyperspectral imaging at the canopy level. We assess various classical vegetation indices, machine learning models, and a physics-informed 3D radiative transfer model. While cross-validated results show comparable results for radiative transfer models and best-performing machine learning models, most single vegetation indices are not capable of exceeding the baseline model (:fleft(mathbf{x}right)=bar{y}) and thus had R2 value less than 0. Despite being less commonly used, 3D radiative transfer modeling shows promise as a strong and adaptable method, producing results that are comparable to the best machine learning models.
{"title":"3D radiative transfer modeling of almond canopy for nitrogen estimation by hyperspectral imaging","authors":"Damian Oswald, Alireza Pourreza, Momtanu Chakraborty, Sat Darshan S. Khalsa, Patrick H. Brown","doi":"10.1007/s11119-024-10207-z","DOIUrl":"https://doi.org/10.1007/s11119-024-10207-z","url":null,"abstract":"<p>Nitrogen (N) is vital for plant growth, but its imbalance can negatively affect crop yields, the environment, and water quality. This is especially crucial for California’s almond orchards, which are the most N-hungry nut crop and require substantial N for high productivity. The current practices of uniform and extensive N application lead to N leaching into the groundwater, creating environmental hazards. Traditional remote sensing methods often rely on data-driven approaches that work well statistically (achieving a high R<sup>2</sup> value) with one dataset but aren’t adaptable across different datasets. To create a more robust, data-driven model, one would typically need a vast and varied collection of datasets. Our goal, however, is to develop a more universally applicable model using smaller datasets, typical of commercial orchards, that can accurately estimate N content in tree canopies, regardless of differences in spatial, spectral, and temporal data. In this study, we investigate and evaluate multiple remote sensing approaches for estimating N concentration in Californian almonds, utilizing hyperspectral imaging at the canopy level. We assess various classical vegetation indices, machine learning models, and a physics-informed 3D radiative transfer model. While cross-validated results show comparable results for radiative transfer models and best-performing machine learning models, most single vegetation indices are not capable of exceeding the baseline model <span>(:fleft(mathbf{x}right)=bar{y})</span> and thus had R<sup>2</sup> value less than 0. Despite being less commonly used, 3D radiative transfer modeling shows promise as a strong and adaptable method, producing results that are comparable to the best machine learning models.</p>","PeriodicalId":20423,"journal":{"name":"Precision Agriculture","volume":"13 1","pages":""},"PeriodicalIF":6.2,"publicationDate":"2024-12-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142820892","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-12-13DOI: 10.1007/s11119-024-10197-y
Anna Petrovskaia, Mikhail Gasanov, Artyom Nikitin, Polina Tregubova, Ivan Oseledets
Soil sampling is crucial for capturing soil variability and obtaining comprehensive soil information for agricultural planning. This article evaluates the potential of MaxVol, an optimal design method for soil sampling based on selecting locations with significant dissimilarities. We compared MaxVol with conditional Latin hypercube sampling (cLHS), simple random sampling (SRS) and Kennard-Stone algorithm (KS) to evaluate their ability to capture soil data distribution. We modeled spatial distributions of soil properties using simple kriging (SK) and regression kriging (RK) interpolation techniques and assessed the interpolation quality using Root Mean Square Error. According to the results, MaxVol performs similarly or better than popular sampling designs in describing soil distributions, particularly with a smaller number of points. This is valuable for costly and time-consuming field surveys. Both MaxVol and Kennard-Stone are deterministic algorithms, unlike cLHS and random sampling, providing a reliable sampling scheme. Thus, the proposed MaxVol algorithm enables obtaining soil property distributions based on environmental features.
{"title":"Maximizing dataset variability in agricultural surveys with spatial sampling based on MaxVol matrix approximation","authors":"Anna Petrovskaia, Mikhail Gasanov, Artyom Nikitin, Polina Tregubova, Ivan Oseledets","doi":"10.1007/s11119-024-10197-y","DOIUrl":"https://doi.org/10.1007/s11119-024-10197-y","url":null,"abstract":"<p>Soil sampling is crucial for capturing soil variability and obtaining comprehensive soil information for agricultural planning. This article evaluates the potential of MaxVol, an optimal design method for soil sampling based on selecting locations with significant dissimilarities. We compared MaxVol with conditional Latin hypercube sampling (cLHS), simple random sampling (SRS) and Kennard-Stone algorithm (KS) to evaluate their ability to capture soil data distribution. We modeled spatial distributions of soil properties using simple kriging (SK) and regression kriging (RK) interpolation techniques and assessed the interpolation quality using Root Mean Square Error. According to the results, MaxVol performs similarly or better than popular sampling designs in describing soil distributions, particularly with a smaller number of points. This is valuable for costly and time-consuming field surveys. Both MaxVol and Kennard-Stone are deterministic algorithms, unlike cLHS and random sampling, providing a reliable sampling scheme. Thus, the proposed MaxVol algorithm enables obtaining soil property distributions based on environmental features.</p>","PeriodicalId":20423,"journal":{"name":"Precision Agriculture","volume":"12 1","pages":""},"PeriodicalIF":6.2,"publicationDate":"2024-12-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142816458","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-12-07DOI: 10.1007/s11119-024-10212-2
Patrick Filippi, Si Yang Han, Thomas F.A. Bishop
There has been a recent surge in the number of studies that aim to model crop yield using data-driven approaches. This has largely come about due to the increasing amounts of remote sensing (e.g. satellite imagery) and precision agriculture data available (e.g. high-resolution crop yield monitor data), as well as the abundance of machine learning modelling approaches. However, there are several common issues in published studies in the field of precision agriculture (PA) that must be addressed. This includes the terminology used in relation to crop yield modelling, predicting, forecasting, and interpolating, as well as the way that models are calibrated and validated. As a typical example, many studies will take a crop yield map or several plots within a field from a single season, build a model with satellite or Unmanned Aerial Vehicle (UAV) imagery, validate using data-splitting or some kind of cross-validation (e.g. k-fold), and say that it is a ‘prediction’ or ‘forecast’ of crop yield. However, this poses a problem as the approach is not testing the forecasting ability of the model, as it is built on the same season that it is then validating with, thus giving a substantial overestimation of the value for decision-making, such as an application of fertiliser in-season. This is an all-too-common flaw in the logic construct of many published studies. Moving forward, it is essential that clear definitions and guidelines for data-driven yield modelling and validation are outlined so that there is a greater connection between the goal of the study, and the actual study outputs/outcomes. To demonstrate this, the current study uses a case study dataset from a collection of large neighbouring farms in New South Wales, Australia. The dataset includes 160 yield maps of winter wheat (Triticum aestivum) covering 26,400 hectares over a 10-year period (2014–2023). Machine learning crop yield models are built at 30 m spatial resolution with a suite of predictor data layers that relate to crop yield. This includes datasets that represent soil variation, terrain, weather, and satellite imagery of the crop. Predictions are made at both the within-field (30 m), and field resolution. Crop yield predictions are useful for an array of applications, so four different experiments were set up to reflect different scenarios. This included Experiment 1: forecasting yield mid-season (e.g. for mid-season fertilisation), Experiment 2: forecasting yield late-season (e.g. for late-season logistics/forward selling), Experiment 3: predicting yield in a previous season for a field with no yield data in a season, and Experiment 4: predicting yield in a previous season for a field with some yield data (e.g. two combine harvesters, but only one was fitted with a yield monitor). This study showcases how different model calibration and validation approaches clearly impact prediction quality, and therefore how they should be interpreted in data-driven crop yield modelling
{"title":"On crop yield modelling, predicting, and forecasting and addressing the common issues in published studies","authors":"Patrick Filippi, Si Yang Han, Thomas F.A. Bishop","doi":"10.1007/s11119-024-10212-2","DOIUrl":"https://doi.org/10.1007/s11119-024-10212-2","url":null,"abstract":"<p>There has been a recent surge in the number of studies that aim to model crop yield using data-driven approaches. This has largely come about due to the increasing amounts of remote sensing (e.g. satellite imagery) and precision agriculture data available (e.g. high-resolution crop yield monitor data), as well as the abundance of machine learning modelling approaches. However, there are several common issues in published studies in the field of precision agriculture (PA) that must be addressed. This includes the terminology used in relation to crop yield modelling, predicting, forecasting, and interpolating, as well as the way that models are calibrated and validated. As a typical example, many studies will take a crop yield map or several plots within a field from a single season, build a model with satellite or Unmanned Aerial Vehicle (UAV) imagery, validate using data-splitting or some kind of cross-validation (e.g. k-fold), and say that it is a ‘prediction’ or ‘forecast’ of crop yield. However, this poses a problem as the approach is not testing the forecasting ability of the model, as it is built on the same season that it is then validating with, thus giving a substantial overestimation of the value for decision-making, such as an application of fertiliser in-season. This is an all-too-common flaw in the logic construct of many published studies. Moving forward, it is essential that clear definitions and guidelines for data-driven yield modelling and validation are outlined so that there is a greater connection between the goal of the study, and the actual study outputs/outcomes. To demonstrate this, the current study uses a case study dataset from a collection of large neighbouring farms in New South Wales, Australia. The dataset includes 160 yield maps of winter wheat (<i>Triticum aestivum</i>) covering 26,400 hectares over a 10-year period (2014–2023). Machine learning crop yield models are built at 30 m spatial resolution with a suite of predictor data layers that relate to crop yield. This includes datasets that represent soil variation, terrain, weather, and satellite imagery of the crop. Predictions are made at both the within-field (30 m), and field resolution. Crop yield predictions are useful for an array of applications, so four different experiments were set up to reflect different scenarios. This included Experiment 1: forecasting yield mid-season (e.g. for mid-season fertilisation), Experiment 2: forecasting yield late-season (e.g. for late-season logistics/forward selling), Experiment 3: predicting yield in a previous season for a field with no yield data in a season, and Experiment 4: predicting yield in a previous season for a field with some yield data (e.g. two combine harvesters, but only one was fitted with a yield monitor). This study showcases how different model calibration and validation approaches clearly impact prediction quality, and therefore how they should be interpreted in data-driven crop yield modelling ","PeriodicalId":20423,"journal":{"name":"Precision Agriculture","volume":"8 1","pages":""},"PeriodicalIF":6.2,"publicationDate":"2024-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142788509","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-12-04DOI: 10.1007/s11119-024-10204-2
Zeeshan Haydar, Travis J. Esau, Aitazaz A. Farooque, Farhat Abbas, Andrew Fraser
Efficient mechanical harvesting of wild blueberries across uneven topographies calls for precise header height adjustments to optimize fruit picking. Conventionally, an operator requires manual adjustment of the harvester header to accommodate the spatial variations in plant height, fruit zone, and field terrain. This can result in inadequate header positioning, which leads to berry losses and increased operator stress. This study aimed to investigate the integration of machine learning techniques with real-time geo-location data to develop an innovative system to automate harvesting operations. A supervised machine learning Random Forest (RF) model was trained based on pre-defined header setting data and integrated with the harvester’s controller to predict and position the header height using real-time geo-location data from the Starfire (SF) 6000 Global Positioning System (GPS) receiver. During harvesting, the system’s performance was evaluated at tractor ground speeds (0.31, 0.45, and 0.58 ms−1) and segment lengths (5, 10, and 15 m). Results indicated that segment size minimally affected the system’s ability to adjust header height. However, at the lowest segment length, 5 m, the coefficient of determination was 97.24, 98.12, and 82.71% for the 0.31, 0.45, and 0.58 ms−1, respectively. This study provided convincing results for automating the harvester header based on pre-defined settings, marking a significant step toward complete automation of the wild blueberry harvester. Automation of wild blueberry harvesting can help to increase picking efficiency and enhance profit margins for growers to justify the ever-increasing cost of production.
{"title":"Integration of machine learning models with real-time global positioning data to automate the wild blueberry harvester","authors":"Zeeshan Haydar, Travis J. Esau, Aitazaz A. Farooque, Farhat Abbas, Andrew Fraser","doi":"10.1007/s11119-024-10204-2","DOIUrl":"https://doi.org/10.1007/s11119-024-10204-2","url":null,"abstract":"<p>Efficient mechanical harvesting of wild blueberries across uneven topographies calls for precise header height adjustments to optimize fruit picking. Conventionally, an operator requires manual adjustment of the harvester header to accommodate the spatial variations in plant height, fruit zone, and field terrain. This can result in inadequate header positioning, which leads to berry losses and increased operator stress. This study aimed to investigate the integration of machine learning techniques with real-time geo-location data to develop an innovative system to automate harvesting operations. A supervised machine learning Random Forest (RF) model was trained based on pre-defined header setting data and integrated with the harvester’s controller to predict and position the header height using real-time geo-location data from the Starfire (SF) 6000 Global Positioning System (GPS) receiver. During harvesting, the system’s performance was evaluated at tractor ground speeds (0.31, 0.45, and 0.58 ms<sup>−1</sup>) and segment lengths (5, 10, and 15 m). Results indicated that segment size minimally affected the system’s ability to adjust header height. However, at the lowest segment length, 5 m, the coefficient of determination was 97.24, 98.12, and 82.71% for the 0.31, 0.45, and 0.58 ms<sup>−1</sup>, respectively. This study provided convincing results for automating the harvester header based on pre-defined settings, marking a significant step toward complete automation of the wild blueberry harvester. Automation of wild blueberry harvesting can help to increase picking efficiency and enhance profit margins for growers to justify the ever-increasing cost of production.</p>","PeriodicalId":20423,"journal":{"name":"Precision Agriculture","volume":"12 1","pages":""},"PeriodicalIF":6.2,"publicationDate":"2024-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142763339","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2024-12-02DOI: 10.1007/s11119-024-10200-6
Bertin Takoutsing, Gerard B. M. Heuvelink, Ermias Aynekulu, Keith D. Shepherd
Crop models can improve our understanding of crop responses to environmental conditions and farming practices. However, uncertainties in model inputs can notably impact the quality of the outputs. This study aimed at quantifying the uncertainty in soil information and analyse how it propagates through the Quantitative Evaluation of Fertility of Tropical Soils model to affect yield and fertilizer recommendation rates using Monte Carlo simulation. Additional objectives were to analyse the uncertainty contributions of the individual soil inputs to model output uncertainty and discuss strategies to communicate uncertainty to end-users. The results showed that the impact of soil input uncertainty on model output uncertainty was significant and varied spatially. Comparison of the results of a deterministic model run with the mean of the Monte Carlo simulation runs showed systematic differences in yield predictions, with Monte Carlo simulations on average predicting a yield that was 0.62 tonnes ha−1 lower than the deterministic run. Similar systematic differences were observed for fertilizer recommendations, with Monte Carlo simulations recommending up to 59, 42, and 20 kg ha−1 lower nitrogen (N), phosphorous (P), and potassium (K) fertilizer applications, respectively. Stochastic sensitivity analysis showed that pH was the main source of uncertainty for K fertilizer (81.6%) and that soil organic carbon contributed most to the uncertainty of N fertilizer application (97%). Uncertainty in P fertilizer application mostly came from uncertainty in extractable phosphorus (55%) and exchangeable potassium (20%). A threshold probability map designed using statistical predictions served as a visual aid that could enable farmers to swiftly make informed decisions about fertilizer application locations. The study highlights the importance of refining the accuracy of soil maps as well as incorporating uncertainty in input data, which improves QUEFTS model predictions and offers valuable insights into the relationship between soil information accuracy and reliable crop modeling for sustainable agricultural decisions.
作物模型可以提高我们对作物对环境条件和耕作方式的反应的理解。然而,模型输入中的不确定性会显著影响输出的质量。本研究旨在量化土壤信息中的不确定性,并利用蒙特卡洛模拟分析其如何通过热带土壤肥力定量评估模型传播,从而影响产量和肥料推荐率。其他目标是分析单个土壤输入对模型输出不确定性的不确定性贡献,并讨论将不确定性传达给最终用户的策略。结果表明:土壤输入不确定性对模型输出不确定性的影响显著,且存在空间差异。将确定性模型运行的结果与蒙特卡罗模拟运行的平均值进行比较,显示出产量预测的系统性差异,蒙特卡罗模拟平均预测的产量比确定性运行低0.62吨ha - 1。在肥料建议方面也观察到类似的系统差异,蒙特卡罗模拟建议分别减少氮肥(N)、磷(P)和钾(K)施用59、42和20 kg ha - 1。随机敏感性分析表明,pH是钾肥不确定性的主要来源(81.6%),土壤有机碳对氮肥不确定性的贡献最大(97%)。磷肥施用的不确定性主要来自可提取磷(55%)和交换性钾(20%)的不确定性。使用统计预测设计的阈值概率图作为视觉辅助工具,可以使农民迅速做出有关施肥地点的明智决定。该研究强调了提高土壤图准确性以及将不确定性纳入输入数据的重要性,这可以提高QUEFTS模型的预测,并为土壤信息准确性和可靠的作物建模之间的关系提供有价值的见解,以促进可持续农业决策。
{"title":"Modelling and mapping maize yields and making fertilizer recommendations with uncertain soil information","authors":"Bertin Takoutsing, Gerard B. M. Heuvelink, Ermias Aynekulu, Keith D. Shepherd","doi":"10.1007/s11119-024-10200-6","DOIUrl":"https://doi.org/10.1007/s11119-024-10200-6","url":null,"abstract":"<p>Crop models can improve our understanding of crop responses to environmental conditions and farming practices. However, uncertainties in model inputs can notably impact the quality of the outputs. This study aimed at quantifying the uncertainty in soil information and analyse how it propagates through the Quantitative Evaluation of Fertility of Tropical Soils model to affect yield and fertilizer recommendation rates using Monte Carlo simulation. Additional objectives were to analyse the uncertainty contributions of the individual soil inputs to model output uncertainty and discuss strategies to communicate uncertainty to end-users. The results showed that the impact of soil input uncertainty on model output uncertainty was significant and varied spatially. Comparison of the results of a deterministic model run with the mean of the Monte Carlo simulation runs showed systematic differences in yield predictions, with Monte Carlo simulations on average predicting a yield that was 0.62 tonnes ha<sup>−1</sup> lower than the deterministic run. Similar systematic differences were observed for fertilizer recommendations, with Monte Carlo simulations recommending up to 59, 42, and 20 kg ha<sup>−1</sup> lower nitrogen (N), phosphorous (P), and potassium (K) fertilizer applications, respectively. Stochastic sensitivity analysis showed that pH was the main source of uncertainty for K fertilizer (81.6%) and that soil organic carbon contributed most to the uncertainty of N fertilizer application (97%). Uncertainty in P fertilizer application mostly came from uncertainty in extractable phosphorus (55%) and exchangeable potassium (20%). A threshold probability map designed using statistical predictions served as a visual aid that could enable farmers to swiftly make informed decisions about fertilizer application locations. The study highlights the importance of refining the accuracy of soil maps as well as incorporating uncertainty in input data, which improves QUEFTS model predictions and offers valuable insights into the relationship between soil information accuracy and reliable crop modeling for sustainable agricultural decisions.</p>","PeriodicalId":20423,"journal":{"name":"Precision Agriculture","volume":"8 1","pages":""},"PeriodicalIF":6.2,"publicationDate":"2024-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142758160","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}