Qing Gu , Li Sheng , Tianhao Zhang , Yuwen Lu , Zhijun Zhang , Kefeng Zheng , Hao Hu , Hongkui Zhou
{"title":"Early detection of tomato spotted wilt virus infection in tobacco using the hyperspectral imaging technique and machine learning algorithms","authors":"Qing Gu , Li Sheng , Tianhao Zhang , Yuwen Lu , Zhijun Zhang , Kefeng Zheng , Hao Hu , Hongkui Zhou","doi":"10.1016/j.compag.2019.105066","DOIUrl":null,"url":null,"abstract":"<div><p>The hyperspectral imaging technique was used for the non-destructive detection of tomato spotted wilt virus (TSWV) infection in tobacco at an early stage. Spectra ranging from 400 to 1000 nm with 128 bands from inoculated and healthy tobacco plants were analyzed by using three wavelength selection methods (successive projections algorithm (SPA), boosted regression tree (BRT), and genetic algorithm (GA)), and four machine learning (ML) techniques (boosted regression tree (BRT), support vector machine (SVM), random forest (RF), and classification and regression tress (CART)). The results indicated that the models built by the BRT algorithm using the wavelengths selected by SPA as the input variables obtained the best outcome for the 10-fold cross-validation with the mean overall accuracy of 85.2% and area under receiver operating curve (AUC) of 0.932. The band selection results and variable contribution analysis in BRT modeling jointly showed that the near-infrared (NIR) spectral region is informative and important for the differentiation of infected and healthy tobacco leaves. Different stages of post-inoculation were split according to the molecular identification and visual observation. The classification results at different stages indicated that the hyperspectral imaging data combined with ML methods and wavelength selection algorithms can be used for the early detection of TSWV in tobacco, both at the presymptomatic stage and during the period before the systematic infection can be detected by the molecular identification approach.</p></div>","PeriodicalId":50627,"journal":{"name":"Computers and Electronics in Agriculture","volume":"167 ","pages":"Article 105066"},"PeriodicalIF":7.7000,"publicationDate":"2019-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/j.compag.2019.105066","citationCount":"51","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers and Electronics in Agriculture","FirstCategoryId":"97","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0168169919304089","RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRICULTURE, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 51
Abstract
The hyperspectral imaging technique was used for the non-destructive detection of tomato spotted wilt virus (TSWV) infection in tobacco at an early stage. Spectra ranging from 400 to 1000 nm with 128 bands from inoculated and healthy tobacco plants were analyzed by using three wavelength selection methods (successive projections algorithm (SPA), boosted regression tree (BRT), and genetic algorithm (GA)), and four machine learning (ML) techniques (boosted regression tree (BRT), support vector machine (SVM), random forest (RF), and classification and regression tress (CART)). The results indicated that the models built by the BRT algorithm using the wavelengths selected by SPA as the input variables obtained the best outcome for the 10-fold cross-validation with the mean overall accuracy of 85.2% and area under receiver operating curve (AUC) of 0.932. The band selection results and variable contribution analysis in BRT modeling jointly showed that the near-infrared (NIR) spectral region is informative and important for the differentiation of infected and healthy tobacco leaves. Different stages of post-inoculation were split according to the molecular identification and visual observation. The classification results at different stages indicated that the hyperspectral imaging data combined with ML methods and wavelength selection algorithms can be used for the early detection of TSWV in tobacco, both at the presymptomatic stage and during the period before the systematic infection can be detected by the molecular identification approach.
期刊介绍:
Computers and Electronics in Agriculture provides international coverage of advancements in computer hardware, software, electronic instrumentation, and control systems applied to agricultural challenges. Encompassing agronomy, horticulture, forestry, aquaculture, and animal farming, the journal publishes original papers, reviews, and applications notes. It explores the use of computers and electronics in plant or animal agricultural production, covering topics like agricultural soils, water, pests, controlled environments, and waste. The scope extends to on-farm post-harvest operations and relevant technologies, including artificial intelligence, sensors, machine vision, robotics, networking, and simulation modeling. Its companion journal, Smart Agricultural Technology, continues the focus on smart applications in production agriculture.