{"title":"Identification of high-oil content soybean using hyperspectral reflectance and one-dimensional convolutional neural network","authors":"Yue-shuang Yang, Jianxin Liao, Hongbo Li, Kezhu Tan, Xihai Zhang","doi":"10.1080/00387010.2022.2160463","DOIUrl":null,"url":null,"abstract":"Abstract It is of great significance to identify soybean seeds with high oil content since the oil content of soybean seeds decides oil yield. At present, related researches mostly used machine learning algorithm to identify soybean varieties with small samples. In this study, 5800 spectral data samples of 58 varieties in the range of 400–1000 nm were obtained. An acceptable method that combines hyperspectral imaging with one-dimensional convolutional neural network was proposed to distinguish high oilcontent soybean seeds. Moreover, traditional machine learning models, including support vector machine, k-nearest neighbor algorithm, and partial squares discriminant analysis, were also established in the experimental study. The effects of four preprocessing methods, namely moving window smoothing, standard normal variate, multivariate scattering correction, and Savitzky–Golay, were compared when building support vector machine-based identification models. The results showed that the model using multivariate scattering correction gave better test accuracy (94.5%), indicating that for this study, multivariate scattering correction was a more suitable method than others. Meanwhile, the study compared the performance of the four models by expanding the number of samples. The results showed that the proposed one-dimensional convolutional neural network model was more stable. The average accuracy of the training set and test set was 96% and 93%, respectively. Therefore, hyperspectral data combined with one-dimensional convolutional neural network was effective in identifying soybean seeds with high oil content.","PeriodicalId":1,"journal":{"name":"Accounts of Chemical Research","volume":null,"pages":null},"PeriodicalIF":16.4000,"publicationDate":"2022-12-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Accounts of Chemical Research","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.1080/00387010.2022.2160463","RegionNum":1,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 4
Abstract
Abstract It is of great significance to identify soybean seeds with high oil content since the oil content of soybean seeds decides oil yield. At present, related researches mostly used machine learning algorithm to identify soybean varieties with small samples. In this study, 5800 spectral data samples of 58 varieties in the range of 400–1000 nm were obtained. An acceptable method that combines hyperspectral imaging with one-dimensional convolutional neural network was proposed to distinguish high oilcontent soybean seeds. Moreover, traditional machine learning models, including support vector machine, k-nearest neighbor algorithm, and partial squares discriminant analysis, were also established in the experimental study. The effects of four preprocessing methods, namely moving window smoothing, standard normal variate, multivariate scattering correction, and Savitzky–Golay, were compared when building support vector machine-based identification models. The results showed that the model using multivariate scattering correction gave better test accuracy (94.5%), indicating that for this study, multivariate scattering correction was a more suitable method than others. Meanwhile, the study compared the performance of the four models by expanding the number of samples. The results showed that the proposed one-dimensional convolutional neural network model was more stable. The average accuracy of the training set and test set was 96% and 93%, respectively. Therefore, hyperspectral data combined with one-dimensional convolutional neural network was effective in identifying soybean seeds with high oil content.
期刊介绍:
Accounts of Chemical Research presents short, concise and critical articles offering easy-to-read overviews of basic research and applications in all areas of chemistry and biochemistry. These short reviews focus on research from the author’s own laboratory and are designed to teach the reader about a research project. In addition, Accounts of Chemical Research publishes commentaries that give an informed opinion on a current research problem. Special Issues online are devoted to a single topic of unusual activity and significance.
Accounts of Chemical Research replaces the traditional article abstract with an article "Conspectus." These entries synopsize the research affording the reader a closer look at the content and significance of an article. Through this provision of a more detailed description of the article contents, the Conspectus enhances the article's discoverability by search engines and the exposure for the research.