Classical and machine learning tools for identifying yellow-seeded Brassica napus by fusion of hyperspectral features.

IF 2.8 3区 生物学 Q2 GENETICS & HEREDITY Frontiers in Genetics Pub Date : 2025-01-15 eCollection Date: 2024-01-01 DOI:10.3389/fgene.2024.1518205
Fan Liu, Fang Wang, Zaiqi Zhang, Liang Cao, Jinran Wu, You-Gan Wang
{"title":"Classical and machine learning tools for identifying yellow-seeded <i>Brassica napus</i> by fusion of hyperspectral features.","authors":"Fan Liu, Fang Wang, Zaiqi Zhang, Liang Cao, Jinran Wu, You-Gan Wang","doi":"10.3389/fgene.2024.1518205","DOIUrl":null,"url":null,"abstract":"<p><strong>Introduction: </strong>Due to its favorable traits-such as lower lignin content, higher oil concentration, and increased protein levels-the genetic improvement of yellow-seeded rapeseed has attracted more attention than other rapeseed color variations. Traditionally, yellow-seeded rapeseed has been identified visually, but the complex variability in the seed coat color of <i>Brassica napus</i> has made manual identification challenging and often inaccurate. Another method, using the RGB color system, is frequently employed but is sensitive to photographic conditions, including lighting and camera settings.</p><p><strong>Methods: </strong>We present four data-driven models to identify yellow-seeded <i>B. napus</i> using hyperspectral features combined with simple yet intelligent techniques. One model employs partial least squares regression (PLSR) to predict the R, G, and B color channels, effectively distinguishing yellow-seeded varieties from others according to globally accepted yellow-seed classification protocols. Another model uses logistic regression (Logit-R) to produce a probability-based assessment of yellow-seeded status. Additionally, we implement two intelligent models, random forest and support vector classifier to evaluate features selected through lasso-penalized logistic regression.</p><p><strong>Results and discussion: </strong>Our findings indicate significant recognition accuracies of 96.55% and 98% for the PLSR and Logit-R models, respectively, aligning closely with the accuracy of previous methods. This approach represents a meaningful advancement in identifying yellow-seeded rapeseed, with high recognition accuracy demonstrating the practical applicability of these models.</p>","PeriodicalId":12750,"journal":{"name":"Frontiers in Genetics","volume":"15 ","pages":"1518205"},"PeriodicalIF":2.8000,"publicationDate":"2025-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11774891/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in Genetics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.3389/fgene.2024.1518205","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
引用次数: 0

Abstract

Introduction: Due to its favorable traits-such as lower lignin content, higher oil concentration, and increased protein levels-the genetic improvement of yellow-seeded rapeseed has attracted more attention than other rapeseed color variations. Traditionally, yellow-seeded rapeseed has been identified visually, but the complex variability in the seed coat color of Brassica napus has made manual identification challenging and often inaccurate. Another method, using the RGB color system, is frequently employed but is sensitive to photographic conditions, including lighting and camera settings.

Methods: We present four data-driven models to identify yellow-seeded B. napus using hyperspectral features combined with simple yet intelligent techniques. One model employs partial least squares regression (PLSR) to predict the R, G, and B color channels, effectively distinguishing yellow-seeded varieties from others according to globally accepted yellow-seed classification protocols. Another model uses logistic regression (Logit-R) to produce a probability-based assessment of yellow-seeded status. Additionally, we implement two intelligent models, random forest and support vector classifier to evaluate features selected through lasso-penalized logistic regression.

Results and discussion: Our findings indicate significant recognition accuracies of 96.55% and 98% for the PLSR and Logit-R models, respectively, aligning closely with the accuracy of previous methods. This approach represents a meaningful advancement in identifying yellow-seeded rapeseed, with high recognition accuracy demonstrating the practical applicability of these models.

Abstract Image

Abstract Image

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于融合高光谱特征的黄种甘蓝型油菜的经典和机器学习识别方法。
黄籽油菜籽由于其木质素含量低、含油量高、蛋白质含量高等优良性状,其遗传改良比其他油菜籽颜色变异更受关注。传统上,黄籽油菜籽是通过视觉识别的,但由于甘蓝型油菜种皮颜色的复杂变化,使得人工识别具有挑战性,而且往往不准确。另一种方法,使用RGB色彩系统,经常被采用,但对摄影条件很敏感,包括照明和相机设置。方法:利用高光谱特征与简单智能技术相结合,建立了四种数据驱动的黄籽甘蓝型油菜识别模型。其中一个模型采用偏最小二乘回归(PLSR)来预测R、G和B颜色通道,根据全球公认的黄种子分类协议,有效区分黄种子品种。另一个模型使用逻辑回归(Logit-R)对黄种子状态进行基于概率的评估。此外,我们实现了随机森林和支持向量分类器两个智能模型来评估通过套索惩罚逻辑回归选择的特征。结果与讨论:我们的研究结果表明,PLSR和Logit-R模型的识别准确率分别为96.55%和98%,与之前方法的准确率密切相关。该方法在黄籽油菜籽识别方面取得了有意义的进展,具有较高的识别精度,证明了该模型的实用性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Frontiers in Genetics
Frontiers in Genetics Biochemistry, Genetics and Molecular Biology-Molecular Medicine
CiteScore
5.50
自引率
8.10%
发文量
3491
审稿时长
14 weeks
期刊介绍: Frontiers in Genetics publishes rigorously peer-reviewed research on genes and genomes relating to all the domains of life, from humans to plants to livestock and other model organisms. Led by an outstanding Editorial Board of the world’s leading experts, this multidisciplinary, open-access journal is at the forefront of communicating cutting-edge research to researchers, academics, clinicians, policy makers and the public. The study of inheritance and the impact of the genome on various biological processes is well documented. However, the majority of discoveries are still to come. A new era is seeing major developments in the function and variability of the genome, the use of genetic and genomic tools and the analysis of the genetic basis of various biological phenomena.
期刊最新文献
Research advancements on the role of CircRNAs in cartilage injury within osteoarthritis. Identification of gene expression signatures associated with neuroinflammation in discogenic sciatica using machine learning and experimental validation. Defects in PDIA4 increase individuals' susceptibility to congenital heart disease. Co-occurring DMD, GJA1, and novel FYCO1 variants in a proband from a consanguineous oculodentodigital dysplasia family: a rare multi-locus case report. Exploring the strengths and limitations of AI-driven variant prioritization versus manual curation in inborn errors of immunity.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1