Tomasz Mróz, Sahameh Shafiee, Jose Crossa, Osval A. Montesinos-Lopez, Morten Lillemo
{"title":"通过预算相机获得的多光谱基因型相似性,可在单一和多环境情景下预测春小麦的籽粒产量并进行基因组选择扩增","authors":"Tomasz Mróz, Sahameh Shafiee, Jose Crossa, Osval A. Montesinos-Lopez, Morten Lillemo","doi":"10.1007/s11032-024-01449-w","DOIUrl":null,"url":null,"abstract":"<p>With abundant available genomic data, genomic selection has become routine in many plant breeding programs. Multispectral data captured by UAVs showed potential for grain yield (GY) prediction in many plant species using machine learning; however, the possibilities of utilizing this data to augment genomic prediction models still need to be explored. We collected high-throughput phenotyping (HTP) multispectral data in a genotyped multi-environment large-scale field trial using two cost-effective cameras to fill this gap. We tested back to back the prediction ability of GY prediction models, including genomic (G matrix), multispectral-derived (M matrix), and environmental (E matrix) relationships using best linear unbiased predictor (BLUP) methodology in single and multi-environment scenarios. We discovered that M allows for GY prediction comparable to the G matrix and that models using both G and M matrices show superior accuracies and errors compared with G or M alone, both in single and multi-environment scenarios. We showed that the M matrix is not entirely environment-specific, and the genotypic relationships become more robust with more data capture sessions over the season. We discovered that the optimal time for data capture occurs during grain filling and that camera bands with the highest heritability are important for GY prediction using the M matrix. We showcased that GY prediction can be performed using only an RGB camera, and even a single data capture session can yield valuable data for GY prediction. This study contributes to a better understanding of multispectral data and its relationships. It provides a flexible framework for improving GS protocols without significant investments or software customization.</p>","PeriodicalId":18769,"journal":{"name":"Molecular Breeding","volume":"65 1","pages":""},"PeriodicalIF":2.6000,"publicationDate":"2024-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Multispectral-derived genotypic similarities from budget cameras allow grain yield prediction and genomic selection augmentation in single and multi-environment scenarios in spring wheat\",\"authors\":\"Tomasz Mróz, Sahameh Shafiee, Jose Crossa, Osval A. Montesinos-Lopez, Morten Lillemo\",\"doi\":\"10.1007/s11032-024-01449-w\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>With abundant available genomic data, genomic selection has become routine in many plant breeding programs. Multispectral data captured by UAVs showed potential for grain yield (GY) prediction in many plant species using machine learning; however, the possibilities of utilizing this data to augment genomic prediction models still need to be explored. We collected high-throughput phenotyping (HTP) multispectral data in a genotyped multi-environment large-scale field trial using two cost-effective cameras to fill this gap. We tested back to back the prediction ability of GY prediction models, including genomic (G matrix), multispectral-derived (M matrix), and environmental (E matrix) relationships using best linear unbiased predictor (BLUP) methodology in single and multi-environment scenarios. We discovered that M allows for GY prediction comparable to the G matrix and that models using both G and M matrices show superior accuracies and errors compared with G or M alone, both in single and multi-environment scenarios. We showed that the M matrix is not entirely environment-specific, and the genotypic relationships become more robust with more data capture sessions over the season. We discovered that the optimal time for data capture occurs during grain filling and that camera bands with the highest heritability are important for GY prediction using the M matrix. We showcased that GY prediction can be performed using only an RGB camera, and even a single data capture session can yield valuable data for GY prediction. This study contributes to a better understanding of multispectral data and its relationships. It provides a flexible framework for improving GS protocols without significant investments or software customization.</p>\",\"PeriodicalId\":18769,\"journal\":{\"name\":\"Molecular Breeding\",\"volume\":\"65 1\",\"pages\":\"\"},\"PeriodicalIF\":2.6000,\"publicationDate\":\"2024-01-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Molecular Breeding\",\"FirstCategoryId\":\"97\",\"ListUrlMain\":\"https://doi.org/10.1007/s11032-024-01449-w\",\"RegionNum\":3,\"RegionCategory\":\"农林科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AGRONOMY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Molecular Breeding","FirstCategoryId":"97","ListUrlMain":"https://doi.org/10.1007/s11032-024-01449-w","RegionNum":3,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRONOMY","Score":null,"Total":0}
引用次数: 0
摘要
随着基因组数据的不断丰富,基因组选择已成为许多植物育种项目的常规做法。无人机采集的多光谱数据显示了利用机器学习预测许多植物物种谷物产量(GY)的潜力;然而,利用这些数据增强基因组预测模型的可能性仍有待探索。为了填补这一空白,我们在一项基因分型多环境大规模田间试验中使用两台经济高效的相机收集了高通量表型(HTP)多光谱数据。我们采用最佳线性无偏预测(BLUP)方法,在单一环境和多环境情景下背靠背测试了 GY 预测模型的预测能力,包括基因组(G 矩阵)、多光谱衍生(M 矩阵)和环境(E 矩阵)关系。我们发现,M 矩阵可以预测与 G 矩阵相当的 GY,而同时使用 G 矩阵和 M 矩阵的模型在单一环境和多环境场景下的准确度和误差都优于单独使用 G 矩阵或 M 矩阵的模型。我们的研究表明,M 矩阵并不完全针对特定环境,随着季节中数据采集次数的增加,基因型关系会变得更加稳健。我们发现,数据采集的最佳时间是在谷粒灌浆期,而且遗传率最高的相机带对使用 M 矩阵预测 GY 非常重要。我们展示了仅使用 RGB 摄像机就能进行 GY 预测,甚至单次数据采集也能为 GY 预测提供有价值的数据。这项研究有助于更好地理解多光谱数据及其关系。它提供了一个灵活的框架,可在无需大量投资或软件定制的情况下改进 GS 协议。
Multispectral-derived genotypic similarities from budget cameras allow grain yield prediction and genomic selection augmentation in single and multi-environment scenarios in spring wheat
With abundant available genomic data, genomic selection has become routine in many plant breeding programs. Multispectral data captured by UAVs showed potential for grain yield (GY) prediction in many plant species using machine learning; however, the possibilities of utilizing this data to augment genomic prediction models still need to be explored. We collected high-throughput phenotyping (HTP) multispectral data in a genotyped multi-environment large-scale field trial using two cost-effective cameras to fill this gap. We tested back to back the prediction ability of GY prediction models, including genomic (G matrix), multispectral-derived (M matrix), and environmental (E matrix) relationships using best linear unbiased predictor (BLUP) methodology in single and multi-environment scenarios. We discovered that M allows for GY prediction comparable to the G matrix and that models using both G and M matrices show superior accuracies and errors compared with G or M alone, both in single and multi-environment scenarios. We showed that the M matrix is not entirely environment-specific, and the genotypic relationships become more robust with more data capture sessions over the season. We discovered that the optimal time for data capture occurs during grain filling and that camera bands with the highest heritability are important for GY prediction using the M matrix. We showcased that GY prediction can be performed using only an RGB camera, and even a single data capture session can yield valuable data for GY prediction. This study contributes to a better understanding of multispectral data and its relationships. It provides a flexible framework for improving GS protocols without significant investments or software customization.
期刊介绍:
Molecular Breeding is an international journal publishing papers on applications of plant molecular biology, i.e., research most likely leading to practical applications. The practical applications might relate to the Developing as well as the industrialised World and have demonstrable benefits for the seed industry, farmers, processing industry, the environment and the consumer.
All papers published should contribute to the understanding and progress of modern plant breeding, encompassing the scientific disciplines of molecular biology, biochemistry, genetics, physiology, pathology, plant breeding, and ecology among others.
Molecular Breeding welcomes the following categories of papers: full papers, short communications, papers describing novel methods and review papers. All submission will be subject to peer review ensuring the highest possible scientific quality standards.
Molecular Breeding core areas:
Molecular Breeding will consider manuscripts describing contemporary methods of molecular genetics and genomic analysis, structural and functional genomics in crops, proteomics and metabolic profiling, abiotic stress and field evaluation of transgenic crops containing particular traits. Manuscripts on marker assisted breeding are also of major interest, in particular novel approaches and new results of marker assisted breeding, QTL cloning, integration of conventional and marker assisted breeding, and QTL studies in crop plants.