通过结合光谱和热信息的多变量和多环境基因组预测模型,提高适应美国东南部地区的小麦品系的谷物产量预测准确性。

IF 3.9 2区 生物学 Q1 GENETICS & HEREDITY Plant Genome Pub Date : 2024-11-19 DOI:10.1002/tpg2.20532
Jordan McBreen, Md Ali Babar, Diego Jarquin, Naeem Khan, Steve Harrison, Noah DeWitt, Mohamed Mergoum, Ben Lopez, Richard Boyles, Jeanette Lyerly, J Paul Murphy, Ehsan Shakiba, Russel Sutton, Amir Ibrahim, Kimberly Howell, Jared H Smith, Gina Brown-Guedira, Vijay Tiwari, Nicholas Santantonio, David A Van Sanford
{"title":"通过结合光谱和热信息的多变量和多环境基因组预测模型,提高适应美国东南部地区的小麦品系的谷物产量预测准确性。","authors":"Jordan McBreen, Md Ali Babar, Diego Jarquin, Naeem Khan, Steve Harrison, Noah DeWitt, Mohamed Mergoum, Ben Lopez, Richard Boyles, Jeanette Lyerly, J Paul Murphy, Ehsan Shakiba, Russel Sutton, Amir Ibrahim, Kimberly Howell, Jared H Smith, Gina Brown-Guedira, Vijay Tiwari, Nicholas Santantonio, David A Van Sanford","doi":"10.1002/tpg2.20532","DOIUrl":null,"url":null,"abstract":"<p><p>Enhancing predictive modeling accuracy in wheat (Triticum aestivum) breeding through the integration of high-throughput phenotyping (HTP) data with genomic information is crucial for maximizing genetic gain. In this study, spanning four locations in the southeastern United States over 3 years, models to predict grain yield (GY) were investigated through different cross-validation approaches. The results demonstrate the superiority of multivariate comprehensive models that incorporate both genomic and HTP data, particularly in accurately predicting GY across diverse locations and years. These HTP-incorporating models achieve prediction accuracies ranging from 0.59 to 0.68, compared to 0.40-0.54 for genomic-only models when tested under different prediction scenarios both across years and locations. The comprehensive models exhibit superior generalization to new environments and achieve the highest accuracy when trained on diverse datasets. Predictive accuracy improves as models incorporate data from multiple years, highlighting the importance of considering temporal dynamics in modeling approaches. The study reveals that multivariate prediction outperformed genomic prediction methods in predicting lines across years and locations. The percentage of top 25% lines selected based on multivariate prediction was higher compared to genomic-only models, indicated by higher specificity, which is the proportion of correctly identified top-yielding lines that matched the observed top 25% performance across different sites and years. Additionally, the study addresses the prediction of untested locations based on other locations within the same year and in new years at previously tested locations. Findings show the comprehensive models effectively extrapolate to new environments, highlighting their potential for guiding breeding strategies.</p>","PeriodicalId":49002,"journal":{"name":"Plant Genome","volume":" ","pages":"e20532"},"PeriodicalIF":3.9000,"publicationDate":"2024-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Enhancing prediction accuracy of grain yield in wheat lines adapted to the southeastern United States through multivariate and multi-environment genomic prediction models incorporating spectral and thermal information.\",\"authors\":\"Jordan McBreen, Md Ali Babar, Diego Jarquin, Naeem Khan, Steve Harrison, Noah DeWitt, Mohamed Mergoum, Ben Lopez, Richard Boyles, Jeanette Lyerly, J Paul Murphy, Ehsan Shakiba, Russel Sutton, Amir Ibrahim, Kimberly Howell, Jared H Smith, Gina Brown-Guedira, Vijay Tiwari, Nicholas Santantonio, David A Van Sanford\",\"doi\":\"10.1002/tpg2.20532\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Enhancing predictive modeling accuracy in wheat (Triticum aestivum) breeding through the integration of high-throughput phenotyping (HTP) data with genomic information is crucial for maximizing genetic gain. In this study, spanning four locations in the southeastern United States over 3 years, models to predict grain yield (GY) were investigated through different cross-validation approaches. The results demonstrate the superiority of multivariate comprehensive models that incorporate both genomic and HTP data, particularly in accurately predicting GY across diverse locations and years. These HTP-incorporating models achieve prediction accuracies ranging from 0.59 to 0.68, compared to 0.40-0.54 for genomic-only models when tested under different prediction scenarios both across years and locations. The comprehensive models exhibit superior generalization to new environments and achieve the highest accuracy when trained on diverse datasets. Predictive accuracy improves as models incorporate data from multiple years, highlighting the importance of considering temporal dynamics in modeling approaches. The study reveals that multivariate prediction outperformed genomic prediction methods in predicting lines across years and locations. The percentage of top 25% lines selected based on multivariate prediction was higher compared to genomic-only models, indicated by higher specificity, which is the proportion of correctly identified top-yielding lines that matched the observed top 25% performance across different sites and years. Additionally, the study addresses the prediction of untested locations based on other locations within the same year and in new years at previously tested locations. Findings show the comprehensive models effectively extrapolate to new environments, highlighting their potential for guiding breeding strategies.</p>\",\"PeriodicalId\":49002,\"journal\":{\"name\":\"Plant Genome\",\"volume\":\" \",\"pages\":\"e20532\"},\"PeriodicalIF\":3.9000,\"publicationDate\":\"2024-11-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Plant Genome\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.1002/tpg2.20532\",\"RegionNum\":2,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"GENETICS & HEREDITY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Plant Genome","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1002/tpg2.20532","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
引用次数: 0

摘要

在小麦(Triticum aestivum)育种中,通过整合高通量表型(HTP)数据与基因组信息来提高预测模型的准确性,对于最大限度地提高遗传增益至关重要。本研究跨越美国东南部四个地点,历时三年,通过不同的交叉验证方法对预测谷物产量(GY)的模型进行了研究。结果表明,结合基因组和 HTP 数据的多变量综合模型具有优越性,尤其是在准确预测不同地点和年份的谷物产量方面。在不同年份和地点的不同预测情况下进行测试时,这些包含 HTP 的模型的预测准确率在 0.59 至 0.68 之间,而纯基因组模型的预测准确率在 0.40 至 0.54 之间。综合模型对新环境表现出卓越的泛化能力,在不同数据集上进行训练时可获得最高准确率。当模型纳入多年数据时,预测准确率也会提高,这凸显了在建模方法中考虑时间动态的重要性。研究表明,在预测不同年份和地点的品系方面,多元预测优于基因组预测方法。与纯基因组模型相比,基于多元预测方法选出的前 25% 品系的比例更高,这体现在更高的特异性上,特异性是指正确识别出的最高产量品系与不同地点和年份观察到的前 25% 表现相匹配的比例。此外,该研究还根据同年其他地点和以前测试地点新年份的情况,对未经测试的地点进行了预测。研究结果表明,综合模型能有效地推断新环境,突出了其指导育种策略的潜力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Enhancing prediction accuracy of grain yield in wheat lines adapted to the southeastern United States through multivariate and multi-environment genomic prediction models incorporating spectral and thermal information.

Enhancing predictive modeling accuracy in wheat (Triticum aestivum) breeding through the integration of high-throughput phenotyping (HTP) data with genomic information is crucial for maximizing genetic gain. In this study, spanning four locations in the southeastern United States over 3 years, models to predict grain yield (GY) were investigated through different cross-validation approaches. The results demonstrate the superiority of multivariate comprehensive models that incorporate both genomic and HTP data, particularly in accurately predicting GY across diverse locations and years. These HTP-incorporating models achieve prediction accuracies ranging from 0.59 to 0.68, compared to 0.40-0.54 for genomic-only models when tested under different prediction scenarios both across years and locations. The comprehensive models exhibit superior generalization to new environments and achieve the highest accuracy when trained on diverse datasets. Predictive accuracy improves as models incorporate data from multiple years, highlighting the importance of considering temporal dynamics in modeling approaches. The study reveals that multivariate prediction outperformed genomic prediction methods in predicting lines across years and locations. The percentage of top 25% lines selected based on multivariate prediction was higher compared to genomic-only models, indicated by higher specificity, which is the proportion of correctly identified top-yielding lines that matched the observed top 25% performance across different sites and years. Additionally, the study addresses the prediction of untested locations based on other locations within the same year and in new years at previously tested locations. Findings show the comprehensive models effectively extrapolate to new environments, highlighting their potential for guiding breeding strategies.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Plant Genome
Plant Genome PLANT SCIENCES-GENETICS & HEREDITY
CiteScore
6.00
自引率
4.80%
发文量
93
审稿时长
>12 weeks
期刊介绍: The Plant Genome publishes original research investigating all aspects of plant genomics. Technical breakthroughs reporting improvements in the efficiency and speed of acquiring and interpreting plant genomics data are welcome. The editorial board gives preference to novel reports that use innovative genomic applications that advance our understanding of plant biology that may have applications to crop improvement. The journal also publishes invited review articles and perspectives that offer insight and commentary on recent advances in genomics and their potential for agronomic improvement.
期刊最新文献
Soybean genomics research community strategic plan: A vision for 2024-2028. Enhancing prediction accuracy of grain yield in wheat lines adapted to the southeastern United States through multivariate and multi-environment genomic prediction models incorporating spectral and thermal information. Genome-wide association of an organic naked barley diversity panel identified quantitative trait loci for disease resistance. Association mapping and genomic prediction for processing and end-use quality traits in wheat (Triticum aestivum L.). Characterization of a new barley greenbug resistance gene Rsg4 in the Chinese landrace CI 2458.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1